U.S. patent application number 10/538410 was filed with the patent office on 2006-08-17 for protein forming complex with c-fos protein, nucleic acid encoding the same and method of using the same.
Invention is credited to Masamichi Ishizaka, Etsuko Miyamoto, Hiroshi Yanagawa.
Application Number | 20060183885 10/538410 |
Document ID | / |
Family ID | 32500970 |
Filed Date | 2006-08-17 |
United States Patent
Application |
20060183885 |
Kind Code |
A1 |
Miyamoto; Etsuko ; et
al. |
August 17, 2006 |
Protein forming complex with c-fos protein, nucleic acid encoding
the same and method of using the same
Abstract
Proteins that interact with c-Fos, nucleic acids encoding them
and inhibitors utilizing them as well as methods for detecting an
interaction and screening methods utilizing a protein that
interacts with c-Fos are provided. Comprehensive analysis of
transcription control factor complexes in a mouse brain cDNA
library with c-Fos as a bait by using the cotranslation selection
and screening of in vitro virus (IVV) and the C-terminal labeling
method are conducted, thereby to analyze proteins unknown so far,
proteins known so far, but unknown to form a complex with the c-Fos
protein, and so forth.
Inventors: |
Miyamoto; Etsuko;
(Yokohama-shi, JP) ; Ishizaka; Masamichi;
(Yokohama-shi, JP) ; Yanagawa; Hiroshi;
(Yokohama-shi, JP) |
Correspondence
Address: |
WENDEROTH, LIND & PONACK, L.L.P.
2033 K STREET N. W.
SUITE 800
WASHINGTON
DC
20006-1021
US
|
Family ID: |
32500970 |
Appl. No.: |
10/538410 |
Filed: |
November 19, 2003 |
PCT Filed: |
November 19, 2003 |
PCT NO: |
PCT/JP03/14749 |
371 Date: |
February 15, 2006 |
Current U.S.
Class: |
530/350 ;
435/320.1; 435/325; 435/7.1; 506/14; 536/23.5 |
Current CPC
Class: |
G01N 2500/04 20130101;
C07K 14/47 20130101; C12N 15/1055 20130101 |
Class at
Publication: |
530/350 ;
435/006; 435/007.1; 435/320.1; 435/325; 536/023.5 |
International
Class: |
C07K 14/705 20060101
C07K014/705; C12Q 1/68 20060101 C12Q001/68; G01N 33/53 20060101
G01N033/53; C07H 21/04 20060101 C07H021/04 |
Foreign Application Data
Date |
Code |
Application Number |
Dec 11, 2002 |
JP |
2002-360046 |
Claims
1. A protein of the following (a) or (b): (a) a protein comprising
any one of the amino acid sequences of SEQ ID NOS: 1 to 14, SEQ ID
NOS: 15 to 19, SEQ ID NOS: 20 to 22, SEQ ID NOS: 47 to 56, SEQ ID
NOS: 57 to 76, SEQ ID NOS: 77 to 81, SEQ ID NOS: 82 to 84, SEQ ID
NOS: 85 to 86, SEQ ID NOS: 87 to 89, SEQ ID NOS: 90 to 91, SEQ ID
NOS: 92 to 93, SEQ ID NOS: 94 to 95, SEQ ID NOS: 96 to 97, SEQ ID
NOS: 98 to 99, SEQ ID NOS: 100 to 101, SEQ ID NO: 102 and SEQ ID
NO: 103, (b) a protein that comprises any one of the amino acid
sequences of SEQ ID NOS:1 to 14, SEQ ID NOS: 15 to 19, SEQ ID NOS:
20 to 22, SEQ ID NOS: 47 to 56, SEQ ID NOS: 57 to 76, SEQ ID NOS:
77 to 81, SEQ ID NOS: 82 to 84, SEQ ID NOS: 85 to 86, SEQ ID NOS:
87 to 89, SEQ ID NOS: 90 to 91, SEQ ID NOS: 92 to 93, SEQ ID NOS:
94 to 95, SEQ ID NOS: 96 to 97, SEQ ID NOS: 98 to 99, SEQ ID NOS:
100 to 101, SEQ ID NO: 102 and SEQ ID NO: 103, including deletion,
substitution or addition of one or several amino acid residues and
interacts with a c-Fos protein.
2. A protein according to claim 1, which comprises any one of the
amino acid sequences of SEQ ID NOS: 1 to 14, SEQ ID NOS: 15 to 19,
SEQ ID NOS: 20 to 22, SEQ ID NOS: 47 to 56, SEQ ID NOS: 57 to 76,
SEQ ID NOS: 77 to 81, SEQ ID NOS: 82 to 84, SEQ ID NOS: 85 to 86,
SEQ ID NOS: 87 to 89, SEQ ID NOS: 90 to 91, SEQ ID NOS: 92 to 93,
SEQ ID NOS: 94 to 95, SEQ ID NOS: 96 to 97, SEQ ID NOS: 98 to 99,
SEQ ID NOS: 100 to 101, SEQ ID NO: 102 and SEQ ID NO: 103.
3. A nucleic acid encoding the protein according to claim 1.
4. A nucleic acid of the following (a) or (b): (a) a nucleic acid
comprising any one of the nucleotide sequences of SEQ ID NOS: 23 to
38, SEQ ID NOS: 39 to 43, SEQ ID NOS: 44 to 46, SEQ ID NOS: 104 to
118, SEQ ID NOS: 119 to 140, SEQ ID NOS: 141 to 145, SEQ ID NOS:
146 to 148, SEQ ID NOS: 149 to 150, SEQ ID NOS: 151 to 153, SEQ ID
NOS: 154 to 155, SEQ ID NOS: 156 to 157, SEQ ID NOS: 158 to 159,
SEQ ID NOS: 160 to 161, SEQ ID NOS: 162 to 163, SEQ ID NOS: 164 to
165, SEQ ID NO: 166, and SEQ ID NO: 167, (b) a nucleic acid that
hybridizes with a nucleic acid comprising any one of the nucleotide
sequences of SEQ ID NOS: 23 to 38, SEQ ID NOS: 39 to 43, SEQ ID
NOS: 44 to 46, SEQ ID NOS: 104 to 118, SEQ ID NOS: 119 to 140, SEQ
ID NOS: 141 to 145, SEQ ID NOS: 146 to 148, SEQ ID NOS: 149 to 150,
SEQ ID NOS: 151 to 153, SEQ ID NOS: 154 to 155, SEQ ID NOS: 156 to
157, SEQ ID NOS: 158 to 159, SEQ ID NOS: 160 to 161, SEQ ID NOS:
162 to 163, SEQ ID NOS: 164 to 165, SEQ ID NO: 166, and SEQ ID NO:
167, under a stringent condition and encodes a protein that
interacts with a c-Fos protein.
5. A nucleic acid according to claim 4, which comprises any one of
the nucleotide sequences of SEQ ID NOS: 23 to 38, SEQ ID NOS: 39 to
43, SEQ ID NOS: 44 to 46, SEQ ID NOS: 104 to 118, SEQ ID NOS: 119
to 140, SEQ ID NOS: 141 to 145, SEQ ID NOS: 146 to 148, SEQ ID NOS:
149 to 150, SEQ ID NOS: 151 to 153, SEQ ID NOS: 154 to 155, SEQ ID
NOS: 156 to 157, SEQ ID NOS: 158 to 159, SEQ ID NOS: 160 to 161,
SEQ ID NOS: 162 to 163, SEQ ID NOS: 164 to 165, SEQ ID NO: 166, and
SEQ ID NO: 167.
6. An inhibitor for an interaction between a protein that interacts
with a c-Fos protein and the c-Fos protein, which comprises the
protein according to claim 1 or 2 or a protein translated from the
nucleic acid according to any one of claims 3 to 5 or 133 as an
active ingredient.
7. A method for detecting an interaction between a bait and a prey,
which comprises bringing the bait and the prey into contact and
detecting a complex formed by the contact, wherein the bait is the
protein according to claim 1 or 2 or a protein translated from the
nucleic acid according to any one of claims 3 to 5 or 133.
8. A method for screening for a prey that interacts with a bait,
which comprises the step of detecting an interaction between the
bait and a prey by the method according to claim 7 and the step of
selecting a prey for which an interaction was detected.
9-132. (canceled)
133. A nucleic acid encoding the protein according to claim 2.
Description
TECHNICAL FIELD
[0001] The present invention relates to proteins that interact with
c-Fos, nucleic acids encoding them and inhibitors utilizing them as
well as methods for detecting an interaction and screening methods
utilizing a protein that interacts with c-Fos.
[0002] At present, genomic nucleotide sequences of various
organisms are going to be elucidated. In researches of genomic
sequences, there are expected, as post-sequencing researches of the
second act, researches of analyzing meanings of the elucidated
genomic information, i.e., structural and functional analyses of
genes and proteins (Non-patent documents 1 and 2), analyses of
protein/protein and protein/nucleic acid interactions (Non-patent
documents 3 and 4) and so forth.
[0003] On the basis of analyses of networks of interactions between
protein and protein, protein and nucleic acid and so forth in
post-sequencing genomic functional analyses utilizing such
techniques as described above, it is expected to create drugs and
so forth standing on discoveries of novel functions of known
proteins or important biological enzymes such as novel proteins
that have been unknown so far.
[0004] As methods for detecting interactions between proteins, the
immunoprecipitation (Non-patent document 5), pull-down assay based
on a GST fusion protein (Non-patent document 6), TAP method
(Non-patent document 7), yeast two-hybrid method (Non-patent
document 8) and so forth are known so far. Further, as methods for
comprehensive analysis of interactions between proteins in the
post-sequencing genomic functional analyses utilizing the
"assignment of gene (genotype) and protein (phenotype)" born as a
tool of the evolutionary molecular engineering, there are the in
vitro virus method (Non-patent documents 9 and 10, Patent documents
1 and 2), STABLE method (Non-patent document 11), phage display
method (Non-patent document 12), ribosome display method
(Non-patent document 13, Patent document 3), mRNA-peptide fusion
method (mRNA display method, Non-patent document 14), and so
forth.
[0005] Furthermore, the surface plasmon resonance method,
fluorescence resonance energy transfer method, fluorescence
depolarization method, evanescent-field imaging method,
fluorescence correlation spectroscopy, fluorescent imaging method,
enzyme linked immunosorbent assay and so forth are also known.
Moreover, methods of modifying C-terminals of proteins in a
translation system utilizing a nucleic acid derivatives such as
puromycin have been previously proposed (Patent documents 4 and 5).
These methods have advantages that functions of proteins are more
unlikely to be damaged compared with the conventional chemical
modification methods and the fluorescent protein fusion
methods.
[0006] In the field of life science, sequence analysis of the human
genome was completed, and genome researches rush into functional
analyses of genes of the post-genome age. Thus, innovative drug
creation based on comprehensive genomic functional analyses and so
forth are expected. There is desired a technique that enables
comprehensive analysis of genes and proteins, which have been
independently studied so far, for example, a technique of analyzing
various cofactors of transcription control factors as target
proteins of drug creation at once and so forth. As a transcription
control factor, the c-Fos protein is known well.
[0007] The v-fos gene was isolated as an oncogene of an FBJ murine
osteosarcoma virus (Non-patent document 15). c-fos is a
transcription control factor detected in many cell species as a
typical immediate early gene in connection with a proliferation
stimulus. fra-1 and fra-2 were cloned from the Fos-related antigens
(Fra), and fosB was also found as a gene having homology to c-fos
in the nucleotide sequence. These constitute the fos family genes
together with c-fos. It is known that a chimeric mouse and
transgenic mouse expressing c-fos at a high level form chondroma
and bone sarcoma, respectively (Non-patent document 16).
[0008] Various genes such as c-jun, junB and junD, which are jun
family genes, are known so far as genes of proteins that interact
with c-fos (Non-patent document 17), and it was recently found by
the two-hybrid method that the transcription control factor Fos/Jun
(AP-1) forms a complex with BAF60a of SWI/SNF to induce remodeling
of chromatin. Further, it was found that AP-1 binds to NFAT, which
is a protein of the cerebral nerve system, to control expression of
the IL2 gene. The former is involved in oncogenesis and
canceration, and the latter consists of two proteins that are
involved in autoimmune diseases and Alzheimer's disease and induce
completely different diseases. Thus, comprehensive analyses of
various complexes of transcription control factors are very
interesting as a new treasury of target proteins for drug creation.
However, such a thorough 1:1 molecule analysis technique as the
two-hybrid method takes enormous time and labor.
<Non-Patent Document 1>
[0009] Saegusa A., Nature, 401, 6751 (1999) <Non-Patent Document
2> [0010] Dalton R, Abbott A., Nature, 402, 6763 (1999)
<Non-Patent Document 3> [0011] Etsuko Miyamoto, Hiroshi
Yanagawa, Series Genome Science of Post-sequencing 3, Proteomics,
pp. 136-145 (2000) <Non-Patent Document 4> [0012] Etsuko
Miyamoto, Hiroshi Yanagawa, PROTEIN, NUCLEIC ACID AND ENZYME,
46(2), pp. 138-147 (2001) <Non-Patent Document 5> [0013]
Xiong et al., Nature, 366, 701-704 (1993) <Non-Patent Document
6> [0014] Kaelin, et al., Cell, 64, 521-532 (1991)
<Non-Patent Document 7> [0015] Guillaume Rigaut, et al.,
Nature Biotechnology, 17, 1030 (1999) <Non-Patent Document 8>
[0016] Fields S., Song O., Nature, 340, 245 (1989) <Non-Patent
Document 9> [0017] Miyamoto-Sato E., et al., Viva Origino, 25,
35 (1997) <Non-Patent Document 10> [0018] Nemoto N., et al.,
FEBS Lett., 414, 405 (1997) <Patent Document 1> [0019]
International Publication WO98/16636 <Patent Document 2>
[0020] International Publication WO02/46395 <Non-Patent Document
11> [0021] Doi N., Yanagawa H., FEBS Lett., 457, 227 (1999)
<Non-Patent Document 12> [0022] Smith G. P., Science, 228,
1315 (1985) <Non-Patent Document 13> [0023] Mattheakis, L. C.
et al., Proc. Natl. Acad. Sci. USA, 91, 9022-9026 (1994) <Patent
Document 3> [0024] International Publication WO95/11922
<Non-Patent Document 14> [0025] Roberts R. W. and Szostak J.
W., Proc. Natl. Acad. Sci. USA, 94, 12297 (1997) <Patent
Document 4> [0026] U.S. Pat. No. 6,228,994 <Patent Document
5> [0027] International Publication WO02/48347 <Non-Patent
Document 15> [0028] Curran, T. et al., J. Viol., 44:674-682
(1982) <Non-Patent Document 16> [0029] Agamemunon, E. G. et
al., Trends Genet., 11:436-441 (1995) <Non-Patent Document
17> [0030] Yurii Chinenovl and Tom K Kerppola, Oncogene, 20,
2438-2452 (2001)
DISCLOSURE OF THE INVENTION
[0031] An object of the present invention is to provide a complex
that interacts with the c-Fos protein, which is well known as a
transcription control factor, as a target protein.
[0032] The inventors of the present invention conducted
comprehensive analysis of transcription control factor complexes in
a mouse brain cDNA library with c-Fos as a bait by using two of
techniques, the cotranslation selection and screening of in vitro
virus (IVV) and the C-terminal labeling method (U.S. Pat. No.
6,228,994, WO02/48347) named puromycin technologies, which have
been researched on the basis of the aforementioned in vitro virus
method as a method for comprehensive analysis as a one-to-multiple
molecule-analysis method replacing the method, and thereby
attempted to analyze proteins unknown so far, proteins known so
far, but unknown to form a complex with the c-Fos protein, and so
forth. The expression of "a protein that form a complex" used
herein refers to a protein that directly or indirectly interacts
with the c-Fos protein.
[0033] Further objects of the present invention is to provide a
protein that interacts with c-Fos and an inhibitor utilizing it, as
well as a method for detecting an interaction and method for
screening utilizing a protein that interacts with c-Fos.
[0034] The inventors of the present invention found novel proteins
that interact with c-Fos by the cotranslation screening, also found
that certain known proteins interacted with c-Fos, and accomplished
the present invention. The present invention thus provides the
followings.
[0035] 1. A protein of the following (a) or (b):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 1 to 14,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 1 to 14 including deletion, substitution or addition of
one or several amino acid residues and interacts with a c-Fos
protein.
[0036] 2. A protein according to 1, which comprises any one of the
amino acid sequences of SEQ ID NOS: 1 to 14.
[0037] 3. A nucleic acid encoding the protein according to 1 or
2.
[0038] 4. A nucleic acid of the following (a) or (b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 23 to 38,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 23 to 38 under a
stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0039] 5. A nucleic acid according to 4, which comprises any one of
the nucleotide sequences of SEQ ID NOS: 23 to 38.
[0040] 6. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises the protein according to 1 or 2 or a protein translated
from the nucleic acid according to any one of 3 to 5 as an active
ingredient.
[0041] 7. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
the protein according to 1 or 2 or a protein translated from the
nucleic acid according to any one of 3 to 5.
[0042] 8. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to 7 and the step of
selecting a prey for which an interaction was detected.
[0043] 9. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 15 to 19,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 15 to 19 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein.
[0044] 10. The inhibitor according to 9, wherein the protein as the
active ingredient comprises any one of the amino acid sequences of
SEQ ID NOS: 15 to 19.
[0045] 11. The inhibitor according to 9, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 39 to 43,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 39 to 43 under a
stringent condition and encodes a protein that interacts with the
c-Fos protein.
[0046] 12. The inhibitor according to 11, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 39 to
43.
[0047] 13. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 15 to 19,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 15 to 19 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein,
(a') a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 39 to 43,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 39 to 43 under a
stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0048] 14. The method according to 13, wherein the protein
comprises any one of the amino acid sequences of SEQ ID NOS: 15 to
19.
[0049] 15. The method according to 13, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 39 to
43.
[0050] 16. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 13 to 15
and the step of selecting a prey for which an interaction is
detected.
[0051] 17. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 20 to 22,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 20 to 22 including deletion, substitution or addition
of one or several amino acid residues and interacts with the c-Fos
protein.
[0052] 18. The inhibitor according to 17, wherein the protein as
the active ingredient comprises any one of the amino acid sequences
of SEQ ID NOS: 20 to 22.
[0053] 19. The inhibitor according to 17, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 44 to 46,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 44 to 46 under a
stringent condition and encodes a protein that interacts with the
c-Fos protein.
[0054] 20. The inhibitor according to 19, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 44 to
46.
[0055] 21. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 20 to 22,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 20 to 22 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein,
(a') a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 44 to 46,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 44 to 46 under a
stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0056] 22. The method according to 21, wherein the protein
comprises any one of the amino acid sequences of SEQ ID NOS: 20 to
22.
[0057] 23. The method according to 21, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 44 to
46.
[0058] 24. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 21 to 23
and the step of selecting a prey for which an interaction is
detected.
[0059] 25. A protein of the following (a) or (b):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 47 to 56,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 47 to 56 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein.
[0060] 26. A protein according to 25, which comprises any one of
the amino acid sequences of SEQ ID NOS: 47 to 56.
[0061] 27. A nucleic acid encoding the protein according to 25 or
26.
[0062] 28. A nucleic acid of the following (a) or (b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 104 to 118,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 104 to 118 under
a stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0063] 29. A nucleic acid according to 28, which comprises any one
of the nucleotide sequences of SEQ ID NOS: 104 to 118.
[0064] 30. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises the protein according to 25 or 26 or a protein translated
from the nucleic acid according to any one of 27 to 29 as an active
ingredient.
[0065] 31. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
the protein according to 25 or 26 or a protein translated from the
nucleic acid according to any one of 27 to 29 as an active
ingredient.
[0066] 32. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to 31 and the step of
selecting a prey for which an interaction was detected.
[0067] 33. A protein of the following (a) or (b):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 57 to 76,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 57 to 76 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein.
[0068] 34. A protein according to 33, which comprises any one of
the amino acid sequences of SEQ ID NOS: 57 to 76.
[0069] 35. A nucleic acid encoding the protein according to 33 or
34.
[0070] 36. A nucleic acid of the following (a) or (b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 119 to 140,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 119 to 140 under
a stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0071] 37. A nucleic acid according to 4, which comprises any one
of the nucleotide sequences of SEQ ID NOS: 119 to 140.
[0072] 38. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises the protein according to 33 or 34 or a protein translated
from the nucleic acid according to any one of 35 to 37 as an active
ingredient.
[0073] 39. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
the protein according to 33 or 34 or a protein translated from the
nucleic acid according to any one of 35 to 37 as an active
ingredient.
[0074] 40. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to 39 and the step of
selecting a prey for which an interaction was detected.
[0075] 41. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 77 to 81,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 77 to 81 including deletion, substitution or addition
of one or several amino acid residues and interacts with the c-Fos
protein.
[0076] 42. The inhibitor according to 41, wherein the protein as
the active ingredient comprises any one of the amino acid sequences
of SEQ ID NOS: 77 to 81.
[0077] 43. The inhibitor according to 41, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 141 to 145,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 141 to 145 under
a stringent condition and encodes a protein that interacts with the
c-Fos protein.
[0078] 44. The inhibitor according to 43, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 141 to
145.
[0079] 45. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 77 to 81,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 77 to 81 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein,
(a') a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 141 to 145,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 141 to 145 under
a stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0080] 46. The method according to 45, wherein the protein
comprises any one of the amino acid sequences of SEQ ID NOS: 77 to
81.
[0081] 47. The method according to 45, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 141 to
145.
[0082] 48. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 45 to 47
and the step of selecting a prey for which an interaction is
detected.
[0083] 49. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 82 to 84,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 82 to 84 including deletion, substitution or addition
of one or several amino acid residues and interacts with the c-Fos
protein.
[0084] 50. The inhibitor according to 49, wherein the protein as
the active ingredient comprises any one of the amino acid sequences
of SEQ ID NOS: 82 to 84.
[0085] 51. The inhibitor according to 49, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 146 to 148,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 146 to 148 under
a stringent condition and encodes a protein that interacts with the
c-Fos protein.
[0086] 52. The inhibitor according to 51, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 146 to
148.
[0087] 53. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 82 to 84,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 82 to 84 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein,
(a') a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 146 to 148,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 146 to 148 under
a stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0088] 54. The method according to 53, wherein the protein
comprises any one of the amino acid sequences of SEQ ID NOS: 82 to
84.
[0089] 55. The method according to 53, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 146 to
148.
[0090] 56. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 53 to 55
and the step of selecting a prey for which an interaction is
detected.
[0091] 57. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 85
or 86,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
85 or 86 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0092] 58. The inhibitor according to 57, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 85 or 86.
[0093] 59. The inhibitor according to 57, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
149 or 150,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 149 or 150 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0094] 60. The inhibitor according to 59, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 149 or 150.
[0095] 61. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 85
or 86,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
85 or 86 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 149 or 150,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 149 or 150 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0096] 62. The method according to 61, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 85 or 86.
[0097] 63. The method according to 61, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 149 or 150.
[0098] 64. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 61 to 63
and the step of selecting a prey for which an interaction is
detected.
[0099] 65. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 87 to 89,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 87 to 89 including deletion, substitution or addition
of one or several amino acid residues and interacts with the c-Fos
protein.
[0100] 66. The inhibitor according to 65, wherein the protein as
the active ingredient comprises any one of the amino acid sequences
of SEQ ID NOS: 87 to 89.
[0101] 67. The inhibitor according to 65, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 151 to 153,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 151 to 153 under
a stringent condition and encodes a protein that interacts with the
c-Fos protein.
[0102] 68. The inhibitor according to 67, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 151 to
153.
[0103] 69. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising any one of the amino acid sequences of SEQ
ID NOS: 87 to 89,
(b) a protein that comprises any one of the amino acid sequences of
SEQ ID NOS: 87 to 89 including deletion, substitution or addition
of one or several amino acid residues and interacts with a c-Fos
protein,
(a') a nucleic acid comprising any one of the nucleotide sequences
of SEQ ID NOS: 151 to 153,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
any one of the nucleotide sequences of SEQ ID NOS: 151 to 153 under
a stringent condition and encodes a protein that interacts with a
c-Fos protein.
[0104] 70. The method according to 69, wherein the protein
comprises any one of the amino acid sequences of SEQ ID NOS: 87 to
89.
[0105] 71. The method according to 70, wherein the nucleic acid
comprises any one of the nucleotide sequences of SEQ ID NOS: 151 to
153.
[0106] 72. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 69 to 71
and the step of selecting a prey for which an interaction is
detected.
[0107] 73. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 90
or 91,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
90 or 91 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0108] 74. The inhibitor according to 73, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 90 or 91.
[0109] 75. The inhibitor according to 74, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
154 or 155,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 154 or 155 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0110] 76. The inhibitor according to 75, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 154 or 155.
[0111] 77. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 90
or 91,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
90 or 91 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 154 or 155,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 154 or 155 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0112] 78. The method according to 69, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 90 or 91.
[0113] 79. The method according to 70, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 154 or 155.
[0114] 80. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 77 to 79
and the step of selecting a prey for which an interaction is
detected.
[0115] 81. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 92
or 93,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
92 or 93 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0116] 82. The inhibitor according to 81, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 92 or 93.
[0117] 83. The inhibitor according to 82, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
156 or 157,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 156 or 157 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0118] 84. The inhibitor according to 83, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 156 or 157.
[0119] 85. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 92
or 93,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
92 or 93 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 156 or 157,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 156 or 157 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0120] 86. The method according to 85, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 92 or 93.
[0121] 87. The method according to 85, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 156 or 157.
[0122] 88. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 85 to 87
and the step of selecting a prey for which an interaction is
detected.
[0123] 89. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 94
or 95,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
94 or 95 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0124] 90. The inhibitor according to 89, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 94 or 95.
[0125] 91. The inhibitor according to 90, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
158 or 159,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 158 or 159 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0126] 92. The inhibitor according to 83, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 158 or 159.
[0127] 93. A method for detecting an interaction between a bait and
a prey, which comprises bringing the bait and the prey into contact
and detecting a complex formed by the contact, wherein the bait is
a protein of the following (a) or (b) or a protein translated from
a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 94
or 95,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
94 or 95 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 158 or 159,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 158 or 159 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0128] 94. The method according to 93, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 94 or 95.
[0129] 95. The method according to 93, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 158 or 159.
[0130] 96. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 93 to 95
and the step of selecting a prey for which an interaction is
detected.
[0131] 97. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 96
or 97,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
96 or 97 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0132] 98. The inhibitor according to 97, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 96 or 97.
[0133] 99. The inhibitor according to 98, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
160 or 161,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 160 or 161 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0134] 100. The inhibitor according to 99, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 160 or 161.
[0135] 101. A method for detecting an interaction between a bait
and a prey, which comprises bringing the bait and the prey into
contact and detecting a complex formed by the contact, wherein the
bait is a protein of the following (a) or (b) or a protein
translated from a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 96
or 97,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
96 or 97 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 160 or 161,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 160 or 161 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0136] 102. The method according to 101, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 96 or 97.
[0137] 103. The method according to 101, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 160 or 161.
[0138] 104. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 101 to
103 and the step of selecting a prey for which an interaction is
detected.
[0139] 105. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 98
or 99,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
98 or 99 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0140] 106. The inhibitor according to 105, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 98 or 99.
[0141] 107. The inhibitor according to 98, wherein the protein is a
protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
162 or 163,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 162 or 163 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0142] 108. The inhibitor according to 107, wherein the nucleic
acid comprises the nucleotide sequence of SEQ ID NO: 162 or
163.
[0143] 109. A method for detecting an interaction between a bait
and a prey, which comprises bringing the bait and the prey into
contact and detecting a complex formed by the contact, wherein the
bait is a protein of the following (a) or (b) or a protein
translated from a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 98
or 99,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
98 or 99 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 162 or 163,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 162 or 163 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0144] 110. The method according to 109, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 98 or 99.
[0145] 111. The method according to 109, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 162 or 163.
[0146] 112. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 109 to
111 and the step of selecting a prey for which an interaction is
detected.
[0147] 113. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises a protein of the following (a) or (b) as an active
ingredient:
(a) a protein comprising the amino acid sequence of SEQ ID NO: 100
or 101,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
100 or 101 including deletion, substitution or addition of one or
several amino acid residues and interacts with the c-Fos
protein.
[0148] 114. The inhibitor according to 113, wherein the protein as
the active ingredient comprises the amino acid sequence of SEQ ID
NO: 100 or 101.
[0149] 115. The inhibitor according to 114, wherein the protein is
a protein translated from a nucleic acid of the following (a) or
(b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
164 or 165,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 164 or 165 under a stringent
condition and encodes a protein that interacts with the c-Fos
protein.
[0150] 116. The inhibitor according to 115, wherein the nucleic
acid comprises the nucleotide sequence of SEQ ID NO: 164 or
165.
[0151] 117. A method for detecting an interaction between a bait
and a prey, which comprises bringing the bait and the prey into
contact and detecting a complex formed by the contact, wherein the
bait is a protein of the following (a) or (b) or a protein
translated from a nucleic acid of the following (a') or (b'):
(a) a protein comprising the amino acid sequence of SEQ ID NO: 100
or 101,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
100 or 101 including deletion, substitution or addition of one or
several amino acid residues and interacts with a c-Fos protein,
(a') a nucleic acid comprising the nucleotide sequence of SEQ ID
NO: 164 or 165,
(b') a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 164 or 165 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0152] 118. The method according to 117, wherein the protein
comprises the amino acid sequence of SEQ ID NO: 100 or 101.
[0153] 119. The method according to 117, wherein the nucleic acid
comprises the nucleotide sequence of SEQ ID NO: 164 or 165.
[0154] 120. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to any one of 117 to
119 and the step of selecting a prey for which an interaction is
detected.
[0155] 121. A protein of the following (a) or (b):
(a) a protein comprising the amino acid sequence of SEQ ID NO:
102,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
102 including deletion, substitution or addition of one or several
amino acid residues and interacts with a c-Fos protein.
[0156] 122. A nucleic acid encoding the protein according to
102.
[0157] 123. A nucleic acid of the following (a) or (b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
166,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 166 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0158] 124. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises the protein according to 121 or a protein translated from
the nucleic acid according to 122 or 123 as an active
ingredient.
[0159] 125. A method for detecting an interaction between a bait
and a prey, which comprises bringing the bait and the prey into
contact and detecting a complex formed by the contact, wherein the
bait is the protein according to 121 or a protein translated from
the nucleic acid according to 122 or 123 as an active
ingredient.
[0160] 126. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to 125 and the step of
selecting a prey for which an interaction was detected.
[0161] 127. A protein of the following (a) or (b):
(a) a protein comprising the amino acid sequence of SEQ ID NO:
103,
(b) a protein that comprises the amino acid sequence of SEQ ID NO:
103 including deletion, substitution or addition of one or several
amino acid residues and interacts with a c-Fos protein.
[0162] 128. A nucleic acid encoding the protein according to
127.
[0163] 129. A nucleic acid of the following (a) or (b):
(a) a nucleic acid comprising the nucleotide sequence of SEQ ID NO:
167,
(b) a nucleic acid that hybridizes with a nucleic acid comprising
the nucleotide sequence of SEQ ID NO: 167 under a stringent
condition and encodes a protein that interacts with a c-Fos
protein.
[0164] 130. An inhibitor for an interaction between a protein that
interacts with a c-Fos protein and the c-Fos protein, which
comprises the protein according to 127 or a protein translated from
the nucleic acid according to 128 or 129 as an active
ingredient.
[0165] 131. A method for detecting an interaction between a bait
and a prey, which comprises bringing the bait and the prey into
contact and detecting a complex formed by the contact, wherein the
bait is the protein according to 127 or a protein translated from
the nucleic acid according to 128 or 129 as an active
ingredient.
[0166] 132. A method for screening for a prey that interacts with a
bait, which comprises the step of detecting an interaction between
the bait and a prey by the method according to 131 and the step of
selecting a prey for which an interaction was detected.
BRIEF EXPLANATION OF THE DRAWINGS
[0167] FIGS. 1A and 1B collectively show information of amino acid
sequence, gene sequence and so forth of the protein of the present
invention. Each of numerals within the parentheses after the DNA
sequence numbers indicates the number of amino acid sequence
encoded by it. A number having a subnumber means that the DNA
encodes the same amino acid sequence, but has a different
nucleotide sequence. SEQ ID NOS: 1 to 22 in FIG. 1A: Example 1, SEQ
ID NOS: 47 to 76 in FIG. 1A and SEQ ID NOS: 77 to 103 in FIG. 1B:
Example 2.
[0168] FIG. 2 shows the outline of the cotranslation screening
method using an IVV random library, which is a method for detecting
the proteins and genes of the present invention and the nucleotide
sequences thereof. An IVV random library of mouse brain and c-Fos
as a bait were used to carry out cell-free cotranslation screening,
and the library after the screening was amplified by RT-PCR and
then subjected to the cell-free cotranslation screening again with
the bait. This procedure was repeated 3 times to detect the
proteins and genes of the present invention and the nucleotide
sequences thereof.
[0169] FIG. 3 shows the random priming library of IVV used for
detection of the proteins and genes of the present invention and
the nucleotide sequences thereof, and the outline of the method for
producing it. Using an RNA library as a template and a random
primer including a random sequence consisting of nine nucleotides
and a specific sequence (tag2 sequence) in the random priming
method, a library of single strand cDNAs (ssDNA) complementary to
mRNAs is synthesized by reverse transcription (I). Only RNAs are
decomposed in the double strands of cDNA and RNA with RNaseH, at
the same time, DNAs complementary to cDNAs are synthesized with DNA
polymerase I, and further a nick existing between the DNAs
synthesized with DNA polymerase I is modified by using a DNA ligase
to synthesized a double strand (dsDNA) library (II). The
synthesized double-stranded cDNAs have a phosphate group at the 5'
end only on the side synthesized with the DNA polymerase I, and
therefore this is utilized to ligate an adaptor having a specific
sequence (5' UTR=promoter+enhancer) using a DNA ligase to
synthesize a ligated dsDNA library (III). PCR is carried out by
utilizing the adaptor and the specific sequence of the random
primer to prepare a cDNA library of assigning molecules having the
sequences of a promoter and an enhancer on the 5' end side and an A
tail on the 3' end side (IVV cDNA library) (IV). Then, the IVV cDNA
library is transcribed to form an IVV RNA library (V), a spacer for
preparing IVV is ligated (VI), and further it is translated in a
cell-free translation system or the like to form a library of
assigning molecules (VII).
[0170] FIG. 4 (electrophoretic photographs) shows the result 1 of
the verification of the interactions of the proteins and genes of
the present invention and the nucleotide sequences thereof.
[0171] A: Proteins having the amino acid sequences of SEQ ID NOS: 2
(Fip-cx), 16 (Eef1dTEF-1), and 22 (Schip1) were confirmed by the
C-terminal labeling method in a wheat cell-free translation system.
Lane 1 to 4: c-Jun protein, proteins of SEQ ID NO: 2 (Fip-cx), SEQ
ID NO: 16 (Eef1dTEF-1), and SEQ ID NO: 22 (Schip1).
[0172] B: IVVs having the amino acid sequences of SEQ ID NOS: 2
(Fip-cx), 16 (Eef1dTEF-1), and 22 (Schip1) were confirmed in a
wheat cell-free translation system. Lanes 1 and 2: mRNA and IVV, I
to IV: c-Jun, SEQ ID NO: 2 (Fip-cx), SEQ ID NO: 16 (Eef1dTEF-1),
and SEQ ID NO: 22 (Schip1).
[0173] C: Interactions were confirmed by the pull-down method
utilizing IVVs having the amino acid sequences of SEQ ID NOS: 2
(Fip-cx), 16 (Eef1dTEF-1), and 22 (Schip1). Lanes 1 to 3: IVV,
supernatant, and beads. a and b: with and without the bait c-Fos. I
to IV: c-Jun, SEQ ID NO: 2 (Fip-cx), SEQ ID NO: 16 (Eef1dTEF-1),
and SEQ ID NO: 22 (Schip1).
[0174] FIG. 5 (electrophoretic photographs) shows the result 2 of
the verification of the interactions of the proteins and genes of
the present invention and the nucleotide sequences thereof.
[0175] A: It was confirmed by the C-terminal labeling method in a
wheat cell-free translation system that the proteins of SEQ ID NOS:
48 (Fip-cx.1), 75 (Fip-cx.2), 78 (Optn), 84 (Snapc5), 86
(C130020M04Rik), 88 (FLJ32000), 91 (Rit2), 93 (cytochrome b), 95
(Apoe), 97 (betaAPP), 99 (Hsp40), 101 (Fip-c10), 102 (Fip-c4) and
103 (Fip-c18) (FIGS. 1A and 1B) were expressed form the nucleic
acid sequences of SEQ ID NOS: 105, 139, 142, 148, 150, 152, 155,
157, 159, 161, 163, 165, 166 and 167. Lanes 1 to 14: proteins of
SEQ ID NOS: 48 (Fip-cx.1), 75 (Fip-cx.2), 78 (Optn), 84 (Snapc5),
86 (C130020M04Rik), 88 (FLJ32000), 91 (Rit2), 93 (cytochrome b), 95
(Apoe), 97 (betaAPP), 99 (Hsp40), 101 (Fip-c10), 102 (Fip-c4) and
103 (Fip-c18).
[0176] B: As a verification experiment of the interactions of the
obtained proteins and c-Fos, direct interactions with c-Fos were
confirmed by pull-down using C-terminal labeled proteins having the
amino acid sequences of the proteins of SEQ ID NOS: 48 (Fip-cx.1)
and 75 (Fip-cx.2) based on the nucleic acid sequences of SEQ ID
NOS: 105 and 139. Lane 1: SEQ ID NO: 48 (Fip-cx.1), Lane 2: SEQ ID
NO: 75 (Fip-cx.2), a and b: with and without bait c-Fos (Lanes 1
and 2: translation product and eluted fraction).
[0177] FIG. 6 shows the result of verification of the concentration
rate and indirect interaction of the genes of the present
invention. In order to confirm concentrations of 4 kinds of
proteins of SEQ ID NOS: 78 (Optn), 84 (Snapc5), 86 (C130020M04Rik)
and 88 (FLJ32000), real-time PCR was performed by using the nucleic
acid sequences of SEQ ID NO: 142, 148, 150 and 152.
[0178] FIG. 7 shows outlines of the primary screening and secondary
screening in the analysis of interaction of IVV with substances or
proteins using the proteins and nucleic acid sequences of the
present invention. It is possible to detect interactions with the
substances and proteins in the primary screening using the proteins
and nucleic acid sequences of the present invention, and further
analyze the details of the interaction in the secondary screening
using FCCS, microarray or the like. Further, the proteins and
nucleic acid sequences of the present invention can also be
independently used for analysis of interaction with the substances
or proteins using FCCS, microarray, or the like as IVV or
C-terminal labeled proteins. Furthermore, it is also possible to
apply them to the evolutionary molecular engineering using IVV of
the proteins or nucleic acid sequences of the present invention,
and utilize them to create functional proteins in the primary
screening. In such a case, it is also possible to analyze details
of interactions of the created functional proteins with a
combination of the primary screening and secondary screening.
[0179] FIG. 8 shows configurations of a translation template (A) as
well as a coding molecule (B) and spacer molecule (C), which are
constituents of the template. The translation template consists of
a coding portion derived from the coding molecule and a spacer
portion derived from the spacer molecule. F1 and F2 represent a
fluorescent dye.
[0180] FIG. 9 shows configurations of a protein modified at the
C-terminal (C-terminal labeled protein) (A), translation template
of the present invention (B) and modification agent (C).
[0181] FIG. 10 shows outline of formation of a complex by cell-free
cotranslation.
[0182] A: Both of bait and prey are translated in a cell-free
translation system to interact with each other to form a complex in
the cell-free translation system. The prey may exist in a single
number (I) or multiple number (II), and it may be a polypeptide
itself obtainable by the translation in the cell-free translation
system, or an assigning molecule (bound substance).
[0183] B: In the presence of the bait, the prey is translated in a
cell-free translation system to interact with the bait and thereby
form a complex in the cell-free translation system. The prey may
exist in a single number (I) or multiple number (II), and it may be
a polypeptide itself obtainable by the translation in the cell-free
translation system, or an assigning molecule (bound substance).
[0184] FIG. 11 shows outline of formation of a complex by cell-free
cotranslation using a complexed bait.
[0185] Both of bait as a part constituting the complexed bait and
prey are translated in a cell-free translation system and interact
to form a complex in the cell-free translation system. The prey may
exist in a single number (I) or multiple number (II), and it may be
a polypeptide itself obtainable by the translation in the cell-free
translation system, or an assigning molecule (bound substance).
Further, the complexed bait is not limited to the combination of a
polypeptide translated in a cell-free translation system and DNA
bait shown in the drawing, and it may be, for example, a
combination of multiple or single polypeptide translated in a
cell-free translation system and multiple or single bait coexisting
in the cell-free translation system (e.g., DNA bait etc.), or the
like.
[0186] FIG. 12 shows outline of the method of screening for a
complex based on cell-free cotranslation.
[0187] By the step of forming a complex on the basis of cell-free
cotranslation as shown in FIGS. 10 and 11 (1), the step of
screening the prey of the complex (2), and the step of analyzing
the prey (3), the cell-free cotranslation and screening can be
realized totally in vitro. If the prey is an assigning molecule,
and it exists in a multiple number, the screening can be repeated
again from the step (1) by reconstructing mRNA or DNA encoding the
prey by RT-PCR or PCR. Further, after analyzing the obtained prey,
screening can be newly repeated from the step of (1) using the prey
as a bait.
BEST MODE FOR CARRYING OUT THE INVENTION
<1>Proteins of the Present Invention
[0188] In this specification, proteins found to interact with
c-Fos, including novel proteins, are called "the proteins of the
present invention" for convenience of explanation.
[0189] The first group of the proteins of the present invention
(SEQ ID NOS: 1 to 14 in FIG. 1A) consists of proteins both of which
function of forming a complex with c-Fos and amino acid sequences
are novel (Fos interacting protein chromosome X, Fip-cx). These
proteins are proteins characterized by showing homology to the
nucleotide sequence (275-829 bp) formed by frame shift (+1) of the
MAGE-necdin/trophinin complexes gene (AB032477) of the MAGE/necdin
homologous region contained in the existing genome sequence WGS
supercontig MmX (NW.sub.--042637) at nucleotide sequence level and
the amino acid sequence thereof (184aa), and they are different
from any of the proteins known so far to form a complex with c-Fos
(Yurii Chinenovl and Tom K Kerppola, Oncogene, 20, 2438-2452
(2001)). Further, the MAGE-necdin/trophinin complex is known to be
a tumor-related gene existing in the X chromosome discovered as
MAGE (melanoma-associated antigen) (Sakura S, et al., J. Biol.
Chem., 276, 49378-49389 (2001)). However, as for the present
protein Fip-cx, the frame shifts by +1 with respect to the gene
sequence of MAGE-necdin/trophinin complex, and any frame shift in
the MAGE-necdin/trophinin complex gene is not known. Therefore, the
present protein Fip-cx is a protein novel for both of the amino
acid sequence and function of interacting with c-Fos. It was
confirmed that the amino acid sequence of the present protein
Fip-cx contains a leucine zipper and directly interacts with c-Fos
(FIG. 2).
[0190] The second group of the proteins of the present invention
(SEQ ID NOS: 15 to 19 in FIG. 1A) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the gene sequence of
the existing eukaryotic translation elongation factor-1 delta
(Eef1d, TEF-1; NM.sub.--023240) gene at gene sequence level and the
amino acid sequence thereof, and they are different from any of the
proteins that are known so far to form a complex with c-Fos. The
function of forming a complex with c-Fos of the present protein
Eef1d was detected by the present invention for the first time.
Although Eef1d is known as a protein that controls translation
extension, it has recently been shown that it is also a cancer- or
tumor-related gene, and it is reported that with increase of the
expression amount of Fos, the expression amount of Eef1d also
increases as relationship of the translation factor Eef1d and tumor
transformation (Joseph P, et al., J. Biol. Chem., 277, 6131-6136
(2002)). The amino acid sequences of these proteins contain a
leucine zipper.
[0191] The third group of the proteins of the present invention
(SEQ ID NOS: 20 to 22 in FIG. 1A) consists of proteins of which
function of interacting with c-Fos is novel. These protein are
proteins characterized by showing homology to the existing
schwannomin interacting protein 1 (Schip1, NM.sub.--013928) gene
sequence at gene sequence level and amino acid sequence thereof,
and they are different from any of the proteins that are known so
far to form a complex with c-Fos. Although it is known that Schip1
is a cancer- or tumor-related gene (Gouthebroze L, et al., Mol.
Cell. Biol., 20, 1699-1712 (2000)), the function of forming a
complex with c-Fos was detected for the first time by the present
invention. These proteins bind with shwannomin, which is a
regulator gene upstream from AP-1 and inhibits AP-1 activity.
Further, the amino acid sequences of these proteins contain a
leucine zipper.
[0192] The fourth group of the protein of the present invention
(SEQ ID NOS: 47 to 56 in FIG. 1A) are proteins both of which
function of forming a complex with c-Fos and amino acid sequences
are novel (Fos interacting protein chromosome X.1, Fip-cx.1). These
proteins are derived from (+1) frame-shifted gene of the Mage-d3
gene (NM.sub.--019548) of the Mage family. The amino acid sequences
of these proteins contain a leucine zipper.
[0193] The fifth group of the proteins of the present invention
(SEQ ID NOS: 57 to 76 in FIG. 1A) consists of proteins both of
which function of forming a complex with c-Fos and amino acid
sequences are novel (Fos interacting protein chromosome X.2,
Fip-cx.2). These proteins are derived from (+1) frame-shifted gene
of the Magphinin gene (AB032477) of the Mage family. The amino acid
sequences of these proteins contain a leucine zipper.
[0194] The sixth group of the proteins of the present invention
(SEQ ID NOS: 77 to 81 of FIG. 1B) consists of proteins of which
function function of interacting with c-Fos is novel. These
proteins are proteins characterized by showin homology to the
existing Optineurin gene sequence (Optn, NM.sub.--181848) at gene
level and the amino acid sequence thereof, and they are different
from any of the proteins that are known so far to form a complex
with c-Fos. It is said that Optn is a causative gene of visual
disturbance called adult-onset primary open-angle glaucoma (Tayebeh
Rezaie, et al., Science, 295, 1077-1079 (2002)). The amino acid
sequences of these proteins contain a leucine zipper.
[0195] The seventh group (SEQ ID NOS: 82 to 84 of FIG. 1B) of the
proteins of the present invention consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing Snapc5
(Snpap19, XM.sub.--284503.1) gene sequence at gene sequence level
and the amino acid sequence thereof, and they are different from
any of the proteins that are known so far to form a complex with
c-Fos. Snapc5 is one of the subunits of the SNAP complex that binds
to PSE, which is a promoter of snRNA transcribed by pol II and pol
III, to regulate transcription (Henry, R. W., Mittal, V., Ma, B.,
Kobayashi, R., Hernandez, N., Genes Dev. 12:2664-2672 (1998),
PubMed ID: 9732265). The amino acid sequences of these proteins
contain a leucine zipper.
[0196] The eighth group of the proteins of the present invention
(SEQ ID NOS: 85 to 86 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing
C130020M04Rik (BC026483) gene sequence at gene sequence level and
the amino acid sequence thereof, and they are different from any of
the proteins that are known so far to form a complex with c-Fos.
C130020M04Rik is a gene of which protein frame is expected, but
function is unknown. The annotation thereof is transcription
regulatory factor. The amino acid sequences of these proteins
contain a leucine zipper.
[0197] The ninth group of the proteins of the present invention
(SEQ ID NOS: 87 to 89 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing FLJ3200
(XM.sub.--342896.1) gene sequence at gene sequence level and the
amino acid sequence thereof, and they are different from any of the
proteins that are known so far to form a complex with c-Fos.
FLJ3200 is a gene of which protein frame is expected, but function
is unknown. They are proteins having a sequence similar to that of
Rattus norvegicus. The amino acid sequences of these proteins
contain a leucine zipper.
[0198] The tenth group of the proteins of the present invention
(SEQ ID NOS: 90 and 91 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing Rit2
(NM.sub.--009065.2) gene sequence at gene sequence level and the
amino acid sequence thereof, and they are different from any of the
proteins that are known so far to form a complex with c-Fos.
Although Rit2 is an Ras-like protein, and it is a protein of the
Ras family, it does not have a typical CAAX box existing at the
C-terminal, which is known as a platform for the Ras protein on a
membrane. It is known that Ras activates the promoter of the App
gene (Ruiz-Leon, Y. and Pascual, A., 2, 278-285 (2001)).
Furthermore, it is reported that Ras is involved in the control of
the secretion process of the App protein together with Rho
(Maillet, M et al., Nat. Cell Biol., 5, 633-639 (2003)). It has
been reported that the Rho family consists of small GTP binding
proteins and involved in cytoskeleton, transcription, development,
transformation and so forth, and the Rho gene may stimulate the
activity of AP1 to regulate a transcription factor involved in the
activation of T cells (JIN-HONG CHANG, et al., Mil. Cell Biol., 18,
4986-4993 (1998)). The amino acid sequences of these proteins do
not contain a leucine zipper.
[0199] The eleventh group of the proteins of the present invention
(SEQ ID NOS: 92 to 93 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing
cytochrome b (AF540912.1) gene sequence at gene sequence level and
the amino acid sequence thereof, and they are different from any of
the proteins that are known so far to form a complex with c-Fos.
The registered cytochrome b gene is not cloned in the full length.
The amino acid sequences of these proteins contain a leucine
zipper.
[0200] The twelfth group of the proteins of the present invention
(SEQ ID NOS: 94 and 95 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing
apolipoprotein E (Apoe; NM.sub.--009696.2) gene sequence at gene
sequence level and the amino acid sequence thereof, and they are
different from any of the proteins that are known so far to form a
complex with c-Fos. Apoe is known as a risk factor gene of
Alzheimer's disease, and interacts with APP (David M. Holtzman, et
al., PNAS, 97, 2892-97 (2000)). The Apoe gene has an AP1 site and
is a gene existing downstream from AP1. The amino acid sequences of
these proteins do not contain a leucine zipper.
[0201] The thirteenth group of the proteins of the present
invention (SEQ ID NOS: 96 and 97 in FIG. 1B) consists of proteins
of which function of interacting with c-Fos is novel. These
proteins are proteins characterized by showing homology to the
existing amyloid beta (A4) precursor protein (App; BC005499.1) gene
sequence at gene sequence level and the amino acid sequence
thereof, and they are different from any of the proteins that are
known so far to form a complex with c-Fos. App is known as a risk
factor gene of Alzheimer's disease, and interacts with Apoe (David
M. Holtzman, et al., PNAS, 97, 2892-97 (2000)). Like the Apoe gene,
the App gene has an AP1 site and is a gene existing downstream from
AP1. In fact, it has been reported that the first cascade of the
series of reactions at the time of formation of memory is
expression of Fos/Jun, and App/Apoe is subsequently expressed at an
early stage of memory formation (Steven P. R. Rose, Learning &
Memory, 7, 1-17 (2000)). Furthermore, it has recently reported that
App is folded during translation by cotranslation by means of the
chaperon function of Apoe (cotranslational folding, Silke Hab and
et al., J. Biol. Chem., 273, 13892-13897 (1998)). It can be said
that this is an example of detection of the App/Apoe complex formed
during cotranslation of IVV that further forms a complex with a
bait Fos. The amino acid sequences of these proteins do not contain
a leucine zipper.
[0202] The fourteenth group of the proteins of the present
invention (SEQ ID NOS: 98 and 99 in FIG. 1B) consists of proteins
of which function of interacting with c-Fos is novel. These
proteins are proteins characterized by showing homology to the
existing Dnaja2 (HSP40; BC003420) gene sequence at gene sequence
level and the amino acid sequence thereof, and they are different
from any of the proteins that are known so far to form a complex
with c-Fos. Dnaja2 is a heat shock protein, and it is known that
the expression amount thereof increases with increase of those of
Fos and Jun when a heat shock is given (Kato N, et. al., Cancer
Science, 97, 644-649 (2000)). The amino acid sequences of these
proteins do not contain a leucine zipper.
[0203] The fifteenth group of the proteins of the present invention
(SEQ ID NOS: 100 and 101 in FIG. 1B) consists of proteins of which
function of interacting with c-Fos is novel. These proteins are
proteins characterized by showing homology to the existing Fip-c10
(KIAA1209, XM.sub.--136911) gene sequence at gene sequence level
and the amino acid sequence thereof, and they are different from
any of the proteins that are known so far to form a complex with
c-Fos. Fip-c10 is a gene of which protein frame is expected, but
function is unknown. The amino acid sequences of these proteins do
not contain a leucine zipper.
[0204] The sixteenth group of the proteins of the present invention
(SEQ ID NO: 102 in FIG. 1B) consists of a protein both of which
function of forming a complex with c-Fos and amino acid sequence
are novel (Fos interacting protein chromosome 4.1, Fip-c4). This
protein is encoded by a region in a genome sequence for which
protein frame has not been expected at all so far. The amino acid
sequence of this protein does not contain a leucine zipper.
[0205] The seventeenth group of the protein of the present
invention (SEQ ID NO: 103 in FIG. 1B) is a protein both of which
function of forming a complex with c-Fos and amino acid sequence
are novel (Fos interacting protein chromosome 18, Fip-c18). This
protein is encoded by a region in a genome sequence for which
protein frame has not been expected at all so far. The amino acid
sequence of this protein does not contain a leucine zipper.
[0206] Hereafter, the proteins of the present invention will be
further explained.
[0207] Among the proteins of the present invention, the proteins
having any one of the amino acid sequences of SEQ ID NOS: 1 to 22
and 47 to 103 are proteins for which it has been found that they
interact with the c-Fos protein, i.e., form a complex, as described
in the examples mentioned below. For proteins, existence of a
mutant having the same function is generally expected. Further, by
suitably modifying an amino acid sequence of a protein, a mutant
having the same function can be obtained. Therefore, proteins that
have any one of the amino acid sequences of SEQ ID NOS: 1 to 22 and
47 to 103 including deletion, substitution or addition of one or
several amino acid residues and interact with the c-Fos protein
also fall within the scope of the proteins of the present
invention. Further, proteins that show a homology of 15% or more to
any one of the amino acid sequences of SEQ ID NOS: 1 to 22 and 47
to 103 and interact with the c-Fos protein also fall within the
scope of the protein of the present invention. Examples of such
proteins of which amino acid residues are modified include, for
example, proteins having any one of the amino acid sequences of SEQ
ID NOS: 2 to 14 for the protein having the amino acid sequence of
SEQ ID NO: 1, proteins having any one of the amino acid sequences
of SEQ ID NOS: 16 to 19 for the protein having the amino acid
sequence of SEQ ID NO: 15, and proteins having any one of the amino
acid sequences of SEQ ID NOS: 21 and 22 for the protein having the
amino acid sequence of SEQ ID NO: 20.
[0208] An amino acid sequence of a protein can be modified by
modifying a nucleotide sequence of DNA encoding the protein using a
well-known means such as site-directed mutagenesis and expressing
DNA of which nucleotide sequence is modified. Among such modified
proteins, those that interact with the c-Fos protein fall within
the scope of the protein of the present invention. The interaction
with the c-Fos protein can be measured by a known method for
measuring an interaction, and examples include the method of
detecting formation of a complex mentioned in the examples
described later.
[0209] The proteins of the present invention may be fused with
another protein and thus provided as a fusion protein.
[0210] The nucleic acids of the present invention are nucleic acids
encoding the proteins of the present invention. The nucleic acids
are usually RNA or DNA. Examples of the nucleic acids of the
present invention include nucleic acids having any one of the
nucleotide sequences of SEQ ID NOS: 23 to 40 and 104 to 167. These
nucleic acids are nucleic acids of which nucleotide sequences were
determined in the examples mentioned below. For a gene, existence
of a gene encoding the same product, but having a different
nucleotide sequence, or a gene encoding a mutant having the same
function is expected. Further, by suitably modifying a nucleotide
sequence, a gene encoding the same product or a mutant having the
same function can be obtained. Therefore, nucleic acids having a
nucleotide sequence similar to any one of the nucleotide sequences
of SEQ ID NOS: 23 to 40 and 104 to 167 and encoding a protein that
interacts with the c-Fos protein also fall within the scope of the
nucleic acids of the present invention. Examples of such nucleic
acids having a similar nucleotide sequence include nucleic acids
that hybridize with a nucleic acid having a nucleotide sequence
complementary to any one of the nucleotide sequences of SEQ ID NOS:
23 to 40 and 104 to 167 under a stringent condition, and nucleic
acids having a nucleotide sequence showing a homology of 16% or
more to any one of the nucleotide sequences of SEQ ID NOS: 23 to 40
and 104 to 167.
[0211] The stringent condition referred to here corresponds to, for
example, that of hybridization using DIG Easy Hyb (Roch
Diagnostics) at 42.degree. C. followed by washing in
0.1.times.SSC/0.1% SDS for 15 minutes at 60.degree. C. Homology of
nucleotide sequences is obtained as a rate of number of nucleotides
matching in alignment of the nucleotide sequences to be compared to
the nucleotide number of the chain length of the nucleotide
sequences. Further, homology of amino acid sequences is obtained as
a rate of number of amino acid residues matching in alignment of
the amino acid sequences to be compared to the amino acid number of
the chain length of the amino acid sequences.
[0212] Whether a DNA encodes a protein that interacts with the
c-Fos protein can be easily confirmed by expressing a protein from
that DNA and confirming whether the expressed protein interacts
with the c-Fos protein using the aforementioned method.
[0213] The nucleic acids of the present invention can be obtained
by a conventional method on the basis of the elucidated nucleotide
sequences. For example, it may be synthesized by a chemical
synthesis method, or it may be obtained by RT-PCR using suitably
designed primers from a mRNA prepared from cells or tissue
expressing a protein that interacts with the c-Fos protein.
<2>Use of the Proteins of the Present Inventions and
Others.
[0214] The proteins and genes of the present invention can be
applied as an inhibitor that blocks transcription function, gene
replication function and so forth as for c-Fos in gene therapy etc.
by utilizing the novel function obtained by the nucleic acid
sequences (function enabling binding with c-Fos in this case). The
basis for this originates in the fact that the genes of the
proteins of the present invention are detected after undergoing
competitive process of screening repeated multiple times. Genes
detected by this method exhibit a certain number distribution, and
a gene having higher competitive power will be detected in a larger
number. This means that a gene of which clones are detected by this
method in a larger number should have a higher competitive power,
and it more effectively functions as a blocking agent or
inhibitor.
[0215] As for use of the proteins of the present invention and
genes encoding them, as in vitro applications, they can be applied
in, for example, evolutionary molecular engineering using a
cell-free protein synthesis system or genomic function analysis by
utilizing the novel function provided by the proteins, genes or
nucleic acid sequences according to the present invention. In this
case, analysis utilizing cotranslation screening and selection of
assigning molecules is extremely effective. This is because the
cotranslation screening/selection method makes it possible to
comprehensively detect proteins that directly or indirectly
interact with a bait protein. Furthermore, in analysis of
interactions between IVVs, IVV and C-terminal labeled protein etc.,
they can also be use as a "target molecule (bait protein)".
Examples of general methods for analyzing an interaction include,
for example, microarray method, fluorescence correlation
spectroscopy (FCS/FCCS), fluorescence imaging analysis,
fluorescence resonance energy transfer method, evanescent-field
molecular imaging method, fluorescence depolarization method,
surface plasmon resonance method, enzyme linked immunosorbent assay
and so forth. Specific examples of the cell-free protein synthesis
system include wheat germ extract, rabbit reticulocyte lysate,
Escherichia coli S30 extract and so forth. By adding a protein,
gene or nucleic acid sequence as a translation template according
to the present invention to any of these cell-free protein
synthesis systems, simultaneously adding 1 to 100 .mu.M of
modification agent in the case of C-terminal labeling, and
maintaining the system at 25 to 37.degree. C. for 1 to several
hours, a C-terminal modified protein is synthesized. In the case of
assigning, only by adding a protein, gene or nucleic acid sequence
as a translation template according to the present invention to the
cell-free protein synthesis system and maintaining the system at 25
to 37.degree. C. for 1 to several hours, an assigning molecule is
synthesized.
[0216] Further, as for in vivo applications, by utilizing the novel
function provided by the proteins, genes or nucleic acid sequences
according to the present invention, for example, a protein modified
for separation and labeled for detection (double modified protein)
synthesized in a cell-free protein synthesis system can be used as
it is for a subsequent purification process, detection process, or
direct introduction into cells. Specific examples of cell
expression system include any kind of cells of from bacteria such
as Escherichia coli, Bacillus subtilis and thermophilic bacteria,
yeast to cultured cells of insects, mammals and so forth, cells of
threadworm, drosophila, zebra fish, mouse and so forth. By directly
introducing the aforementioned C-terminal labeled or assigned
double modified protein into these cells, an objective protein can
be blocked. Alternatively, it is also possible to introduce the
aforementioned gene or nucleic acid of the present invention and
utilize the gene or nucleic acid as it is as an antisense sequence
or RNAi sequence to block expression of an objective nucleic acid,
or they can be expressed in a cell and utilized as a protein or
assigning molecule to block a protein having an interacting action.
When a protein is used in the C-terminal labeling method, by
simultaneously introducing 1 to 100 .mu.M modification agent for
C-terminal labeling into cells using electroporation,
microinjection or the like and maintaining the cells at the optimum
growth temperature of the cells for several hours, a modified
protein is synthesized. In the case of assigning, by introducing a
template of an assigning molecule having the aforementioned gene or
nucleic acid sequence of the protein of the present invention into
cells and maintaining the cells at the optimum growth temperature
of the cells for several hours, an assigning molecule is
synthesized. The synthesized double modified protein can be
collected by disrupting the cells and used for the subsequent
purification process or detection process. Further, it can be used
as it is in the cells for the detection process.
[0217] Hereafter, the use of the proteins of the present invention
and others will be further explained.
[0218] The detection method of the present invention is a method of
utilizing the proteins of the present invention as a bait in
detection of interaction between the bait and a prey.
[0219] Preferably, the method is mainly characterized in that the
bait and prey are modified for separation and labeled for detection
in a specific manner, and the prey is produced by translation in a
cell-free translation system in the presence of the bait to contact
the bait and prey. In this specification, contacting a bait and a
prey by producing the prey by translation in a cell-free
translation system in the presence of the bait is also referred to
as "cell-free cotranslation".
[0220] In this specification, the terms of "bait" and "prey" have
the meanings usually used in the technical field of analysis of
interaction between substances. That is, a protein, nucleic acid or
the like as a known substance is called "bait", and a protein,
nucleic acid or the like as a substance that interacts with the
bait is called "prey". In the present invention, the prey is
preferably a protein.
[0221] The bait used herein may be the protein of the present
invention, or a complex constituted by arbitrary components
including protein (including peptide), nucleic acid, or ligand such
as antibody and hormone, metal and so forth, so long as it contains
the protein of the present invention, and it may be a natural
substance or artificial substance. The bait is not particularly
limited as for the molecular weight and so forth. Examples include,
for example, in the case of protein, a functional domain, a
full-length protein containing a functional domain and so forth. If
a prey library is used, use of full-length proteins enables
comprehensive detection.
[0222] Further, as the prey, a protein is preferably used. The prey
is not particularly limited as for the molecular weight and so
forth.
[0223] Preferably, the detection method of the present invention is
mainly characterized in that, in the detection of an interaction of
the bait and prey, the bait and prey are modified for separation
and labeled for detection in a specific manner, and cell-free
cotranslation is performed as described above. Therefore, a
preferred configuration of the detection method of the present
invention may be the same as that of a usual method for detecting
an interaction between a bait and a prey comprising contacting the
bait and prey and detecting a complex formed by the contact, except
that the bait and prey are modified for separation and labeled for
detection in a specific manner, and cell-free cotranslation is
performed.
[0224] Although the modification for separation and labeling for
detection of the bait and prey are arbitrarily performed so that
they are suitable for the detection of the complex, they should be
performed so that both of the bait and prey should not be labeled
with a label for detection or modified for separation. Therefore,
the prey is used as a fusion protein with a protein that can be
used as a label for detection or an assigning molecule, and the
bait correspondingly has a modification for separation.
[0225] When the prey is used as a fusion protein, the bait should
have a modification for separation. When the bait is a protein, the
bait is preferably produced in a cell-free translation system by
translation of mRNA encoding a fusion protein containing the bait
as a fusion protein with a protein that can be used as a
modification for separation in the cell-free translation
system.
[0226] Examples of the modification for separation in the case
where the bait is a protein include formation of a fusion protein
with the GST protein, CBP used for the TAP method etc. (this can be
separated by using affinity with calmodulin beads), protein A (this
can be separated by using IgG-protein A affinity) as a protein, or
any of various antibody tags etc. as an affinity tag. When the bait
itself has a property that it can be used as a modification for
separation, the bait can be used as it is as a bait having a
modification for separation. Examples of the modification for
detection of the prey include formation of a fusion protein with a
fluorescent protein such as GFP (green fluorescent protein).
[0227] Preparation of mRNA encoding such a fusion protein mentioned
above and translation of this mRNA in a cell-free translation
system can be performed by usual methods. The mRNA may be mRNA
produced by transcription of DNA in a cell-free transcription and
translation system.
[0228] When the prey is an assigning molecule, arbitrary
modification for separation can be added to the bait. When the bait
is a protein, the aforementioned examples of the modification for
separation can be used. In addition, when the bait is a nucleic
acid, drug or the like, examples of the modification for separation
include use of biotin etc. that interact with streptavidin or
avidin. When the bait itself has a property that it can be used as
modification for separation, the bait can be used it is as a bait
having modification for separation.
[0229] An assigning molecule means a molecule assigning a phenotype
and a genotype. The assigning molecule is usually a molecule
comprising a genotype molecule containing a nucleic acid having a
nucleotide sequence reflecting a genotype and a phenotype molecule
containing a protein relating to expression of phenotype, which are
bound to each other. By using a prey as this protein, the prey can
be used as an assigning molecule. Such an assigning molecule can be
formed by performing translation of mRNA encoding a prey in a
cell-free translation system so that the translated prey should
associate with the mRNA, or performing transcription and
translation of DNA encoding a prey in a cell-free transcription and
translation system so that the translated prey should associate
with the DNA. Therefore, by allowing a bait to exist during the
production, cell-free cotranslation can be attained. That is, the
cell-free cotranslation can be perfomed by the following scheme (1)
or (2).
[0230] (1) By performing translation of mRNA encoding the prey in
the presence of the bait in a cell-free translation system so that
the translated prey should associate with the mRNA, the prey is
produced in the cell-free translation system, and thereby the bait
and prey are brought into contact with each other.
[0231] (2) By performing transcription and translation of DNA
encoding the prey in the presence of the bait in a cell-free
transcription and translation system so that the translated prey
should associate with the DNA, the prey is produced in the
cell-free translation system, and thereby the bait and prey are
brought into contact with each other.
[0232] Hereafter, the embodiments of (1) and (2) mentioned above
will be explained.
[0233] In the embodiment of (1), the translated prey preferably
associates with the mRNA, because the mRNA has a spacer region
bound to the 3' end and a peptide acceptor region bound to the
spacer region and containing a group that can bind to a peptide by
transpeptidation reaction. Examples of the method for detecting an
interaction using such an assigning molecule include the in vitro
virus method.
[0234] The mRNA is preferably a nucleic acid containing a 5'
untranslation region including a transcription promoter and a
translation enhancer, an ORF region encoding a prey and binding to
the 3' end side of the 5' untranslation region, and a 3' end region
including a poly-A sequence and binding to 3' end side of the ORF
region. Preferably, an expression amplification sequence containing
an SNNS (S is G or C) sequence on the 5' end side of the poly-A
sequence (for example, a sequence recognizable by the restriction
enzyme XhoI) is further included. The mRNA may or may not have a
Cap structure at the 5' end.
[0235] The poly-A sequence is a poly-A continuous chain of at least
2 or more residues comprising dA and/or rA as single kind of
residues or mixture of two kinds, and the poly-A chain consists of,
preferably 3 or more residues, more preferably 6 or more residues,
still more preferably 8 or more residues.
[0236] One of the factors affecting the translation efficiency is a
combination of the 5' UTR comprising a transcription promoter and a
translation enhancer and the 3' end region including a poly-A
sequence. The effect of the poly-A sequence of the 3' end region is
usually exerted with a length of ten or less residues. As the
transcription promoter of the 5' UTR, T7/T3, SP6, and so forth can
be used, and no particular limitation is imposed. SP6 is preferred,
and it is particularly preferable to use SP6, especially when a
sequence containing an omega sequence or a part of omega sequence
is used as the translation enhancer sequence. The translation
enhancer is preferably a part of the omega sequence, and as the
part of the omega sequence, one containing a part of the omega
sequence of TMV (O29, refer to Gallie D. R., Walbot V., Nucleic
Acids Res., vol. 20, 4631-4638 (1992), and WO02/48347, FIG. 3) is
preferred.
[0237] Further, for the translation efficiency, the combination of
the XhoI sequence and a poly-A sequence is preferred in the 3' end
region. Furthermore, a combination of the downstream portion of the
ORF region, i.e., the upstream region of the XhoI sequence having
an affinity tag, and a poly-A sequence is preferred. The affinity
tag sequence may be any sequence for utilizing a means that can
detect a protein such as an antigen-antibody reaction, and no
limitation is imposed. The affinity tag is preferably the Flag-tag
sequence or His-tag sequence, which is a tag for affinity
separation analysis based on an antigen-antibody reaction. As for
the effect of the poly-A sequence, an affinity tag such as the
Flag-tag attached with the XhoI sequence and further attached with
a poly-A sequence increases the translation efficiency. As for the
His-tag, even a His-tag having a configuration not containing the
XhoI sequence also exhibits sufficient translation efficiency, and
thus is effective.
[0238] Such a configuration effective for improvement of
translation efficiency is also effective for assignment
efficiency.
[0239] If SP6+O29 and Flag+XhoI+A.sub.n (n=8) or His+A.sub.n (n=8),
for example, are used as the 5' UTR and the 3' end region,
respectively, the 5' UTR and the 3' end region would have lengths
of about 49 bp and about 38 or 26 bp, respectively, and thus they
have such a length that they can be incorporated into primers for
PCR as an adaptor region. Therefore, a coding region having such a
5' UTR and 3' end region can be easily produced by PCR from any of
vectors, plasmids and cDNA libraries. In the coding region,
translation may occur beyond the ORF region. That is, there may not
be a stop codon at the end of the ORF region.
[0240] The peptide acceptor region is not particularly limited, so
long as it can bind to the C-terminal of a peptide. For example,
puromycin and 3'-N-aminoacylpuromycin aminonucleosides (PANS-amino
acids) including PANS-amino acids corresponding to all amino acids
such as PANS-Gly in which the amino acid portion is glycine,
PANS-Val in which the amino acid portion is valine, and PANS-Ala in
which the amino acid portion is alanine can be utilized. Further,
3'-N-aminoacyladenosine aminonucleosides (AANS-amino acids), in
which a 3'-aminoacyladenosine and an amino acid are bonded via an
amide bond as a chemical bond formed as a result of dehydration
condensation of the amino group of the 3'-aminoacyladenosine and
the carboxyl group of the amino acid, corresponding to all amino
acids, for example, AANS-Gly in which the amino acid portion is
glycine, AANS-Val in which the amino acid portion is valine,
AANS-Ala in which the amino acid portion is alanine, and so forth
can also be used. Furthermore, nucleosides and nucleosides bound
with an amino acid via an ester bond can also be used. In addition,
any of substances formed with a bonding scheme that can chemically
bond a nucleoside or a substance having a chemical structure
similar to that of nucleoside and an amino acid or a substance
having a chemical structure similar to amino acid can be used.
[0241] The peptide acceptor region preferably comprises puromycin
or a derivative thereof, or puromycin or a derivative thereof and
one or two residues of deoxyribonucleotides or ribonucleotides. The
term "derivative" used in this case means a derivative that can
bind to the C-terminal of peptide in a protein translation system.
The puromycin derivative is not limited to those having the total
puromycin structure, and includes those having the puromycin
structure a part of which is eliminated. Specific examples of the
puromycin derivative include PANS-amino acids, AANS-amino acids and
so forth.
[0242] Although the peptide acceptor region may have a structure
consisting only of puromycin, it preferably has a nucleotide
sequence comprising DNA and/or RNA of one or more residues at the
5' end side. As such a sequence, dC-puromycin, rC-puromycin, and so
forth, more preferably, a CCA sequence comprising dCdC-puromycin,
rCrC-puromycin, rCdC-puromycin, dCrC-puromycin or the like and
imitating the 3' end of aminoacyl-tRNA (Philipps, G. R., Nature
223, 374-377 (1969)), is suitable. As for the type of nucleotide,
preference is higher in the order of C>(U or T)>G>A.
[0243] The spacer region is preferably a PEG region containing
polyethylene glycol as a main component. The spacer region usually
contains, in addition to the PEG region, a donor region that can
bind to the 3' end of a nucleic acid.
[0244] The donor region that can bind to the 3' end of nucleic acid
usually consists of one or more nucleotides. The number of
nucleotides is usually 1 to 15, preferably 1 to 2. The nucleotides
may be a ribonucleotide or a deoxyribonucleotide. The donor region
may have a modification substance.
[0245] The sequence of the 5' end of the donor region affects the
ligation efficiency with the coding region encoding the prey. In
order to attain ligation of the coding region and the spacer
region, it is required to include at least one or more residues,
and at least one residue of dC (deoxycytidylic acid) or two
residues of dCdC (dideoxycytidylic acid) is preferred for an
acceptor having a poly-A sequence. As for the type of nucleotide,
preference is higher in the order of C>(U or T)>G>A.
[0246] The PEG region contains polyethylene glycol as a main
component. The expression "contains polyethylene glycol as a main
component" used herein means that the total number of nucleotides
contained in the PEG region is 20 or less, or the average molecular
weight of the polyethylene glycol is 400 or more. It preferably
means that the total number of nucleotides is 10 or less, or the
average molecular weight of the polyethylene glycol is 1000 or
more.
[0247] The average molecular weight of the polyethylene glycol in
the PEG region is usually 400 to 30,000, preferably 1,000 to
10,000, more preferably 2,000 to 8,000. If the molecular weight of
the polyethylene glycol is lower than about 400, a posttreatment
for assignment translation may be required for assignment
translation of a genotype molecule containing the spacer region
(Liu, R., Barrick, E., Szostak, J. W., Roberts, R. W., Methods in
Enzymology, vol. 318, 268-293 (2000)). However, if PEG having a
molecular weight if 1000 or more, preferably 2000 or more, is used,
highly efficient assignment can be attained only by assignment
translation, and therefore the posttreatment for the translation
becomes unnecessary. Further, when the molecular weight of the
polyethylene glycol increases, stability of the genotype molecule
tends to increase, and in particular, the stability becomes
favorable with a molecular weight of 1000 or more. If the molecular
weight is 400 or less, properties thereof are not different so much
from those of a DNA spacer, and it may become unstable.
[0248] By having a spacer region containing polyethylene glycol as
a main component, it becomes possible to form an assigning molecule
not only in a cell-free translation system of rabbit reticulocytes,
but also in a cell-free translation system of wheat germ, the
stability of the genotype molecule in both translation systems is
markedly improved, and it becomes unnecessary to perform any
treatment after the translation.
[0249] In the embodiment of (2), it is preferred that DNA encodes a
fusion protein of a protein and streptavidin or avidin, DNA is
labeled with bibtin, and a translated prey associates with the DNA
because transcription and translation is carried out in a state
that one DNA molecule is contained in one compartment of emulsion.
Examples of the method for detecting an interaction using such an
assigning molecule include the STABLE method.
[0250] The emulsion is usually a W/O type emulsion formed by mixing
two kinds of surface active agents, mineral oil and a reaction
mixture of cell-free transcription and translation system. In order
to form a W/O type emulsion, it is usually necessary for the
surface active agents to have an HLB (hydrophile-lipophile balance)
value of 3.5 to 6. The HLB value of mixed two kinds of surface
active agents is calculated from the HLB values of the individual
surface active agents by using a simple equation. For example, if
Span 85 (HLB=1.8) and Tween 80 (HLB=15.0) are mixed in volumes of
40.2 .mu.l and 9.8 .mu.l, respectively, the mixture has an HLB
value of 4.4. The ratio of the surface active agents and mineral
oil is usually 1:18 (volume ratio). Further, the ratio of the
reaction mixture is 1 to 50% (volume ratio) with respect to the
whole emulsion, and it is usually 5%. The emulsion can be formed by
adding the reaction mixture as several divided portions to a
mixture of the surface active agents and mineral oil at a low
temperature with stirring and mixing them. The reactions of
transcription and translation can be started by raising the
temperature of the emulsion.
[0251] The preparation of DNA encoding a prey, and transcription
and translation of such a DNA in a cell-free transcription and
translation system can be performed in a usual manner.
[0252] As described above, by labeling the bait and prey for
detection and modifying them for separation in particular schemes,
a complex formed by cell-free cotranslation can be specifically
detected.
[0253] As for the cell-free cotranslation of a bait and a prey, the
cell-free translation system (including cell-free transcription and
translation system) in which the cell-free cotranslation is
performed may be any of systems of E. coli, rabbit reticulocytes,
wheat germs and so forth. Although formation of assigning molecules
is quite unstable with E. coli in the in vitro virus method, it has
been confirmed that it is stable in a system of rabbit
reticulocytes (Nemoto N., Miyamoto-Sato E., and Yanagawa H., FEBS
Lett. 414, 405 (1997); Roberts R. W., Szostak J. W., Proc. Natl.
Acad. Sci. USA, 94, 12297 (1997)), and it has been further
confirmed that it is still more stable in a system of wheat germ
(Japanese Patent Laid-open No. 2002-176987). For the STABLE method,
the system may be any of systems of E. coli, rabbit reticulocyte,
wheat germ and so forth.
[0254] The conditions for the translation and transcription in the
cell-free cotranslation are suitably selected depending on a
cell-free translation system to be used.
[0255] The templates of the bait and prey added to the cell-free
translation system may be either RNA or DNA, so long as the
cell-free translation system is a cell-free transcription and
translation also causing transcription.
[0256] Hereafter, an example of a translation template preferred
for use as a bait will be explained.
[0257] As a bait used in the cotranslation screening of this
embodiment, used is a translation template characterized by
comprising a coding portion having information for translation into
a protein and a PEG spacer portion as shown in FIG. 8. The coding
portion has information for translation into a protein, and it may
be any sequence. However, it is preferably characterized by having
an acceptor (A sequence) in a 3' end region of the coding portion,
or having an acceptor (A sequence) in a 3' end region of the coding
portion and a translation amplification sequence (X sequence)
5'-upstream from the A sequence. It contains a short poly-A
sequence as the A sequence of the coding portion. The short poly-A
sequence is usually a sequence comprising 2 to 10 nucleotides of A.
It is characterized by having a sequence having (C or G)NN(C or G)
sequence, for example, a XhoI sequence, as the X sequence. The PEG
spacer portion has a PEG region containing polyethylene glycol as a
main component, a donor region for ligation with the coding
portion, and a CCA region at the 3' end. Although the PEG spacer
portion may consist only of the donor region or CCA region, it is
preferably has a configuration comprising the PEG region containing
polyethylene glycol as a main component. The CCA region is
characterized by not having a function of binding by a
transpeptidation reaction to a protein translated from the
translation template. The PEG region is characterized by having a
molecular weight of the polyethylene glycol of 500 or more.
Further, it is characterized by containing at least one
function-imparting unit (F) in the donor region and/or the CCA
region. It is characterized in that the function-imparting unit (F1
and/or F2) immobilizes or labels with fluorescence the translation
template and/or a protein translated from the translation template.
As the immobilization substance, biotin and so forth are
contemplated, and as the fluorescent substance, fluorescein, Cy5,
rhodamine green (RhG) and so forth can be contemplated. The present
invention relates to these coding portion, translation template,
and libraries thereof, as well as a protein translated on a
ribosome and library thereof.
[0258] The translation template of a bait (FIG. 8, A) comprises a
coding portion derived from a coding molecule (FIG. 8, B) and a PEG
spacer portion derived from a PEG spacer molecule (FIG. 8, C). In
this embodiment, a PEG spacer portion can be ligated to the coding
portion to improve stability thereof, and thus translation
efficiency can be improved, basically regardless of the sequence of
the coding portion. However, it is further possible to further
improve the translation efficiency depending on the configuration
of the coding portion or the type of the PEG spacer portion. The
details thereof are described below.
[0259] The coding portion of this embodiment (FIG. 8, B) comprises
a 5' end region, an ORF region, and a 3' end region, and it may or
may not have a Cap structure at the 5' end. Further, the sequence
of the coding portion is not particularly limited, and use thereof
as one incorporated into any vector or plasmid can be contemplated.
The 3' end region of the coding portion includes one having a
poly-A x 8 sequence as the A sequence or one having, as the X
sequence, the XhoI sequence or a sequence of SNNS(S is G or C) as a
sequence of 4 or more nucleotides, and XA as a combination of the A
sequence and X sequence. A configuration that a Flag-tag sequence
is included as an affinity tag sequence upstream from the A
sequence, X sequence, or XA sequence is contemplated. The affinity
tag sequence used here may be a sequence for using any means that
enables detection or purification of a protein, for example, those
utilizing an antigen-antibody reaction such as HA-tag and protein A
of IgG (z domain), His-tag, and so forth. As for factors affecting
the translation efficiency, the combination of the XA sequence is
important. The first four nucleotides of the X sequence are
important, and one having a sequence of SNNS is preferred. Further,
the 5' end region comprises a transcription promoter and a
translation enhancer. As the transcription promoter, T7/T3, SP6,
and so forth can be used, and no particular limitation is imposed.
However, for a cell-free translation system of wheat, a sequence
containing an omega sequence or a part of omega sequence is
preferably used as the translation enhancer sequence, and SP6 is
preferably used as the promoter. The part of the omega sequence is
one containing a part of the omega sequence of TMV (O29, refer to
Gallie D. R., Walbot V., Nucleic Acids Res., vol. 20, 4631-4638
(1992) and WO02/48347, FIG. 3). The ORF region of the coding
portion may be any sequence comprising DNA and/or RNA. It may be a
gene sequence, exon sequence, intron sequence, random sequence, or
any natural sequence or artificial sequence, and the sequence is
not limited.
[0260] The PEG spacer molecule of this embodiment (FIG. 8, C)
comprises a CCA region, a PEG region, and a donor region. The
minimum essential component is the donor region. As for the factors
affecting the translation efficiency, one having not only the donor
region but also the PEG region is preferred, and it preferably has
puromycin which does not have an ability to bind with an amino
acid. The molecular weight of the polyethylene glycol in the PEG
region is 400 to 30,000, preferably 1,000 to 10,000, more
preferably 2,000 to 6,000. Further, the CCA region may have a
configuration including puromycin or a configuration not including
puromycin. As puromycin, puromycin and 3'-N-aminoacylpuromycin
aminonucleosides (PANS-amino acids) including PANS-amino acids
corresponding to all amino acids such as PANS-Gly in which the
amino acid portion is glycine, PANS-Val in which the amino acid
portion is valine, and PANS-Ala in which the amino acid portion is
alanine can be utilized. Further, 3'-N-aminoacyladenosine
aminonucleosides (AANS-amino acids), in which a
3'-aminoacyladenosine and an amino acid is bonded via an amide bond
as a chemical bond formed as a result of dehydration condensation
of the amino group of the 3'-aminoacyladenosine and the carboxyl
group of the amino acid, corresponding to all amino acids, for
example, AANS-Gly in which the amino acid portion is glycine,
AANS-Val in which the amino acid portion is valine, AANS-Ala in
which the amino acid portion is alanine, and so forth can also be
used. Furthermore, nucleosides and nucleosides bound with an amino
acid via an ester bond can also be used. In addition, any of
substances formed with a bonding scheme that can chemically bond a
nucleoside or a substance having a chemical structure similar to
that of nucleoside and an amino acid or a substance having a
chemical structure similar to amino acid can be used. For this
translation template, any substances corresponding to the
aforementioned puromycin derivatives of which amino group lacks the
ability to bind to an amino acid, and a CCA region lacking
puromycin are also contemplated. However, by incorporating
puromycin that cannot bind with a protein on a ribosome, the
translation efficiency can be further enhanced. Although the reason
for this is not certain, it may possible that puromycin that cannot
bind with a protein stimulates a ribosome to enhance the turnover.
A nucleotide sequence comprising DNA and/or RNA of one or more
residues is preferably contained on the 5' end side of the CCA
region (CCA). As for the type of nucleotide, preference is higher
in the order of C>(U or T)>G>A. As such a sequence,
dC-puromycin, rC-puromycin, and so forth, more preferably, a CCA
sequence comprising dCdC-puromycin, rCrC-puromycin, rCdC-puromycin,
dCrC-puromycin or the like and imitating the 3' end of
aminoacyl-tRNA (Philipps, G. R., Nature 223, 374-377 (1969)) is
suitable. In one embodiment of the present invention, these
puromycins are made incapable of binding with an amino acid in a
certain manner.
[0261] The PEG spacer portion of this embodiment may have a
configuration containing a modification substance (F1 and/or F2).
With this characteristic, it can be used as a tag for collection,
reuse by purification, or immobilization of translation template.
Those comprising at least one residue of nucleotide of DNA and/or
RNA incorporated with any of various separation tags such as
fluorescent substance, biotin, and His-tag may be possible.
Further, if SP6+O29 and Flag+XhoI+An (n=8) are used as the 5' end
region and the 3' end region of the coding portion, respectively,
for example, the lengths of the 5' end region and the 3' end region
are about 60 bp and about 40 bp, respectively, and thus they have
such a length that they can be designed in primers for PCR as an
adaptor region. This provides a novel advantage. That is, it
becomes possible to easily prepare a coding portion having a 5' end
region and 3' end region according to this embodiment by PCR from
any vector, plasmid, and cDNA library, and by ligating the PEG
spacer portion, instead of a 3' UTR, to this coding portion, a
translation template showing a high translation efficiency can be
obtained.
[0262] The ligation of the PEG spacer molecule and the coding
molecule according to this embodiment may be attained any method
such as usual methods utilizing a DNA ligase or those based on a
photoreaction, and the method is not particularly limited. In the
ligation using an RNA ligase, as factors in the coding portion
affecting the ligation efficiency, the A sequence of the 3' end
region is important. It is a poly-A continuous chain consisting of
at least two, preferably 3 or more, more preferably 6 to 8, of
single kind or mixed kinds of residues selected from dA and/or rA.
The DNA and/or RNA sequence of the 5' end of the donor region of
the PEG spacer portion affects the ligation efficiency. In order to
ligate the coding portion and PEG spacer portion with an RNA
ligase, it is required to contain at least one or more residues,
and for an acceptor having a poly-A sequence, at least 1 residue of
dC (deoxycytidylic acid) or two residues of dCdC (dideoxycytidylic
acid) is preferred. As for the type of nucleotide, preference is
higher in the order of C>(U or T)>G>A. Furthermore, it is
preferable to add polyethylene glycol of the same molecular weight
as the PEG region during the ligation reaction.
[0263] Hereafter, an example of a translation template preferably
used as a prey will be explained.
[0264] As a prey in cotranslation screening according to this
embodiment, a protein of which C-terminal is modified with a
translation template as represented in FIG. 9 (i.e., assigning
molecule) is used. The translation template comprises a coding
portion having information for translation into a protein and a PEG
spacer portion. The coding portion has an A sequence at the 3' end,
and the A sequence comprises a short poly-A sequence. The PEG
spacer portion is characterized in that polyethylene glycol in the
PEG region containing polyethylene glycol as a main component has a
molecular weight of 400 or more, and the donor region and/or the
CCA region contains at least one modification substance (F1 and/or
F2). Further, the CCA region is characterized by having a function
of binding by transpeptidation to a protein translated from the
translation template, and the CCA region typically has puromycin.
Further, it is characterized in that the modification substance (F1
and/or F2) immobilizes or labels with fluorescence the translation
template and/or a protein translated from the translation template.
As the immobilization substance, biotin and so forth are
contemplated, and as the fluorescent substance, fluorescein, Cy5,
rhodamine green (RhG) and so forth can be contemplated. The present
invention relates to these coding portion, translation template,
and libraries thereof, as well as a protein synthesized by
translation on a ribosome (i.e., assigning molecule) and a library
of such proteins (i.e., assigning molecules).
[0265] The prey is a protein synthesized by translation utilizing
the translation template, of which C-terminal is modified with the
translation template (FIG. 9, A, assigning molecule), and has
characteristics in the translation template (FIG. 9, B) and the
configuration of a protein of which C-terminal is modified with PEG
(FIG. 9, C). It will be described in detail below.
[0266] The PEG spacer portion of the translation template (FIG. 9,
B) is the same as that of the aforementioned translation template
preferred for use as an bait except that it is characterized in
that puromycin can bind with an amino acid. Further, the coding
portion is also the same as that of the aforementioned translation
template preferred for use as a bait. However, as for a
configuration suitable for assignment, in particular, it is
important to use an A sequence as the 3' end region, and this
markedly increase the assignment efficiency of the total proteins
and markedly decrease the amount of free proteins. Also in this
case, if SP6+O29 and Flag+XhoI+An (n=8) are used as the 5' end
region and the 3' end region of the coding portion, respectively,
for example, the lengths of the 5' end region and the 3' end region
are about 60 bp and about 40 bp, respectively, and thus they have
such a length that they can be designed in primers for PCR as an
adaptor region. This makes it possible to easily prepare a coding
portion having a 5' end region and 3' end region according to this
embodiment by PCR from any vector, plasmid, and cDNA library, and
by ligating the PEG spacer portion, a translation template showing
a high assignment efficiency can be obtained.
[0267] When the cording portion of the protein of which C-terminal
is modified with PEG according to this embodiment (FIG. 9, C) is
not used in detection of an interaction of proteins, i.e., when the
protein is use for, for example, FCCS measurement, fluorescence
reader, protein chip, and so forth, it may be intentionally cleaved
with an RNase A or the like. By the cleavage, difficulty of
detection of an interaction between proteins due to inhibition by
the coding portion can be eliminated. Further, it is also possible
to immobilize such a simple assigning molecule on a plate, bead, or
slide glass.
[0268] The cell-free cotranslation will be explained with reference
to FIG. 10. As shown in FIG. 10, a prey is translated in vitro in
the presence of a bait. As shown in FIG. 10, A and B, there are a
case where the bait is a protein, and it is translated
simultaneously with the prey in a cell-free translation system, and
a case where the bait is a nucleic acid, hormone or the like, and
it is added to a cell-free translation system. As shown in FIG. 10,
the prey is made into a fusion protein or assigning molecule.
[0269] The complex may be formed by binding of a bait and one prey
(I), or by binding of another prey to a prey binding to a bait
(II).
[0270] Because the detection method of the present invention
enables in vitro formation of the complex, interactions between
proteins, nucleic acid and protein, and so forth can be
consistently detected in vitro.
[0271] When the bait is a protein, examples of the bait include a
protein consisting only of a functional domain for an interaction
with an objective protein, a protein including a functional domain,
a protein of full length, and so forth. If a protein of full length
is used, it is expected that it has multiple functional domains,
and therefore it favorably becomes possible to more comprehensively
detect preys. The protein of full length may be a single protein
having a full length, or an assembly of two or more baits from
which a protein of full length can be reconstructed.
[0272] The bait may be a complex as shown in FIG. 11, and this is
called "complexed bait". By using such a complex, nonspecific
adsorption can be further reduced, and it becomes possible to more
comprehensively detect preys as the same effect as that of the full
length protein.
[0273] As described above, as a complex contemplated for the
cell-free cotranslation, a complex of a single bait and a single
prey, a complex of a complexed bait and a prey, a complex of a bait
and multiple preys, and a complex of a complexed bait and multiple
preys are possible. Therefore, an interaction detectable by the
detection method of the present invention includes not only a
direct interaction between a bait and a prey, but also an indirect
interaction for forming a complex.
[0274] It is considered that the most important factor in the
cell-free cotranslation according to the present invention is that
a protein is folded in a native state and in an undenatured state
immediately after translation, a bait and a prey, a bait and
another bait, or a prey and another prey, which should interact
with each other, should coexist in a cell-free translation system,
and thus they can promptly interact with each other. This is
supported by the fact that a more superior result could be obtained
by cotranslation compared with separate translation followed by
coexistence by mixing. That is, it is considered that this is
because a protein translated in vitro in a native folding state can
encounter a protein, nucleic acid or the like, and therefore prompt
formation of a complex by an interaction becomes possible.
[0275] The conventional methods of detecting an interaction require
expression in E. coli and purification of a bait in a large amount.
For example, when an interaction of a bait and a prey is expressed
in a cell by the TAP method or the like, at least one month of
preparation is needed. Further, the mRNA displaying method
employing the pull-down method based on a GST fusion protein has
problems that it takes at least 2 or 3 weeks because of the large
amount expression in E. coli and purification of the bait, a
substance that cannot be expressed in E. coli cannot be used as the
bait, and so forth, and it further requires addition of bait in an
amount 50 to 100 times the amount of prey to cause interaction with
the prey. In the cell-free cotranslation, it becomes that only
addition of almost equal weight of mRNA or DNA template to a
cell-free translation system is sufficient, and it becomes
completely unnecessary to express the bait in cells. Thus, the
operation time can be markedly shortened. Furthermore, with a
complexed bait or full length protein, an interaction of a bait and
a prey can be further enhanced and made specific, and thus
detection of nonspecific bonds can be avoided. Further, by using a
complexed bait, a larger number of preys that interact with the
second bait thereof can be comprehensively analyzed.
[0276] Although no system realizing complex formation by an
interaction and screening consistently in vitro has existed so far,
by performing translation and screening including those for a bait
completely in vitro according to the detection method of the
present invention described above, a system that can
comprehensively detect interactions between proteins or between a
protein and a nucleic acid with avoiding nonspecific detection can
be constructed. Therefore, the present invention also provides a
screening method utilizing the detection method of the present
invention.
[0277] The screening method of the present invention is
characterized by forming complexes by interactions of a bait and
preys during cell-free cotranslation, and analyzing a prey that
interacts with the bait by screening of the complexes. Therefore,
the screening method of the present invention may be the same as an
ordinary screening method for a prey that interacts with a bait
comprising the detection step of detecting an interaction between a
bait and prey and the selection step of selecting a prey for which
an interaction is detected, except that the method comprises the
detection step of detecting an interaction between a bait and a
prey by the detection method of the present invention.
[0278] The screening method of the present invention further
comprises the preparation step of preparing a prey selected in the
selection step, and it is preferable to repeat the detection step,
selection step and preparation step by using the prepared prey
instead of or together with the bait used in the detection step. In
this embodiment, the method is constituted by, for example, 1) the
step of cell-free cotranslation in a cell-free translation system
in which a prey and a bait cause an interaction, 2) the step of
screening for detecting a prey interacting with the bait, 3) the
step of examining and analyzing the prey, and 4) the step of
repeating the steps from 1) by using the prey examined and analyzed
in 3), as shown in FIG. 12. The steps of 1) and 2) correspond to
the detection step and selection step, and the step of 3)
corresponds to the preparation step. That is, the step of
contacting a prey to the bait in the detection step corresponds to
the step of the cell-free cotranslation, and the steps of detecting
and selecting a complex in the detection step correspond to the
step of screening.
[0279] In the screening method of the present invention, the prey
selected in the selection step may be used again in the detection
step.
[0280] In the screening method of the present invention, the
cell-free cotranslation may be performed with a bait and a prey
library, which is a group of multiple preys, so that two or more
preys may be detected in the step of screening.
[0281] As shown in FIG. 11, a complexed bait and a prey may
coexist, and a complex of the complexed bait and the prey may be
formed by an interaction. By using a prey library in this cell-free
cotranslation so that multiple kinds of preys of the prey library
should coexist with the bait, and multiple complexes of the preys
with the bait should be formed by interactions, multiple kinds of
preys that interacts with the bait can be simultaneously and
comprehensively detected in the screening. Further, if a full
length protein is used as the bait, it becomes possible to
comprehensively detect a larger number of preys, because a full
length protein generally contains multiple functional domains for
interactions.
[0282] Furthermore, as shown in FIG. 11, by forming multiple
complexes of preys that interact with a complexed bait, multiple
preys that interact with the complexed bait can be detected, and
the second bait serves as a reinforcer of the interaction of a bait
and a prey to realize a more specific interaction, which makes it
possible to avoid nonspecific detection in the comprehensive
detection. In evolutionary molecular engineering techniques such as
the vitro virus method and the STABLE method, the prey is an
assigning molecule (fusion). In the formation of complexes using a
prey library or multiple kinds of preys, the preys may or may not
directly interact with the bait.
[0283] When the complex obtained by the screening of complexes is
an assigning molecule, a prey forming the complex may be detected
by RT-PCR or PCR, and the screening may be performed again by using
the PCR product as a prey (reconstruction of prey) or by using a
prey analyzed from the PCR product as a new succeeding bait, as
shown in FIG. 12. The method of performing the screening again by
using the PCR product or performing screening by using a prey
analyzed from the PCR product as a new succeeding bait can be
performed only in the evolutionry molecular engineering techniques
such as the in vitro virus method and the STABLE method, and cannot
be carried out in a method of directly analyzing a protein such as
the pull-down method and the TAP method.
[0284] When an assigning molecule is used, gene sequence of a
proteinic prey can be known by RT-PCR or PCR after the screening.
As shown in FIGS. 10 and 11, the proteinic prey referred to above
is a prey interacting with a bait, a prey interacting that prey or
the like, and all multiple kinds of preys interacting a bait can be
comprehensively analyzed. When rescreening of a prey is further
necessary, a DNA template, which is a product of RT-PCR or PCR, is
transcribed, and the same cycles are repeated. Further, when a prey
is determined by RT-PCR or PCR and the following sequence, it
becomes possible to use that proteinic prey as a bait. If two or
more kinds of preys that interact with the first bait are found, it
becomes possible to form a complexed bait, and thus it becomes
possible to detect a further larger number of preys.
[0285] If the cell-free cotranslation is used, it becomes possible
to detect an interaction between proteins consistently in vitro
even in the pull-down method or the TAP method. However, the
assigning molecule is not formed in the TAP method, and therefore
proteins must be directly analyzed in the analysis of preys. Then,
if the pull-down method or the TAP method is used as the screening
method in the in vitro virus method or the STABLE method, an
assigning molecule is formed, and therefore gene sequence of a prey
that causes interaction can be easily detected by RT-PCR or PCR in
the analysis of the prey. Furthermore, if the cell-free
cotranslation is used, it becomes possible to detect interactions
between proteins consistently in vitro in the in vitro virus method
or the STABLE method. Further, the number of preys is extremely
large, the range of candidate preys can be narrowed down by
rescreening performed by repeating the cycles. Further, the
analyzed prey can be used as a bait in the next analysis, and if
the number of preys increases, complexing of the bait advances,
which results in detection of further preys. As described above,
use of a prey as a bait in the subsequent cycle can be easily
realized only in the in vitro virus method, STABLE method and so
forth, which use an assigning molecule. However, the mRNA display
method and the like requires synthesis in E. coli and purification
of a large amount of GST fusion protein as a new bait, and thus
preparation of the bait takes time, which makes the method
difficult. If the cell-free cotranslation is used, such procedures
are unnecessary, and the cycles can be easily repeated.
[0286] In the screening of complexes after the cell-free
cotranslation, it is preferred that preys can be screened
comprehensively without breaking the complexes produced by the
cell-free cotranslation. For this purpose, a device for
immobilization may be imparted to the bait with an affinity tag or
the like so as to detect a prey that interacts with the bait. Any
kind of such a device for immobilization may be used. Examples
include, for example, a method of performing two-stage screening
using IgG-protein A affinity or calmodulin beads as in the
conventional TAP method, and a method of performing one- or
two-stage screening using streptavidin or avidin/biotin affinity,
GST-tag, Flag-tag, T7-tag, His-tag or the like as in the pull-down
method.
[0287] Examples of the prey library include a cDNA library (random
priming library, dT priming library), random library, peptide
library, hormone library, antibody library, ligand library,
pharmaceutical compound library, and so forth, and any kind of
library may be used. For example, if a random priming cDNA library
is used as the prey library, although a full length prey cannot be
expected for this library, a prey containing a functional domain
can be expected. If such a library is used especially for screening
using combinations with a complexed bait or full length protein, it
becomes effective for comprehensive detection of preys.
[0288] Examples of the random priming library include cDNAs
obtained by random priming and incorporated into multi-cloning
sites (MCS) of vectors having a 5' untranslation region (UTR)
containing the promoter of the RNA polymerase of SP6 (SP6) as a
transcription promoter and a part of TMV omega sequence (O29) of
the tobacco mosaic virus as a translation enhancer on the 5' end
side of MCS, and containing a sequence for the Flag-tag, which is a
tag for affinity separation analysis based on an antigen-antibody
reaction, as an affinity tag sequence on the 3' end side of MCS, so
that the Flag-tag should be added to the C-terminal of a protein
expressed from an insert sequence incorporated into MCS.
[0289] The aforementioned detection method of the present invention
includes the step of contacting a bait and a prey to form a
complex. Therefore, a method of forming a complex of a bait and a
prey that interacts with the bait is provided according to this
step.
[0290] The formation method of the present invention is
characterized by using the protein of the present invention as a
bait in the formation of a complex of a bait and a prey, which is a
protein that interacts with the bait, and preferably further
labeling the bait and prey for detection and modifying them for
separation in particular schemes, to perform cell-free
cotranslation. Therefore, a preferred configuration of the
formation method of the present invention may be the same as that
of an ordinary method of forming a complex of a bait and a prey
comprising contacting a bait and a prey that interacts with the
bait, except that the bait and prey are labeled for detection and
modified for separation in particular schemes, and the cell-free
cotranslation is performed. The labeling for detection and
modification for separation of the bait and prey in particular
schemes as well as the cell-free cotranslation may be the same as
those explained for the detection method of the present
invention.
[0291] In the formation method of the present invention, not only a
complex of a bait and a prey, for which interaction is known, but
also a complex comprising elements for which interaction is unknown
can be formed by performing the step of contacting the bait with a
prey that interacts with the bait by contacting a bait with a prey
library consisting of multiple kinds of preys.
[0292] Other methods for utilizing the protein of the present
invention include the followings:
[0293] a method for analyzing an interaction between a protein and
a substance, which is performed by fluorescence correlation
spectroscopy, fluorescent imaging analysis method, fluorescence
resonance energy transfer method, evanescent field molecular
imaging method, fluorescence depolarization method, surface plasmon
resonance method or enzyme linked immunosorbent assay using the
protein of the present invention,
[0294] a method for detecting an interaction between a protein and
a substance, which uses the protein of the present invention and
detects the interaction by amplification of a nucleotide sequence
of a coding portion bound to the C-terminal of the protein of the
present invention,
[0295] a method for detecting an interaction between a protein and
a substance, which uses the protein of the present invention and
uses the cell-free cotranslation method or the cell-free
cotranslation screening method,
[0296] a method for detecting an interaction between a protein and
a substance, which uses the protein of the present invention and
labels the protein with fluorescence and/or immobilizes the
protein,
[0297] a method for analyzing an interaction of a protein or
substance in vitro by using the protein of the present
invention,
[0298] a method for analyzing an interaction of a protein or
substance, which uses the protein of the present invention, and
utilizes the cotranslation method in vitro,
[0299] a method for analyzing an interaction of a protein or
substance in vivo by using the protein of the present invention,
and
[0300] the aforementioned methods for analyzing an interaction,
which uses a nucleic acid encoding the protein of the present
invention.
[0301] Further, the followings are also mentioned:
[0302] a method for analyzing an interaction between a protein and
a target molecule, which uses a C-terminal modified protein
comprising the protein and a modification agent binding to the
C-terminal of the protein. The analysis of the interaction can be
carried out by fluorescence correlation spectroscopy, fluorescent
imaging analysis method, fluorescence resonance energy transfer
method, evanescent field molecular imaging method, fluorescence
depolarization method, surface plasmon resonance method or enzyme
linked immunosorbent assay. The C-terminal modified protein may be
immobilized. The C-terminal modified protein may be added on an
array on which the target molecule is immobilized, and then the
C-terminal modified protein specifically binding to the target
molecule may be detected.
[0303] In the analysis method of this embodiment, the interaction
is usually analyzed by contacting the modified protein of the
present invention obtained above and the target molecule in a
suitable combination selected depending on the type of the
modification substance or reaction system and measuring change of a
signal generated by the interaction between the modified protein
and the target molecule among signals generated by the modified
protein or the target molecule. The analysis of the interaction is
carried out by, for example, fluorescence correlation spectroscopy,
fluorescent imaging analysis method, fluorescence resonance energy
transfer method, evanescent field molecular imaging method,
fluorescence depolarization method, surface plasmon resonance
method or enzyme linked immunosorbent assay. The details of these
methods are explained below.
[0304] The "target molecule" means a molecule that interacts with
the modified protein of the present invention, and it may be
specifically a protein, nucleic acid, sugar chain, low molecular
weight compound or the like, preferably a protein or DNA.
[0305] The protein is not particularly limited so long as it has an
ability to interact with the modified protein of the present
invention, and it may be a protein of full length or a partial
peptide containing an active site for binding. Further, it may be a
protein of which amino acid sequence or function is known or
unknown. It may be a synthesized peptide chain, a protein purified
from an organism, a protein obtained by translation from a cDNA
library using a suitable translation system and purification, or
the like, and they can be used as the target molecule. The
synthesized peptide chain may be a glycoprotein consisting of a
synthesized peptide chain attached with a sugar chain. Among these,
a purified protein of which amino acid sequence is known or a
protein obtained by translation from a cDNA library and
purification using suitable methods can be preferably used.
[0306] The nucleic acid is not particularly limited so long as it
has an ability to interact with the modified protein of the present
invention, and either DNA or RNA may be used. Further, it may be a
nucleic acid of which nucleotide sequence or function is known or
unknown. Preferably, a nucleic acid of which function as a nucleic
acid having an ability to bind to a protein or nucleotide sequence
is known or a nucleic acid obtained by cleavage with a restriction
enzyme or the like and isolation from a genomic library or the like
can be used.
[0307] The sugar chain is not particularly limited so long as it
has an ability to interact with the modified protein of the present
invention, and it may be a sugar chain of which saccharide sequence
or function is known or unknown. Preferably, an already isolated
and analyzed sugar chain of which saccharide sequence or function
is known is used.
[0308] The low molecular weight compound is not particularly
limited so long as it has an ability to interact with the modified
protein of the present invention. A compound of which function is
unknown or a compound of which ability to bind to a protein is
already known may also be used.
[0309] The "interaction" caused by these targets molecules with the
modified protein of the present invention usually means an action
caused by an intermolecular force generated by at least one of
covalent bond, hydrophobic bond, hydrogen bond, van der Waals
binding and binding caused by electrostatic force between a protein
and a target molecule. However, this term should be construed in
its broadest sense, and it should not be construed in any
limitative way. The covalent bond includes a coordinate bond and
dipole bond. The binding caused by electrostatic force includes,
besides electrostatic bond, electric repulsion. Further, a bonding
reaction, synthetic reaction and decomposition reaction caused as a
result of the aforementioned action are also included in the
interaction.
[0310] Specific examples of the interaction include association and
dissociation of an antigen and an antibody, association and
dissociation of a protein receptor and a ligand, association and
dissociation of an adhesion molecule and a partner molecule,
association and dissociation of an enzyme and a substrate,
association and dissociation of a nucleic acid and a protein
binding to it, association and dissociation of proteins in an
information transmission system, association and dissociation of a
glycoprotein and a protein and association and dissociation of a
sugar chain and a protein.
[0311] The target molecule to be used may be modified with a
modification substance and used depending on embodiments. The
modification substance is usually selected from nonradioactive
modification substances such as fluorescent substances. The
fluorescent substances may be any of various fluorescent dyes of,
for example, fluorescein type, rhodamine type, Cy3, Cy5, eosine
type, NBD type and so forth, which have a free functional group
(e.g., carboxyl group, hydroxyl group, amino group etc.) and can
bind to the aforementioned target substance such as proteins and
nucleic acids. In addition, other compounds such as dyes may be
used, and type and size of the compounds are not critical so long
as they enable the modification.
[0312] Among these modification substances, a substance suitable
for the method of measurement or analysis of change of signal
generated due to an interaction between the target molecule and the
modified protein of the present invention is used.
[0313] The aforementioned modification substance can be bound to
the target molecule by a suitable method known per se.
Specifically, when the target molecule is a protein, the method of
modifying the C-terminal described in WO02/48347 or the like may be
used. Further, when the target molecule is a nucleic acid, it can
by easily modified by a method of performing PCR using an oligo DNA
primer bound with a modification substance beforehand via a
covalent bond or the like.
[0314] Further, the modified protein of the present invention or
the target molecule used for present invention may be bound to a
solid phase (i.e., immobilized) depending on the embodiment. As the
method for binding to a solid phase, there are a method of binding
it via the modification substance and a method of binding it via
another portion.
[0315] The modification substance used in binding via the
modification substance is usually a molecule specifically binding
to a particular polypeptide (henceforth also referred to as a
"ligand"), and a particular polypeptide binding to the ligand
(henceforth also referred to as an "adaptor protein") is bound to
the solid phase. The adaptor protein also includes binding
proteins, receptor proteins constituting receptors, antibodies and
so forth.
[0316] Examples of combinations of the adaptor protein and the
ligand include any of various receptor proteins and a ligand
thereof, for example, a biotin or iminobiotin binding protein such
as avidin and streptavidin and biotin or iminobiotin, maltose
binding protein and maltose, G protein and guanine nucleotide,
polyhistidine peptide and metal ion such as nickel or cobalt ion,
glutathione-S-transferase and glutathione, DNA binding protein and
DNA, antibody and antigen molecule (epitope), calmodulin and
calmodulin binding peptide, ATP binding protein and ATP, estradiol
receptor protein and estradiol and so forth.
[0317] Among these, preferred combinations of the adaptor protein
and the ligand are biotin or iminobiotin binding protein such as
avidin and streptavidin and biotin or iminobiotin, maltose binding
protein and maltose, polyhistidine peptide and metal ion such as
nickel or cobalt ion, glutathione-S-transferase and glutathione,
antibody and antigen molecule (epitope) and so forth, and a
combination of streptavidin and biotin or iminobiotin is the most
preferred. These binding proteins per se are known, and DNAs coding
these proteins have already been cloned.
[0318] The adaptor protein can be bound to a solid phase surface by
using a method known per se. Specifically, for example, there can
be used a method of utilizing tannic acid, formalin,
glutaraldehyde, pyruvic aldehyde, bis-diazotized benzizone,
toluene-2,4-diisocyanate, amino group, carboxyl group that can be
converted into an active ester group, hydroxyl group or amino group
that can be converted into phosphoramidite group, or the like.
[0319] When the binding is attained via a portion other than the
modification substance, there can be used a known method usually
used for binding a protein, nucleic acid, sugar chain or low
molecular weight compound to a solid phase. Specifically, there can
be used, for example, a method of utilizing tannic acid, formalin,
glutaraldehyde, pyruvic aldehyde, bis-diazotized benzizone,
toluene-2,4-diisocyanate, amino group, carboxyl group that can be
converted into an active ester group, hydroxyl group or amino group
that can be converted into phosphoramidite group, or the like.
[0320] The solid phase may be one usually used for immobilizing a
protein, nucleic acid or the like, and material and shape thereof
are not particularly limited. For example, glass plates,
nitrocellulose membranes, nylon membranes, polyvinylidene fluoride
membranes, microplates made of plastics and so forth can be
used.
[0321] The "measurement" is a means for collecting changes of
signals used for analysis, and it should not be construed in any
limitative way. As the measurement method used, any of methods that
can detect an intermolecular interaction can be used, including
fluorescence correlation spectroscopy, fluorescence resonance
energy transfer method, evanescent field molecular imaging method,
fluorescence depolarization method, fluorescence imaging analysis
method, surface plasmon resonance method, enzyme linked
immunosorbent assay and so forth.
[0322] The measurement method includes a method comprising adding
the modified protein of the present invention onto an array on
which a target molecule is immobilized and detecting the modified
protein of the present invention specifically binding to the target
molecule. The array on which the target molecule is immobilized
means a solid phase on which the target molecule is immobilized in
an arrangement enabling identification thereof. The method for
detecting the modified protein of the present invention
specifically binding to the target molecule is not particularly
limited, so long as the method enables detection of the modified
protein of the present invention specifically binding to the target
molecule. However, there is usually used, for example, a method of
removing the modified protein of the present invention not binding
to the target molecule by washing from the array to which the
modified protein of the present invention is added and detecting
the remaining modified protein of the present invention.
[0323] Hereafter, examples of the measurement method will be
explained.
(1) Fluorescence Correlation Spectroscopy
[0324] The fluorescence correlation spectroscopy (FCS, Eigen, M.,
et al., Proc. Natl. Acad. Sci., USA, 91, 5740-5747 (1994)) is a
method of measuring flow rate, diffusion coefficient, volume
shrinkage or the like of particles under a confocal laser
microscope or the like. In the present invention, interacting
molecules can be measured by measuring change of translational
Brownian movement of one original modified molecule of the present
invention (C-terminal modified protein) caused by an interaction
between the modified protein and a target molecule.
[0325] Specifically, fluorescence emitted from sample particles in
a partial volume of a sample solution due to excitation of the
sample particles by an excitation light is measured to obtain a
photon ratio. This value changes with the number of the particles
existing in a space volume observed during a specific period of
time. The aforementioned various parameters can be calculated from
the change of signals using an autocorrelation function.
Apparatuses for carrying out FCS are also marketed from Carl Zeiss
and so forth, and analysis can be performed by using these
apparatuses also in the present invention.
[0326] When a protein-target molecule interaction is measured or
analyzed by using this method, it is required to provide both of
the C-terminal modified protein and the target molecule as
solutions (liquid phase method). The target molecule does not need
to be labeled. Further, a molecule having a molecular weight
extremely smaller than that of the C-terminal modified protein of
which interaction should be investigated is not suitable for this
method, since such a molecule does not affect the Brownian movement
of the C-terminal modified protein.
[0327] However, fluorescence cross-correlation spectroscopy (FCCS)
using two kinds of fluorescent dyes can detect even an interaction
between proteins having molecular weights of similar order, of
which detection is difficult by FCS using one kind of fluorescent
dye. Although the fluorescence resonance energy transfer (FRET)
method is known as another method of using two kinds of fluorescent
dyes, two kinds of fluorescent dyes need to approach each other at
a distance within 40 to 50 .ANG. in order to cause FRET, and there
is a risk in this method that FRET may not be observed depending on
sizes of proteins, locations at which the fluorescent dyes are
attached or the like, even an interaction occurs. On the other
hand, since the detection of cross-correlation does not depend on
the distance between the fluorescent dyes in the FCCS method, it
does not suffer from such a problem. Further, comparing with the
fluorescence depolarization method as another detection system, the
FCCS method has advantages of a smaller amount of required sample,
shorter detection time, easier automatization for HTS and so forth.
Further, since the FCCS method provides extremely fundamental
information such as size and number of fluorescence-labeled
molecules, it may be used for general purpose like the surface
plasmon resonance method. The difference between the both is that,
in the surface plasmon resonance method, an interaction is detected
in the state that proteins are immobilized, whereas the FCCS method
enables observation of interaction in a solution, which is closer
to a natural state. In the FCCS method, although proteins do not
need to be immobilized, the proteins must be labeled with
fluorescent dyes instead. However, it has been made possible by the
present invention to overcome this problem.
[0328] Further, the FCCS method enables investigation of a
protein-protein interaction or protein-nucleic acid interaction in
a state of solution, which is close to the intracellular
environment, and enables convenient calculation of dissociation
constant (binding constant) by one measurement.
[0329] The method for bringing a target molecule into contact with
the C-terminal modified protein in this method may be any method
that allows the contact in a sufficient degree such that they can
interact with each other. However, it is preferably attained by a
method of introducing a solution dissolving the C-terminal modified
protein in a buffer usually used for biochemical purpose or the
like at an appropriate concentration into a well for measurement in
a commercially available FCS apparatus and further introducing a
solution dissolving the target molecule in the same buffer at an
appropriate concentration into the well.
[0330] In this method, as a method of performing multiple analyses,
for example, there is used a method of introducing multiple kinds
of different C-terminal modified proteins into wells for
measurement in the aforementioned FCS apparatus, respectively, and
further introducing a solution of a particular target molecule into
the wells, or introducing a particular C-terminal modified protein
into wells, and further introducing solutions of multiple kinds of
different target molecules into the wells, respectively.
(2) Fluorescence Imaging Analysis Method
[0331] The fluorescence imaging analysis method is a method of
bringing a modifying molecule into contact with an immobilized
molecule and measuring or analyzing fluorescence emitted by the
immobilized modifying molecule remained on the immobilized molecule
due to an interaction between the both molecules using a
commercially available fluorescence imaging analyzer.
[0332] When a protein-target molecule interaction is measured or
analyzed by using this method, one of the C-terminal modified
protein or the target molecule must be immobilized by the
aforementioned method. When an immobilized target molecule is used,
either a modified or unmodified target molecule can be used.
Further, when it is used without immobilization, it must be
modified with the aforementioned modification substance. Either a
C-terminal modified protein immobilized at the modified portion or
a C-terminal modified protein immobilized at a portion other than
the modified portion may be used.
[0333] As a substrate for immobilizing a C-terminal modified
protein or target molecule (solid phase), there can be used glass
plates, nitrocellulose membranes, nylon membranes, microplates made
of plastics and so forth, which are usually used for immobilizing a
protein, nucleic acid or the like. Further, such substrates as
mentioned above of which surfaces are bound with various functional
groups (amino group, carboxyl group, thiol group, hydroxyl group
etc.) or various ligands (biotin, iminobiotin, metal ions such as
nickel or cobalt ion, glutathione, saccharides, nucleotides, DNA,
RNA, antibody, calmodulin, receptor protein etc.) can also be
used.
[0334] The method for bringing a modified target molecule or a
C-terminal modified protein into contact with an immobilized
molecule in this method may be any method that allows the contact
in a sufficient degree such that the both molecules can interact
with each other. However, it is preferably attained by a method of
preparing a solution dissolving the modified target molecule or the
C-terminal modified protein in a buffer usually used for
biochemical purpose at an appropriate concentration and bringing
the solution into contact with the solid phase surface.
[0335] After bringing the both molecules into contact with each
other, a step of washing off excessively existing modified target
molecule or C-terminal modified protein with the same buffer or the
like is preferably performed, and fluorescence signal emitted from
the modification substance of the target molecule or C-terminal
modified protein which remained on the solid phase, or a mixed
signal of fluorescence emitted from the immobilized modified
molecule and fluorescence emitted from the modified molecule
remained on the solid phase can be measured or analyzed by using a
commercially available imaging analyzer to identify the molecule
that interacts with the immobilized molecule.
[0336] In this method, as a method of simultaneously performing
multiple analyses, for example, there is used a method of
immobilizing multiple kinds of C-terminal modified proteins or
modified or unmodified target molecules on the aforementioned solid
phase surface with positioning addresses, a method of bringing
multiple kinds of non-immobilized C-terminal modified proteins or
modified target molecules into contact with one kind of C-terminal
modified protein or modified or unmodified target molecule, or the
like. When multiple kinds of C-terminal modified proteins or
modified target molecules are brought into contact, the molecules
remained on the solid phase can be obtained by dissociating them
using difference of buffer concentration or the like and analyzed
by a known method to identify them.
(3) Fluorescence Resonance Energy Transfer Method
[0337] As another intermolecular interaction detection method using
two kinds of fluorescent dyes, the fluorescence resonance energy
transfer (FRET) method is well known. FRET means a phenomenon that,
if a fluorescence spectrum of one of two kinds of fluorescent dyes
(energy donor) and an absorption spectrum of the other (energy
receptor) overlap, and the distance between two of the fluorescent
dyes is sufficiently small, it becomes more likely that excitation
energy of the donor excites the receptor before the donor emits
fluorescence. Therefore, two kinds of proteins of which interaction
is desired to be detected are labeled with fluorescent dyes serving
as the donor and the receptor, respectively, and the donor is
excited. When the two kinds of proteins do not interact with each
other, FRET is not caused because the distance between the
fluorescence dyes is large, and thus fluorescence spectrum of the
donor is observed. However, if the two kinds of proteins interact
with each other, and hence the distance between the fluorescent
dyes becomes smaller, fluorescence spectrum of the receptor is
observed due to FRET. Therefore, presence or absence of an
interaction between the proteins can be determined on the basis of
difference in wavelengths of fluorescence spectra. As for the
fluorescent dyes, a combination of fluorescein as the donor and
rhodamine as the receptor is frequently used. Further, it is
recently attempted to observe FRET in a cell to detect an
interaction by using combination of mutant green fluorescence
proteins (GFP) emitting fluorescence of different wavelengths. As a
drawback of this method, it is mentioned that since two kinds of
fluorescent dyes need to approach to each other at a distance
within 40 to 50 .ANG. in order to cause FRET, there is a risk that
FRET may not be observed depending on sizes of proteins, locations
at which the fluorescent dyes are attached or the like, even if an
interaction occurs.
(4) Evanescent Field Molecular Imaging Method
[0338] The evanescent field molecular imaging method is a method
described in Funatsu, T., et al., Nature, 374, 555-559 (1995) or
the like, and it is a method of bringing a second molecule as a
solution into contact with a molecule immobilized on a transparent
material such as glass, irradiating them with a laser light or the
like from a light source at such an angle that an evanescent field
should be generated, and measuring or analyzing the generated
evanescent light using a detector. These operations can be
performed by using an evanescent field fluorescence microscope
known per se.
[0339] When a protein-target molecule interaction is measured or
analyzed by using this method, one of the C-terminal modified
protein or the target molecule must be immobilized by the
aforementioned method. When an immobilized target molecule is used,
it does not need to be modified. However, when it is used without
immobilization, it must be modified with the aforementioned
modification substance.
[0340] As the substrate for immobilizing the C-terminal modified
protein or target molecule, a substrate made of a material of glass
or the like is used, and quartz glass is preferably used. Further,
a substrate of which surface is cleaned by ultrasonication is
preferred in order to prevent scatter of laser light or the
like.
[0341] The method for bringing a non-immobilized C-terminal
modified protein or target molecule into contact with an
immobilized molecule in this method may be any method that allows
the contact in a sufficient degree such that the both molecules can
interact with each other. However, a method of preparing a solution
dissolving the non-immobilized C-terminal modified protein or
modified target molecule in a buffer usually used for biochemical
purpose at an appropriate concentration and adding the solution
dropwise to the solid phase surface is preferred.
[0342] After bringing the both molecules into contact with each
other, fluorescence generated through excitation by the evanescent
field illumination can be measured by using a detector such as a
CCD camera to identify the molecule that interacts with the
immobilized molecule.
[0343] In this method, as a method of simultaneously performing
multiple analyses, for example, there is used a method of
immobilizing multiple kinds of C-terminal modified proteins or
modified target molecules on the aforementioned substrate with
positioning addresses, or the like.
(5) Fluorescence Depolarization Method
[0344] The fluorescence polarization method (Perran, J., et al., J.
Phys. Rad., 1, 390-401 (1926)) is a method utilizing the fact that
a fluorescent molecule excited with a polarized fluorescent light
emits fluorescence in the same plane of polarization during the
excited state while it maintains a stationary state, whereas the
emitted fluorescence has a plane different from that of the
excitation light when the excited molecule undergoes rotational
Brownian movement or the like during the excited state. The
movement of molecule is affected by the size thereof, and when the
fluorescent molecule is a macromolecule, the molecule scarcely
shows movement during the excited state, and emitted light is
maintained to be a polarized light. However, in the case of a low
molecular weight fluorescent molecule, since it shows high moving
velocity, the emitted light is depolarized. Therefore, if intensity
of the fluorescence emitted from a fluorescent molecule excited by
a plane polarized light is measured along the original plane and a
plane perpendicular thereto, information of motility and existing
state of the molecule can be obtained from a ratio of the
fluorescence intensities for the both planes. According to this
method, behavior of a target molecule that interacts with a
fluorescence-modified molecule can be traced without being affected
by contaminants, if any. This is because shift of polarization
degree is measured only when the fluorescence-modified molecule and
the target molecule interact with each other.
[0345] As apparatuses for carrying out this method, BECON (produced
by Panyera) and so forth are marketed, and this method can be
carried out by using these apparatuses.
[0346] When a protein-target molecule interaction is measured or
analyzed by using this method, it is required to provide both of
the C-terminal modified protein and the target molecule as
solutions. The target molecule does not need to be modified.
Further, a molecule having a molecular weight extremely smaller
than that of the C-terminal modified protein of which interaction
should be investigated is not suitable for this method, since such
a molecule does not affect the Brownian movement of the C-terminal
modified protein.
[0347] The method for bringing a target molecule into contact with
the C-terminal modified protein in this method may be any method
that allows the contact in sufficient degree such that they should
interact with each other. However, it is preferably attained by a
method of introducing a solution dissolving the C-terminal modified
protein in a buffer usually used for biochemical purpose at an
appropriate concentration into a well for measurement in a
commercially available fluorescence depolarization apparatus and
further introducing a solution dissolving the target molecule in
the same buffer at an appropriate concentration into the well.
[0348] It is expected that specificity of interaction between the
C-terminal modified protein and the target molecules to be measured
in this method is not necessarily so high as that of an
antigen-antibody reaction. Therefore, in order to identify an
optimum combination, it is effective that degree of interaction
should be numerically defined. As an index representing degree of
interaction, for example, a value of the minimum target substance
concentration providing the maximum fluorescence polarization
degree for a C-terminal modified protein of a fixed concentration
or the like can be used.
[0349] In this method, as a method of simultaneously performing
multiple analyses, for example, there is used a method of
introducing multiple kinds of different C-terminal modified
proteins into wells for measurement in the aforementioned
fluorescence depolarization apparatus, respectively, and further
introducing a solution of a particular target molecule into the
wells, or introducing a particular C-terminal modified protein into
wells and further introducing solutions of multiple kinds of
different target molecules into the wells, respectively.
(6) Surface Plasmon Resonance Method
[0350] The surface plasmon resonance method is a method of
measuring surface plasmon excited by a molecule interacting at a
metal/liquid interface as change of intensity of reflected light
(Cullen, D. C., et al., Biosensors, 3 (4), 211-225 (1987-88)). When
a protein-target molecule interaction is measured or analyzed by
using this method, the C-terminal modified protein must be
immobilized by the aforementioned method, but the target molecule
does not need to be modified.
[0351] As a substrate for immobilizing the C-terminal modified
protein, a transparent substrate made of glass or the like on which
a thin film of metal such as gold, silver or platinum is formed is
used. The transparent substrate may be any of those usually used
for surface plasmon resonance apparatuses. It generally consists of
glass as a substrate consisting of a material transparent to a
laser light, and such a substrate having a thickness of about 0.1
to 5 mm is generally used. Further, thickness of the metal thin
film is suitably about 100 to 2000 .ANG.. Those marketed as such
immobilization substrates for surface plasmon resonance apparatuses
can also be used. The C-terminal modified protein can be
immobilized on the substrate by the method described above.
[0352] The method for bringing a target molecule into contact with
the C-terminal modified protein in this method may be any method
that allows the contact in a sufficient degree such that the both
molecules can interact with each other. However, a method of
bringing the immobilized C-terminal modified protein into contact
with a solution dissolving the target molecule in a buffer usually
used for biochemical purpose at an appropriate concentration can be
preferably used.
[0353] These steps may also be performed by using a commercially
available surface plasmon resonance apparatus, for example, BIAcore
2000 (produced by Pharmacia Biosensor). After bringing the both
molecules into contact with each other, change with time of
relative intensity of each reflected light can be measured by using
a surface plasmon resonance apparatus known per se to analyze or
measure an interaction of the immobilized C-terminal modified
protein and the target molecule.
[0354] In this method, as a method of simultaneously performing
multiple analyses, for example, there is used a method of
immobilizing multiple kinds of C-terminal modified proteins on a
substrate used for the surface plasmon resonance apparatus with
positioning addresses, a method of bringing multiple kinds of
target molecules into contact with one kind of immobilized
C-terminal modified protein, or the like.
(7) Enzyme Linked Immunosorbent Assay
[0355] The enzyme linked immunosorbent assay (ELISA, Crowther, J.
R., Methods in Molecular Biology, 42 (1995)) is a method of
bringing a solution containing an antibody into contact with an
antigen immobilized on a solid phase and measuring or analyzing the
antibody remaining on the immobilized antigen due to the
interaction between the both molecules (antigen-antibody reaction)
on the basis of fluorescence emitted from a modification molecule
(IgG etc.) specifically binding to the antibody or a signal emitted
by a dye formed from the modification molecule as a substrate using
a commercially available detector (ELISA reader).
[0356] When a protein-target molecule interaction is measured or
analyzed by using this method, the C-terminal modified protein
serving as the antigen must be immobilized by the aforementioned
method. Further, the target molecule serving as the antibody must
be modified with the aforementioned modification substance.
[0357] As a substrate for immobilizing the C-terminal modified
protein serving as the antigen, microplates made of plastics
usually used for ELISA and so forth can also be used.
[0358] The method for bringing the modified target molecule serving
as the antibody into contact with an immobilized molecule in this
method may be any method that allows the contact in a sufficient
degree such that the both molecules can interact with each other.
However, a method of preparing a solution dissolving the modified
target molecule in a buffer usually used for biochemical purpose at
an appropriate concentration and introducing the solution into a
microplate is preferred.
[0359] After bringing the both molecules into contact with each
other, a step of washing off excessively existing modified molecule
not binding to the immobilized molecule is preferably performed,
and fluorescence emitted from the modified molecule remained on the
solid phase can be measured or analyzed by using a commercially
available ELISA reader or the like to identify the molecule that
interacts with the immobilized antigen molecule.
[0360] In this method, as a method of simultaneously performing
multiple analyses, for example, there is used a method of
immobilizing multiple kinds of different modified target molecules
in each well of the aforementioned microplate.
[0361] Further, the protein of the present invention can also be
used for identification of a molecule that causes an
interaction.
[0362] When primary structure of a target molecule for which an
interaction with a C-terminal modified protein is recognized on the
basis of measurement according to any one of the methods described
above is unknown, the primary structure can be analyzed by a
suitable method known per se. Specifically, when the target
molecule for which an interaction is recognized is a protein, its
amino acid sequence can be analyzed by using an amino acid analyzer
etc. to identify the primary structure. Further, when the target
molecule is a nucleic acid, nucleotide sequence can be determined
by a nucleotide sequence determination method using an automatic
DNA sequencer or the like.
[0363] Furthermore, the protein of the present invention can also
be used for analysis of an interaction with a protein library.
[0364] The present invention provides a gene or nucleic acid
sequence encoding a novel protein that can form a complex with the
c-Fos protein, which was obtained by performing cotranslation
selection/screening of IVV using the c-Fos protein as the bait and
a mouse brain cDNA library as the prey, and methods for utilizing
them. The present invention also provides a method for utilizing a
gene or nucleic acid sequence encoding a known protein, which is
not known to form a complex with the c-Fos protein.
[0365] The present invention not only enables screening of known
gene sequences and known nucleic acid sequences, but also can
provide a novel protein having a novel amino acid sequence formed
by unexpected frame shift, a novel protein having a nucleic acid
sequence for which only the nucleic acid sequence is published on
the basis of genome information, or a novel protein having a
completely novel nucleic acid sequence, further, a protein that
forms a complex by an unexpected indirect interaction in addition
to a direct interaction, a gene or nucleic acid sequence encoding
the protein, and methods for utilizing them.
EXAMPLES
[0366] Hereafter, the amino acid sequences of the proteins of the
present invention and the sequences of the nucleic acids encoding
them will be specifically described. However, the following
examples should be construed as a mere aid for specifically
understanding the present invention, and the scope of the present
invention is no way limited by the following examples.
Example 1
[0367] Cotranslation selection/screening of IVV was carried out by
using the c-Fos protein as a bait and a mouse brain cDNA library as
a prey (FIG. 2), and as a result, genes or nucleic acid sequences
encoding novel proteins that can form a complex with the c-Fos
protein were obtained.
[0368] The preparation method of the bait, c-Fos protein, was as
follows. A DNA template was prepared from a pCMV-FosCBPzz vector
(SEQ ID NO: 168) by PCR (primers 5' SP6(O29)T7-FosCBPzz (SEQ ID NO:
169) and 3' FosCBPzz (SEQ ID NO: 170), and PCR program CYCB1 (refer
to Table 1)) using TaKaRa Ex Taq (Takara Shuzo). The DNA template
was transcribed (37.degree. C., 2 hours) by using RiboMAX.TM. Large
Scale RNA Production Systems (Promega) to prepare a mRNA template
of the bait c-Fos protein. A bait DNA made to coexist was prepared
by PCR (primers 5' DNA (SEQ ID NO: 172) and 3' DNA (SEQ ID NO:
173)) using DNA-Fos/Jun (SEQ ID NO: 171) containing the Fos/Jun
binding sequence as a template according to the PCR program V-2
(refer to Table 1).
[0369] The preparation method of the mouse brain cDNA library as
the prey was as follows. An IVV random library was prepared as
shown in FIG. 3. As an RNA library, a commercially available mouse
brain (polyA+) RNA library (obtained by purifying a tissue
extracted RNA library in an oligo dT column, Clontech) was
purchased. As for design of an adaptor, it was designed so as to
add a 5' UTR sequence suitable for the production of assigning
molecules (promoter SP6+enhancer O29 or O') to the library as a
sequence required for IVV formation. For the mouse brain (polyA+)
RNA library, an adaptor having the enhancer O29 was used. The main
chain (SEQ ID NO: 174 or 175) and the subchain (gaattcgc or
ggaattcg) of the adaptor for the enhancer O29 were each dissolved
in the TE buffer (10 mM Tris-Cl, pH8.0, 1 mM EDTA) at a
concentration of 100 .mu.M, and 10 .mu.l each of the solutions of
the main chain and subchain were mixed so that the main chain and
the subchain were mixed in equimolar amounts. The mixture was
heated at 90.degree. C. for 2 minutes and at 70.degree. C. for 5
minutes, set on a water bath of 60.degree. C., and then slowly
cooled from 60.degree. C. to room temperature by turning off the
heater of the bath. The mixture was divided into 5 .mu.l aliquots,
and stored at -20.degree. C. Then, the mouse brain (polyA+) RNA
library was reverse-transcribed into single stranded DNAs (FIG. 3,
I). 0.5 .mu.g of the mouse brain (polyA+) RNA library (1.4
pmole/0.5 .mu.g), 2 pmol of 3' random primer (SEQ ID NO: 176) and
DEPC water were added to obtain a volume of 12.0 .mu.l, and the
mixture was heated at 70.degree. C. for 10 minutes, and cooled for
1 minute on ice. A reverse transcription reaction was performed at
45.degree. C. for 1 hour by using this mixture and SuperScriptII RT
(SuperScript Double-stranded cDNA Synthesis Kit, Invitrogen). Then,
the total amount of the single stranded DNAs synthesized by the
reverse transcription reaction was used for a reaction with an E.
coli DNA ligase, E. coli Polymerase I and E. coli RNase H
(SuperScript Double-stranded cDNA Synthesis Kit, Invitrogen) at
16.degree. C. for 2 hours, and the product was blunt-ended with T4
DNA polymerase at 16.degree. C. for 5 minutes to synthesize
double-stranded DNAs (FIG. 3, II). Then, the adaptor previously
prepared was ligated by taking advantage of the fact that the 5'
ends of the double-stranded DNAs were phosphorylated (FIG. 3, III).
The synthesized double-stranded DNA library was subjected to
ethanol precipitation, and dissolved in 4 .mu.l of DEPC water. 100
.mu.M of the prepared adaptor in a volume of 1.0 .mu.l and 50 .mu.l
of Ligation High (TOYOBO) were added thereto, reacted overnight at
16.degree. C., purified (DNA purification kit, QIAGEN), and then
adjusted to a volume of 50 .mu.l. Thereafter, PCR (EX Taq Hot Start
Version, TaKaRa) was performed (FIG. 3, IV). Out of 50 .mu.l of the
ligated double-stranded DNA library, 2 .mu.l was used as a template
together with 5' PCR primer (SEQ ID NO: 172) having a specific
sequence required for IVV (O29) and 3' PCR primer (SEQ ID NO: 173)
to prepare an IVV cDNA library. As for the PCR conditions, the
total volume was 100 .mu.l, and 22 cycles of the reactions (each
cycle consists of reactions at 94.degree. C. for 30 seconds, at
60.degree. C. for 30 seconds, and at 72.degree. C. for 90 seconds,
and the final extension reaction was performed at 72.degree. C. for
180 seconds).
[0370] Cotranslation (26.degree. C., 60 minutes) of the mRNA
template of the bait c-Fos protein, the mouse brain cDNA library as
the prey, and the bait DNA made to coexist was carried out in a
cell-free translation system of wheat (Wheat Germ Extract, Promega)
in a volume of 50 .mu.l. To 50 .mu.l of the sample, 50 .mu.l of IgG
binding buffer (10 mM Tris-Cl, pH 8.0, 150 mM NaCl, 0.1% NP40) was
added to obtain the total volume of 100 .mu.l (cotranslation
sample). Then, IgG agarose (Sigma) was washed twice with the IgG
binding buffer, and the cotranslation sample (100 .mu.l) was added
thereto, and the mixture was stirred by rotation at 4.degree. C.
for 2 hours. The IgG agarose was washed 3 times with the binding
buffer and once with a TEV cleaving buffer (10 mM Tris-Cl, pH 8.0,
150 mM NaCl, 0.1% NP40, 0.5 mM EDTA, 1 mM DTT), and the bait/prey
complex binding to the IgG agarose was cleaved with TEV protease
(GIBCO-BRL, 16.degree. C., 2 hours). Further, to 90 .mu.l of the
supernatant, 300 .mu.l of a calmodulin binding buffer, 0.3 .mu.l of
1 M CaCl.sub.2 and 50 .mu.l of calmodulin beads washed twice with
500 .mu.l of the calmodulin binding buffer were added, and the
mixture was stirred by rotation at 4.degree. C. for 1 hour. After
centrifugation, the beads were washed 3 times with 1000 .mu.l the
calmodulin binding buffer. 50 .mu.l of a calmodulin elution buffer
was added, and the mixture was left on ice for 1 to 2 minutes, and
centrifuged to collect 50 .mu.l of a solution. By using the
collected solution as a template, RT-PCR (One step RT-PCR kit
(QIAGEN), primers: SEQ ID NOS: 177 and 178, program: RT-QH30'
(refer to Table 1)). After this screening/selection procedure (FIG.
2) was repeated for 3 rounds, the library was cloned and sequenced
to obtain the sequences of SEQ ID NOS: 1 to 14 (amino acid
sequences of Fip-cx), SEQ ID NOS: 15 to 19 (amino acid sequences of
Eef1dTEF-1), SEQ ID NOS: 20 to 22 (amino acid sequences of Schip1)
and nucleic acid sequences corresponding to them (SEQ ID NOS: 1 to
22 in FIG. 1A). The same results were obtained for both of the
library prepared by using the sequence of SEQ ID NO: 174 as the
main chain of the adaptor for enhancer O29 and the library prepared
by using the sequence of SEQ ID NO: 175 as the same.
[0371] All the proteins had a Leu zipper, and they are proteins
found by the present invention for the first time to directly
interact with c-Fos.
[0372] As a verification experiment of the interactions of the
obtained proteins and c-Fos, expression of the proteins of SEQ ID
NOS: 2 (Fip-cx), 16 (Eef1dTEF-1) and 22 (Schip1) (FIG. 1A) in a
cell-free translation system was experimentally confirmed by using
the DNA sequences of SEQ ID NOS: 2-1, 16-1, and 22-1 according to
the descriptions of WO02/46395, Example 1, (2) Preparation of
coding molecule and (3) Translation of coding molecule, i.e., it
was confirmed by the C-terminal labeling method that the proteins
were expressed in a wheat cell-free translation system (FIG. 4, A).
Further, formation of IVV was also confirmed according to the
descriptions of WO02/46395, Example 1, (4) Binding of spacer
molecule and coding molecule and (5) Formation of assigning
molecule (FIG. 4, B). Furthermore, the interaction with c-Fos was
confirmed for those subjected to the first stage pull-down (FIG. 2,
IgG+TEV) by 8 M urea/10% SDS-PAGE (FIG. 4, C). As a result, it
could be confirmed that the proteins of SEQ ID NOS: 2 (Fip-cx), 16
(Eef1dTEF-1) and 22 (Schip1) interacted with c-Fos.
[0373] Further, the proteins and genes or nucleic acid sequences of
the present invention can be used as an inhibitor for blocking
transcription, gene duplication and so forth as functions of c-Fos
by utilizing the novel function thereof (function of enabling
binding with c-Fos in this case). The basis of the above is
originates in the fact that the genes detected by the IVV method
have been detected through a competitive process constituted by
screening repeated multiple times. Therefore, the genes detected by
the IVV method show a certain number distribution, and a gene
having a stronger competitive power should be detected in a larger
number. This suggests that a larger number of clones corresponds to
stronger competitive power, and thus such a gene acts more
effectively as a blocking agent or inhibitor. In the IVV selection
performed in this example, three (/72) of c-Jun well known as a
prey were detected for the bait c-Fos. Thus, the numbers of clones
(FIG. 1A) detected in the selection indicate that Fip-cx, Eef1 and
Schip1 have extremely stronger competitive power compared with
known proteins, and they can sufficiently compete, and the proteins
can be utilized as an inhibitor for blocking functions of
transcription of a complex, gene duplication and so forth by the
interaction of c-Jun and a known protein.
Example 2
[0374] A prey IVV library was prepared from the bait c-Fos and a
mouse brain cDNA library in the same manner as that used in Example
1, and the screening/selection procedure (FIG. 2) was also
performed in the same manner as that used in Example 1. However, in
this example, the first stage selection using IgG beads in the
two-stage screening was repeated 3 times, and the two-stage
selection was performed only for the 4th time to obtain the
proteins of SEQ ID NOS: 47 to 56 (amino acid sequences of
Fip-cx.1), SEQ ID NOS: 57 to 76 (amino acid sequences of Fip-cx.2),
SEQ ID NOS: 77 to 81 (amino acid sequences of Optin), SEQ ID NOS:
82 to 84 (amino acid sequences of Snap19), SEQ ID NOS: 85 and 86
(amino acid sequences of C130020M04Rik), SEQ ID NOS: 87 to 89
(amino acid sequences of FLJ32000), SEQ ID NOS: 90 and 91 (amino
acid sequences of Rit2), SEQ ID NOS: 92 and 93 (amino acid
sequences of cytocrome b), SEQ ID NOS: 94 and 95 (amino acid
sequences of Apoe), SEQ ID NOS: 96 and 97 (amino acid sequences of
App), SEQ ID NOS: 98 and 99 (amino acid sequences of Dnaja2), SEQ
ID NOS: 100 and 101 (amino acid sequences of Fip-c10), SEQ ID NO:
102 (amino acid sequence of Fip-c4), SEQ ID NO: 103 (amino acid
sequence of Fip-c18), and nucleic acid sequences corresponding to
these proteins (SEQ ID NOS: 47 to 76 in FIG. 1A, and SEQ ID NOS: 77
to 103 in FIG. 1B). The same results were obtained for both of the
library prepared by using the sequence of SEQ ID NO: 174 as the
main chain of the adaptor for enhancer O29 and the library prepared
by using the sequence of SEQ ID NO: 175 as the same.
[0375] Fip-cx.1, Fip-cx.2, Optin, C130020M04Rik, FLJ32000, and
cytocrome b proteins have a Leu zipper, and Rit2, Apoe, App,
Dnaja2, Fip-c10, Fip-c4, and Fip-c18 proteins do not have a Leu
zipper. All the proteins are proteins found by the present
invention for the first time to form a complex with c-Fos.
[0376] As a verification experiment of the interactions of the
obtained proteins and c-Fos, expression of the proteins of SEQ ID
NOS: 48 (Fip-cx.1), 75 (Fip-cx.2), 78 (Optn), 84 (Snapc5), 86
(C13002004Rik), 88 (FLJ32000), 91 (Rit2), 93 (cytochrome b), 95
(Apoe), 97 (betaAPP), 99 (Hsp40), 101 (Fip-c10), 102 (Fip-c4) and
103 (Fip-c18) (FIG. 1) in a cell-free translation system was
experimentally confirmed by using the DNA sequences of SEQ ID NOS:
105, 139, 142, 148, 150, 152, 155, 157, 159, 161, 163, 165, 166 and
167 according to the descriptions of WO02/46395, Example 1, (2)
Preparation of coding molecule and (3) Translation of coding
molecule, i.e., it was confirmed by the C-terminal labeling method
that the proteins were expressed in a wheat cell-free translation
system (FIG. 5, A). Further, among those C-terminal labeled
proteins for which expression was confirmed, the proteins of SEQ ID
NOS: 48 (Fip-cx.1) and 75 (Fip-cx.2), which are completely novel
proteins not registered at any database, were used as a prey
protein to confirm interactions thereof with the bait c-Fos by
pull-down. As for the preparation method of the prey protein,
specifically, PCR cloning kit (QIAGEN) was used to extract a
sequence cloned in the pDrive vector (SEQ ID NO: 179, QIAGEN) from
cells, and a DNA template was prepared by PCR (primers 5' F3 (SEQ
ID NO: 180) and 3' R3 (SEQ ID NO: 181), PCR program: ISHI1562
(refer to Table 1), 100 .mu.l scale) using TaKaRa Ex Taq (Takara
Shuzo). The DNA template was transcribed (37.degree. C., 2 hours,
50 .mu.l scale) by using RiboMAX.TM. Large Scale RNA Production
Systems (Promega) to prepare a mRNA template of the prey
protein.
[0377] The preparation method of the bait c-Fos protein is the same
as that used for the selection/screening.
[0378] Cell-free translation of the prey template (10 .mu.l scale)
was performed for 1 hour by using the C-terminal labeling method to
prepare a prey protein in a C-terminal labeled state. At the same
time, the translation reaction of the bait c-fos template was
performed for 1 hour by the cell-free translation (50 .mu.l scale)
to produce the bait protein. After the translation, the both and
the binding buffer were mixed (prey: 8 .mu.l, bait: 10 .mu.l, IgG
binding buffer: 82 .mu.l), and incubated with 50 .mu.l of IgG
agarose beads for 2 hours, and the beads were washed, then added
with 20 .mu.l of a buffer containing SDS, boiled at 100.degree. C.
for 5 minutes, and eluted. This sample was developed by 17.5%
SDS-PAGE, and the FITC fluorochrome was observed by means of a
fluorescence imager (FIG. 5, B). In addition, a reaction was also
performed without adding the bait c-Fos protein as a control.
[0379] As a result, it could be confirmed that the proteins of SEQ
ID NOS: 48 (Fip-cx.1) and 75 (Fip-cx.2) directly interacted with
c-Fos.
[0380] Furthermore, as shown in FIG. 6, concentration of genes
directly or indirectly interacting with c-Fos was confirmed by real
time PCR using the nucleic acid sequences of SEQ ID NOS: 142
(Optn), 148 (Snapc5), 150 (C130020M04Rik) and 152 (FLJ32000). As
for the specific method of the real time PCR, primers (SEQ ID NOS:
182 to 189) were designed for four kinds of genes (SEQ ID NOS: 142
(Optn), 148 (Snapc5), 150 (C130020M04Rik) and 152 (FLJ32000)) so
that the amplification should be attained in the ranges of
sequences obtained by the screening. For preparation of calibration
curves, a gene comprising a DNA fragment of positive control
incorporated into the pDrive vector was amplified by PCR (5' M13_F
primer (SEQ ID NO: 190) and 3' M13_R primer (SEQ ID NO: 191) were
used, and the PCR program lightcycler of Table 1 was used), which
was controlled so that 1E03, 1E05, 1E07 or 1E09 clones/reaction
should be obtained. The measurement was controlled so that each of
the library DNA before screening, library DNA in each cycle of
screening, and Mock library DNA not added with the bait c-Fos
should be in an amount of 5 ng/reaction. The PCR measurement
reaction was performed in a scale of 20 .mu.l according to the
programs shown in Table 1 by using LightCycler Instrument or
LightCycler FastStart DNA Master SYBR Green I (both are produced by
Roche Diagnostics).
[0381] Further, the proteins and genes or nucleic acid sequences of
the present invention can be used as an inhibitor for blocking
transcription, gene duplication and so forth as functions of c-Fos
by utilizing the novel function thereof (function of enabling
binding with c-Fos in this case). The basis of the above is
originates in the fact that the genes detected by the IVV method
have been detected through a competitive process constituted by
screening repeated multiple times. Therefore, the genes detected by
the IVV method show a certain number distribution, and a gene
having a stronger competitive power should be detected in a larger
number. This suggests that a larger number of clones corresponds to
stronger competitive power, and thus such a gene acts more
effectively as a blocking agent or inhibitor. In the IVV selection
performed in this example, three (/142) of JunD well known as a
prey were detected for the bait c-Fos. Thus, the numbers of clones
(FIGS. 1A and 1B) detected in the selection indicate that Fip-cx.1,
Fip-cx.2, Optn, and so forth have extremely stronger competitive
power compared with known proteins, and Snap19, FLJ32000, and so
forth can sufficiently compete with known proteins, and thus the
proteins can be utilized as an inhibitor for blocking functions of
transcription of a complex, gene duplication and so forth by the
interaction of c-Jun and a known protein. TABLE-US-00001 TABLE 1
PCR programs Program name: CYCB1 Reaction conditions: ##STR1##
Program name: V-2 Reaction conditions: ##STR2## Program name:
RT-QH30' Reaction conditions: ##STR3## Program name: ISHI1562
Reaction conditions: ##STR4## Program name: lightcycler Reaction
condition: ##STR5##
INDUSTRIAL APPLICABILITY
[0382] Because proteins that interact with c-Fos have been
provided, it becomes possible to provide proteins forming a complex
with c-Fos by not only a direct interaction, but also an unexpected
indirect interaction, and nucleic acids encoding these proteins as
well as methods for utilizing them.
Sequence CWU 1
1
191 1 184 PRT Mus musculus 1 Met Pro Leu Arg His Leu Ala Asp Arg
Leu Gly His Leu Ala Asp Arg 1 5 10 15 Leu Arg His Leu Thr Asp Arg
Leu Arg His Leu Ala Asp Arg Leu Arg 20 25 30 His Leu Thr Asp Arg
Leu Arg His Leu Ala Asp Arg Leu Arg His Leu 35 40 45 Ala Asp Arg
Leu Lys His Leu Thr Ser Arg Leu Gly His Leu Thr Asp 50 55 60 Arg
Ser Trp His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu 65 70
75 80 Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Gln Arg
Tyr 85 90 95 Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly
His Leu Thr 100 105 110 Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly
His Leu Thr Asp Arg 115 120 125 Leu Arg His Leu Thr Asp Arg Leu Gly
His Leu Thr Asp Arg Leu Met 130 135 140 His Leu Thr Asp Arg Leu Arg
His Leu Ala Asp Arg Gln Arg His Leu 145 150 155 160 Ala Asp Arg Gln
Arg His Leu Ala Asp Arg Leu Arg His Leu Ala Asp 165 170 175 Lys Leu
Arg His Gln Leu Gln Leu 180 2 55 PRT Mus musculus 2 Asp Lys Leu Arg
His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg 1 5 10 15 Leu Arg
His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Met 20 25 30
His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Arg His Leu 35
40 45 Ala Asp Arg Gln Arg His Asp 50 55 3 55 PRT Mus musculus 3 Asp
Arg Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg 1 5 10
15 Leu Arg His Leu Thr Asp Arg Leu Gly His Val Thr Asp Arg Leu Met
20 25 30 His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Arg
His Leu 35 40 45 Ala Asp Arg Gln Arg His Asp 50 55 4 38 PRT Mus
musculus 4 Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg His Leu
Thr Asp 1 5 10 15 Arg Leu Gly His Leu Thr Asp Arg Leu Met His Leu
Thr Asp Arg Leu 20 25 30 Ser His Pro Thr Gln Thr 35 5 45 PRT Mus
musculus 5 Asp Arg Leu Gly His Leu Thr Asp Arg Leu Lys His Leu Thr
Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp Arg Leu Val His Leu Thr
Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu Arg His Leu Ala
Val Arg Gln 35 40 45 6 55 PRT Mus musculus 6 Asp Lys Leu Arg His
Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg 1 5 10 15 Leu Arg His
Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Met 20 25 30 His
Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Arg His Leu 35 40
45 Ala Asp Arg Arg Arg His Asp 50 55 7 55 PRT Mus musculus 7 Asp
Lys Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg 1 5 10
15 Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Met
20 25 30 His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Pro Arg
His Leu 35 40 45 Ala Asp Arg Gln Arg His Asp 50 55 8 55 PRT Mus
musculus 8 Asp Lys Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr
Asp Arg 1 5 10 15 Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr
Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu Met His Leu Thr
Asp Arg Leu Gly His Leu 35 40 45 Ala Asp Arg Gln Arg His Asp 50 55
9 45 PRT Mus musculus 9 Asp Arg Leu Gly His Leu Thr Asp Arg Leu Lys
His Leu Thr Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp Arg Leu Ile
His Leu Thr Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu Arg
His Leu Ala Val Arg Gln 35 40 45 10 38 PRT Mus musculus 10 Gly His
Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly His 1 5 10 15
Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Met His Leu Thr 20
25 30 Asp Arg Leu Arg Gln Arg 35 11 31 PRT Mus musculus 11 Gly His
Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Met His 1 5 10 15
Leu Thr Asp Arg Leu Arg His Leu Ala Asp Arg Gln Arg His Asp 20 25
30 12 38 PRT Mus musculus 12 Thr Asp Gly Leu Gly His Leu Thr Asp
Arg Leu Arg His Leu Thr Asp 1 5 10 15 Arg Leu Gly His Leu Thr Asp
Arg Leu Met His Leu Thr Asp Arg Leu 20 25 30 Arg His Leu Ala Asp
Gln 35 13 45 PRT Mus musculus 13 Asp Arg Leu Gly His Leu Thr Asp
Ile Leu Lys His Leu Thr Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp
Arg Leu Ile His Leu Thr Asp Arg Leu Met 20 25 30 His Leu Thr Asp
Arg Leu Arg His Leu Ala Val Arg Gln 35 40 45 14 55 PRT Mus musculus
14 Asp Lys Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg
1 5 10 15 Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg
Leu Met 20 25 30 Arg Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg
Leu Arg His Leu 35 40 45 Ala Asp Arg Gln Arg His Asp 50 55 15 281
PRT Mus musculus 15 Met Ala Thr Asn Phe Leu Ala His Glu Lys Ile Trp
Phe Asp Lys Phe 1 5 10 15 Lys Tyr Asp Asp Ala Glu Arg Arg Phe Tyr
Glu Gln Met Asn Gly Pro 20 25 30 Val Thr Ser Gly Ser Arg Gln Glu
Asn Gly Ala Ser Val Ile Leu Arg 35 40 45 Asp Ile Ala Arg Ala Arg
Glu Asn Ile Gln Lys Ser Leu Ala Gly Ser 50 55 60 Ser Gly Pro Gly
Ala Ser Ser Gly Pro Gly Gly Asp His Ser Glu Leu 65 70 75 80 Ile Val
Arg Ile Thr Ser Leu Glu Val Glu Asn Gln Asn Leu Arg Gly 85 90 95
Val Val Gln Asp Leu Gln Gln Ala Ile Ser Lys Leu Glu Ala Arg Leu 100
105 110 Ser Ser Leu Glu Lys Ser Ser Pro Thr Pro Arg Ala Thr Ala Pro
Gln 115 120 125 Thr Gln His Val Ser Pro Met Arg Gln Val Glu Pro Pro
Thr Lys Lys 130 135 140 Gly Ala Thr Pro Ala Glu Asp Asp Glu Asp Lys
Asp Ile Asp Leu Phe 145 150 155 160 Gly Ser Asp Glu Glu Glu Glu Asp
Lys Glu Ala Ala Arg Leu Arg Glu 165 170 175 Glu Arg Leu Arg Gln Tyr
Ala Glu Lys Lys Ala Lys Lys Pro Thr Leu 180 185 190 Val Ala Lys Ser
Ser Ile Leu Leu Asp Val Lys Pro Trp Asp Asp Glu 195 200 205 Thr Asp
Met Ala Gln Leu Glu Thr Cys Val Arg Ser Ile Gln Leu Asp 210 215 220
Gly Leu Val Trp Gly Ala Ser Lys Leu Val Pro Val Gly Tyr Gly Ile 225
230 235 240 Arg Lys Leu Gln Ile Gln Cys Val Val Glu Asp Asp Lys Val
Gly Thr 245 250 255 Asp Leu Leu Glu Glu Glu Ile Thr Lys Phe Glu Glu
His Val Gln Ser 260 265 270 Val Asp Ile Ala Ala Phe Asp Lys Ile 275
280 16 54 PRT Mus musculus 16 His Ser Glu Leu Ile Val Arg Ile Thr
Ser Leu Glu Val Glu Asn Gln 1 5 10 15 Asn Leu Arg Gly Val Val Gln
Asp Leu Gln Gln Ala Ile Ser Lys Leu 20 25 30 Glu Ala Arg Leu Ser
Ser Leu Glu Lys Ser Ser Pro Thr Pro Arg Ala 35 40 45 Thr Ala Pro
Gln Thr Arg 50 17 53 PRT Mus musculus 17 Arg Glu Leu Ile Val Arg
Ile Thr Ser Leu Glu Val Glu Asn Gln Asn 1 5 10 15 Leu Arg Gly Val
Val Gln Asp Leu Gln Gln Val Ile Ser Lys Leu Glu 20 25 30 Ala Arg
Leu Ser Ser Leu Glu Lys Ser Ser Pro Thr Pro Arg Ala Thr 35 40 45
Ala Pro Gln Thr Arg 50 18 54 PRT Mus musculus 18 His Ser Glu Leu
Ile Val Arg Ile Asn Ser Leu Glu Val Glu Asn Gln 1 5 10 15 Asn Leu
Arg Gly Val Val Gln Asp Leu Gln Gln Ala Ile Ser Lys Leu 20 25 30
Glu Ala Arg Leu Ser Ser Leu Glu Lys Ser Ser Pro Thr Pro Arg Ala 35
40 45 Thr Ala Pro Arg Thr Arg 50 19 54 PRT Mus musculus 19 His Ser
Glu Leu Ile Val Arg Ile Thr Ser Leu Glu Val Glu Asn Gln 1 5 10 15
Asn Leu Arg Gly Val Val Gln Asp Leu Gln Gln Ala Ile Ser Arg Leu 20
25 30 Glu Ala Arg Leu Ser Ser Leu Glu Lys Ser Ser Pro Thr Pro Arg
Ala 35 40 45 Thr Ala Pro Gln Thr Arg 50 20 268 PRT Mus musculus 20
Met Leu Ser Ala Phe Pro Ala Gln Leu Ala Gln Gln Ser Ser Phe Gly 1 5
10 15 Val Cys Val Leu Gly Cys Thr Glu Met Val His Gln Glu Asn Cys
Ser 20 25 30 Tyr Gln Ala Gln Lys Asn Glu Arg Glu Ser Ile Arg Gln
Lys Leu Ala 35 40 45 Leu Gly Ser Phe Phe Asp Asp Gly Pro Gly Ile
Tyr Thr Ser Cys Ser 50 55 60 Lys Ser Gly Lys Pro Ser Leu Ser Ala
Arg Leu Gln Ser Gly Met Asn 65 70 75 80 Leu Gln Ile Cys Phe Val Asn
Asp Ser Gly Ser Asp Lys Asp Ser Asp 85 90 95 Ala Asp Asp Ser Lys
Thr Glu Thr Ser Leu Asp Thr Pro Leu Ser Pro 100 105 110 Met Ser Lys
Gln Ser Ser Ser Tyr Ser Asp Arg Asp Thr Thr Glu Glu 115 120 125 Glu
Ser Glu Ser Leu Asp Asp Met Asp Phe Leu Thr Arg Gln Lys Lys 130 135
140 Leu Gln Ala Glu Ala Lys Met Ala Leu Ala Met Ala Lys Pro Met Ala
145 150 155 160 Lys Met Gln Val Glu Val Glu Arg Gln Asn Arg Lys Lys
Ser Pro Val 165 170 175 Ala Asp Leu Leu Pro His Met Pro His Ile Ser
Glu Cys Leu Met Lys 180 185 190 Arg Ser Leu Lys Pro Thr Asp Leu Arg
Asp Met Thr Ile Gly Gln Leu 195 200 205 Gln Val Ile Val Asn Asp Leu
His Ser Gln Ile Glu Ser Leu Asn Glu 210 215 220 Glu Leu Val Gln Leu
Leu Leu Ile Arg Asp Glu Leu His Thr Glu Gln 225 230 235 240 Asp Ala
Met Leu Val Asp Ile Glu Asp Leu Thr Arg His Ala Glu Ser 245 250 255
Gln Gln Lys His Met Ala Glu Lys Met Pro Ala Lys 260 265 21 78 PRT
Mus musculus 21 Pro His Thr Pro His Ile Ser Glu Cys Leu Met Lys Arg
Ser Leu Lys 1 5 10 15 Pro Thr Asp Leu Arg Asp Met Thr Ile Gly Gln
Leu Gln Val Ile Val 20 25 30 Asn Asp Leu His Ser Gln Ile Glu Ser
Leu Asn Glu Glu Leu Val Gln 35 40 45 Leu Leu Leu Ile Arg Asp Glu
Leu His Thr Glu Gln Asp Ala Met Leu 50 55 60 Val Asp Ile Glu Asp
Leu Thr Arg His Ala Glu Arg Glu Gln 65 70 75 22 78 PRT Mus musculus
22 Pro His Met Pro His Ile Ser Glu Cys Leu Met Lys Arg Ser Leu Lys
1 5 10 15 Pro Thr Asp Leu Arg Asp Met Thr Ile Gly Gln Leu Gln Val
Ile Val 20 25 30 Asn Asp Leu His Ser Gln Ile Glu Arg Leu Asn Glu
Glu Leu Val Gln 35 40 45 Leu Leu Leu Ile Arg Asp Glu Leu His Thr
Glu Gln Asp Ala Met Leu 50 55 60 Val Asp Ile Glu Asp Leu Thr Arg
His Ala Glu Lys Glu Gln 65 70 75 23 552 DNA Mus musculus 23
atgccattga ggcatctagc agacagattg gggcatctgg cagacagact gaggcatcta
60 acagacagat tgaggcatct agcagacaga ctgaggcatt taacagacag
attgaggcat 120 ctagcagaca gattgaggca tctagcagac agactgaaac
atcttaccag cagattgggg 180 catctaacag acagatcatg gcatctaaca
gacagattgg ggcatctaac agacagattg 240 aggcatctaa cagacagatt
ggggcatcta acagacagac agaggtatct agcagacaga 300 ttgaggcatc
taacagacag attggggcat ctaacagaca gactgaggca tctaacagac 360
agattggggc atctaacaga cagactgagg catctaacag acagattggg gcatctaaca
420 gacagactga tgcatctaac agacagactg aggcatctag cagacagaca
gaggcatcta 480 gcagacagac agaggcatct agcagacaga ctgaggcatc
tagcagacaa attgaggcat 540 cagctgcagc tg 552 24 165 DNA Mus musculus
24 gacaaactga ggcatctaac agacagattg gggcatctaa cagacagact
gaggcatcta 60 acagacagat tggggcatct aacagacaga ctgatgcatc
taacagacag actgatgcat 120 ctaacagaca gactgaggca tctagcagac
agacagaggc acgac 165 25 165 DNA Mus musculus 25 gacaaactga
ggcatctaac agacagattg gggcatctaa cagacaggct gaggcatcta 60
acagacagat tggggcatct aacagacaga ctgatgcatc taacagacag actgatgcat
120 ctaacagaca gactgaggca tctagcagac agacagaggc acgac 165 26 165
DNA Mus musculus 26 gacaaactga ggcatctaac agacagattg gggcatctaa
cagacagact gaggcatcta 60 acagacagat tggggcatct aacagacaga
ctgatgcatc taacagacag actgatgcat 120 ctaacagaca gactgaggca
tctagcagac aggcagaggc acgac 165 27 165 DNA Mus musculus 27
gacagactga ggcatctaac agacagattg gggcatttaa cagacagact gaggcattta
60 acagacagat tggggcatgt aacagacaga ctgatgcatt taacagacag
actgatgcat 120 ctaacagaca gactgaggca tttagcagac agacagaggc acgac
165 28 114 DNA Mus musculus 28 acagacagat tggggcatct aacagacaga
ctgaggcatc taacagacag attggggcat 60 ctaacagaca gactgatgca
tctaacagac agactgagcc atcctacgca gacc 114 29 135 DNA Mus musculus
29 gacagattgg ggcatctaac agacagactg aagcatctaa cagacagatt
ggggcatcta 60 acagacagac tggtccatct aacagacaga ctgatgcatc
taacagacag actgaggcat 120 ctagcagtta gacag 135 30 165 DNA Mus
musculus 30 gacaaactga ggcatctaac agacagattg gggcatctaa cagacagact
gaggcatcta 60 acagacagat tggggcatct aacagacaga ctgatgcatc
taacagacag actgatgcat 120 ctaacagaca gactgaggca tctagcagac
agacggaggc acgac 165 31 165 DNA Mus musculus 31 gacaaactga
ggcatctaac agacagattg gggcatctaa cagacagact gaggcatcta 60
acagacagat tggggcatct aacagacaga ctgatgcatc taacagacag actgatgcac
120 ctaacagaca gaccgaggca tctagcagac agacagaggc acgac 165 32 165
DNA Mus musculus 32 gacaaactga ggcatctaac agacagattg gggcatctaa
cagacagact gaggcatcta 60 acagacagat tggggcatct aacagacaga
ctgatgcatc taacagacag actgatgcat 120 ctaacagaca gactggggca
tctagcagac agacagaggc acgac 165 33 135 DNA Mus musculus 33
gacagattgg ggcatctaac agacagactg aagcatctaa cagacagatt ggggcatcta
60 acagacagac tgatccatct aacagacaga ctgatgcatc taacagacag
actgaggcat 120 ctagcagtca gacag 135 34 114 DNA Mus musculus 34
gggcatctaa cagacagact gaggcatcta acagacagat tggggcatct aacagacaga
60 ctgatgcatc taacagacag actgatgcat ctaacagaca gactgaggca aaga 114
35 93 DNA Mus musculus 35 gggcatctaa cagacagact gatgcatcta
acagacagac tgatgcatct aacagacaga 60 ctgaggcatc tagcagacag
acagaggcac gac 93 36 114 DNA Mus musculus 36 acagacggat tggggcatct
aacagacaga ctgaggcatc taacagacag attggggcat 60 ctaacagaca
gactgatgca tctaacagac agactgaggc atctagcaga ccag 114 37 135 DNA Mus
musculus 37 gacagattgg ggcatctaac agacatactg aagcatctaa cagacagatt
ggggcatcta 60 acagacagac tgatccatct aacagacaga ctgatgcatc
taacagacag actgaggcat 120 ctagcagtca gacag 135 38 165 DNA Mus
musculus 38 gacaaactga ggcatctaac agacagattg gggcatctaa cagacagact
gaggcatcta 60 acagacagat tggggcatct aacagacaga ctgatgcgtc
taacagacag actgatgcat 120 ctaacagaca gactgaggca tctagcagac
agacagaggc acgac 165 39 843 DNA Mus musculus 39 atggctacaa
actttctagc gcatgagaag atctggtttg acaagtttaa atatgatgat 60
gcagaaagga gattctatga gcagatgaac gggcctgtga cctccggctc ccgccaggag
120 aatggtgcca gcgtgatcct ccgagacatt gcaagagcca gagagaacat
ccagaaatcc 180 ttggctggaa gctcaggccc tggagcctcc agtggacctg
gtggagacca cagtgagctc 240 attgtgagga ttaccagtct ggaagtggag
aaccagaacc ttcgaggcgt ggtgcaagat 300 ttgcagcagg ccatttccaa
gttggaggcc cggctgagct ctctagagaa gagttcacct 360 actccccgag
ccacggcccc acagacccaa catgtctctc ctatgcgtca agtggagccc 420
ccaaccaaga aaggagccac accagcagag gacgatgagg acaaggacat tgacctgttc
480 ggcagtgacg aggaggaaga agataaggag gctgcccgac tacgggagga
gaggctacgc 540 cagtacgcag agaagaaggc caagaagccc acactggtgg
ccaaatcctc catccttttg 600 gatgttaaac
cttgggatga tgagactgac atggcccagc tagagacttg tgtgcgttcc 660
atccaattgg acgggctggt ttggggggcc tccaagcttg tgcctgttgg ctatggcatc
720 cggaagctgc agatccagtg tgtggtggag gatgacaaag tgggcaccga
cttgctcgag 780 gaggagatca ccaaatttga ggagcatgtg cagagtgtcg
acatcgcagc tttcgacaag 840 atc 843 40 162 DNA Mus musculus 40
cacagtgagc tcattgtgag gattaccagt ctggaagtgg agaatcagaa ccttcgaggc
60 gtggtgcaag atttgcagca ggccatttcc aagttggagg cccggctgag
ctctctagag 120 aagagttcac ctactccccg agccacggcc ccacagaccc ga 162
41 159 DNA Mus musculus 41 cgtgagctca ttgtgaggat taccagtctg
gaagtggaga atcagaacct tcgaggcgtg 60 gtgcaagatt tgcagcaggt
catttccaag ttggaggccc ggctgagctc tctagagaag 120 agttcaccta
ctccccgagc cacggcccca cagacccga 159 42 162 DNA Mus musculus 42
cacagtgagc tcattgtgag gattaacagt ttggaagtgg agaatcagaa ccttcgaggg
60 gtggtgcaag atttgcagca ggccatttcc aagttggagg cccggctgag
ctctttagag 120 aagagttcac ctactccccg agccacggcc ccacggaccc ga 162
43 162 DNA Mus musculus 43 cacagtgagc tcattgtgag gattaccagt
ctggaagtgg agaatcagaa ccttcggggc 60 gtggtgcaag atttgcagca
ggccatttcc aggttggagg cccggctgag ctctctagag 120 aagagttcac
ctactccccg agccacggcc ccacagaccc ga 162 44 804 DNA Mus musculus 44
atgctcagcg ctttccctgc gcagctcgcc cagcagtcca gctttggggt ctgcgtccta
60 ggatgtactg agatggtaca tcaggagaac tgctcgtacc aggcacagaa
gaatgagaga 120 gagtctatca gacagaagtt ggcactcgga agcttctttg
acgatggccc aggaatctat 180 accagctgca gcaaaagtgg gaagccaagc
ctttctgcaa gactacagag cgggatgaac 240 ctccagatat gctttgtcaa
tgacagcggc agtgacaagg acagcgatgc agatgacagt 300 aagacggaaa
ccagcttgga cacgcccttg tcccccatga gcaagcagag ttcttcctat 360
tcggatagag acacaactga ggaggagtct gaatccctgg atgacatgga cttcctcaca
420 aggcaaaaga agctacaagc tgaagccaaa atggctctgg ccatggccaa
accaatggcc 480 aaaatgcaag tagaagtgga aagacagaac aggaaaaagt
ctcccgtcgc tgatcttctc 540 ccacacatgc ctcacataag cgaatgtttg
atgaaaagaa gcttaaagcc caccgacctg 600 agagacatga ctatcgggca
gctacaagtg atcgtcaatg acctccactc ccagattgaa 660 agtttgaatg
aagagttggt ccagctgctc cttattcgag atgagctgca cacagaacaa 720
gatgccatgc tggtggacat tgaagacttg actagacacg ctgagagtca gcagaagcac
780 atggctgaga aaatgcccgc gaag 804 45 234 DNA Mus musculus 45
ccacacacgc ctcacataag cgaatgtttg atgaaaagaa gcttaaagcc caccgacctg
60 agagacatga ctatcgggca gctacaagtg atcgtcaatg acctccactc
ccagattgaa 120 agtttgaatg aagagttggt ccagctgctc cttattcgag
atgagctgca cacagaacaa 180 gatgccatgc tggtggacat tgaagacttg
actagacacg ctgagaggga gcag 234 46 234 DNA Mus musculus 46
ccacacatgc ctcacataag cgaatgtttg atgaaaagaa gcttaaagcc caccgacctg
60 agagacatga ctatcgggca gctacaagtg atcgtcaatg acctccactc
ccagattgag 120 cgtttgaatg aagagttggt ccagctgctc cttattcgag
atgagctgca cacagaacaa 180 gatgccatgc tggtggacat tgaagacttg
actagacacg ctgagaagga gcag 234 47 191 PRT Mus musculus 47 Met Pro
Leu Arg His Leu Ala Asp Arg Leu Gly His Leu Ala Asp Arg 1 5 10 15
Leu Arg His Leu Thr Asp Arg Leu Arg His Leu Ala Asp Arg Leu Arg 20
25 30 His Leu Thr Asp Arg Leu Arg His Leu Ala Asp Arg Leu Lys His
Leu 35 40 45 Ala Asp Arg Leu Lys His Leu Thr Asp Arg Leu Gly His
Leu Thr Asp 50 55 60 Arg Ser Trp His Leu Thr Asp Arg Leu Gly His
Leu Thr Asp Arg Leu 65 70 75 80 Arg His Leu Thr Asp Arg Leu Gly His
Leu Thr Asp Arg Gln Arg Tyr 85 90 95 Leu Ala Asp Arg Leu Arg His
Leu Thr Asp Arg Leu Gly His Leu Thr 100 105 110 Asp Lys Leu Arg His
Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg 115 120 125 Leu Arg His
Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Met 130 135 140 His
Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Arg His Leu 145 150
155 160 Ala Asp Arg Gln Arg His Leu Ala Asp Arg Gln Arg His Leu Ala
Asp 165 170 175 Arg Leu Arg His Leu Ala Asp Lys Leu Arg His Gln Leu
Gln Leu 180 185 190 48 71 PRT Mus musculus 48 Ile Glu Ala Ser Asn
Arg Gln Ile Gly Ala Ser Asn Arg Gln Thr Glu 1 5 10 15 Ala Ser Asn
Arg Gln Ile Gly His Leu Thr Asp Arg Leu Arg His Leu 20 25 30 Thr
Asp Arg Leu Gly His Leu Thr Asp Arg Leu Met His Leu Thr Asp 35 40
45 Arg Leu Met His Leu Thr Asp Arg Leu Arg His Leu Ala Asp Arg Gln
50 55 60 Arg His Leu Ala Asp Arg Leu 65 70 49 55 PRT Mus musculus
49 Asp Lys Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg
1 5 10 15 Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg
Leu Met 20 25 30 His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg
Leu Arg His Leu 35 40 45 Ala Asp Arg Gln Arg His Asp 50 55 50 45
PRT Mus musculus 50 Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg His
Leu Thr Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp Arg Leu Met His
Leu Thr Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu Arg His
Leu Ala Val Arg Gln 35 40 45 51 45 PRT Mus musculus 51 Asp Arg Leu
Gly Arg Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10 15 Leu
Gly His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Met 20 25
30 His Leu Thr Asp Arg Leu Arg His Leu Ala Val Arg Gln 35 40 45 52
45 PRT Mus musculus 52 Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg
Tyr Leu Thr Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp Arg Leu Met
His Leu Thr Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu Arg
His Leu Ala Val Arg Gln 35 40 45 53 45 PRT Mus musculus 53 Asp Arg
Leu Gly His Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10 15
Leu Gly His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Thr 20
25 30 His Leu Thr Asp Arg Leu Arg His Leu Ala Val Arg Gln 35 40 45
54 45 PRT Mus musculus 54 Asp Arg Leu Gly His Leu Thr Asp Arg Leu
Lys His Leu Thr Asp Arg 1 5 10 15 Leu Gly His Leu Thr Asp Arg Leu
Ile His Leu Thr Asp Arg Leu Met 20 25 30 His Leu Thr Asp Arg Leu
Arg His Leu Ala Val Arg Gln 35 40 45 55 45 PRT Mus musculus 55 Gly
Arg Leu Gly His Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10
15 Leu Gly His Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Met
20 25 30 His Leu Thr Asp Arg Leu Arg His Leu Ala Val Arg Gln 35 40
45 56 38 PRT Mus musculus 56 Asp Arg Leu Arg His Leu Ala Asp Arg
Leu Arg His Leu Thr Asp Arg 1 5 10 15 Leu Arg His Leu Ala Asp Arg
Leu Lys His Leu Ala Asp Arg Leu Lys 20 25 30 His Leu Thr Asn Arg
Lys 35 57 184 PRT Mus musculus 57 Met Pro Leu Arg His Leu Ala Asp
Arg Leu Gly His Leu Ala Asp Arg 1 5 10 15 Leu Arg His Leu Thr Asp
Arg Leu Arg His Leu Ala Asp Arg Leu Arg 20 25 30 His Leu Thr Asp
Arg Leu Arg His Leu Ala Asp Arg Leu Arg His Leu 35 40 45 Ala Asp
Arg Leu Lys His Leu Thr Asp Arg Leu Gly His Leu Thr Asp 50 55 60
Arg Ser Trp His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu 65
70 75 80 Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Gln
Arg Tyr 85 90 95 Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg Leu
Gly His Leu Thr 100 105 110 Asp Arg Leu Arg His Leu Thr Asp Arg Leu
Gly His Leu Thr Asp Arg 115 120 125 Leu Arg His Leu Thr Asp Arg Leu
Gly His Leu Thr Asp Arg Leu Met 130 135 140 His Leu Thr Asp Arg Leu
Arg His Leu Ala Asp Arg Gln Arg His Leu 145 150 155 160 Ala Asp Arg
Gln Arg His Leu Ala Asp Arg Leu Arg His Leu Ala Asp 165 170 175 Lys
Leu Arg His Gln Leu Gln Leu 180 58 79 PRT Mus musculus 58 Glu Lys
Val Lys Thr Leu Lys Ala Gln Asn Ser Glu Leu Ala Ser Thr 1 5 10 15
Ala Asn Thr Leu Arg Glu Gln Val Ala Leu Leu Lys Gln Lys Val Met 20
25 30 Asn His Val Asn Ser Gly Cys Gln Leu Met Leu Thr Gln Gln Leu
Gln 35 40 45 Thr Phe Trp Glu Gln Thr Val Arg Ala Glu Gly Gln Trp
Lys Lys Lys 50 55 60 Asn Asn Arg Asp Lys Leu Glu Asn Leu Thr Gly
Cys Asp Arg Glu 65 70 75 59 76 PRT Mus musculus 59 Arg Ile Lys Ala
Glu Arg Lys Arg Met Arg Asn Arg Ile Ala Ala Ser 1 5 10 15 Glu Cys
Arg Lys Arg Lys Leu Glu Arg Ile Ala Arg Leu Glu Glu Lys 20 25 30
Val Lys Thr Leu Lys Ala Gln Asn Ser Glu Leu Ala Ser Thr Ala Asn 35
40 45 Met Leu Arg Glu Gln Val Ala Gln Leu Lys Gln Lys Val Met Asn
His 50 55 60 Val Asn Ser Gly Cys Gln Leu Met Leu Thr Gln Gln 65 70
75 60 49 PRT Mus musculus 60 Gly His Leu Ala Asp Arg Leu Arg His
Leu Thr Asp Arg Leu Arg His 1 5 10 15 Leu Ala Asp Arg Leu Arg His
Leu Thr Asp Arg Leu Arg His Leu Ala 20 25 30 Asp Arg Leu Arg His
Leu Ala Asp Arg Leu Lys His Leu Thr Asp Arg 35 40 45 Tyr 61 56 PRT
Mus musculus 61 His Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg Leu
Gly His Leu 1 5 10 15 Thr Asp Arg Leu Arg His Leu Thr Asp Arg Leu
Arg His Leu Thr Asp 20 25 30 Arg Leu Met His Leu Thr Asp Arg Leu
Gly His Leu Ala Asp Arg Gln 35 40 45 Arg His Leu Ala Asp Arg Gln
Arg 50 55 62 44 PRT Mus musculus 62 Ala Asp Arg Leu Gly His Leu Ala
Asp Arg Leu Arg His Leu Thr Asp 1 5 10 15 Arg Leu Arg His Leu Ala
Asp Arg Leu Arg His Leu Thr Asp Arg Leu 20 25 30 Arg His Leu Ala
Asp Arg Leu Arg His Leu Ala Asp 35 40 63 27 PRT Mus musculus 63 Ala
Asp Arg Leu Arg His Leu Thr Asp Arg Leu Arg His Leu Ala Asp 1 5 10
15 Arg Leu Arg His Leu Thr Asp Arg Leu Arg His 20 25 64 53 PRT Mus
musculus 64 Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg His Leu
Thr Asp 1 5 10 15 Arg Leu Gly His Leu Thr Asp Arg Leu Arg His Leu
Thr Asp Arg Leu 20 25 30 Gly His Leu Thr Asp Arg Leu Met His Leu
Thr Asp Arg Leu Arg His 35 40 45 Leu Ala Asp Arg Pro 50 65 46 PRT
Mus musculus 65 Arg Gln Arg His Leu Thr Asp Arg Leu Arg His Leu Thr
Asp Arg Leu 1 5 10 15 Gly His Leu Thr Asp Arg Leu Arg His Leu Thr
Asp Arg Leu Gly His 20 25 30 Leu Thr Asp Arg Leu Met His Leu Thr
Asp Arg Leu Arg Pro 35 40 45 66 39 PRT Mus musculus 66 Asp Arg Leu
Ser His Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10 15 Leu
Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg 20 25
30 His Leu Ala Asp Arg Gln Arg 35 67 39 PRT Mus musculus 67 Asp Arg
Leu Arg His Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10 15
Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg 20
25 30 His Leu Ala Asp Arg Gln Arg 35 68 39 PRT Mus musculus 68 Gly
Arg Leu Arg His Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg 1 5 10
15 Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu Arg
20 25 30 His Leu Ala Asp Arg Gln Arg 35 69 43 PRT Mus musculus 69
Tyr Leu Ala Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly His Leu 1 5
10 15 Thr Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr
Asp 20 25 30 Arg Leu Arg His Leu Thr Asp Arg Leu Gly Gln 35 40 70
43 PRT Mus musculus 70 Tyr Leu Ala Asp Arg Leu Arg His Leu Thr Asp
Arg Leu Arg His Leu 1 5 10 15 Thr Asp Arg Leu Arg His Leu Thr Asp
Arg Leu Gly His Leu Thr Asp 20 25 30 Arg Leu Arg His Leu Thr Asp
Arg Leu Gly Gln 35 40 71 76 PRT Mus musculus 71 Arg Ile Lys Ala Glu
Arg Lys Arg Met Arg Asn Arg Ile Ala Ala Ser 1 5 10 15 Lys Cys Arg
Lys Arg Lys Leu Glu Arg Ile Ala Arg Leu Glu Glu Lys 20 25 30 Val
Lys Thr Leu Lys Ala Gln Asn Ser Glu Leu Ala Ser Thr Ala Asn 35 40
45 Met Leu Arg Glu Gln Val Ala Gln Leu Lys Gln Lys Val Met Asn His
50 55 60 Val Asn Ser Gly Cys Gln Leu Met Leu Thr Gln Gln 65 70 75
72 44 PRT Mus musculus 72 Leu Gly His Leu Thr Asp Arg Leu Arg His
Leu Thr Asp Arg Leu Gly 1 5 10 15 His Leu Thr Asp Arg Leu Met His
Leu Thr Asp Arg Leu Arg His Leu 20 25 30 Ala Asp Arg Gln Arg His
Leu Ala Asp Arg Gln Lys 35 40 73 36 PRT Mus musculus 73 Leu Gly His
Leu Thr Asp Arg Leu Arg His Leu Thr Asp Arg Leu Gly 1 5 10 15 His
Leu Thr Asp Arg Leu Met His Leu Thr Asp Arg Leu Arg His Leu 20 25
30 Ala Asp Thr Gln 35 74 44 PRT Mus musculus 74 Gly His Leu Thr Asp
Arg Leu Arg His Leu Thr Asp Arg Leu Gly His 1 5 10 15 Leu Thr Asp
Arg Leu Arg His Leu Thr Asp Arg Leu Gly His Leu Thr 20 25 30 Asp
Arg Leu Met His Leu Thr Asp Arg Leu Arg His 35 40 75 51 PRT Mus
musculus 75 Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu
Arg His 1 5 10 15 Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu
Met His Leu Thr 20 25 30 Asp Arg Leu Arg His Leu Ala Asp Arg Gln
Arg His Leu Ala Asp Arg 35 40 45 Gln Arg His 50 76 51 PRT Mus
musculus 76 Arg His Leu Thr Asp Arg Leu Gly His Leu Thr Asp Arg Leu
Arg His 1 5 10 15 Leu Thr Asp Arg Leu Gly Arg Leu Thr Asp Arg Leu
Met His Leu Thr 20 25 30 Asp Arg Leu Arg His Leu Ala Asp Arg Gln
Arg His Leu Ala Asp Arg 35 40 45 Gln Arg His 50 77 584 PRT Mus
musculus 77 Met Ser His Gln Pro Leu Ser Cys Leu Thr Glu Lys Gly Asp
Ser Pro 1 5 10 15 Cys Glu Thr Pro Gly Asn Gly Pro Ser Asn Met Val
His Pro Ser Leu 20 25 30 Asp Thr Phe Thr Pro Glu Glu Leu Leu Gln
Gln Met Lys Glu Leu Leu 35 40 45 Val Glu Asn His Gln Leu Lys Glu
Ala Met Lys Leu Asn Asn Gln Ala 50 55 60 Met Lys Gly Arg Phe Glu
Glu Leu Ser Ala Trp Thr Glu Lys Gln Lys 65 70 75 80 Glu Glu Arg Leu
Leu Phe Glu Met Gln Ser Lys Glu Val Lys Glu Arg 85 90 95 Leu Lys
Ala Leu Thr His Glu Asn Glu Arg Leu Lys Glu Glu Leu Gly 100 105 110
Lys Phe Lys Glu Lys Ser Glu Lys Pro Leu Glu Asp Leu Thr Gly Gly 115
120 125 Tyr Arg Tyr Pro Arg Ala Leu Glu Glu Glu Val Glu Lys Leu Lys
Thr 130 135 140 Gln Val Glu Gln Glu Val Glu His Leu Lys Ile Gln Val
Met Arg Leu 145 150 155 160 Arg Ala Glu Lys Ala Asp Leu Leu Gly Ile
Val Ser Glu Leu Gln Leu 165 170 175 Lys Leu Asn Ser Gly Gly Ser Ser
Glu Asp Ser Phe Val Glu Ile Arg 180 185 190
Met Thr Glu Gly Glu Thr Glu Gly Ala Met Lys Glu Met Lys Asn Cys 195
200 205 Pro Thr Pro Thr Arg Thr Asp Pro Ile Ser Leu Ser Asn Cys Thr
Glu 210 215 220 Asp Ala Arg Ser Cys Ala Glu Phe Glu Glu Leu Thr Val
Ser Gln Leu 225 230 235 240 Leu Leu Cys Leu Arg Glu Gly Asn Gln Lys
Val Glu Arg Leu Glu Val 245 250 255 Ala Leu Arg Glu Ala Lys Glu Arg
Ile Ser Asp Phe Glu Lys Lys Ala 260 265 270 Asn Gly His Ser Ser Thr
Glu Lys Gln Thr Ala Arg Arg Ala Asp Arg 275 280 285 Glu Lys Glu Asp
Lys Gly Gln Glu Ser Val Gly Ser Glu Val Glu Thr 290 295 300 Leu Ser
Ile Gln Val Thr Ser Leu Phe Lys Glu Leu Gln Glu Ala His 305 310 315
320 Thr Lys Leu Ser Glu Ala Glu Leu Met Lys Lys Arg Leu Gln Glu Lys
325 330 335 Cys Gln Ala Leu Glu Arg Lys Asn Ser Ala Thr Pro Ser Glu
Leu Asn 340 345 350 Glu Lys Gln Glu Leu Val Tyr Ser Asn Lys Lys Leu
Glu Leu Gln Val 355 360 365 Glu Ser Met Arg Ser Glu Ile Lys Met Glu
Gln Ala Lys Thr Glu Glu 370 375 380 Glu Lys Ser Arg Leu Ala Thr Leu
Gln Ala Thr His Asn Lys Leu Leu 385 390 395 400 Gln Glu His Asn Lys
Ala Leu Lys Thr Ile Glu Glu Leu Thr Lys Gln 405 410 415 Gln Ala Glu
Lys Val Asp Lys Met Leu Leu Gln Glu Leu Ser Glu Lys 420 425 430 Leu
Glu Leu Ala Glu Gln Ala Leu Ala Ser Lys Gln Leu Gln Met Asp 435 440
445 Glu Met Lys Gln Thr Leu Ala Lys Gln Glu Glu Asp Leu Glu Thr Met
450 455 460 Ala Val Leu Arg Ala Gln Met Glu Val Tyr Cys Ser Asp Phe
His Ala 465 470 475 480 Glu Arg Ala Ala Arg Glu Lys Ile His Glu Glu
Lys Glu Gln Leu Ala 485 490 495 Leu Gln Leu Ala Ile Leu Leu Lys Glu
Asn Asn Asp Ile Glu Glu Gly 500 505 510 Gly Ser Arg Gln Ser Leu Met
Glu Met Gln Cys Arg His Gly Ala Arg 515 520 525 Thr Ser Asp Ser Asp
Gln Gln Thr Tyr Leu Phe Gln Arg Gly Ala Glu 530 535 540 Asp Arg Ser
Trp Gln His Gly Gln Gln Pro Arg Ser Ile Pro Ile His 545 550 555 560
Ser Cys Pro Lys Cys Gly Glu Val Leu Pro Asp Ile Asp Thr Leu Gln 565
570 575 Ile His Val Met Asp Cys Ile Ile 580 78 108 PRT Mus musculus
78 Leu Lys Thr Gln Val Glu Gln Glu Val Glu His Leu Lys Ile Gln Val
1 5 10 15 Met Arg Leu Arg Ala Glu Lys Ala Asp Leu Leu Gly Ile Val
Ser Glu 20 25 30 Leu Gln Leu Lys Leu Asn Ser Gly Gly Ser Ser Glu
Asp Ser Phe Val 35 40 45 Glu Ile Arg Met Thr Glu Gly Glu Thr Glu
Gly Ala Met Lys Glu Met 50 55 60 Lys Ser Cys Pro Thr Pro Thr Arg
Thr Asp Pro Ile Ser Leu Ser Asn 65 70 75 80 Cys Thr Glu Asp Ala Arg
Ser Cys Ala Glu Phe Glu Glu Leu Thr Val 85 90 95 Ser Gln Leu Leu
Leu Cys Leu Arg Glu Gly Asn Gln 100 105 79 62 PRT Mus musculus 79
His Leu Lys Ile Gln Val Met Arg Leu Arg Ala Glu Lys Ala Asp Leu 1 5
10 15 Leu Gly Ile Val Ser Glu Leu Gln Leu Lys Leu Asn Ser Gly Gly
Ser 20 25 30 Ser Glu Asp Ser Phe Val Glu Ile Arg Met Thr Glu Gly
Glu Thr Glu 35 40 45 Gly Ala Met Lys Glu Met Lys Asn Cys Pro Thr
Pro Thr Arg 50 55 60 80 62 PRT Mus musculus 80 His Leu Lys Ile Gln
Val Met Arg Leu Arg Ala Glu Lys Ala Asp Leu 1 5 10 15 Leu Gly Ile
Val Ser Glu Leu Gln Leu Lys Leu Asn Ser Gly Gly Ser 20 25 30 Ser
Glu Asp Ser Phe Val Glu Ile Arg Met Thr Glu Gly Glu Thr Glu 35 40
45 Gly Ala Met Lys Glu Met Lys Asn Cys Pro Ala Pro Thr Arg 50 55 60
81 62 PRT Mus musculus 81 His Leu Lys Ile Gln Val Met Arg Leu Arg
Ala Glu Lys Ala Asp Leu 1 5 10 15 Leu Gly Ile Val Ser Glu Leu Arg
Leu Lys Leu Asn Ser Gly Gly Ser 20 25 30 Ser Glu Asp Ser Phe Val
Glu Ile Arg Met Thr Glu Gly Glu Thr Glu 35 40 45 Gly Ala Met Lys
Glu Met Lys Asn Cys Pro Thr Pro Thr Arg 50 55 60 82 102 PRT Mus
musculus 82 Met Leu Ser Arg Leu Gln Glu Leu Arg Lys Glu Glu Glu Thr
Leu Leu 1 5 10 15 Arg Leu Lys Ala Ala Leu His Asp Gln Leu Asn Arg
Leu Lys Val Glu 20 25 30 Glu Leu Ala Leu Gln Ser Met Ile Asn Ser
Arg Gly Arg Thr Glu Thr 35 40 45 Leu Ser Ser Gln Pro Ala Pro Glu
Gln Leu Cys Asp Met Ser Leu His 50 55 60 Val Asp Asn Glu Val Thr
Ile Asn Gln Thr Thr Leu Lys Leu Ser Thr 65 70 75 80 Arg Ser Pro Met
Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu Glu 85 90 95 Glu Glu
Glu Ser Asp Ser 100 83 56 PRT Mus musculus 83 Arg Cys Arg Gln Lys
Arg Lys Leu Trp Val Ser Ser Leu Glu Lys Lys 1 5 10 15 Ala Glu Glu
Leu Thr Ser Gln Asn Ile Gln Leu Ser Asn Glu Val Thr 20 25 30 Leu
Leu Arg Asn Glu Val Ala Gln Leu Lys Gln Leu Leu Leu Ala His 35 40
45 Lys Asp Cys Pro Val Thr Ala Gln 50 55 84 79 PRT Mus musculus 84
Arg Lys Trp Lys Gly Thr Leu Ser Arg Leu Gln Glu Leu Arg Lys Glu 1 5
10 15 Val Glu Thr Pro Leu Arg Leu Lys Ala Ala Leu His Asp Gln Leu
Asn 20 25 30 Arg Leu Lys Val Glu Glu Leu Ala Leu Gln Ser Met Ile
Asn Ser Arg 35 40 45 Gly Arg Thr Glu Thr Leu Ser Ser Gln Pro Ala
Pro Glu Gln Leu Cys 50 55 60 Asp Met Ser Leu His Val Asp Asn Glu
Val Thr Ile Asn Gln Thr 65 70 75 85 413 PRT Mus musculus 85 Met Gly
Asp Asp Arg Pro Phe Val Cys Ser Ala Pro Gly Cys Gly Gln 1 5 10 15
Arg Phe Thr Asn Glu Asp His Leu Ala Val His Lys His Lys His Glu 20
25 30 Met Thr Leu Lys Phe Gly Pro Ala Arg Thr Asp Ser Val Ile Ile
Ala 35 40 45 Asp Gln Thr Pro Thr Pro Thr Arg Phe Leu Lys Asn Cys
Glu Glu Val 50 55 60 Gly Leu Phe Asn Glu Leu Ala Ser Ser Phe Glu
His Glu Phe Lys Lys 65 70 75 80 Ala Ser Asp Asp Asp Glu Lys Lys Gly
Ala Ala Gly Pro Leu Asp Met 85 90 95 Ser Leu Pro Ser Thr Pro Asp
Ile Lys Ile Lys Glu Glu Glu Pro Val 100 105 110 Glu Val Asp Ser Ser
Pro Pro Asp Ser Pro Ala Ser Ser Pro Cys Ser 115 120 125 Pro Pro Leu
Lys Glu Lys Glu Val Thr Thr Lys Pro Val Val Ile Ser 130 135 140 Thr
Pro Thr Pro Thr Ile Val Arg Pro Gly Ser Leu Pro Leu His Leu 145 150
155 160 Gly Tyr Asp Pro Leu His Pro Thr Leu Pro Ser Pro Thr Ser Val
Ile 165 170 175 Thr Gln Ala Pro Pro Ser Asn Arg Gln Ile Gly Ser Pro
Thr Gly Ser 180 185 190 Leu Pro Leu Val Met His Leu Ala Asn Gly Gln
Thr Met Pro Met Leu 195 200 205 Pro Gly Pro Pro Val Gln Met Pro Ser
Val Ile Ser Leu Ala Arg Pro 210 215 220 Val Ser Met Val Pro Asn Ile
Pro Gly Ile Pro Gly Pro Pro Val Asn 225 230 235 240 Asn Ser Gly Ser
Ile Ser Pro Ser Gly His Pro Met Pro Ser Glu Ala 245 250 255 Lys Met
Arg Leu Lys Ala Thr Leu Thr His Gln Val Ser Ser Ile Asn 260 265 270
Gly Gly Cys Gly Met Val Val Gly Thr Ala Ser Thr Met Val Thr Ala 275
280 285 Arg Pro Glu Gln Asn Gln Ile Leu Ile Gln His Pro Asp Ala Pro
Ser 290 295 300 Pro Ala Gln Pro Gln Val Ser Pro Ala Gln Pro Thr Pro
Ser Thr Gly 305 310 315 320 Gly Arg Arg Arg Arg Thr Val Asp Glu Asp
Pro Asp Glu Arg Arg Gln 325 330 335 Arg Phe Leu Glu Arg Asn Arg Ala
Ala Ala Ser Arg Cys Arg Gln Lys 340 345 350 Arg Lys Leu Trp Val Ser
Ser Leu Glu Lys Lys Ala Glu Glu Leu Thr 355 360 365 Ser Gln Asn Ile
Gln Leu Ser Asn Glu Val Thr Leu Leu Arg Asn Glu 370 375 380 Val Ala
Gln Leu Lys Gln Leu Leu Leu Ala His Lys Asp Cys Pro Val 385 390 395
400 Thr Ala Leu Gln Lys Lys Thr Gln Gly Tyr Leu Gly Lys 405 410 86
58 PRT Mus musculus 86 Arg Ala Ala Ala Ser Arg Cys Arg Gln Lys Arg
Lys Leu Trp Val Ser 1 5 10 15 Ser Leu Glu Lys Lys Ala Glu Glu Leu
Thr Ser Gln Asn Ile Gln Leu 20 25 30 Ser Asn Lys Val Thr Leu Leu
Arg Asn Glu Val Ala Gln Leu Lys Gln 35 40 45 Leu Leu Leu Ala His
Lys Asp Cys Pro Gly 50 55 87 1031 PRT Mus musculus 87 Met Thr Asn
Pro Lys Gly Lys Arg Arg Gly Thr Gln Ser Met Phe Ser 1 5 10 15 Arg
Pro Phe Arg Lys His Gly Val Val Ser Leu Ala Thr Tyr Met Arg 20 25
30 Ile Tyr Lys Lys Arg Asp Ile Val Asp Ile Lys Gly Met Gly Thr Val
35 40 45 Gln Lys Gly Met Pro Cys Lys Cys Tyr His Gly Lys Thr Gly
Arg Val 50 55 60 Tyr Asn Val Thr Gln His Ala Met Gly Ile Ile Val
Asn Lys Gln Val 65 70 75 80 Lys Gly Lys Ile Leu Ala Lys Arg Ile Asn
Val Gln Ile Glu His Ile 85 90 95 Lys His Ser Lys Ser Arg Asp Gly
Phe Leu Lys Gln Gly Glu Ala Ala 100 105 110 His Phe Glu Tyr Leu Leu
Tyr Pro Leu His Ser Ala Ser Ile Thr Gly 115 120 125 Leu Ala Thr Cys
Ile Arg Lys Pro Leu Ile Ala Thr Cys Ser Leu Asp 130 135 140 Arg Ser
Val Arg Ile Trp Asn Tyr Glu Ser Asn Ser Ser Cys Cys Lys 145 150 155
160 Ala Leu Arg Glu Asp Leu Trp Leu Leu Leu Leu Phe His Ile Thr Ala
165 170 175 Pro Ala Thr Leu Ser Ser Pro Pro Val Ile Phe Phe Cys Thr
Leu Glu 180 185 190 Leu Tyr Lys Glu Tyr Gln Glu Glu Ala Tyr Thr Val
Ser Leu His Pro 195 200 205 Ser Gly His Tyr Ile Val Val Gly Phe Ala
Asp Lys Leu Arg Leu Met 210 215 220 Asn Leu Leu Ile Asp Asp Ile Arg
Ser Phe Lys Glu Tyr Ser Val Arg 225 230 235 240 Gly Cys Lys Glu Cys
Ala Phe Ser Asn Gly Gly His Leu Phe Ala Ala 245 250 255 Val Asn Gly
Asn Val Ile His Ile Phe Thr Thr Thr Asn Leu Glu Asn 260 265 270 Ile
Asn Asn Leu Lys Gly His Thr Gly Lys Arg Glu Thr Glu Cys Val 275 280
285 Leu Lys Val Cys Ser Tyr Asn Ser Val Thr Ile Ser Pro Asp Gly Lys
290 295 300 Val Ile Phe Ala Val Gly Ser Asp Gln Thr Leu Lys Glu Ile
Ala Asp 305 310 315 320 Ser Leu Ile Leu Arg Glu Ile Pro Ala Phe Asp
Val Val Tyr Thr Ala 325 330 335 Ile Thr Ile Ser His Ser Gly Arg Met
Ile Phe Val Gly Thr Ser Val 340 345 350 Gly Thr Ile Arg Ala Met Lys
Tyr Pro Leu Pro Leu Gln Arg Glu Phe 355 360 365 Asn Glu Tyr Gln Ala
His Ala Gly Pro Val Thr Lys Ile Leu Leu Thr 370 375 380 Phe Asp Asp
Gln Phe Leu Leu Thr Val Ser Glu Asp Gly Cys Leu Phe 385 390 395 400
Thr Trp Lys Val Phe Asp Lys Glu Gly Arg Gly Ile Lys Arg Glu Arg 405
410 415 Glu Val Gly Phe Ala Glu Glu Val Leu Val Thr Lys Thr Asp Met
Glu 420 425 430 Glu Lys Ile Leu His Arg Asn Leu Ala Thr Glu Phe Arg
Arg Pro Met 435 440 445 Ser Lys His Leu Glu Cys Pro Thr Ser Glu Thr
Gly Pro Leu Thr Thr 450 455 460 Ile Asn Ile Ser Pro Val Gln Pro Arg
Pro Trp Gly His Val Leu Thr 465 470 475 480 Cys Arg Thr Pro Val Ser
Thr Asp Ser Ala Val Ala Ser Thr Arg Gly 485 490 495 Ser Val Asp Ser
Ala Val Lys Pro Asp Arg Ser Thr Pro Thr Gln Glu 500 505 510 Val Arg
Ile Pro Pro Lys Pro Ala Ser Gly Val His Thr Arg Cys Gln 515 520 525
Leu Gly Val Gln Lys Gln Met Glu His Val Ser Val Val Met Glu Val 530
535 540 Arg Glu Thr Asn Arg Gln Arg Gln Gly Gly Gly Ala Arg Asn Val
Ile 545 550 555 560 Lys Ala Gln Ile Met Leu Glu Leu Lys Thr Arg Val
Glu Glu Leu Lys 565 570 575 Met Glu Asn Glu Tyr Gln Leu Arg Leu Lys
Asp Met Asn Tyr Ser Glu 580 585 590 Lys Ile Lys Glu Leu Thr Asp Lys
Phe Ile Gln Glu Met Glu Ser Leu 595 600 605 Lys Thr Lys Asn Gln Val
Leu Lys Thr Glu Lys Glu Lys Gln Asp Ile 610 615 620 Ser His Arg Glu
His Leu Glu Asp Leu Ile Glu Arg Gln Ser Arg Glu 625 630 635 640 Leu
Gln Asp Leu Glu Cys Cys Asn Asn Gln Lys Leu Leu Leu Glu Tyr 645 650
655 Glu Lys Tyr Gln Glu Leu Gln Leu Lys Ser Gln Arg Met Gln Glu Glu
660 665 670 Tyr Glu Lys Gln Leu Arg Asp Asn Asp Glu Thr Lys Ser Gln
Ala Leu 675 680 685 Glu Glu Leu Thr Glu Phe Tyr Glu Ala Lys Leu Gln
Glu Lys Thr Gly 690 695 700 Leu Leu Glu Glu Ala Leu Ser Thr Ala Ala
Ser Pro Pro Leu Pro Ser 705 710 715 720 Ala His Val Leu Ser Pro Phe
Pro Thr Leu Ser Gln Ala Gln Glu Asp 725 730 735 Val Arg Gln Gln Leu
Arg Glu Phe Glu Glu Thr Lys Lys Gln Ile Glu 740 745 750 Glu Asp Glu
Asp Arg Glu Ile Gln Asp Ile Lys Thr Lys Tyr Glu Arg 755 760 765 Lys
Leu Arg Asp Glu Lys Glu Ser Asn Leu Arg Leu Lys Gly Glu Thr 770 775
780 Gly Ile Met Arg Lys Lys Phe Ser Ser Leu Gln Lys Glu Ile Glu Glu
785 790 795 800 Arg Thr Asn Asp Ile Glu Leu Leu Lys Thr Glu Gln Val
Lys Leu Gln 805 810 815 Gly Val Ile Arg Ser Leu Glu Lys Asp Ile Gln
Gly Leu Lys Arg Glu 820 825 830 Ile Gln Glu Arg Asp Glu Thr Ile Gln
Asp Lys Glu Lys Arg Ile Tyr 835 840 845 Asp Leu Lys Lys Lys Asn Gln
Glu Leu Glu Lys Phe Lys Phe Val Leu 850 855 860 Asp Tyr Lys Ile Lys
Glu Leu Lys Lys Gln Ile Glu Pro Arg Glu Asn 865 870 875 880 Glu Ile
Lys Val Met Lys Glu Gln Ile Gln Glu Asn Pro Val Asn His 885 890 895
Trp Leu Arg Ser Arg Glu Arg Glu Cys Val Thr Gln Pro Arg His Leu 900
905 910 Arg Leu Pro Ala Pro Gln Asn Lys Leu Asp Gly Asn Leu Ala Cys
Gly 915 920 925 Pro Val Arg Gly Arg Leu Cys His Ser Asp Ala Thr Ser
Gly Ala Leu 930 935 940 Asn Val Gln Gly Ile Leu Cys Leu Phe His Leu
Pro Phe Pro Cys Asp 945 950 955 960 Arg Thr Pro Ser Phe Phe Pro Gly
Glu Ala Cys Leu Leu Val Phe Ser 965 970 975 Leu Leu Ile Asp Val Leu
Cys Arg Pro Thr Ser Asp Val Pro Val Ala 980 985 990 Ala Gly Asp Phe
Leu Pro Cys Gly Gly Pro Leu His Leu Pro Pro Glu 995 1000 1005 Leu
His His Leu Thr Val Ile Arg Thr Asn Ala Ser Pro Gln Lys 1010 1015
1020 Cys Tyr Pro Pro Thr Ser Pro Leu
1025 1030 88 70 PRT Mus musculus 88 Lys Lys Phe Ser Ser Leu Gln Lys
Glu Ile Glu Glu Arg Thr Asn Asp 1 5 10 15 Ile Glu Leu Leu Lys Ser
Glu Arg Met Lys Leu Gln Gly Ile Ile Arg 20 25 30 Ser Leu Glu Lys
Asp Ile Gln Gly Leu Lys Arg Glu Ile Gln Glu Arg 35 40 45 Asp Glu
Thr Ile Gln Asp Met Glu Lys Leu Asp Tyr Lys Asp Asp Tyr 50 55 60
Asn Ser Asn Leu Glu Ile 65 70 89 56 PRT Mus musculus 89 Lys Lys Phe
Ser Ser Leu Gln Lys Glu Ile Glu Glu Arg Thr Asn Asp 1 5 10 15 Ile
Glu Leu Leu Lys Ser Glu Arg Met Lys Leu Gln Gly Ile Ile Arg 20 25
30 Ser Leu Glu Lys Asp Ile Gln Gly Leu Lys Arg Glu Ile Gln Glu Arg
35 40 45 Asp Glu Thr Ile Gln Asp Met Glu 50 55 90 217 PRT Mus
musculus 90 Met Glu Val Glu Asn Glu Ala His Cys Cys Pro Gly Ser Ser
Ser Gly 1 5 10 15 Gly Ser Arg Glu Tyr Lys Val Val Met Leu Gly Ala
Gly Gly Val Gly 20 25 30 Lys Ser Ala Val Thr Met Gln Phe Ile Ser
His Gln Phe Pro Asp Tyr 35 40 45 His Asp Pro Thr Ile Glu Asp Ala
Tyr Lys Thr Gln Val Arg Ile Asp 50 55 60 Asn Glu Pro Ala Tyr Leu
Asp Ile Leu Asp Thr Ala Gly Gln Ala Glu 65 70 75 80 Phe Thr Ala Met
Arg Glu Gln Tyr Met Arg Gly Gly Glu Gly Phe Ile 85 90 95 Ile Cys
Tyr Ser Val Thr Asp Arg Gln Ser Phe Gln Glu Ala Ala Lys 100 105 110
Phe Lys Glu Leu Ile Phe Gln Val Arg His Thr Tyr Glu Ile Pro Leu 115
120 125 Val Leu Val Gly Asn Lys Ile Asp Leu Glu Gln Phe Arg Gln Val
Ser 130 135 140 Thr Glu Glu Gly Met Asn Leu Ala Arg Asp Tyr Asn Cys
Ala Phe Phe 145 150 155 160 Glu Thr Ser Ala Ala Leu Arg Phe Gly Ile
Asp Asp Ala Phe Gln Gly 165 170 175 Leu Val Arg Glu Ile Arg Arg Lys
Glu Ser Met Leu Ser Leu Val Glu 180 185 190 Arg Lys Leu Lys Arg Lys
Asp Ser Leu Trp Lys Lys Ile Lys Ala Ser 195 200 205 Leu Lys Lys Lys
Arg Glu Asn Met Leu 210 215 91 50 PRT Mus musculus 91 Ala Ala Leu
Arg Phe Gly Ile Asp Asp Ala Leu Gln Gly Leu Val Arg 1 5 10 15 Glu
Ile Arg Arg Lys Glu Ser Met Leu Pro Leu Val Glu Arg Lys Leu 20 25
30 Lys Arg Lys Asp Ser Leu Trp Lys Lys Ile Lys Ala Ser Leu Lys Lys
35 40 45 Lys Arg 50 92 140 PRT Mus musculus 92 Gly Ala Thr Val Ile
Thr Asn Leu Leu Ser Ala Ile Pro Tyr Ile Gly 1 5 10 15 Thr Thr Leu
Val Glu Trp Ile Trp Gly Gly Phe Ser Val Asp Lys Ala 20 25 30 Thr
Leu Thr Arg Phe Phe Ala Phe His Phe Ile Leu Pro Phe Ile Ile 35 40
45 Ala Ala Leu Ala Ile Val His Leu Leu Phe Leu His Glu Thr Gly Ser
50 55 60 Asn Asn Pro Thr Gly Leu Asn Ser Asp Ala Asp Lys Ile Pro
Phe His 65 70 75 80 Pro Tyr Tyr Thr Ile Lys Asp Ile Leu Gly Ile Leu
Ile Met Phe Leu 85 90 95 Ile Leu Met Thr Leu Val Leu Phe Phe Pro
Asp Met Leu Gly Asp Pro 100 105 110 Asp Asn Tyr Met Pro Ala Asn Pro
Leu Asn Thr Pro Pro His Ile Lys 115 120 125 Pro Glu Trp Tyr Phe Leu
Phe Ala Tyr Ala Ile Leu 130 135 140 93 41 PRT Mus musculus 93 Ser
Asp Ala Asp Lys Ile Pro Phe His Pro Tyr Tyr Thr Ile Lys Asn 1 5 10
15 Ile Leu Gly Ile Leu Ile Ile Phe Leu Ile Leu Ile Thr Leu Val Leu
20 25 30 Phe Phe Pro Asp Ile Leu Gly Asp Pro 35 40 94 311 PRT Mus
musculus 94 Met Lys Ala Leu Trp Ala Val Leu Leu Val Thr Leu Leu Thr
Gly Cys 1 5 10 15 Leu Ala Glu Gly Glu Pro Glu Val Thr Asp Gln Leu
Glu Trp Gln Ser 20 25 30 Asn Gln Pro Trp Glu Gln Ala Leu Asn Arg
Phe Trp Asp Tyr Leu Arg 35 40 45 Trp Val Gln Thr Leu Ser Asp Gln
Val Gln Glu Glu Leu Gln Ser Ser 50 55 60 Gln Val Thr Gln Glu Leu
Thr Ala Leu Met Glu Asp Thr Met Thr Glu 65 70 75 80 Val Lys Ala Tyr
Lys Lys Glu Leu Glu Glu Gln Leu Gly Pro Val Ala 85 90 95 Glu Glu
Thr Arg Ala Arg Leu Gly Lys Glu Val Gln Ala Ala Gln Ala 100 105 110
Arg Leu Gly Ala Asp Met Glu Asp Leu Arg Asn Arg Leu Gly Gln Tyr 115
120 125 Arg Asn Glu Val His Thr Met Leu Gly Gln Ser Thr Glu Glu Ile
Arg 130 135 140 Ala Arg Leu Ser Thr His Leu Arg Lys Met Arg Lys Arg
Leu Met Arg 145 150 155 160 Asp Ala Asp Asp Leu Gln Lys Arg Leu Ala
Val Tyr Lys Ala Gly Ala 165 170 175 Arg Glu Gly Ala Glu Arg Gly Val
Ser Ala Ile Arg Glu Arg Leu Gly 180 185 190 Pro Leu Val Glu Gln Gly
Arg Gln Arg Thr Ala Asn Leu Gly Ala Gly 195 200 205 Ala Ala Gln Pro
Leu Arg Asp Arg Ala Gln Ala Phe Gly Asp Arg Ile 210 215 220 Arg Gly
Arg Leu Glu Glu Val Gly Asn Gln Ala Arg Asp Arg Leu Glu 225 230 235
240 Glu Val Arg Glu His Met Glu Glu Val Arg Ser Lys Met Glu Glu Gln
245 250 255 Thr Gln Gln Ile Arg Leu Gln Ala Glu Ile Phe Gln Ala Arg
Leu Lys 260 265 270 Gly Trp Phe Glu Pro Ile Val Glu Asp Met His Arg
Gln Trp Ala Asn 275 280 285 Leu Met Glu Lys Ile Gln Ala Ser Val Ala
Thr Asn Pro Ile Ile Thr 290 295 300 Pro Val Ala Gln Glu Asn Gln 305
310 95 31 PRT Mus musculus 95 Thr Glu Val Lys Ala Tyr Lys Lys Glu
Leu Glu Glu Gln Leu Gly Pro 1 5 10 15 Val Ala Glu Glu Thr Arg Ala
Arg Leu Gly Lys Glu Glu Gln Gly 20 25 30 96 695 PRT Mus musculus 96
Met Leu Pro Ser Leu Ala Leu Leu Leu Leu Ala Ala Trp Thr Val Arg 1 5
10 15 Ala Leu Glu Val Pro Thr Asp Gly Asn Ala Gly Leu Leu Ala Glu
Pro 20 25 30 Gln Ile Ala Met Phe Cys Gly Lys Leu Asn Met His Met
Asn Val Gln 35 40 45 Asn Gly Lys Trp Glu Ser Asp Pro Ser Gly Thr
Lys Thr Cys Ile Gly 50 55 60 Thr Lys Glu Gly Ile Leu Gln Tyr Cys
Gln Glu Val Tyr Pro Glu Leu 65 70 75 80 Gln Ile Thr Asn Val Val Glu
Ala Asn Gln Pro Val Thr Ile Gln Asn 85 90 95 Trp Cys Lys Arg Gly
Arg Lys Gln Cys Lys Thr His Thr His Ile Val 100 105 110 Ile Pro Tyr
Arg Cys Leu Val Gly Glu Phe Val Ser Asp Ala Leu Leu 115 120 125 Val
Pro Asp Lys Cys Lys Phe Leu His Gln Glu Arg Met Asp Val Cys 130 135
140 Glu Thr His Leu His Trp His Thr Val Ala Lys Glu Thr Cys Ser Glu
145 150 155 160 Lys Ser Thr Asn Leu His Asp Tyr Gly Met Leu Leu Pro
Cys Gly Ile 165 170 175 Asp Lys Phe Arg Gly Val Glu Phe Val Cys Cys
Pro Leu Ala Glu Glu 180 185 190 Ser Asp Ser Val Asp Ser Ala Asp Ala
Glu Glu Asp Asp Ser Asp Val 195 200 205 Trp Trp Val Gly Ala Asp Thr
Asp Tyr Ala Asp Gly Gly Glu Asp Lys 210 215 220 Val Val Glu Val Ala
Glu Glu Glu Glu Val Ala Asp Val Glu Glu Glu 225 230 235 240 Glu Ala
Asp Asp Asp Glu Asp Val Glu Asp Gly Asp Glu Val Glu Glu 245 250 255
Glu Ala Glu Glu Pro Tyr Glu Glu Ala Thr Glu Arg Thr Thr Ser Thr 260
265 270 Ala Thr Thr Thr Thr Thr Thr Thr Glu Ser Val Glu Glu Val Val
Arg 275 280 285 Val Pro Thr Thr Ala Ala Ser Thr Pro Asp Ala Val Asp
Lys Tyr Leu 290 295 300 Glu Thr Pro Gly Asp Glu Asn Glu His Ala His
Phe Gln Lys Ala Lys 305 310 315 320 Glu Arg Leu Glu Ala Lys His Arg
Glu Arg Met Ser Gln Val Met Arg 325 330 335 Glu Trp Glu Glu Ala Glu
Arg Gln Ala Lys Asn Leu Pro Lys Ala Asp 340 345 350 Lys Lys Ala Val
Ile Gln His Phe Gln Glu Lys Val Glu Ser Leu Glu 355 360 365 Gln Glu
Ala Ala Asn Glu Arg Gln Gln Leu Val Glu Thr His Met Ala 370 375 380
Arg Val Glu Ala Met Leu Asn Asp Arg Arg Arg Leu Asp Leu Glu Asn 385
390 395 400 Tyr Ile Ile Ala Leu Gln Ala Val Pro Pro Arg Pro His His
Val Phe 405 410 415 Asn Met Leu Lys Lys Tyr Val Arg Ala Glu Gln Lys
Asp Arg Gln His 420 425 430 Thr Leu Lys His Phe Glu His Val Arg Met
Val Asp Pro Lys Lys Ala 435 440 445 Thr Gln Ile Arg Ser Gln Val Met
Thr His Leu Arg Val Ile Tyr Glu 450 455 460 Arg Met Asn Gln Ser Leu
Ser Leu Leu Tyr Asn Val Pro Ala Val Ala 465 470 475 480 Glu Glu Ile
Gln Asp Glu Val Asp Glu Leu Leu Gln Lys Glu Gln Asn 485 490 495 Tyr
Ser Asp Asp Val Leu Ala Asn Met Ile Ser Glu Pro Arg Ile Ser 500 505
510 Tyr Gly Asn Asp Ala Leu Met Pro Ser Leu Thr Glu Thr Lys Thr Thr
515 520 525 Val Glu Leu Leu Pro Val Asn Gly Glu Phe Ser Leu Asp Asp
Leu Gln 530 535 540 Pro Trp His Pro Phe Gly Val Asp Ser Val Pro Ala
Asn Thr Glu Asn 545 550 555 560 Glu Val Glu Pro Val Asp Ala Arg Pro
Ala Ala Asp Arg Gly Leu Thr 565 570 575 Thr Arg Pro Gly Ser Gly Leu
Thr Asn Ile Lys Thr Glu Glu Ile Ser 580 585 590 Glu Val Lys Met Asp
Ala Glu Phe Gly His Asp Ser Gly Phe Glu Val 595 600 605 Arg His Gln
Lys Leu Val Phe Phe Ala Glu Asp Val Gly Ser Asn Lys 610 615 620 Gly
Ala Ile Ile Gly Leu Met Val Gly Gly Val Val Ile Ala Thr Val 625 630
635 640 Ile Val Ile Thr Leu Val Met Leu Lys Lys Lys Gln Tyr Thr Ser
Ile 645 650 655 His His Gly Val Val Glu Val Asp Ala Ala Val Thr Pro
Glu Glu Arg 660 665 670 His Leu Ser Lys Met Gln Gln Asn Gly Tyr Glu
Asn Pro Thr Tyr Lys 675 680 685 Phe Phe Glu Gln Met Gln Asn 690 695
97 68 PRT Mus musculus 97 Ser Glu Pro Arg Ile Ser Tyr Gly Asn Asp
Ala Leu Met Pro Ser Leu 1 5 10 15 Thr Glu Thr Lys Thr Thr Val Glu
Leu Leu Pro Val Asn Gly Glu Phe 20 25 30 Ser Leu Asp Asp Leu Gln
Pro Trp His Pro Ser Gly Val Asp Ser Val 35 40 45 Pro Ala Asn Thr
Glu Asn Glu Val Glu Pro Val Asp Ala Arg Pro Ala 50 55 60 Ala Asp
Arg Gly 65 98 983 PRT Mus musculus 98 Met Lys Ala Gln Gln Ala Met
Asp Lys Tyr Glu Gly Asp Ser Lys Ala 1 5 10 15 Arg Glu Thr Arg Ser
Thr Ala Ala Met Val Gly Trp Arg Ser Asp Arg 20 25 30 Gly Leu Val
Thr Cys Thr Arg Leu Arg Met Gln Asn Gly Ser Ser Leu 35 40 45 Lys
Ala Phe Arg Ser Arg Val Gly Lys Trp Gly Glu Pro Ser Ser Arg 50 55
60 Ser His Lys Val Leu Lys Thr Ser Glu Thr Ser Gln Asp Ile Gln Lys
65 70 75 80 Val Ser Arg Glu Glu Ser Pro Ser Gln Leu Thr Ser Ala Val
Pro Ala 85 90 95 Gln Arg Asn Cys Gln Pro Gly Ser Ala Ala Val Ile
Asn Met Leu Arg 100 105 110 Gly Gly Gly Gly Val Arg Ser Pro Trp Thr
Asp His His Ile Arg Gln 115 120 125 Arg Thr Asp His His Ile Arg Gln
Pro Leu Phe Pro Ser Arg Arg Ser 130 135 140 Pro Gln Glu Asn Glu Asp
Asp Asp Asp Asp Tyr Gln Met Phe Val Pro 145 150 155 160 Ser Phe Ser
Ser Ser Asp Leu Asn Ser Thr Arg Leu Cys Glu Glu Asn 165 170 175 Ala
Ser Ser Arg Pro Cys Ser Trp His Leu Gly Leu Ile Glu Pro Thr 180 185
190 Glu Ile Ser Ser Ser Gly His Arg Ile Val Arg Arg Ala Ser Ser Ala
195 200 205 Gly Glu Ser Asn Ala Cys Pro Pro Glu Val Arg Ile Arg Asp
Cys Asp 210 215 220 Asp Ser Gln Tyr Cys Pro Gly Arg Gln Leu Gln Asn
Ser Pro Arg Pro 225 230 235 240 Gly Gly Glu Arg Gly Met Thr Pro Tyr
Gly Ser Ser Val Glu Leu Thr 245 250 255 Ile Asp Asp Ile Asp His Val
Tyr Asp Asn Ile Ser Phe Glu Asp Leu 260 265 270 Lys Leu Met Val Ala
Lys Arg Asp Glu Thr Glu Cys Ser Phe Ser Lys 275 280 285 Pro Ser Arg
Asp Ser Val Arg Pro Lys Ser Thr Pro Glu Leu Ala Phe 290 295 300 Ser
Lys Arg Gln Val Ser His Ser Thr Ser Ser Leu His Ser Arg Lys 305 310
315 320 Glu Ala Gly Leu Gly Gly Gln Glu Ala Ser Thr Gln Ser Val His
Glu 325 330 335 His Gln Glu Val Glu Glu Asn Ile Tyr Asp Thr Ile Gly
Leu Pro Asp 340 345 350 Pro Pro Ser Met Asn Leu Asn His Ser Ser Leu
His Gln Pro Lys Arg 355 360 365 Ser Thr Phe Leu Gly Leu Glu Ala Asp
Phe Ala Cys Cys Asp Ser Leu 370 375 380 Arg Pro Phe Val Ser Gln Asp
Ser Leu Gln Phe Ser Glu Asp Asp Ile 385 390 395 400 Ser Tyr His Gln
Gly Pro Ser Asp Thr Glu Tyr Leu Ser Leu Leu Tyr 405 410 415 Asp Ser
Pro Arg Cys Asn Leu Pro Ile Ala Asp Lys Ala Leu Ser Asp 420 425 430
Lys Leu Ser Glu Glu Val Asp Glu Ile Trp Asn Asp Leu Glu Asn Tyr 435
440 445 Ile Lys Lys Asn Glu Asp Lys Ser Arg Asp Arg Leu Leu Ala Ala
Phe 450 455 460 Pro Val Ser Lys Asp Asp Ala Pro Glu Arg Leu Tyr Val
Asp Ser Thr 465 470 475 480 His Glu Leu Gly Arg Asp Thr Gly His Ala
Thr Ser Met Leu Ala Leu 485 490 495 Pro Thr Ser Gln Thr Phe Leu Leu
Pro Gly Lys Ser Arg Val Val Arg 500 505 510 Ala Ser Arg Ala Asn Cys
Ser Leu Asp Asn Asp Ile Ile Ser Thr Glu 515 520 525 Gly Ser Phe Leu
Ser Leu Asn Gln Leu Ser Leu Ala Ser Asp Gly Pro 530 535 540 Pro Val
Asp Asn Pro Tyr Asp Leu Ala Asn Cys Ser Leu Pro Gln Thr 545 550 555
560 Asp Pro Glu Asn Pro Asp Pro Gly Met Glu Val Thr Asp Lys Thr Lys
565 570 575 Ser Arg Val Phe Met Met Ala Arg Gln Tyr Ser Gln Lys Ile
Lys Lys 580 585 590 Val Asn Gln Ile Leu Lys Val Lys Ser Pro Glu Leu
Glu Gln Pro Pro 595 600 605 Ser Ser Gln His Arg Pro Ser His Lys Asp
Leu Val Ala Ile Leu Glu 610 615 620 Glu Lys Arg Gln Gly Gly Pro Ala
Ile Gly Ala Arg Ile Ala Glu Tyr 625 630 635 640 Ser Gln Leu Tyr Asp
Gln Ile Val Phe Arg Glu Thr Pro Leu Lys Ala 645 650 655 Gln Lys Asp
Gly Trp Ala Ser Pro Gln Gly Pro Thr Leu His Arg Pro 660 665 670 Val
Ser Pro Pro Gln Ala Gln Gly Ala Gly Glu Asp Trp Leu Trp His 675 680
685 Ser Pro Tyr Ser Asn Gly Glu Leu Ala Asp Phe Ser Pro Gln Thr Glu
690 695 700 Gln Asp Ser Lys Ser Lys Tyr Pro Ile Thr Leu Glu Ser Thr
Thr Lys 705 710 715 720 Ile Arg Pro Arg Gln Leu Ser Gly Ala Cys Ser
Val Pro Ser Leu Gln
725 730 735 Val Ser Asp Pro Leu Leu Gly Ser Val Gln Gln Arg Cys Ser
Val Val 740 745 750 Val Ser Gln Pro His Lys Glu Asn Ser Gly Gln Ser
Pro Leu Tyr Asn 755 760 765 Ser Leu Gly Arg Lys Ala Ile Ser Ala Lys
Pro Gln Pro Tyr Ser Arg 770 775 780 Pro Gln Ser Ser Ser Ser Ile Leu
Ile Asn Lys Ser Leu Asp Ser Ile 785 790 795 800 Asn Tyr Pro Ser Glu
Thr Glu Thr Lys Gln Leu Leu Ser Ser Gln Lys 805 810 815 Ser Pro Arg
Gly Ala Ser Gln Gln Asp Leu Pro Ser Gly Leu Ala Asn 820 825 830 Ser
Cys Gln Gln Asp Arg Gly Lys Arg Ser Asp Leu Thr Leu Gln Asp 835 840
845 Ser Gln Lys Val Leu Val Val Asn Arg Asn Leu Pro Leu Ser Ala Gln
850 855 860 Ile Ala Thr Gln Asn Tyr Phe Cys Asn Phe Lys Asp Pro Glu
Gly Asp 865 870 875 880 Glu Asp Asp Tyr Val Glu Ile Lys Ser Glu Glu
Asp Glu Val Arg Leu 885 890 895 Asp Leu Ser Pro Arg Arg Gly Arg Lys
Ser Asp Pro Gln Thr Pro Asp 900 905 910 Pro Asp Cys Ser Asp Ser Ile
Cys Ser His Ser Thr Pro Tyr Ser Leu 915 920 925 Lys Glu Pro Val Ser
Gly Arg Leu Gly Leu Pro Pro Tyr Leu Thr Ala 930 935 940 Cys Lys Asp
Ser Asp Lys Leu Asn Asp Tyr Leu Trp Arg Gly Pro Ser 945 950 955 960
Pro Asn Gln Gln Asn Ile Val Gln Ser Leu Arg Glu Lys Phe Gln Cys 965
970 975 Leu Ser Ser Ser Ser Phe Ala 980 99 46 PRT Mus musculus 99
Phe Met Met Ala Arg Gln Tyr Ser Gln Lys Ile Lys Lys Val Asn Gln 1 5
10 15 Ile Leu Lys Val Lys Ser Pro Glu Leu Glu Gln Pro Pro Ser Ser
Gln 20 25 30 His Arg Pro Ser His Lys Asp Leu Ala Ala Ile Leu Glu
Lys 35 40 45 100 412 PRT Mus musculus 100 Met Ala Asn Val Ala Asp
Thr Lys Leu Tyr Asp Ile Leu Gly Val Pro 1 5 10 15 Pro Gly Ala Ser
Glu Asn Glu Leu Lys Lys Ala Tyr Arg Lys Leu Ala 20 25 30 Lys Glu
Tyr His Pro Asp Lys Asn Pro Asn Ala Gly Asp Lys Phe Lys 35 40 45
Glu Ile Ser Phe Ala Tyr Glu Val Leu Ser Asn Pro Glu Lys Arg Glu 50
55 60 Leu Tyr Asp Arg Tyr Gly Glu Gln Gly Leu Arg Glu Gly Ser Gly
Gly 65 70 75 80 Gly Gly Gly Met Asp Asp Ile Phe Ser His Ile Phe Gly
Gly Gly Leu 85 90 95 Phe Gly Phe Met Gly Asn Gln Ser Arg Ser Arg
Asn Gly Arg Arg Arg 100 105 110 Gly Glu Asp Met Met His Pro Leu Lys
Val Ser Leu Glu Asp Leu Tyr 115 120 125 Asn Gly Lys Thr Thr Lys Leu
Gln Leu Ser Lys Asn Val Leu Cys Ser 130 135 140 Ala Cys Ser Gly Gln
Gly Gly Lys Ser Gly Ala Val Gln Lys Cys Ser 145 150 155 160 Ala Cys
Arg Gly Arg Gly Val Arg Ile Met Ile Arg Gln Leu Ala Pro 165 170 175
Gly Met Val Gln Gln Met Gln Ser Val Cys Ser Asp Cys Asn Gly Glu 180
185 190 Gly Glu Val Ile Asn Glu Lys Asp Arg Cys Lys Lys Cys Glu Gly
Lys 195 200 205 Lys Val Ile Lys Glu Val Lys Ile Leu Glu Val His Val
Asp Lys Gly 210 215 220 Met Lys His Gly Gln Arg Ile Thr Phe Thr Gly
Glu Ala Asp Gln Ala 225 230 235 240 Pro Gly Val Glu Pro Gly Asp Ile
Val Leu Leu Leu Gln Glu Lys Glu 245 250 255 His Glu Val Phe Gln Arg
Asp Gly Asn Asp Leu His Met Thr Tyr Lys 260 265 270 Ile Gly Leu Val
Glu Ala Leu Cys Gly Phe Gln Phe Thr Phe Lys His 275 280 285 Leu Asp
Ala Arg Gln Ile Val Val Lys Tyr Pro Pro Gly Lys Val Ile 290 295 300
Glu Pro Gly Cys Val Arg Val Val Arg Gly Glu Gly Met Pro Gln Tyr 305
310 315 320 Arg Asn Pro Phe Glu Lys Gly Asp Leu Tyr Ile Lys Phe Asp
Val Gln 325 330 335 Phe Pro Glu Asn Asn Trp Ile Asn Pro Asp Lys Leu
Ser Glu Leu Glu 340 345 350 Asp Leu Leu Pro Ser Arg Pro Glu Val Pro
Asn Val Ile Gly Glu Thr 355 360 365 Glu Glu Val Glu Leu Gln Glu Phe
Asp Ser Thr Arg Gly Ser Gly Gly 370 375 380 Gly Gln Arg Arg Glu Ala
Tyr Asn Asp Ser Ser Asp Glu Glu Ser Ser 385 390 395 400 Ser His His
Gly Pro Gly Val Gln Cys Ala His Gln 405 410 101 25 PRT Mus musculus
101 Leu Ser Asn Pro Glu Lys Arg Glu Leu Tyr Asp Arg Tyr Gly Glu Gln
1 5 10 15 Gly Leu Arg Glu Gly Ser Gly Gly Gly 20 25 102 29 PRT Mus
musculus 102 Ala His Ser Phe Ser Val Phe Arg Leu Pro Ser Trp Trp
Ile Val Gly 1 5 10 15 Trp Trp Ser Lys Gly Gly Val Gly Ser Asp Leu
Glu Met 20 25 103 35 PRT Mus musculus 103 Pro Asp Ile Lys His Pro
Gly Asn Leu Glu His Tyr Ile Lys Arg Val 1 5 10 15 Asn Leu Arg Ile
Ile Ala Ile Glu Glu Gly Glu Lys Ser Gln Leu Lys 20 25 30 Gly Pro
Lys 35 104 573 DNA Mus musculus 104 atgccattga ggcatctagc
agacagattg gggcatctgg cagacagact gaggcatcta 60 acagacagat
tgaggcatct agcagacaga ctgaggcatc taacagacag actgaggcat 120
ctagcagaca gactgaagca tctagcagac agactgaaac atctaacaga cagattgggg
180 catctaacag acagatcatg gcatctaaca gacagattgg ggcatctaac
agacagattg 240 aggcatctaa cagacagatt ggggcatcta acagacagac
agaggtatct agcagacaga 300 ttgaggcatc taacagacag attggggcat
ctaacagaca aactgaggca tctaacagac 360 agattggggc atctaacaga
cagactgagg catctaacag acagattggg gcatctaaca 420 gacagactga
tgcatctaac agacagactg atgcatctaa cagacagact gaggcatcta 480
gcagacagac agaggcatct agcagacaga cagaggcatc tagcagacag actgaggcat
540 ctagcagaca aattgaggca tcagctgcag ctg 573 105 213 DNA Mus
musculus 105 attgaggcat ctaacagaca gattggggca tctaacagac aaactgaggc
atctaacaga 60 cagattgggc atctaacaga cagactgagg catctaacag
acagattggg gcatctaaca 120 gacagactga tgcatctaac agacagactg
atgcatctaa cagacagact gaggcatcta 180 gcagacagac agaggcatct
agcagacaga ctg 213 106 165 DNA Mus musculus 106 gacaaactga
ggcatctaac agacagattg gggcatctaa cagacagact gaggcatcta 60
acagacagat tggggcatct aacagacaga ctgatgcatc taacagacag actgatgcat
120 ctaacagaca gactgaggca tctagcagac agacagaggc acgac 165 107 135
DNA Mus musculus 107 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 108 135
DNA Mus musculus 108 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt gggacatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 109 135
DNA Mus musculus 109 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagatag actgaggcat 120 ctagcagtca gacag 135 110 135
DNA Mus musculus 110 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acggacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 111 135
DNA Mus musculus 111 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgagacat 120 ctagcagtca gacag 135 112 135
DNA Mus musculus 112 gacagattgg ggcgtctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 113 135
DNA Mus musculus 113 gacagattgg ggcgtctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacaggc tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 114 135
DNA Mus musculus 114 gacagattgg ggcatctaac agacagactg aggtatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 115 135
DNA Mus musculus 115 gacagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgacgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 116 135
DNA Mus musculus 116 gacagattgg ggcatctaac agacagactg aagcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatccatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcagtca gacag 135 117 135
DNA Mus musculus 117 ggcagattgg ggcatctaac agacagactg aggcatctaa
cagacagatt ggggcatcta 60 acagacagac tgatgcatct aacagacaga
ctgatgcatc taacagacag actgaggcat 120 ctagcggtca gacag 135 118 114
DNA Mus musculus 118 gacagattga ggcatctagc agacagactg aggcatctaa
ccgacagact gaggcatcta 60 gcagacagac tgaagcatct agcagacaga
ctgaaacatc taacaaacag aaag 114 119 552 DNA Mus musculus 119
atgccattga ggcatctagc agacagattg gggcatctgg cagacagact gaggcatcta
60 acagacagat tgaggcatct agcagacaga ctgaggcatc taacagacag
actgaggcat 120 ctagcagaca gactgaggca tctagcagac agactgaaac
atctaacaga cagattgggg 180 catctaacag acagatcatg gcatctaaca
gacagattgg ggcatctaac agacagattg 240 aggcatctaa cagacagatt
ggggcatcta acagacagac agaggtatct agcagacaga 300 ttgaggcatc
taacagacag attggggcat ctaacagaca gactgaggca tctaacagac 360
agattggggc atctaacaga cagactgagg catctaacag acagattggg gcatctaaca
420 gacagactga tgcatctaac agacagactg aggcatctag cagacagaca
gaggcatcta 480 gcagacagac agaggcatct agcagacaga ctgaggcatc
tagcagacaa attgaggcat 540 cagctgcagc tg 552 120 237 DNA Mus
musculus 120 gaaaaagtga aaaccttgaa agcgcaaaac tccgagctgg catccacggc
caacacgctc 60 agggaacagg tggcactgct taagcagaaa gtcatgaacc
acgttaacag tgggtgccaa 120 ctcatgctaa cgcagcagtt gcaaacgttt
tgggaacaga ctgtcagggc tgaggggcaa 180 tggaagaaaa aaaataacag
agacaaactt gagaacttga ctggttgcga cagagaa 237 121 228 DNA Mus
musculus 121 cggatcaagg cagagaggaa gcgcatgagg aaccgcattg ccgcctccga
gtgccggaaa 60 aggaagctgg agcggatcgc tcggctagag gaaaaagtga
aaaccttgaa agcgcaaaac 120 tccgagctgg catccacggc caacatgctc
agggaacagg tggcacagct taagcagaaa 180 gtcatgaacc acgttaacag
tgggtgccaa ctcatgctaa cacagcag 228 122 149 DNA Mus musculus 122
gggcatctgg cagacagact gaggcatcta acagacagat tgaggcatct agcagacaga
60 ctgaggcatc taacagacag actgaggcat ctagcagaca gactgaggca
tctagcagac 120 agactgaaac atctaacaga cagatatgg 149 123 168 DNA Mus
musculus 123 catctaacag acagactgag gcatctaaca gacagattgg ggcatctaac
agacagactg 60 aggcatctaa cagacagatt gaggcatcta acagacagac
tgatgcatct aacagacaga 120 ctggggcatc tagcagacag acagaggcat
ctagcagaca gacagaga 168 124 132 DNA Mus musculus 124 gcagacagat
tggggcatct ggcagacaga ctgaggcatc taacagacag attgaggcat 60
ctagcagaca gactgaggca tctaacagac agactgaggc atttagcaga cagactgagg
120 catctagcag ac 132 125 132 DNA Mus musculus 125 gcagacagat
tggggcatct ggcagacaga ctgaggcatc taacagacag attgaggcat 60
ctagcagaca gactgagaca tctaacagac agactgaggc atttagcaga cagactgagg
120 catctagcag ac 132 126 81 DNA Mus musculus 126 gcagacagac
tgaggcatct aacagacaga ttgaggcatc tagcagacag actgaggcat 60
ctaacagaca gactgaggca c 81 127 159 DNA Mus musculus 127 acagacagat
tggggcatct aacagacaga ctgaggcatc taacagacag attggggcat 60
ctaacagaca gactgaggca tctaacagac agattggggc atctaacaga cagactgatg
120 catctaacag acagactgag gcatctagca gacagaccg 159 128 138 DNA Mus
musculus 128 agacagaggc atctaacaga cagactgagg catctaacag acagattggg
gcatctaaca 60 gacagactga ggcatctaac agacagattg gggcatctaa
cagacagact gatgcatcta 120 acagacagac tgaggccc 138 129 117 DNA Mus
musculus 129 gacagattga gccatctagc agacagactg aggcatctaa cagacagact
gaggcatcta 60 acagacagat tggggcatct aacagacaga ctgaggcatc
tagcagacag acagagg 117 130 117 DNA Mus musculus 130 gacagattga
ggcatctagc agacagactg aggcatctaa cagacagact gaggcatcta 60
acagacagat tggggcatct aacagacaga ctgaggcatc tagcagacag acagagg 117
131 117 DNA Mus musculus 131 ggcagattga ggcatctagc agacagactg
aggcatctaa cagacagact gaggcatcta 60 acagacagat tggggcatct
aacagacaga ctgaggcatc tagcagacag acagagg 117 132 129 DNA Mus
musculus 132 tatctagcag acagattgag gcatctaaca gacagattgg ggcatctgac
agacaggctg 60 aggcatctaa cagacagatt ggggcatcta acagacagac
tgaggcatct aacagacaga 120 ttggggcaa 129 133 129 DNA Mus musculus
133 tatctagcag acagattgag gcatctaaca gacagattga ggcatctaac
agacaggctg 60 aggcatctaa cagacagatt ggggcatcta acagacagac
tgaggcatct aacagacaga 120 ttggggcaa 129 134 228 DNA Mus musculus
134 cggatcaagg cagagaggaa gcgcatgagg aaccgcattg ccgcctccaa
gtgccggaaa 60 aggaagctgg agcggatcgc tcggctagag gaaaaagtga
aaaccttgaa agcgcaaaac 120 tccgagctgg catccacggc caacatgctc
agggaacagg tggcacagct taagcagaaa 180 gtcatgaacc acgttaacag
tgggtgccaa ctcatgctaa cacagcag 228 135 132 DNA Mus musculus 135
ttggggcatc taacagacag actgaggcat ctaacagaca gattggggca tctaacagac
60 agactgatgc atctaacaga cagactgagg catctagcag acagacagag
gcatctagca 120 gacagacaga aa 132 136 108 DNA Mus musculus 136
ttggggcatc taacagacag actgaggcat ctaacagaca gattggggca tctaacagac
60 agactgatgc atctaacaga cagactgagg catctagcag acacccag 108 137 132
DNA Mus musculus 137 gggcatctaa cagacagact gaggcatcta acggacagat
tggggcatct aacagacaga 60 ctgaggcatc taacagacag attggggcat
ctaacagaca gactgatgca tctaacagac 120 agactgaggc ac 132 138 132 DNA
Mus musculus 138 gggcatctaa cagacagact gaggcatcta acagacagat
tggggcatct aacagacaga 60 ctgaggcatc taacagacag attggggcat
ctaacagaca gactgatgca tctaacagac 120 agactgaggc ac 132 139 153 DNA
Mus musculus 139 aggcatctaa cagacagatt ggggcatcta acagacagac
tgaggcatct aacagacaga 60 ttggggcatc taacagacag actgatgcat
ctaacagaca gactgaggca tctagcagac 120 agacagaggc atctagcaga
caggcagagg cac 153 140 153 DNA Mus musculus 140 aggcatctaa
cagacagatt ggggcatcta acagacagac tgaggcatct aacagacaga 60
ttggggcgtc taacagacag actgatgcat ctaacagaca gactgaggca tctagcagac
120 agacagaggc atctagcaga cagacagagg cac 153 141 1752 DNA Mus
musculus 141 atgtcccatc aacctctgag ctgcctgact gagaaggggg acagcccttg
tgagacccca 60 ggaaatggac cctccaatat ggttcacccc agcctggaca
cattcacccc tgaggagctg 120 ctgcagcaaa tgaaggaact cctggttgag
aaccaccagc tgaaagaagc catgaagcta 180 aataatcaag ctatgaaagg
gcgatttgag gagctgtccg cctggacaga gaagcagaag 240 gaagagcgcc
tgttgtttga gatgcaaagc aaagaggtta aggagcgcct taaggccctg 300
actcatgaaa atgagaggct gaaggaagag cttggaaaat tcaaagagaa atcagaaaag
360 ccattggaag acctcacagg tggctacagg tatcccagag ccttggagga
ggaagtggag 420 aagctgaaga cccaggtgga gcaggaagtg gagcatctga
agatccaggt gatgcgcctt 480 cgggctgaaa aggcagacct gctgggcatc
gtctcagaac tgcagctcaa actcaactcc 540 ggcggctcct cggaagactc
cttcgttgag atcaggatga ccgaaggaga gactgaaggg 600 gcaatgaagg
agatgaagaa ctgccctaca cccacaagaa cagaccccat cagcttgagc 660
aactgtacag aggatgccag gagttgtgcg gagtttgaag aactgactgt gagccagctt
720 ctgctttgcc taagggaagg aaaccaaaag gtggagagac ttgaagtcgc
cctcagagaa 780 gccaaagaaa gaatttcaga ttttgaaaag aaagcaaatg
gccattcttc tactgagaag 840 cagacagcga ggagagcaga cagagagaag
gaggacaaag gccaagagag tgttggaagc 900 gaagtggaaa cactgagcat
tcaagtgacc tctctgttta aggagcttca agaggcacac 960 acaaaactca
gtgaggctga gctgatgaag aagagacttc aagaaaagtg tcaggctctg 1020
gagaggaaga actctgcaac accatcagag ctgaatgaaa agcaagagct cgtttacagt
1080 aacaagaagt tagagctgca ggtggagagc atgcgctccg aaatcaagat
ggagcaggcc 1140 aagacagagg aggagaagtc caggttagcc actctgcagg
caactcacaa caagctcctt 1200 caagaacata ataaggcact gaaaacaatt
gaagaactaa ccaagcaaca ggcagaaaag 1260 gtggacaaga tgttgctgca
ggagctcagc gagaagctgg agctggcaga gcaggctctg 1320 gcatccaaac
agctccagat ggatgagatg aagcagacgc tcgctaagca ggaggaagac 1380
ctggagacca tggccgtcct cagggctcag atggaggtgt actgctcaga ttttcacgct
1440 gagagagcag caagagagaa gattcatgaa gaaaaggagc agctggcctt
gcagctcgcg 1500 attttgctga aagagaacaa tgacattgaa gagggaggca
gtagacagtc
cctgatggaa 1560 atgcagtgcc gacacggggc aagaaccagt gactctgacc
agcagactta cctgtttcaa 1620 agaggagccg aggacaggag ctggcagcac
gggcagcagc ctcgcagtat tccgattcac 1680 tcctgcccca agtgcgggga
ggtcctgccg gacatcgaca cgcttcagat ccatgtgatg 1740 gactgcatca tt 1752
142 324 DNA Mus musculus 142 ctgaagaccc aggtggagca ggaagtggag
catctgaaga tccaggtgat gcgccttcgg 60 gctgaaaagg cagacctgct
gggcatcgtc tcagaactgc agctcaaact caactccggc 120 ggctcctcgg
aagactcctt cgttgagatc aggatgaccg aaggagagac tgaaggggca 180
atgaaggaga tgaagagctg ccctacaccc acaagaacag accccatcag cttgagcaac
240 tgtacagagg atgccaggag ttgtgcggag tttgaagaac tgactgtgag
ccagcttctg 300 ctttgcctaa gggaaggaaa ccaa 324 143 186 DNA Mus
musculus 143 catctgaaga tccaggtgat gcgccttcgg gctgaaaagg cagacctgct
gggcatcgtc 60 tcagaactgc agctcaaact caactccggc ggctcctcgg
aagactcctt cgttgagatc 120 aggatgaccg aaggagagac tgaaggggca
atgaaggaga tgaagaactg ccctacaccc 180 acaaga 186 144 186 DNA Mus
musculus 144 catctgaaga tccaggtgat gcgccttcgg gctgaaaagg cagacctgct
gggcatcgtc 60 tcagaactgc agctcaaact caactccggc ggctcctcgg
aagactcctt cgttgagatc 120 aggatgaccg aaggagagac tgaaggggca
atgaaggaga tgaagaactg ccctgcaccc 180 acaaga 186 145 186 DNA Mus
musculus 145 catctgaaga tccaggtgat gcgccttcgg gctgaaaagg cagacctgct
gggcatcgtc 60 tcagaactgc ggctcaaact caactccggc ggctcctcgg
aagactcctt cgttgagatc 120 aggatgaccg aaggagagac tgaaggggca
atgaaggaga tgaagaactg ccctacaccc 180 acaaga 186 146 306 DNA Mus
musculus 146 atgttgagtc gactgcagga gctccgcaag gaggaggaaa ccctgctgcg
tctaaaggcg 60 gctctacacg accaactgaa ccgcctcaag gttgaagaat
tagcccttca atccatgata 120 aattctcgag gaaggaccga gacactgtct
tctcagcctg cacctgaaca gttatgtgat 180 atgtccctac atgtagacaa
cgaagtgaca ataaatcaga ctacactgaa gctgagcaca 240 aggagcccta
tggaagaaga ggaggaggaa gaggaggagg aagaggagga ggaagaatct 300 gattcg
306 147 249 DNA Mus musculus 147 gccctaagga agtggaaggg gatgttgagt
cgactgcagg agctccgcaa ggaggtggaa 60 accctgctgc gtctaaaggc
ggctctacac gaccaactga accgcctcaa ggttgaagaa 120 ttagcccttc
aatccatgat aaattctcga ggaaggaccg agacactgtc ttctcagcct 180
gcacctgaac agttatgtga tatgtcccta catgtagaca acgaagtgac aataaatcag
240 actaggccg 249 148 237 DNA Mus musculus 148 aggaagtgga
aggggacgtt gagtcgactg caggagctcc gcaaggaggt ggaaaccccg 60
ctgcgtctaa aggcggctct acacgaccaa ctgaaccgcc tcaaggttga agaattagcc
120 cttcaatcca tgataaattc tcgaggaagg accgagacac tgtcttctca
gcctgcacct 180 gaacagttat gtgatatgtc cctacatgta gacaacgaag
tgacaataaa tcagact 237 149 1239 DNA Mus musculus 149 atgggagacg
acagaccgtt tgtgtgcagt gccccgggct gtggacagag atttacaaat 60
gaggaccacc tggcagttca taaacataag catgagatga cactgaaatt tggcccagcc
120 cgaacggact cagtcatcat tgcagatcaa acgcctactc caactagatt
cctgaagaac 180 tgtgaggaag tggggctctt caatgaacta gctagctcct
ttgaacatga atttaagaaa 240 gcttctgatg acgatgagaa aaagggtgct
gctgggcctc ttgacatgtc tctgccttct 300 acaccagaca tcaaaatcaa
ggaagaagag ccagtggaag tagactcatc gccccctgac 360 agtcctgctt
ctagcccctg ttccccacca ctgaaggaga aggaagttac cacaaaaccg 420
gttgtgatct ctacccctac acctaccatt gtacgtcctg gctccctgcc tctccactta
480 ggttatgatc cacttcatcc aactcttcct tccccaacct ctgtcatcac
acaggctcca 540 ccatccaaca ggcaaatagg atctcctact ggctccctcc
ctctcgtcat gcatcttgct 600 aatggacaga ccatgcctat gttgccaggg
cctccagtac agatgccttc tgttatttcg 660 ctggccagac ctgtgtccat
ggtgcccaac attcctggta tacctggccc accggttaac 720 aacagtggct
ccatttctcc ctctggccac cctatgccgt cagaagccaa aatgagacta 780
aaagccacgc tgacccatca agtttcttca atcaatggag gttgtggaat ggtggtgggt
840 actgcaagca ccatggtgac tgcccgtcca gagcaaaacc agatcctcat
ccagcaccca 900 gatgccccat cccctgccca gccacaggtc tctccagctc
agcccacccc tagcactggg 960 ggacggcgac ggcgtacagt ggatgaagat
ccagatgagc ggcggcagcg gtttttagag 1020 cgaaacagag ctgcagcctc
tcgatgccgg caaaagcgga aactgtgggt gtcctccctg 1080 gaaaagaagg
cagaagaact tacttctcag aacattcagc tgagtaatga agtcacatta 1140
ctacgcaatg aggtggctca gctgaagcag ctactgttag ctcataaaga ttgcccagtc
1200 actgcactac agaaaaagac tcaaggctac ctaggtaag 1239 150 168 DNA
Mus musculus 150 cgatgccggc aaaagcggaa actgtgggtg tcctccctgg
aaaagaaggc agaagaactt 60 acttctcaga acattcagct gagtaatgaa
gtcacattac tacgcaatga ggtggctcag 120 ctgaagcagc tactgttagc
tcataaagat tgtccagtca ccgcacaa 168 151 3093 DNA Mus musculus 151
atgacaaacc caaaaggaaa gaggagaggt actcagtcta tgttctctag gccttttagg
60 aaacatggag ttgtttcttt ggccacatac atgcgaatct acaagaagcg
tgatattgta 120 gacatcaagg gaatgggcac tgttcaaaaa ggaatgccct
gtaagtgtta ccacggcaaa 180 accggaagag tctacaatgt cacccagcat
gccatgggca tcattgtaaa caagcaagtt 240 aaaggcaaga ttctggccaa
gaggatcaat gtgcagattg agcacatcaa gcactcgaag 300 agcagagacg
gcttcctgaa gcagggagag gccgcccatt tcgaatacct gctgtaccca 360
cttcactcag cgtccatcac gggcctggcc acctgcatcc gaaaacccct cattgccacc
420 tgctccctgg atcgctccgt tcgcatctgg aattacgagt cgaattcctc
ctgctgcaag 480 gctctcagag aagacctttg gctcctcctc cttttccaca
tcactgcccc ggccaccctt 540 agctccccac cagtgatatt cttctgcaca
ctggagctat ataaggaata ccaagaagag 600 gcctacacgg tcagccttca
cccctccgga cactacattg tggtggggtt tgctgacaaa 660 cttcgcctta
tgaacctgct cattgatgac atccgttctt tcaaagaata ttctgtcaga 720
ggatgcaaag agtgtgcctt tagcaatgga ggtcacctgt ttgctgccgt caatggtaat
780 gtgattcaca tcttcaccac cacgaacctg gagaatatca acaacctgaa
aggccacaca 840 gggaagaggg agacagagtg tgtactcaag gtctgtagtt
acaactcggt cactatctcc 900 cctgacggca aagttatctt cgctgttgga
tcagaccaga ctcttaagga gatcgccgat 960 tctttgatcc ttcgagagat
accagcattt gatgtcgtct acacggccat caccatctca 1020 cattccggac
gcatgatatt cgtgggcact tcagtgggga ctatccgtgc catgaagtac 1080
ccgctgcctc tgcagagaga attcaatgag taccaggctc acgctggccc cgtcacgaag
1140 atactgctca ccttcgatga ccagttcctg ctgacggtct ctgaggatgg
ctgcctgttc 1200 acctggaaag tctttgataa ggagggtcgg ggaatcaaac
gagagaggga ggtgggcttt 1260 gctgaagagg tactcgtgac taagacagac
atggaggaga agatactcca caggaactta 1320 gcaacggaat tcagaaggcc
aatgagcaag caccttgagt gtcccacatc ggaaactggg 1380 ccactcacaa
caataaatat ctccccggtc cagcccaggc cttggggcca tgtactcacc 1440
tgcagaacac ccgtcagcac tgacagtgct gttgcgtcta caagaggctc tgtggacagc
1500 gcagtgaagc cagataggtc aactccaacc caggaagtcc gcatcccacc
aaagccagcc 1560 tcgggagtcc acaccaggtg ccagttagga gtacagaaac
agatggaaca cgtttctgtt 1620 gtcatggagg tacgagaaac aaaccggcag
agacagggtg ggggtgcgcg gaatgtaatc 1680 aaggctcaga tcatgctgga
gctgaagacg cgtgtagagg aactgaaaat ggagaacgag 1740 tatcagctcc
ggctgaagga catgaactac tcagagaaga tcaaggagct gacagacaag 1800
ttcatccagg agatggagtc cttgaagacg aagaaccagg ttttaaaaac agagaaagaa
1860 aaacaggaca tcagtcaccg agagcactta gaagacctca tagaaagaca
gagccgggag 1920 ctgcaagacc tggaatgttg taacaaccag aagctgctcc
ttgaatacga gaagtaccag 1980 gagctgcagc tcaagtccca gaggatgcag
gaggagtacg aaaaacagct ccgagacaat 2040 gatgagacca agagccaagc
actggaggag ctgaccgagt tctacgaggc caaactccag 2100 gagaaaaccg
gccttctgga agaggccctc agcacagcag cctcaccacc ccttccctca 2160
gcacacgttc tctctccctt ccccactctg agccaggcac aggaagatgt ccgacagcag
2220 ctccgggaat tcgaggaaac caagaagcag attgaagaag atgaggacag
ggagatccaa 2280 gacatcaaaa ccaagtatga gagaaagctt cgagatgaaa
aggagtccaa ccttcggctt 2340 aagggagaaa caggaatcat gaggaagaag
ttcagcagcc tgcagaagga gatcgaagag 2400 cgcaccaatg acatcgagct
cctcaaaacg gagcaggtga agctgcaggg ggtcatcagg 2460 tccctagaga
aggacatcca aggactcaag agagagatcc aggagaggga tgagaccatt 2520
caagacaagg agaagcgaat ttatgatctg aagaagaaga accaagagtt agagaaattc
2580 aaatttgtcc ttgactacaa aataaaggaa ctgaagaagc aaatagaacc
aagggagaac 2640 gagatcaaag tgatgaagga gcagatccag gagaaccctg
tcaatcactg gctcagaagc 2700 agggagagag aatgtgtcac acagccaagg
catctgcggc ttccagctcc ccagaacaag 2760 ttagatggga atttagcttg
tggaccggta agaggtcggt tgtgccactc agacgcgacc 2820 tcaggggccc
tgaatgttca gggcatcctt tgtctcttcc acctgccatt tccctgtgat 2880
aggacgccat ctttcttccc cggagaagct tgtctcctgg ttttctctct tctgatagat
2940 gttctatgta gacccacctc tgacgtacca gtcgctgctg gcgattttct
tccgtgtggc 3000 ggacctctgc acttgcctcc agagctgcac caccttacag
tcatccggac caatgccagc 3060 ccacagaaat gctacccacc caccagtcct ctg
3093 152 210 DNA Mus musculus 152 aagaagttca gcagcctgca gaaggagatc
gaagagcgca ccaacgacat cgagctcctc 60 aagtcggagc ggatgaagct
gcagggcatc atcagatccc tggagaaaga catccaaggg 120 ctcaagagag
agatccagga gagggacgag accattcaag acatggagaa gcttgactac 180
aaggacgatt ataattcaaa cctagagatc 210 153 168 DNA Mus musculus 153
aagaagttca gcagcctgca gaaggagatc gaagagcgca ccaacgacat cgagctcctc
60 aagtcggagc ggatgaagct gcagggcatc atcagatccc tggagaaaga
catccaaggg 120 ctcaagagag agatccagga gagggacgag accattcaag acatggag
168 154 651 DNA Mus musculus 154 atggaagtag aaaacgaagc ccactgctgc
cctggcagct catcaggcgg gtccagagag 60 tacaaggtgg taatgctggg
cgcagggggc gttggtaaaa gcgcagtcac aatgcagttt 120 ataagccacc
agttcccgga ctatcacgac cccacaatcg aagatgctta taaaacccag 180
gtgaggattg ataatgagcc tgcttactta gacatcttgg acactgctgg tcaggcagag
240 ttcacggcca tgcgggagca gtacatgcgt gggggagagg gcttcatcat
ctgctattct 300 gtcactgacc gccagtcatt ccaggaggct gccaagttca
aggagcttat tttccaggtc 360 cgtcacacct atgaaattcc ccttgtgcta
gtgggtaaca aaattgactt ggagcagttc 420 cgtcaggtat ctacagaaga
aggcatgaat cttgctcgag actacaactg tgccttcttt 480 gagacatctg
cagccctgcg attcggtatc gatgatgctt ttcaaggctt agtgagagaa 540
attcgcagga aggaatccat gctgtccttg gtggaaagga aattgaagag gaaggacagc
600 ctgtggaaga agataaaagc ctccctgaag aagaagagag aaaacatgtt g 651
155 150 DNA Mus musculus 155 gcagccctgc gattcggtat cgatgatgct
cttcaaggct tagtgagaga aattcgcagg 60 aaggaatcca tgctgccctt
ggtggaaagg aaattgaaga ggaaggacag cctgtggaag 120 aagataaaag
cctccctgaa gaagaagagg 150 156 420 DNA Mus musculus 156 ggtgccacag
ttattacaaa cctcctatca gccatcccat atattggaac aaccctagtc 60
gaatgaattt gagggggctt ctcagtagac aaagccacct tgacccgatt cttcgctttc
120 cacttcatct taccatttat tatcgcggcc ctagcaatcg ttcacctcct
cttcctccac 180 gaaacaggat caaacaaccc aacaggatta aactcagatg
cagataaaat tccatttcac 240 ccctactata caatcaaaga tatcctaggt
atcctaatca tattcttaat tctcataacc 300 ctagtattat ttttcccaga
catactagga gacccagaca actacatacc agctaatcca 360 ctaaacaccc
caccccatat taaacccgaa tgatatttcc tatttgcata cgccattcta 420 157 123
DNA Mus musculus 157 tcagatgcag ataaaattcc atttcacccc tactatacaa
tcaaaaatat cctaggtatc 60 ctaatcatat tcttaattct cataacccta
gtattatttt tcccagacat actaggagac 120 cca 123 158 933 DNA Mus
musculus 158 atgaaggctc tgtgggccgt gctgttggtc acattgctga caggatgcct
agccgaggga 60 gagccggagg tgacagatca gctcgagtgg caaagcaacc
aaccctggga gcaggccctg 120 aaccgcttct gggattacct gcgctgggtg
cagacgctgt ctgaccaggt ccaggaagag 180 ctgcagagct cccaagtcac
acaagaactg acggcactga tggaggacac tatgacggaa 240 gtaaaggctt
acaaaaagga gctggaggaa cagctgggtc cagtggcgga ggagacacgg 300
gccaggctgg gcaaagaggt gcaggcggca caggcccgac tcggagccga catggaggat
360 ctacgcaacc gactcgggca gtaccgcaac gaggtgcaca ccatgctggg
ccagagcaca 420 gaggagatac gggcgcggct ctccacacac ctgcgcaaga
tgcgcaagcg cttgatgcgg 480 gatgccgatg atctgcagaa gcgcctagct
gtgtacaagg caggggcacg cgagggcgcc 540 gagcgcggtg tgagtgccat
ccgtgagcgc ctggggcctc tggtggagca aggtcgccag 600 cgcactgcca
acctaggcgc tggggccgcc cagcctctgc gcgatcgcgc ccaggctttt 660
ggtgaccgca tccgagggcg gctggaggaa gtgggcaacc aggcccgtga ccgcctagag
720 gaggtgcgtg agcacatgga ggaggtgcgc tccaagatgg aggaacagac
ccagcaaata 780 cgcctgcagg cggagatctt ccaggcccgc ctcaagggct
ggttcgagcc aatagtggaa 840 gacatgcatc gccagtgggc aaacctgatg
gagaagatac aggcctctgt ggctaccaac 900 cccatcatca ccccagtggc
ccaggagaat caa 933 159 90 DNA Mus musculus 159 acggaagtaa
aggcttacaa aaaggagctg gaggaacagc tgggtccagt ggcggaggag 60
acacgggcca ggctgggcaa agaggagcag 90 160 2085 DNA Mus musculus 160
atgctgccca gcttggcact gctcctgctg gccgcctgga cggttcgggc tctggaggta
60 cccactgatg gcaacgccgg gctgctggca gaaccccaga tcgccatgtt
ctgtggtaaa 120 ctcaacatgc acatgaatgt gcagaatgga aagtgggagt
cagacccgtc agggaccaaa 180 acctgcattg gcaccaagga gggcatcttg
cagtactgcc aagaggtcta ccctgaactg 240 cagatcacaa acgtggtgga
agccaaccag ccagtgacca tccagaactg gtgcaagcgg 300 ggccgcaagc
agtgcaagac acacacccac atcgtgattc cttaccgttg cctagttggt 360
gagtttgtga gcgacgccct tctcgtgccc gacaagtgca agttcctaca ccaggagcgg
420 atggatgttt gtgagaccca tcttcactgg cacaccgtcg ccaaagagac
atgcagcgag 480 aagagcacta acttgcacga ctatggcatg ctgctgccct
gcggcatcga caagttccga 540 ggggtagagt ttgtatgctg cccgttggcc
gaggaaagcg acagcgtgga ttctgcggat 600 gcagaggagg atgactctga
tgtctggtgg gttggagcgg acacagacta cgctgatggc 660 ggtgaagaca
aagtagtaga agtcgccgaa gaggaggaag tggctgatgt tgaggaagag 720
gaagctgatg atgatgagga tgtggaggat ggggacgagg tggaggagga ggccgaggag
780 ccctacgaag aggccaccga gagaacaacc agcactgcca ccaccaccac
aaccaccact 840 gagtccgtgg aggaggtggt ccgagttccc acgacagcag
ccagcacccc cgacgccgtc 900 gacaagtacc tggagacacc cggggacgag
aacgagcatg cccatttcca gaaagccaaa 960 gagaggctgg aagccaagca
ccgagagaga atgtcccagg tcatgagaga atgggaagag 1020 gcagagcgtc
aagccaagaa cttgcccaaa gctgacaaga aggccgttat ccagcatttc 1080
caggagaaag tggaatctct ggaacaggaa gcagccaatg agagacagca gcttgtagag
1140 acacacatgg ccagagttga agccatgctc aatgaccgcc gccgcctgga
cctcgagaat 1200 tacatcatcg cactgcaggc ggtgccccca aggcctcatc
atgtgttcaa catgctgaag 1260 aagtacgtcc gtgcggagca gaaagacaga
cagcacaccc taaagcattt tgaacatgtg 1320 cgcatggtgg accccaagaa
agctactcag atccggtccc aggttatgac acacctccgt 1380 gtgatctacg
agcgcatgaa ccagtctctg tccctgctct acaatgtccc tgcggtggct 1440
gaggagattc aagatgaagt cgatgagctg cttcagaagg agcagaacta ctccgacgat
1500 gtcttggcca acatgatcag tgagcccaga atcagctacg gaaacgacgc
tctcatgcct 1560 tcgctgacgg aaaccaagac caccgtggag ctccttcccg
tgaatgggga attcagcctg 1620 gatgacctcc agccgtggca cccttttggg
gtggactctg tgccagccaa taccgaaaat 1680 gaagtcgagc ctgttgacgc
ccgccccgct gctgaccgag gactgaccac tcgaccaggt 1740 tctgggctga
caaacatcaa gacggaagag atctcggaag tgaagatgga tgcagaattc 1800
ggacatgatt caggatttga agtccgccat caaaaactgg tgttctttgc tgaagatgtg
1860 ggttcgaaca aaggcgccat catcggactc atggtgggcg gcgttgtcat
agcaaccgtg 1920 attgtcatca ccctggtgat gttgaagaag aaacagtaca
catccatcca tcatggcgtg 1980 gtggaggtcg acgccgccgt gaccccagag
gagcgccatc tctccaagat gcagcagaac 2040 ggatatgaga atccaactta
caagttcttt gagcaaatgc agaac 2085 161 201 DNA Mus musculus 161
agtgagccca gaatcagcta cggaaacgac gctctcatgc cttcgctgac ggaaaccaag
60 accaccgtgg agctccttcc cgtgaatggg gaattcagcc tggatgacct
ccagccgtgg 120 cacccttctg gggtggactc tgtgccagcc aataccgaaa
atgaggtcga gcctgttgac 180 gcccgccccg ctgctgaccg a 201 162 1236 DNA
Mus musculus 162 atggcgaacg tggccgacac gaagctgtac gacatcctgg
gcgtccctcc cggcgctagc 60 gagaacgagc tgaagaaggc ataccgaaag
ttagccaaag aataccaccc tgataagaat 120 ccaaatgctg gagacaaatt
taaagaaata agttttgcat atgaagtatt gtcaaatcca 180 gagaagcgag
agctgtatga cagatatgga gaacaaggcc tacgggaagg cagcggcgga 240
ggcggtggaa tggatgatat cttctcacat atttttggtg gaggattgtt tggctttatg
300 ggcaatcaga gtagaagtcg aaatggcaga agaagaggcg aggacatgat
gcatccacta 360 aaagtatctt tagaagacct gtacaatggc aagacaacca
aactacaact tagcaagaat 420 gtgctctgta gtgcatgcag tggccaaggt
gggaagtctg gagctgttca gaaatgcagc 480 gcttgtcggg gtcgaggtgt
gcgcattatg atcagacagc tggctccagg aatggtgcag 540 cagatgcagt
ccgtgtgctc cgactgtaat ggagaagggg aggtcatcaa tgaaaaagac 600
cgctgtaaaa aatgtgaagg gaagaaggta atcaaagaag tcaagattct ggaagtccat
660 gtagacaaag gcatgaaaca tggacagagg attacgttca ctggggaagc
agaccaggct 720 ccaggagtgg aacctggaga tattgttctt ttgctacagg
aaaaagaaca tgaggtgttc 780 cagagagatg ggaatgattt gcatatgaca
tataagatag gacttgttga agctttatgt 840 ggatttcagt tcacatttaa
acatcttgat gctcgtcaga ttgtggtgaa atacccccct 900 ggcaaagtaa
ttgaaccagg atgtgttcgt gttgttcgag gtgaaggaat gccacagtat 960
cgtaatccct ttgaaaaggg tgatctttac ataaagtttg atgtacagtt tcctgagaat
1020 aactggatca acccagacaa actttctgaa ttagaagatc tcctgccatc
tagaccagaa 1080 gttcctaatg ttattggaga gacagaagaa gtggagcttc
aggaatttga tagcactcga 1140 ggctctggcg gtggtcagag acgtgaagcc
tataatgata gctctgatga agaaagtagc 1200 agccatcatg gacctggagt
gcagtgtgcc catcag 1236 163 75 DNA Mus musculus 163 ttgtcaaatc
cagagaagcg agagctgtat gacagatatg gagaacaagg cctacgggaa 60
ggcagcggag gaggc 75 164 2949 DNA Mus musculus 164 atgaaggctc
agcaggccat ggacaaatat gaaggagata gcaaggcgag ggagacccgg 60
agcacagcgg ccatggtagg ctggagaagt gacagaggcc tggtgacttg caccaggctt
120 aggatgcaga atggtagctc actaaaagct tttcggagca gggtggggaa
gtggggagaa 180 ccttcctcga gatcacacaa agtgttgaag accagcgaaa
cttcacaaga tatccagaag 240 gtttctagag aggaaagccc ttcccagctg
acttctgccg tgcctgccca gaggaactgc 300 cagcccggca gcgctgccgt
tatcaatatg cttcgcgggg gaggcggtgt tagaagccca 360 tggacagatc
accacatcag gcagcggacg gatcaccaca tccggcagcc cttgtttcca 420
agccgccggt ctccacagga gaatgaggac gatgacgacg attaccagat gttcgtcccg
480 tccttttctt cctcggatct caactccacc aggttgtgtg aagagaacgc
ttcaagtcgc 540 ccttgcagct ggcatctggg cctcatcgaa cccacagaga
tctccagctc tggtcacagg 600 atagtccgac gggccagtag tgcaggtgaa
agcaacgcat gccctcctga agtaagaatt 660 agagactgtg atgactctca
gtactgtccc gggagacagc tgcagaacag tcctcgacca 720 ggaggagaga
gggggatgac tccctatggg tcgtctgtag agttgaccat tgatgatata 780
gaccatgtct atgataacat aagttttgaa gacttaaagt taatggttgc taagcgggat
840 gaaactgaat gttctttctc caaaccatcc agagactctg ttcgccccaa
gagtacccca 900 gagttagcct tctcaaagag
acaggttagc cacagtacaa gctctctcca ctcaaggaaa 960 gaggcaggcc
tcggtggtca agaggcatcc acccaaagcg tacatgaaca ccaggaagtg 1020
gaagaaaaca tctatgacac catagggctt ccagacccac catcgatgaa cttgaaccac
1080 agcagccttc atcagcccaa aaggagcacc ttcctgggtc tggaagctga
ttttgcatgc 1140 tgtgacagcc tgagaccatt tgtctcccag gatagcctcc
agttcagtga ggatgacata 1200 tcttaccacc agggaccctc cgatactgaa
tatttgagtt tgttatatga ctctccccgc 1260 tgtaatctgc ctattgcaga
taaggctctg tccgacaaat tgtctgaaga agtagatgaa 1320 atctggaatg
atctggaaaa ttacatcaag aaaaatgaag acaaatcaag agaccgcctc 1380
cttgcggcct ttcctgtgag caaagacgat gcaccagaga ggctatatgt tgacagcacc
1440 catgagctgg gcagggatac aggacatgcc acatctatgc tggcccttcc
tacaagccag 1500 actttcctcc tgcctgggaa gagcagagtt gtcagagcta
gcagggccaa ctgctccctg 1560 gataatgaca ttatttcaac agaaggttcc
ttcctgagtc ttaaccaact ctctctggcc 1620 agtgatgggc ctcctgtgga
caatccctat gacctggcca actgcagcct gccccagaca 1680 gacccagaaa
accctgaccc cgggatggag gtcacagaca agactaagag cagggtcttt 1740
atgatggcca ggcagtatag tcaaaagatc aagaaggtaa atcagatttt gaaagtgaaa
1800 agcccagagt tggagcaacc accgtccagt cagcataggc ccagtcacaa
agacctggtg 1860 gccatcttgg aagagaagag gcaaggaggg cccgccattg
gtgccaggat cgctgaatat 1920 tcccaactgt atgaccagat tgtgttcaga
gagacacccc ttaaagctca gaaggatggc 1980 tgggccagcc cccaaggacc
caccctccac aggcctgtgt cacctcccca ggcccagggc 2040 gctggtgagg
actggctctg gcattcgccc tacagtaacg gagagttggc agatttctct 2100
ccccagacag aacaagactc aaaatcaaaa taccccatca cattagagag caccacaaaa
2160 attaggccca ggcagttgtc gggtgcttgt tcggtgccgt ctctccaagt
gtcggaccct 2220 ctgctgggct ctgtgcagca gagatgcagc gtggtggtca
gccagcccca caaagagaac 2280 tcaggtcaga gtcctcttta caactcgctg
ggtcgcaaag caatcagcgc taaaccccag 2340 ccttatagca ggcctcagtc
atcttcctca atcttgataa acaaatctct ggactctatc 2400 aactacccca
gtgagacaga gacgaagcaa ctactctctt cacagaaaag tcccagaggc 2460
gcgagccagc aggatttgcc gtcagggcta gcaaactcat gtcaacagga caggggcaaa
2520 cggtccgatc tcacgctcca agactcgcag aaggttctcg tggtaaatag
aaatttaccc 2580 ttaagcgctc aaatagcgac gcagaactac ttttgtaatt
tcaaagatcc cgagggagat 2640 gaagatgact atgtggaaat caagtcagaa
gaggacgaag tgcgtctgga tctctctcca 2700 aggcggggca ggaagtctga
cccacagacc ccggaccctg actgttcgga tagcatctgt 2760 agccacagca
caccttactc tttgaaggag ccagtgagtg gcaggcttgg gcttcctcct 2820
tacctgacag catgtaagga ctctgataaa ctgaatgatt atctgtggag ggggccctca
2880 cccaatcagc aaaacattgt ccaatctctg agggagaaat ttcagtgtct
tagctcaagc 2940 agctttgcc 2949 165 138 DNA Mus musculus 165
tttatgatgg ccaggcagta tagtcaaaag atcaagaagg taaatcagat tttgaaagtg
60 aaaagcccag agttggagca accaccgtcc agtcagcata ggcccagtca
caaagacctg 120 gcggccatct tggaaaag 138 166 87 DNA Mus musculus 166
gcacactcat tctccgtttt cagactcccg agttggtgga ttgtgggctg gtggagcaaa
60 ggtggagtag gctctgattt agaaatg 87 167 105 DNA Mus musculus 167
cctgatataa aacatccagg aaatctggaa cactatataa aaagagtaaa cctaagaata
60 atagcaatag aagaaggaga aaaatcccag ctcaaaggcc cgaaa 105 168 5172
DNA Artificial Sequence vector CMV-FosCBPzz 168 atgcattagt
tattaatagt aatcaattac ggggtcatta gttcatagcc catatatgga 60
gttccgcgtt acataactta cggtaaatgg cccgcctggc tgaccgccca acgacccccg
120 cccattgacg tcaataatga cgtatgttcc catagtaacg ccaataggga
ctttccattg 180 acgtcaatgg gtggagtatt tacggtaaac tgcccacttg
gcagtacatc aagtgtatca 240 tatgccaagt acgcccccta ttgacgtcaa
tgacggtaaa tggcccgcct ggcattatgc 300 ccagtacatg accttatggg
actttcctac ttggcagtac atctacgtat tagtcatcgc 360 tattaccatg
gtgatgcggt tttggcagta catcaatggg cgtggatagc ggtttgactc 420
acggggattt ccaagtctcc accccattga cgtcaatggg agtttgtttt ggcaccaaaa
480 tcaacgggac tttccaaaat gtcgtaacaa ctccgcccca ttgacgcaaa
tgggcggtag 540 gcgtgtacgg tgggaggtct atataagcag agctggttta
gtgaaccgtc agatccgcta 600 gcgattacgc caagctcgaa attaaccctc
actaaaggga acaaaagctg gagctccacc 660 gcggtggcgg ccgctctagc
ccgggcggat cacgatcccg cgaaattaat acgactcact 720 ataggggaat
tgtgagcgga taacaattcc cctctagaaa taattttgtt taactttaag 780
aaggagatat accatggcta gcatgactgg tggacagcaa atgggtcgcg gatccggcag
840 agcgcagagc atcggcagaa ggggcaaagt agagcagcta tctcctgaag
aggaagagaa 900 acggagaatc cgaagggaac ggaataagat ggctgcagcc
aagtgccgga atcggaggag 960 ggagctgaca gatacactcc aagcggagac
agatcaactt gaagatgaga agtctgcgtt 1020 gcagactgag attgccaatc
tgctgaaaga gaaggaaaaa ctggagttta ttttggcagc 1080 ccaccgacct
gcctgcaaga tccccgatga ccttggcctc gagctcaaga gaagatggaa 1140
aaagaatttc atagccgtct cagcagccaa ccgctttaag aaaatctcat cctccggggc
1200 acttggatca gattatgata ttccaactac tgctagcgag aatttgtatt
ttcagggtgg 1260 taccaaaacc gcggctcttg cgcaacacga tgaagccgta
gacaacaaat tcaacaaaga 1320 acaacaaaac gcgttctatg agatcttaca
tttacctaac ttaaacgaag aacaacgaaa 1380 cgccttcatc caaagtttaa
aagatgaccc aagccaaagc gctaaccttt tagcagaagc 1440 taaaaagcta
aatgatgctc aggcgccgaa agtagacaac aaattcaaca aagaacaaca 1500
aaacgcgttc tatgagatct tacatttacc taacttaaac gaagaacaac gaaacgcctt
1560 catccaaagt ttaaaagatg acccaagcca aagcgctaac cttttagcag
aagctaaaaa 1620 gctaaatgat gctcaggcgc cgaaagtaga cgcgaattct
agctctgtac cccatcacca 1680 tcaccatcac taagtcgact tcgatcgccc
ttcccaacag ttgcgcagcc tgaatggcga 1740 atggagatcc aatttttaag
tgtataatgt gttaaactac tgattctaat tgtttgtgta 1800 ttttagattc
acagtcccaa ggctcatttc aggcccctca gtcctcacag tctgttcatg 1860
atcataatca gccataccac atttgtagag gttttacttg ctttaaaaaa cctcccacac
1920 ctccccctga acctgaaaca taaaatgaat gcaattgttg ttgttaactt
gtttattgca 1980 gcttataatg gttacaaata aagcaatagc atcacaaatt
tcacaaataa agcatttttt 2040 tcactgcatt ctagttgtgg tttgtccaaa
ctcatcaatg tatcttaacg cgtaaattgt 2100 aagcgttaat attttgttaa
aattcgcgtt aaatttttgt taaatcagct cattttttaa 2160 ccaataggcc
gaaatcggca aaatccctta taaatcaaaa gaatagaccg agatagggtt 2220
gagtgttgtt ccagtttgga acaagagtcc actattaaag aacgtggact ccaacgtcaa
2280 agggcgaaaa accgtctatc agggcgatgg cccactacgt gaaccatcac
cctaatcaag 2340 ttttttgggg tcgaggtgcc gtaaagcact aaatcggaac
cctaaaggga gcccccgatt 2400 tagagcttga cggggaaagc cggcgaacgt
ggcgagaaag gaagggaaga aagcgaaagg 2460 agcgggcgct agggcgctgg
caagtgtagc ggtcacgctg cgcgtaacca ccacacccgc 2520 cgcgcttaat
gcgccgctac agggcgcgtc aggtggcact tttcggggaa atgtgcgcgg 2580
aacccctatt tgtttatttt tctaaataca ttcaaatatg tatccgctca tgagacaata
2640 accctgataa atgcttcaat aatattgaaa aaggaagaat cctgaggcgg
aaagaaccag 2700 ctgtggaatg tgtgtcagtt agggtgtgga aagtccccag
gctccccagc aggcagaagt 2760 atgcaaagca tgcatctcaa ttagtcagca
accaggtgtg gaaagtcccc aggctcccca 2820 gcaggcagaa gtatgcaaag
catgcatctc aattagtcag caaccatagt cccgccccta 2880 actccgccca
tcccgcccct aactccgccc agttccgccc attctccgcc ccatggctga 2940
ctaatttttt ttatttatgc agaggccgag gccgcctcgg cctctgagct attccagaag
3000 tagtgaggag gcttttttgg aggcctaggc ttttgcaaag atcgatcaag
agacaggatg 3060 aggatcgttt cgcatgattg aacaagatgg attgcacgca
ggttctccgg ccgcttgggt 3120 ggagaggcta ttcggctatg actgggcaca
acagacaatc ggctgctctg atgccgccgt 3180 gttccggctg tcagcgcagg
ggcgcccggt tctttttgtc aagaccgacc tgtccggtgc 3240 cctgaatgaa
ctgcaagacg aggcagcgcg gctatcgtgg ctggccacga cgggcgttcc 3300
ttgcgcagct gtgctcgacg ttgtcactga agcgggaagg gactggctgc tattgggcga
3360 agtgccgggg caggatctcc tgtcatctca ccttgctcct gccgagaaag
tatccatcat 3420 ggctgatgca atgcggcggc tgcatacgct tgatccggct
acctgcccat tcgaccacca 3480 agcgaaacat cgcatcgagc gagcacgtac
tcggatggaa gccggtcttg tcgatcagga 3540 tgatctggac gaagaacatc
aggggctcgc gccagccgaa ctgttcgcca ggctcaaggc 3600 gagcatgccc
gacggcgagg atctcgtcgt gacccatggc gatgcctgct tgccgaatat 3660
catggtggaa aatggccgct tttctggatt catcgactgt ggccggctgg gtgtggcgga
3720 ccgctatcag gacatagcgt tggctacccg tgatattgct gaagaacttg
gcggcgaatg 3780 ggctgaccgc ttcctcgtgc tttacggtat cgccgctccc
gattcgcagc gcatcgcctt 3840 ctatcgcctt cttgacgagt tcttctgagc
gggactctgg ggttcgaaat gaccgaccaa 3900 gcgacgccca acctgccatc
acgagatttc gattccaccg ccgccttcta tgaaaggttg 3960 ggcttcggaa
tcgttttccg ggacgccggc tggatgatcc tccagcgcgg ggatctcatg 4020
ctggagttct tcgcccaccc tagggggagg ctaactgaaa cacggaagga gacaataccg
4080 gaaggaaccc gcgctatgac ggcaataaaa agacagaata aaacgcacgg
tgttgggtcg 4140 tttgttcata aacgcggggt tcggtcccag ggctggcact
ctgtcgatac cccaccgaga 4200 ccccattggg gccaatacgc ccgcgtttct
tccttttccc caccccaccc cccaagttcg 4260 ggtgaaggcc cagggctcgc
agccaacgtc ggggcggcag gccctgccat agcctcaggt 4320 tactcatata
tactttagat tgatttaaaa cttcattttt aatttaaaag gatctaggtg 4380
aagatccttt ttgataatct catgaccaaa atcccttaac gtgagttttc gttccactga
4440 gcgtcagacc ccgtagaaaa gatcaaagga tcttcttgag atcctttttt
tctgcgcgta 4500 atctgctgct tgcaaacaaa aaaaccaccg ctaccagcgg
tggtttgttt gccggatcaa 4560 gagctaccaa ctctttttcc gaaggtaact
ggcttcagca gagcgcagat accaaatact 4620 gtccttctag tgtagccgta
gttaggccac cacttcaaga actctgtagc accgcctaca 4680 tacctcgctc
tgctaatcct gttaccagtg gctgctgcca gtggcgataa gtcgtgtctt 4740
accgggttgg actcaagacg atagttaccg gataaggcgc agcggtcggg ctgaacgggg
4800 ggttcgtgca cacagcccag cttggagcga acgacctaca ccgaactgag
atacctacag 4860 cgtgagctat gagaaagcgc cacgcttccc gaagggagaa
aggcggacag gtatccggta 4920 agcggcaggg tcggaacagg agagcgcacg
agggagcttc cagggggaaa cgcctggtat 4980 ctttatagtc ctgtcgggtt
tcgccacctc tgacttgagc gtcgattttt gtgatgctcg 5040 tcaggggggc
ggagcctatg gaaaaacgcc agcaacgcgg cctttttacg gttcctggcc 5100
ttttgctggc cttttgctca catgttcttt cctgcgttat cccctgattc tgtggataac
5160 cgtattaccg cc 5172 169 70 DNA Artificial Sequence PCR primer
5'SP6(O29)T7-FosCBPzz 169 gaatttaggt gacactatag aacaacaaca
acaacaaaca acaacaaaat ggctagcatg 60 actggtggac 70 170 20 DNA
Artificial Sequence PCR primer 3'FosCBPzz 170 ggatctccat tcgccattca
20 171 89 DNA Artificial Sequence DNA beit template DNA-Fos/Jun 171
cgactctgac ggcagtttac gtgactcatg agtcatgact catgagtcat gactcatgag
60 tcacgttaga acgcggctac aattaatac 89 172 21 DNA Artificial
Sequence PCR primer 5'DNA 172 cgactctgac ggcagtttac g 21 173 26 DNA
Artificial Sequence PCR primer 3'DNA 173 gtattaattg tagccgcgtt
ctaacg 26 174 67 DNA Artificial Sequence main chain of adaptor
(O29) 174 gaacaacaac aacaacaaac aacaacaaaa tgactggtgg acagcaaatg
ggtgcggccg 60 cgaattc 67 175 68 DNA Artificial Sequence main chain
of adaptor (O29-2) 175 gaacaacaac aacaacaaac aacaacaaaa tggctagcat
gactggtgga cagcaaatgg 60 cgaattcc 68 176 32 DNA Artificial Sequence
random primer for reverse transcription 176 tcatcgtcct tgtagtcaag
cttnnnnnnn nn 32 177 58 DNA Artificial Sequence PCR 5' primer (O29)
177 ggaagatcta tttaggtgac actatagaac aacaacaaca acaaacaaca acaaaatg
58 178 36 DNA Artificial Sequence PCR 3' primer 178 ttttttttct
tgtcgtcatc gtccttgtag tcaagc 36 179 3851 DNA Artificial Sequence
pDrive vector 179 gcgcccaata cgcaaaccgc ctctccccgc gcgttggccg
attcattaat gcagctggca 60 cgacaggttt cccgactgga aagcgggcag
tgagcgcaac gcaattaatg tgagttagct 120 cactcattag gcaccccagg
ctttacactt tatgcttccg gctcgtatgt tgtgtggaat 180 tgtgagcgga
taacaatttc acacaggaaa cagctatgac catgattacg ccaagctcta 240
atacgactca ctatagggaa agctcggtac cacgcatgct gcagacgcgt tacgtatcgg
300 atccagaatt cgtgatatct gaattcgtcg acaagcttct cgagcctagg
ctagctctag 360 accacacgtg tgggggcccg agctcgcggc cgctgtattc
tatagtgtca cctaaatggc 420 cgcacaattc actggccgtc gttttacaac
gtcgtgactg ggaaaaccct ggcgttaccc 480 aacttaatcg ccttgcagca
catccccctt tcgccagctg gcgtaatagc gaagaggccc 540 gcaccgatcg
cccttcccaa cagttgcgca gcctgaatgg cgaatggaaa ttgtaagcgt 600
taatattttg ttaaaattcg cgttaaattt ttgttaaatc agctcatttt ttaaccaata
660 ggccgaaatc ggcaaaatcc cttataaatc aaaagaatag accgagatag
ggttgagtgt 720 tgttccagtt tggaacaaga gtccactatt aaagaacgtg
gactccaacg tcaaagggcg 780 aaaaaccgtc tatcagggcg atggcccact
acgtgaacca tcaccctaat caagtttttt 840 ggggtcgagg tgccgtaaag
cactaaatcg gaaccctaaa gggagccccc gatttagagc 900 ttgacgggga
aagccggcga acgtggcgag aaaggaaggg aagaaagcga aaggagcggg 960
cgctagggcg ctggcaagtg tagcggtcac gctgcgcgta accaccacac ccgccgcgct
1020 taatgcgccg ctacagggcg cgtcaggtgg cacttttcgg ggaaatgtgc
gcggaacccc 1080 tatttgttta tttttctaaa tacattcaaa tatgtatccg
ctcatgagac aataaccctg 1140 ataaatgctt caataatatt gaaaaaggaa
gagtatgagt attcaacatt tccgtgtcgc 1200 ccttattccc ttttttgcgg
cattttgcct tcctgttttt gctcacccag aaacgctggt 1260 gaaagtaaaa
gatgctgaag atcagttggg tgcacgagtg ggttacatcg aactggatct 1320
caacagcggt aagatccttg agagttttcg ccccgaagaa cgttttccaa tgatgagcac
1380 ttttaaagtt ctgctatgtg gcgcggtatt atcccgtatt gacgccgggc
aagagcaact 1440 cggtcgccgc atacactatt ctcagaatga cttggttgag
tactcaccag tcacagaaaa 1500 gcatcttacg gatggcatga cagtaagaga
attatgcagt gctgccataa ccatgagtga 1560 taacactgcg gccaacttac
ttctgacaac gatcggagga ccgaaggagc taaccgcttt 1620 tttgcacaac
atgggggatc atgtaactcg ccttgatcgt tgggaaccgg agctgaatga 1680
agccatacca aacgacgagc gtgacaccac gatgcctgta gcaatggcaa caacgttgcg
1740 caaactatta actggcgaac tacttactct agcttcccgg caacaattaa
tagactggat 1800 ggaggcggat aaagttgcag gaccacttct gcgctcggcc
cttccggctg gctggtttat 1860 tgctgataaa tctggagccg gtgagcgtgg
gtctcgcggt atcattgcag cactggggcc 1920 agatggtaag ccctcccgta
tcgtagttat ctacacgacg gggagtcagg caactatgga 1980 tgaacgaaat
agacagatcg ctgagatagg tgcctcactg attaagcatt ggtaactgtc 2040
agaccaagtt tactcatata tactttagat tgatttaaaa cttcattttt aatttaaaag
2100 gatctaggtg aagatccttt ttgataatct catgaacaat aaaactgtct
gcttacataa 2160 acagtaatac aaggggtgtt atgagccata ttcaacggga
aacgtcttgc tctaggccgc 2220 gattaaattc caacatggat gctgatttat
atgggtataa atgggctcgc gataatgtcg 2280 ggcaatcagg tgcgacaatc
tatcgattgt atgggaagcc cgatgcgcca gagttgtttc 2340 tgaaacatgg
caaaggtagc gttgccaatg atgttacaga tgagatggtc agactaaact 2400
ggctgacgga atttatgcct cttccgacca tcaagcattt tatccgtact cctgatgatg
2460 catggttact caccactgcg atccccggga aaacagcatt ccaggtatta
gaagaatatc 2520 ctgattcagg tgaaaatatt gttgatgcgc tggcagtgtt
cctgcgccgg ttgcattcga 2580 ttcctgtttg taattgtcct tttaacagcg
atcgcgtatt tcgtctcgct caggcgcaat 2640 cacgaatgaa taacggtttg
gttgatgcga gtgattttga tgacgagcgt aatggctggc 2700 ctgttgaaca
agtctggaaa gaaatgcata aacttttgcc attctcaccg gattcagtcg 2760
tcactcatgg tgatttctca cttgataacc ttatttttga cgaggggaaa ttaataggtt
2820 gtattgatgt tggacgagtc ggaatcgcag accgatacca ggatcttgcc
atcctatgga 2880 actgcctcgg tgagttttct ccttcattac agaaacggct
ttttcaaaaa tatggtattg 2940 ataatcctga tatgaataaa ttgcagtttc
atttgatgct cgatgagttt ttctaagaat 3000 taattcatga ccaaaatccc
ttaacgtgag ttttcgttcc actgagcgtc agaccccgta 3060 gaaaagatca
aaggatcttc ttgagatcct ttttttctgc gcgtaatctg ctgcttgcaa 3120
acaaaaaaac caccgctacc agcggtggtt tgtttgccgg atcaagagct accaactctt
3180 tttccgaagg taactggctt cagcagagcg cagataccaa atactgtcct
tctagtgtag 3240 ccgtagttag gccaccactt caagaactct gtagcaccgc
ctacatacct cgctctgcta 3300 atcctgttac cagtggctgc tgccagtggc
gataagtcgt gtcttaccgg gttggactca 3360 agacgatagt taccggataa
ggcgcagcgg tcgggctgaa cggggggttc gtgcacacag 3420 cccagcttgg
agcgaacgac ctacaccgaa ctgagatacc tacagcgtga gctatgagaa 3480
agcgccacgc ttcccgaagg gagaaaggcg gacaggtatc cggtaagcgg cagggtcgga
3540 acaggagagc gcacgaggga gcttccaggg ggaaacgcct ggtatcttta
tagtcctgtc 3600 gggtttcgcc acctctgact tgagcgtcga tttttgtgat
gctcgtcagg ggggcggagc 3660 ctatggaaaa acgccagcaa cgcggccttt
ttacggttcc tggccttttg ctggcctttt 3720 gctcacatgt tctttcctgc
gttatcccct gattctgtgg ataaccgtat taccgccttt 3780 gagtgagctg
ataccgctcg ccgcagccga acgaccgagc gcagcgagtc agtgagcgag 3840
gaagcggaag a 3851 180 58 DNA Artificial Sequence primer 5'F3 180
ggaagatcta tttaggtgac actatagaac aacaacaaca acaaacaaca acaaaatg 58
181 27 DNA Artificial Sequence primer 3'R3 181 ttttttttct
cgagcttgtc gtcatcg 27 182 18 DNA Artificial Sequence primer Optn_F
182 tgggcatcgt ctcagaac 18 183 18 DNA Artificial Sequence primer
Optn_R 183 tgtgggtgta gggcagtt 18 184 18 DNA Artificial Sequence
primer SNAP19_F 184 aaaccctgct gcgtctaa 18 185 19 DNA Artificial
Sequence primer SNAP19_R 185 atcatggatt gaagggcta 19 186 17 DNA
Artificial Sequence primer C130020M04RIK_F 186 ggtgtcctcc ctggaaa
17 187 20 DNA Artificial Sequence primer C130020M04RIK_R 187
tgggcaatct ttatgagcta 20 188 18 DNA Artificial Sequence primer
Rattus FLJ32000_F 188 aagagcgcac caatgaca 18 189 20 DNA Artificial
Sequence primer Rattus FLJ32000_R 189 tcttgaatgg tctcatccct 20 190
21 DNA Artificial Sequence primer 5'M13_F 190 gttttcccag tcacgacgtt
g 21 191 24 DNA Artificial Sequence primer 3'M13_R 191 gaaacagcta
tgaccatgat tacg 24
* * * * *