U.S. patent application number 10/507780 was filed with the patent office on 2007-02-08 for repressor of skeletal muscle differentiation, nucleic acid coding therefor and the use thereof in diagnosis and therapy.
This patent application is currently assigned to Universitaetsklinikum Freiburg. Invention is credited to Philip Hublitz, Roland Schuele.
Application Number | 20070031395 10/507780 |
Document ID | / |
Family ID | 27815811 |
Filed Date | 2007-02-08 |
United States Patent
Application |
20070031395 |
Kind Code |
A1 |
Schuele; Roland ; et
al. |
February 8, 2007 |
Repressor of skeletal muscle differentiation, nucleic acid coding
therefor and the use thereof in diagnosis and therapy
Abstract
Polypeptide sequences which play a part in the regulation of
skeletal muscle differentiation, and nucleic acids coding therefor,
and the use thereof in diagnosis and therapy are disclosed.
Possible uses are also indicated for antibodies which are directed
against corresponding epitopes of the GRIM1 polypeptide.
Inventors: |
Schuele; Roland; (Weisweil,
DE) ; Hublitz; Philip; (Freiburg, DE) |
Correspondence
Address: |
HELLER EHRMAN WHITE & MCAULIFFE LLP
1717 RHODE ISLAND AVE, NW
WASHINGTON
DC
20036-3001
US
|
Assignee: |
Universitaetsklinikum
Freiburg
Hugstetter Strasse 49
Freiburg
DE
D-79106
|
Family ID: |
27815811 |
Appl. No.: |
10/507780 |
Filed: |
March 13, 2003 |
PCT Filed: |
March 13, 2003 |
PCT NO: |
PCT/EP03/02638 |
371 Date: |
November 2, 2004 |
Current U.S.
Class: |
424/94.2 ;
435/184; 435/320.1; 435/325; 435/456; 435/69.1; 536/23.2 |
Current CPC
Class: |
C07K 14/4703 20130101;
C07K 16/18 20130101; C07K 14/4716 20130101; A61K 38/00
20130101 |
Class at
Publication: |
424/094.2 ;
435/069.1; 435/184; 435/320.1; 435/325; 536/023.2; 435/456 |
International
Class: |
A61K 38/54 20060101
A61K038/54; C07H 21/04 20060101 C07H021/04; C12P 21/06 20060101
C12P021/06; C12N 9/99 20060101 C12N009/99; C12N 15/86 20060101
C12N015/86 |
Foreign Application Data
Date |
Code |
Application Number |
Mar 20, 2002 |
DE |
102 12 397.7 |
Claims
1. A polypeptide comprising at least 20 consecutive amino acids
from the sequence of hsGRIM1 having the sequence ID No. 3.
2. The polypeptide as claimed in claim 1, comprising at least 40
consecutive amino acids.
3. The polypeptide as claimed in claim 1, comprising at least 80
consecutive amino acids.
4. The polypeptide as claimed in claim 1, comprising at least 120
consecutive amino acids.
5. The polypeptide as claimed in claim 1, comprising at least 200
consecutive amino acids.
6. A polypeptide comprising at least 20 consecutive amino acids
from the mmGRIM sequence having the sequence ID No. 4.
7. The polypeptide comprising at least 40 consecutive amino
acids.
8. The polypeptide as claimed in claim 6, comprising at least 120
consecutive amino acids.
9. The polypeptide as claimed in claim 6, comprising at least 200
consecutive amino acids.
10. An antibody that binds specifically to an epitope of a
polypeptide as claimed in any of claims 1 to 9.
11. A medicament comprising a polypeptide as claimed in any of
claims 1 to 9, and a pharmaceutically acceptable excipient.
12. The medicament as claimed in claim 11, that is employed for the
treatment of disturbances of skeletal muscle differentiation.
13. The medicament as claimed in claim 11, characterized that is
employed for the treatment of disturbances of fat cell
differentiation.
14. A method for identifying substances which influence the
biological function of a polypeptide as claimed in any of claims 1
to 9, comprising contacting the substance to be identified with
said polypeptide in a test system.
15. A cDNA which codes for hsGRIM1, comprising at least 1200
consecutive nucleotides from the sequence ID No.1.
16. A cDNA which codes for mmGRIM, comprising at least 1200
consecutive nucleotides from the sequence ID No.2.
17. A transfection vector comprising a cDNA as claimed in claim 15
or 16.
18. The transfection vector as claimed in claim 17, which is an
adenoviral vector.
19. A cell transfected with a vector as claimed in claim 17.
20. The cell of claim 19, wherein said vector is an
adenovector.
21. A method of treating a disturbance of skeletal muscle
differentiation comprising administering to a subject in need
thereof an effective amount of the medicament of claim 12.
22. A method of treating disturbances of fat cell differentiation
comprising administering to a subject in need thereof an effective
amount of the medicament of claim 13.
Description
[0001] The present invention relates to a novel corepressor whose
function has been substantially elucidated. Various possible uses
of this corepressor are disclosed.
[0002] The designation GRIM1 which has been chosen for the
corepressor of the invention represents the abbreviation of the
designation "Global Repressor Involved in Myogenic
Differentiation". This name was chosen because the cloned factor
plays an important part in particular during skeletal muscle
differentiation.
[0003] In recent years a large number of molecules which prove to
be responsible for direct transcriptional regulation has been
found. Besides the DNA-binding factors (in the narrower sense
referred to as transcription factors), there are regulatory
molecules which, as coactivators, facilitate gene expression or
which, as corepressors, actively bring about transcriptional
repression. Cofactors are promiscuous and can be recruited in
various combinations by various DNA-binding partners. The
alternation between associated coactivator and corepressor
complexes is an important regulatory step within cellular
differentiation processes.
[0004] For the example of skeletal muscle differentiation, over the
course of years a cascade of transcription factors which are
necessary for myogenesis has been found. Besides other
transcription factors, the bHLH proteins MyoD, myf5, MRF4 and
myogenin bring about execution, in a proliferating myoblast
precursor cell, of the genetic program which results in a
terminally differentiated functional skeletal muscle cell. During
the course of this phylogenesis, the cell passes through a cell
cycle arrest, and it fuses with other committed muscle cells to
give multinuclear myoblasts and expresses skeletal muscle-specific
structural and metabolic enzymes.
[0005] The associated cofactors acting within this process have
been investigated only in recent years. Positively regulated
expression of myogenically specific genes takes place for example
through a functional association of MyoD with the coactivator
protein p300. Acetylated MyoD has far greater transcriptional
activity than unmodified protein.
[0006] On the other hand, there are certain corepressors which
actively prevent targeted repression of the expression of
muscle-specific genes in proliferating cells. Thus, for example, a
functional association between MyoD and the corepressor N-CoR has
been described, and in this way revealed a new field of activity
for the cofactors which had previously been assigned only to
nuclear hormone receptors. In addition, members of the histone
deacetylase (HDAC) family have been described as being involved in
skeletal muscle differentiation (C2C12 cell culture model) and
showing a subcellular relocalization during the first steps of
differentiation.
[0007] Association of HDACs with active repression is regarded as
one of the basic requirements for negative transcriptional
regulation, because the reaction catalyzed by the HDACs is
transmitted directly to the chromatin and provides a mechanistic
explanation for the observed effects.
[0008] The data shown in this application describe GRIM1 as a novel
"bona fide" transcriptional repressor with a novel mechanism of
repression. GRIM1 is not associated with an HDAC activity, and the
GRIM1-mediated repression cannot be influenced by specific HDAC
inhibitors. GRIM1 is able to inhibit directly, via acidic domains
in the N and C termini, the acetylation of histone N termini (one
of the preconditions for directed transcriptional activation).
Repression domains which have not to date been characterized in
detail within GRIM1 likewise have high potential for repression of
transcription activity. GRIM1 has the potential in transient
transfections for dose-dependent repression of both complex and
synthetic minimal promoters. Based on these data, the newly cloned
factor has been designated GRIM1 (Global Repressor Involved in
Myogenic differentiation).
[0009] GRIM1 shows relocalization during skeletal muscle
differentiation, although at a distinctly later time than
previously described for the HDACs 4, 5 and 7 in connection with
MEF2-specific transcription. The available data suggest that the
repression potential of GRIM1 must not be switched off until later
times during skeletal muscle differentiation so that terminal
differentiation of the myotubes is possible. In addition, GRIM1
likewise shows a subcellular change of localization during
preadipocyte differentiation.
[0010] One function of GRIM1 is involvement as repressor in
regulating the differentiation of skeletal muscle cells. The data
show that GRIM1 shows a distinctly different role than the involved
factors previously described. Elucidation of the in vivo role of
GRIM1 takes place via GRIM1 knockout animals and transgenic mice.
In the transgenic animals, GRIM1 mutants will express under
skeletal muscle-specific promoters which can no longer reach the
nucleus or which can no longer be transported out of the nucleus.
The phenotype of the mice which results from the generated atypical
localization of GRIM1 will contribute further to understanding the
function of GRIM1 within differentiation processes.
[0011] Application of the present invention is particularly in the
area of degenerative muscle disorders or in controlling muscle
atrophy in the aging man. The therapeutic modification of the
function of GRIM1 may in such cases counteract premature muscle
degeneration or actively intervene in processes which build up
muscle.
[0012] The present invention therefore relates firstly to
polypeptides having a sequence of at least 20 consecutive amino
acids from the sequence of hsGRIM1 (hs=homo sapiens) having the
sequence ID No. 3.
[0013] The polypeptides of the invention preferably have a sequence
of at least 40 consecutive amino acids. The polypeptides more
preferably have a sequence of at least 80 consecutive amino acids,
even more preferably a sequence of at least 120 consecutive amino
acids and most preferably the polypeptides of the invention have a
sequence of at least 200 consecutive amino acids from Seq. ID No.
3.
[0014] The invention further relates to polypeptides which have a
sequence of at least 20 consecutive amino acids from the mmGRIM
sequence (mm=mus musculus) having the Seq. ID No. 4.
[0015] These polypeptides more preferably have a sequence of at
least 40 consecutive amino acids, even more preferably a sequence
of at least 120 consecutive amino acids and very particularly
preferably a sequence of at least 200 consecutive amino acids of
Seq. ID No. 4.
[0016] The present invention further relates to antibodies which
bind specifically to an epitope of a polypeptide of the invention.
The antibodies of the invention can be prepared by customary
standard methods. Either the polypeptides coding for GRIM1 can be
used to immunize suitable laboratory animals such as, for example,
rabbits or goats, and it is possible in this way to prepare
suitable polyclonal antibodies. The antibodies of the invention may
be directed against epitopes such as conformational epitopes, but
also against peptides of the invention.
[0017] As an alternative to this it is possible to prepare suitable
monoclonal antibodies by methods which have been well known since
the publication by Kohler and Milstein in 1975 and belong to the
standard repertoire of an average molecular biologist.
[0018] Antibodies of this type can be employed in therapy. However,
to do this, it is usually necessary to prepare humanized
antibodies, because antibodies having constituents derived from
animals may cause unwanted side effects. If the binding regions of
a suitable antibody have been sequenced, these sequences can be
incorporated into a human basic antibody structure, and an antibody
of this type can be used in therapy.
[0019] An alternative area of use of antibodies of this type is in
diagnosis. It is perfectly possible for simple polyclonal
antibodies, with which qualitative and/or quantitative detection of
the presence of GRIM1 polypeptides in the cells or cell
compartments of interest is possible, to suffice for this purpose.
The average skilled worker can carry out suitable diagnostic
methods with antibodies of this type. The present invention further
relates to medicaments which comprise a polypeptide of the
invention.
[0020] These medicaments are preferably employed for the treatment
of disturbances of skeletal muscle differentiation and for the
treatment of disturbances of fat cell differentiation.
[0021] The medicaments of the invention may comprise a polypeptide
having the complete GRIM1 sequence or a suitable part of the
sequence. The medicaments are administered in a suitable form to
the patients to be treated. Oral or preferably parenteral
pharmaceutical formulations are suitable in this connection.
[0022] The mechanism of action of the medicaments of the invention
relies on modulation of the function of GRIM1. During skeletal
muscle differentiation, GRIM1 is transported from the nucleus into
the cytoplasm. The medicaments of the invention can intervene in
this differentiation process by either specifically blocking
importation or else blocking exportation. This can be achieved by
incorporating the suitable segments of the polypeptides into
appropriate formulations so that the target site of interest can be
reached.
[0023] It has also been found within the scope of the present
invention that GRIM1 is involved in further differentiation
processes. GRIM1 shows a subnuclear relocalization during adipocyte
differentiation. Involvement of GRIM1 in various other regulatory
processes emerges from the findings of the present invention, in
particular from the experimental data which show that GRIM1 shows a
repression of a wide variety of complexes and synthetic promoters,
and expression of GRIM1 is ubiquitous.
[0024] The present invention further relates to a method for
identifying substances which influence the biological function of a
polypeptide of the invention. This entails the substance to be
identified being brought into contact with the polypeptide in a
test system. It is possible with this test system to identify
substances which interact with the GRIM1 polypeptide and inhibit or
enhance the biological function of GRIM1.
[0025] The systems used according to the invention are preferably
cellular differentiation systems which make use in particular of
myogenic C2C12 cells or rapidly and inducibly differentiating
10T.sub.1/2 MyoD-ER cells. Detection of GRIM1 or the GRIM1
localization is possible for example immunohistologically. Suitable
methods are, inter alia, the use of fluorescence-coupled or
enzyme-coupled antibodies in a direct coloring step.
[0026] It is also possible to provide suitable cell lines which
comprise an appropriate genetic construct with a GRIM1 gene. The
substances to be investigated can then be brought into contact with
the cells, and the effect of the substance to be investigated can
be determined by appropriate comparative tests.
[0027] It is possible in a further embodiment to introduce suitable
genetic constructs into laboratory animals and to use a transgenic
animal for the test methods.
[0028] It is possible in a further embodiment of the invention to
incubate recombinantly produced GRIM1 fragments in cell-free test
systems with substances to be tested, and to obtain the desired
result of the test by use of a suitable reporter molecule.
[0029] A further aspect of the invention is represented by the cDNA
which codes for hsGRIM1 (homo sapiens) and which has a sequence of
at least 1200 consecutive nucleotides from Seq. ID No. 1.
[0030] Another aspect of the invention is represented by the cDNA
which codes for mmGRIM1 (mus musculus) and which has a sequence of
at least 1200 consecutive nucleotides from Seq. ID No. 2.
[0031] The cDNAs of the invention can be employed in transfection
vectors which have a cDNA of the invention. A preferred possibility
in this connection is an adenoviral vector.
[0032] These transfection vectors are used to produce cells which
have been transfected with such a vector.
[0033] In the examples which are described in detail below, besides
customary standard methods, the following methods were preferably
used, and the underlined sections of the methods are explained in
more detail hereinafter:
[0034] Method 1 (Transient and Stable Transfection of Cell Culture
Cells):
[0035] The cells were cultured in the media recommended by the ATCC
on 15 cm cell culture dishes under sterile conditions. Cells are
subcultured in a ratio of 1:5 to 1:20 every two to three days and
kept at a maximum of 60% confluence. Old medium is removed, and the
cells are washed with PBS buffer and treated with trypsin/EDTA
solution (0.25% trypsin, 0.04% EDTA in PBS buffer) until they are
detached from the culture dish. The cells are isolated in fresh
medium and subcultured on new subculture dishes. TABLE-US-00001 PBS
buffer: 137 mM NaCl; 2.7 mM KCl; 8 mM Na.sub.2HPO.sub.4; 1.8 mM
KH.sub.2PO.sub.4 Media used: DMEM (Gibco BRL), 100 IU penicillin
G/ml, 100 .mu.g/ml streptomycin, 2 mM glutamine, 10% fetal calf
serum. As differentiation medium for C2C12 and L6 cells, 10% FCS
was replaced by 2% horse serum. 10T1/2 MyoD-ER cells were cultured
in hormone-free serum, and 1 .mu.M estradiol, 10 .mu.g/ml insulin
and 10 .mu.g/ml transferrin were supplemented as differentiation
medium.
[0036] To determine the transcriptional properties of hsGRIM1
proteins in eukaryotic cells, pCMX-hsGRIM1 expression plasmids were
cotransfected with luciferase (LUC) reporter plasmids. Expression
of the particular hsGRIM1 protein in a pCMX expression plasmid is
under the control of the cytomegalovirus promoter. Luciferase
expression in the LUC reporter plasmids used is controlled either
by the thymidine kinase (TK) promoter, an E1b minimal promoter
(TATA box), or by the respectively indicated complex promoters.
[0037] Transfection takes place by the calcium phosphate method or
by the DOTAP liposomes method (Roche Diagnostics, product
information). Unless otherwise indicated, 10.sup.5 cells/well were
seeded in 1 ml of culture medium in 12-well dishes. In the
CaPO.sub.4 method, the plasmid DNA to be transfected is
coprecipitated with calcium phosphate, and the precipitate is
distributed to the cells capable of division. The calcium phosphate
precipitate is prepared by preparing the mixture indicated below
for duplicates in 12-well dishes: TABLE-US-00002 Reporter plasmid 1
.mu.g hsGRIM1 expression plasmid 20-400 ng Carrier DNA (pUC18) ad 8
.mu.g CaCl.sub.2 solution (2.5 M) 20 .mu.l ddH.sub.2O ad 80
.mu.l
[0038] While agitating (vortexing), 80 .mu.l of 2.times. BES buffer
are slowly and uniformly added dropwise to this mixture. The
precipitate is incubated at RT for 10 min and distributed uniformly
to the cells (80 .mu.l per well) and, after 8 hours, removed by
washing with PBS. The cells are incubated in fresh culture medium
for a further 18 to 24 h. TABLE-US-00003 BES buffer (2x): 50 mM
N,N-bis[2hydroxyethyl]-2- aminoethanesulfonic acid; 280 mM NaCl;
1.5 mM Na.sub.2HPO.sub.4.2H.sub.2O; pH 6.95
[0039] Stable transfection of cell culture cells is carried out
with the pcDNA6-His system supplied by Invitrogen. Cells are
transfected as described above, and stable integration of the
plasmids used is detected by the selection antibiotic blasticidin
S. Individual clones are picked, expanded and tested by Western
blotting and immunodetection for stable expression.
[0040] Method 2 (Protein-Protein Interactions):
[0041] Yeast two-hybrid interaction assays, mammalian THS,
pull-down analyses, immunoprecipitation of cellular proteins
followed by immunodetection or in combination with in vitro
translated .sup.35S-methionine-labeled interactants, Phast gel
system.
[0042] Glutathione S-transferase Pulldown
[0043] A protein is expressed as GST fusion protein in E. coli,
immobilized on a GST-binding carrier material, and incubated with
the potential interactant which is usually radiolabeled. In the
subsequent washing steps, proteins which interact only weakly or
not at all are washed off the carrier material or the carrier
material-bound GST fusion protein. The amount of associated
interactant can be determined by fractionation of the proteins by
SDS-PAGE and subsequent autoradiography. To check the specificity
of interaction between potential interactant and GST fusion
protein, the interaction with GST alone is examined.
[0044] Aliquots of GST-GRIM1 fragments or GST-containing E. coli
crude lysate are mixed in each case with 30 .mu.l of
glutathione(GSH)-Sepharose and incubated on a bohemian wheel at
4.degree. C. for 1 h. The GSH-Sepharose is pelleted by
centrifugation (1' at 2000 rpm) and washed twice in pulldown
buffer. The pellet is then resuspended in 500 .mu.l of pulldown
buffer and incubated with the in vitro translated,
.sup.35S-methionine-labeled potential interactant at 4.degree. C.
on the bohemian wheel for 1 h. After washing three times with
pulldown buffer, the GSH-Sepharose pellet is boiled in 30 .mu.l of
SDS sample buffer, the proteins are fractionated by SDS-PAGE, and
the gel is dried and then subjected to autoradiography.
TABLE-US-00004 Pulldown buffer: 20 mM HEPES (pH 7.7); 150 mM KCl;
0.1 mM EDTA; 25 mM MgCl.sub.2; 10 mM DTT; 0.15% (v/v) NP40.
[0045] Immunoprecipitation
[0046] If two proteins interact in solution, the stable complex of
the two proteins can be precipitated with an antibody which is
directed against only one of the two interactants. In this case,
the protein-antibody complex is isolated from the solution with the
aid of an antibody-binding carrier material (.gamma.-bind G
Sepharose, Pharmacia). To investigate interactions between GRIM1
and putative interactants, either native cell extracts or
GRIM1-containing cell extracts are mixed with in vitro translated,
.sup.35S-methionine-labeled target proteins in 500 .mu.l of IP
buffer. 5 .mu.g portions of anti-GRIM1 or nonspecific antibodies
(rabbit IgG.sub.1) are mixed with 30 .mu.l of preequilibrated
.gamma.-bind G Sepharose, and the mixture is incubated on a
bohemian wheel at 4.degree. C. for 60 min. The G-Sepharose is then
pelleted at 2000 rpm for 1' and washed three times with 300 .mu.l
each time of ice-cold IP buffer and finally the pellet is boiled in
30 .mu.l of SDS sample buffer. The samples were finally separated
by SDS-PAGE, and interaction was detected either by Western
blotting or, in the case of the proteins translated in vitro in the
presence of .sup.35S-methionine, by autoradiography. TABLE-US-00005
IP buffer: 20 mM TrisHCl (pH 8.0), 300 mM NaCl, 5 mM EDTA, 0.3%
(v/v) NP40, 0.5 mM Pefablock Cell lysis buffer: 50 mM TrisHCl (pH
8.0), 170 mM NaCl, 0.1% NP40, 50 mM NaF, 2 mM NaO.sub.3V.sub.4, 0.2
mM DTT, 0.1 mM Pefablock, 1 .mu.g/ml aprotinin, 10% glycerol
[0047] Method 3 (Detection of Intracellular Localization):
[0048] Indirect immunofluorescence, cell fractionation followed by
immunodetection.
[0049] Indirect Immunofluorescence (IIF)
[0050] Cells are seeded in the desired cell density on autoclaved
slides and fixed on the support with 5% paraformaldehyde for 10'.
Cell membranes are made permeable for antibodies by incubation in
0.2% TritonX100 for 10'. Before the incubation with the primary
antibodies, the cells are blocked in 0.2% gelatin (in PBS) at RT
for 1 h. Anti-hsGRIM1 antibodies are diluted 1:500 in 0.2% strength
gelatin solution and incubated at RT for 1 h. After washing three
times with PBS for 5' each time, the respective
fluorochrome-coupled secondary antibody is added 1:2000 in 0.2%
gelatin solution and incubated in the dark at RT for 30'. After the
washing steps, the nucleus is counterstained with DAPI (1 .mu.g/ml
PBS, Roche Diagnostics) and the cells are finally washed twice in
0.1% TritonX100 in order to minimize autofluorescences. The slides
are preserved in Fluoromount M (Southern Biotechnology Associates)
and analyzed using a fluorescence microscope.
[0051] Cell Fractionation
[0052] Intact nuclei can be isolated and lyzed separately by
hypotonic lysis of the cell membrane. Cells are harvested in
ice-cold PBS, pelleted and swollen in hypotonic lysis buffer (LB).
The cell membrane is permeated by adding NP40 ad 0.1%, and the
cytoplasmic constituents are released by gentle shaking and
incubation on ice for 15'. Intact nuclei are pelleted by
centrifugation at 14 000 rpm for 15', the CP fraction is removed
quantitatively, and the protein concentration is determined. Nuclei
are lyzed by incubation in nuclear lysis buffer (NLB) while
vortexing for 15'. Debris is pelleted and the soluble nuclear
preparation is removed. The quality of the preparation is examined
by Western blotting and immunodetection with nucleus-specific (e.g.
TIF2) and cytoplasmic (e.g. RasGAP) markers. TABLE-US-00006 LB: 10
mM HEPES (pH 7.9), 1.5 mM MgCl.sub.2, 10 mM KCl, 0.5 mM DTT, 0.5 mM
Pefablock NLB: 20 mM HEPES (pH 7.9), 420 mM NaCl, 1.5 mM
MgCl.sub.2, 0.2 mM EDTA, 0.5 mM DTT, 0.5 mM Pefablock, 10%
glycerol
[0053] Method 4 (Detection of Enzymatic Activities):
[0054] HDAC assay, INHAT assay, luciferase assays.
[0055] HDAC Assay
[0056] The histone deacetylation reactions which were carried out
were carried out within the framework of a kit obtained from
Upstate Biotechnology (.sup.3H-labeled histone H3 N-terminus as
substrate). Alternative substrates used were histones
.sup.14C-acetyl-labeled with p300 or in vivo labeled histones
(kindly provided by Dr Martin Gottlicher, Kernforschungszentrum
Karlsruhe).
[0057] INHAT Assays
[0058] INHAT assays were carried out as described by Eckner et al.
and by Seo et al. (Eckner et al., 1996, Genes and Development 10
(19), pp 2478-2490, Seo et al., 2001, Cell 104 (1), pp 119-130).
Histones or histone mixtures are preincubated with the appropriate
peptides in a total of 50 .mu.l of HAT buffer, then mixed with
bacterially expressed p300 HAT domains and 0.5 .mu.Ci of
.sup.14C-acetyl-coenzyme A, and incubated at 30.degree. C. and 1000
rpm for 2 h. Reactions are stopped by adding SDS Laemmli buffer.
Proteins are fractionated in denaturing SDS-PAGE, and the gels are
incubated in Amplify (NEN) for 1 h. Acetylations are detected in
autoradiographies.
[0059] Luciferase Assays
[0060] Transiently transfected cells are washed once with PBS and
lyzed by adding 50 .mu.l of reporter lysis buffer (Promega) and
further disrupted by freezing at -80.degree. C. Cell lysates are
transferred into Eppendorf reaction vessels, and cell detritus is
pelleted at 14 000 rpm for 2'. 10 .mu.l portions of the lysates are
pipetted into 96-well microtiter plates to determine the luciferase
activity. The luciferase assay took place in an EG&G Berthold
Microluminomat. 40 .mu.l of luciferase assay buffer are injected
into the 10 .mu.l of cell lysate, and the resulting
chemiluminescence is determined in a time integral of 10''. The
measured values for the luciferase activity are related to the
amount of protein present in the cell lysate. For this purpose, 3
.mu.l portions of the lysates are mixed in 96-well microtiter
plates with 100 .mu.l of Bradford assay reagent (BioRad) diluted
1:5 with H.sub.2O, and the absorption of the solution is determined
at 595 nm. Relative luciferase activities are found from the ratio
of the luciferase activity to the protein absorption.
[0061] Luciferase assay buffer: 200 mM tricine; 1.07 mM
(MgCO.sub.3).4Mg(OH) .sub.2; 0.1 M MgSO.sub.4, 10 mM EDTA (pH 8.0);
33 mM DTT; 0.5 M ATP; 270 mM acetyl-coenzyme A; 470 mM glowworm
luciferin.
[0062] The invention is explained in more detail by the following
examples:
EXAMPLE 1
[0063] The cloned human GRIM1 gene includes 2250 base pairs and
codes for 749 amino acids. The cloned human GRIM1 sequence obtained
by sequencing was deposited for the first time as unknown cDNA in
1999 under the accession number AL050019 (protein DKFZp564C186) as
part of a "Random EST sequencing" project of the DKFZ in
Heidelberg. A sequence which had been revised further was deposited
in March 2001 (Wiemann et al., 2001). The function of the protein
DKFZp564c186 has not previously been described. The cDNA of the
invention is depicted in FIG. 1. The sequence has Seq. ID No.
1.
EXAMPLE 2
[0064] It was possible to calculate from mouse EST databases by
comparative analysis with the hsGRIM1 aa sequence from independent
ESTs a full-length protein which has a total of 750 amino acids and
a total of 60% identical and more than 85% homologous residues to
the known hsGRIM1 sequence. Analysis of mouse cDNA libraries and
mouse genomic libraries (Celera) have confirmed the calculated and
expected mmGRIM1 aa sequence. It was possible to confirm the
putative cDNA by RT-PCR analyses. The full-length cDNA of mmGRIM1
has not yet been described or deposited. The sequence having Seq.
ID No. 2 is depicted in FIG. 2.
[0065] Seq. ID No. 2 was obtained by screening mmEST dbs with the
hsGRIM1 aa sequence. The resulting data were compared with mm cDNA
dbs and mouse genomic dbs and the virtual mmGRIM1 cDNA was
constructed. The sequence was confirmed by RT-PCR and genomic
sequencing, and by a direct sequence comparison with the mouse
genomic database of Celera Inc.
EXAMPLE 3
[0066] Demonstration of homology between hs and mmGRIM1 at the cDNA
and at the protein level is shown in FIG. 3. FIG. 3 depicts a
comparative demonstration of the homology of the mmGRIM1 and
hsGRIM1 amino acid sequences including conserved sequence
motifs.
[0067] The amino acid sequence of mmGRIM1 (mouse) is Seq. ID No. 4
and the amino acid sequence of hsGRIM1 (human) is Seq. ID No.
3.
[0068] All the functional and potentially interesting sequence
motifs described in detail below are sequence-identically conserved
in the mouse and the human protein (nuclear localization sequence
NLS, nuclear export signal NES, so-called CoRNR box with which
corepressors with their associated transcription factors are able
to interact). Further motifs are HMG box-like domains within the C
terminus and N terminus of GRIM1 of both species, which may
possibly bring about contact with DNA and/or histones. In addition,
in total 5 so-called LXXLL or related motifs are present within the
primary hsGRIM1 aa sequence, an alpha-helical structure which has
been reported to be responsible solely for contact of cofactors
with their associated transcription factors. Within the mmGRIM1
primary sequence, only 2 of these motifs are conserved vis-a-vis
the human sequence. No functional data are available on these
motifs as yet. According to calculations, hsGRIM1 mainly has an
alpha-helical structure (Columbia University PHD predict program)
and is non-globular (GLOBE of the Columbia University
PredictProtein server). GRIM1 is localized in the nuclei of
proliferating cells, while GRIM1 is relocated from the nucleus to
the cytoplasm within cellular differentiation systems (e.g. in
C2C12, L6, 10T1/2 and 3T3-L1 cells).
EXAMPLE 4
[0069] For the purposes of the present invention, only hsGRIM1 was
used on the basis of cDNA. However, it is assumed that
corresponding results will be obtained on use of mouse cDNA.
Various constructs were produced by deletion- and sequence-specific
point mutagenesis from the human GRIM1 for detailed functional
characterization and are compiled in the following tables.
TABLE-US-00007 TABLE 1 hsGRIM1 point mutants used Name aa positions
Constructs used .DELTA.NES I L309K, L312A -. FLAG, His, Xpress, GFP
.DELTA.NESII L309R, L312A, L315Q, -. FLAG, His, Xpress, GFP L317A
.DELTA.NLS I K649A, R650L, R651E, -. FLAG, His, Xpress, GFP K652A
.DELTA.NLS II K649A, R650L, R651E, -. FLAG, His, Xpress, GFP K652A,
R661D, K662I, K665E .DELTA.CoRNR I439A, I440S, I443A -. FLAG
[0070] TABLE-US-00008 TABLE 2 hsGRIM1 deletion mutants used Name aa
positions Constructs used GRIM1 1-749 jcdc5 mut A 3-609 Gal4-DBD,
FLAG, GFP, GST, His mut B 3-487 Gal4-DBD, GST, His mut C 3-423
Gal4-DBD, GST, His mut D 3-245 Gal4-DBD, GST, His mut E 3-147
Gal4-DBD, GST, His mut F 145-749 Gal4-DBD, GST, His mut G 246-749
Gal4-DBD, GST, His mut H 485-749 Gal4-DBD, GST, His mut I 609-749
Gal4-DBD, FLAG, GFP, GST, His mut J 145-609 Gal4-DBD, His mut M
468-749 GST mut N 246-609 Gal4-DBD mut O 246-485 Gal4-DBD mut P
485-609 Gal4-DBD mut Q 246-423 Gal4-DBD mut R 423-485 Gal4-DBD, GST
mut S 145-245 Gal4-DBD 246-334 246-334 FLAG, GFP .DELTA.246-334
3-245 FLAG, GFP 335-749
[0071] Many of the mutants used carry specific tags such as the
Gal4 DNA-binding domain of the yeast GAL4 transcription factor
[Gal4-DBD], green-fluorescent protein [GFP], herpes simplex virus
transactivator protein 16 [VP16], the antibody-specific signal
sequences FLAG tag [FLAG], Xpress tag [XPRESS], c-myc tag [MYC],
nickel-binding epitope His tag [His], maltose-binding epitopes
[MBP], glutathione-binding portions of glutathione S-transferase
[GST]. The use of proteins modified in this way is identified
through use of the epitope name in the GRIM1 construct employed.
The function of the individual regions of the polypeptides of the
invention was investigated by these experiments.
EXAMPLE 5
[0072] Database analyses have shown that hsGRIM1-homologous
proteins are to be found in a large number of species. Homologous
sequences are to be found in Shizosaccharomyces pombe,
Saccharomyces cerevisiae, Rattus spec., Bos bovis, Mus musculus,
Drosophila melanogaster, Caenorhabditis elegans, Danio rerio and
also in Arabidopsis thaliana. A summarizing overview is obtained
from the databases UNIGENE (National Center for Biotechnological
Information, NCBI) and of the PredictProtein server of Columbia
University and is shown in Table 3 below:
[0073] Selected Model Organisms in a Protein-Similarity
Representation
[0074] Organism, protein, percent identity and length of the
compared region. TABLE-US-00009 H. sapiens: sp: Q9Y3T9 - YU20_HUMAN
HYPOTHETICAL 84.9 KDA PROTEIN DKFZP564C186 100% over 753 amino
acids (according to the invention) M. musculus: sp: Q9WV70 - YU20
MOUSE HYPOTHETICAL PROTEIN 73% over 267 Aa R. norvegicus: sp:- -
|Segment 1 of 2| COLLAGEN ALPHA 1 (I) CHAIN 29% over 328 AA A.
thaliana: sp: Q9ZPV5 - YU20 ARATH HYPOTHETICAL PROTEIN T30D6.27 IN
CHROMOSOME II 27% over 722 AA C. elegans: sp: O17580 - YTB2_CAEEL
HYPOTHETICAL 82.0 KDA PROTEIN C07E3.2 IN CHROMOSOME II 27% over 656
AA D. melanogaster: sp: Q9VIF0 - YU20 DROME HYPOTHETICAL PROTEIN
CG9246 37% over 681 AA S. cerevisiae: sp. P39744 - YO26_YEAST
HYPOTHETICAL 81.6 KDA PROTEIN IN DEDI-RETI INTERGENIC REGION 29%
over 674 AA
[0075] Table 3: Representation of the UNIGENE maximally homologous
proteins from mouse, rat, Arabidopsis, threadworm, fruit fly and
baker's yeast. The Swissprot accession numbers, UNIGENE accession
numbers, investigated region and maximum homology are indicated
(mouse Q9WV70 is not the full-length clone; compare with the
described mouse clone described under Seq. ID No. 4).
[0076] FIG. 4 shows a comparison of the amino acids and
establishment of a consensus sequence.
EXAMPLE 6
[0077] Genomic Organization
[0078] The characteristics of the genomic sequences of mmGRIM1 and
hsGRIM1 are compared in Table 4 below. The genomic sequence of both
species, including the detailed exon-intron boundaries, is
indicated in FIGS. 5 and 6. TABLE-US-00010 TABLE 4 Genomic
organization comparison of mm (mouse) versus hsGRIM1 (human).
hsGRIM1 mmGRIM1 Number of exons 19 19 Location Chr. 1p33.36 Chr. 4p
Pseudogenes Chr. 2 NT_022127.3 not known Chr. 2 NT_022140.3 Chr. 11
STS WI-15347 not available Accession # GenBank AF276983.1 Accession
to the revised version under complete sequence is NCBI not
available. NT_021903 Individual exons are UniGene Hs.134200 present
in EST dbs. Swissprot YU_20 human ENSG00000077716 LokusLink 26155
GenBank AL050019.1 OMIM entries No muscle- or fat-related No
muscle- or fat- phenotype. related phenotype. 1p33 is a highly
mutated locus and is associated with breast, prostate and brain
tumors Spliced isoforms nothing known nothing known fragmentary
ESTs present in various dbs. Northern analyses show only one
isoform (exception: heart, 3 isoforms). Size of the 14.24 kb
(Ensembl) 12.36 kb (Celera) locus Quality of the Chromosome 1 build
27 no Ordered up to 5.5 kb contigs longer has any gaps; 5'- and
3'-flanking, ordered structure only a small gap between exons 2 and
3 (see annex 2).
[0079] FIG. 5 depicts the genomic sequence of hsGRIM1 (NCBI Human
Genome Server, Chromosome 1 map view, build 27, December 2001).
Exons have a shaded background, and the putative poly-A site is
underlined. The sequence corresponds to Seq. ID No. 5.
[0080] FIG. 6 depicts the genomic sequence of mmGRIM1 including 5.5
kb flanking sequences (established from the public NCBI mouse
genome db and the commercial Celera mouse genome gb). The exons
have a shaded background, and intronic sequences are depicted in
pale. Exon/intron junctions are depicted in bold. The putative
poly-A site and the putative transcription start are underlined. As
yet incompletely assembled sequence fragments are depicted by "N".
The genomic mouse sequence has the Seq. ID No. 6.
EXAMPLE 7
[0081] Expression of Recombinant GRIM1 Protein
[0082] Many different strategies and expression systems were tested
for expression of recombinant hsGRIM1 protein, but only expression
of short protein fragments either in the HiS.sub.6.times. or in the
GST context was successful. The following lists briefly summarize
the results and the methods used.
[0083] a) Expression Systems and Brief Description of the
Characteristics for Expression of Full-Length hsGRIM1
[0084] Bacterial systems: all the systems used employ Escherichia
coli safety strains which were purchased from Stratagene.
[0085] BL21 (DE3)lysGOLD His-fused: Expression in the pRSET system
or in a modified pRSET vector with unique hexahistidine tag,
purification on a nickel affinity matrix (TALON, Clontech).
Coexpression with GroESL and hspHJK chaperones.
[0086] BL21(DE3)lysGOLD GST-fused: Expression in the pGEX4 or 6
systems from Pharmacia, purification on GSH affinity columns,
expression inducible by IPTG. Coexpression with GroESL and hspHJK
chaperones.
[0087] BL21(DE3)lysGOLD NusA-fused: Increase in solubility of the
target protein via the NusA content, purification by GPC
(Stratagene, Novagen)
[0088] BL21(DE3)lysGOLD MBP-fused: periplasmic expression, and thus
reduced cytotoxicity (Stratagene, New England Biolabs pMAL
system)
[0089] BL21(DE3)lysRIL: Codon-optimized BL21 strain (Stratagene).
Both hexahistidine- and GST-fused protein expression.
[0090] ABLE C: Safety strain which downregulates endogenous plasmid
copy numbers by a factor of 4.times. (Stratagene). Both
hexahistidine- and GST-fused protein expression.
[0091] ABLE K: Safety strain which downregulates endogenous plasmid
copy numbers by a factor of 10.times. (Stratagene). Both
hexahistidine- and GST-fused protein expression.
[0092] Plant Systems:
[0093] Physcomitrella patens moss: stable integration of a
CaMV-driven expression plasmid for GST-hsGRIM1
(pRT99/35S-GST-hsGRIM1) via non-homologous recombination, triple
selection via two different selection antibiotics and multiple
seedings (in cooperation with Prof. Ralf Reski, Plant Biotechnology
Institute of Freiburg University).
[0094] Cellular Systems:
[0095] Expression of GST fusion protein in eukaryotic cell culture
cells under the control of a constitutively active CMV promoter.
[0096] b) Expression of hsGRIM1 peptide Fragments
[0097] Short hsGRIM1 protein fragments were obtained both as
hexahistidine- and as GST-fusion proteins only in BL21(DE3)lysGOLD
without coexpression of chaperones. The following table summarizes
the expression results obtained; unlisted fragments (compare table
of the hsGRIM1 constructs used) could not be expressed:
TABLE-US-00011 TABLE 5 Bacterially expressed hsGRIM1 fragments
His-E (aa 3-147) +++ His-I (aa 609-749) +++ His-H (aa 485-749) ++
GST-E (aa 3-147) +++ GST-I (aa 609-749) +++ GST-H (aa 485-749) ++
GST-M (aa 468-749) + GST-R (aa 423-485) ++ +++ very soluble,
readily inducible ++ readily inducible, low yield + solubility poor
in some cases, low yield
EXAMPLE 8
[0098] Generation and Description of the .alpha.-GRIM1 Antibodies
Used
[0099] To produce hsGRIM1-specific antibodies, two
hexahistidine-fused hsGRIM1 protein fragments were expressed in E.
coli, purified and used to immunize rabbits. Amino acids 3-147 and
609-749 of hsGRIM1 served as epitopes for the antibody.
TABLE-US-00012 TABLE 6 N-terminal His.sub.6x-hsGRIM1 peptide (aa
3-147) which was used to immuniz rabbits, peptide "E" (S q. ID No.
7). The antibodies 2719 and 2720 resulted therefrom. N terminus
MHHHHHHGMASEFGSAGSRKRRLAFLTVDEFLASGFDSESESESENSPQAETREAREAAPEPDKPGGS
PSASRRKGRASEHKDQLSRLKDRDPEFYKFLQENDQSLLNFSDSDSSEEEEGPFHSLFDVLSEASSEE
DGAEEGEDGDRVPRGLKGDDNSVPSTI* C-terminus
[0100] TABLE-US-00013 TABLE 7 C-terminal His.sub.6x-hsGRIM1 peptide
(aa 609-749) which was used to immunize rabbits, peptide "I" (Seq.
ID No. 8). The antibodies 2910 and 2911 resulted therefrom. N
terminus
MRGSHHHHHHGMASMTGGQQMGRDLYDDDDKDRWGSEEGTPLTLYYSHWRKLRDREIQEEISGKERLE
(Seq.
DLNFFEIKRRKMADRKDEDRKQFKDLFDLNSSEEDDTEGFSERGILRPLSTRHGVEDDREDEEEGEED
(Seq. SSNSEDGDPDAEAGLAFGELQQLAQGPEDELEDLQLSEDD* C-terminus
[0101] These regions correspond to the putative INHAT domains of
hsGRIM1 and are distinguished in particular by acidic (polyE/polyD
clusters). The corresponding regions of the hsGRIM1 cDNA sequence
were generated by restriction digestion with endogenous EcoRI
(aa147) and XmaI (aa609) cleavage sites, and the products were
cloned into the vector pRSET-B (Invitrogen) (aa 609-749) or
pRSET-N+/H based on the PRSET-B with the difference that further
tags have been deleted and the protein is expressed only with
hexahistidine tag) (aa 3-147). The hexahistidine-fused hsGRIM1
protein fragments (aa 3-147 "E", aa 609-749 "I ") were
overexpressed in E. coli BL21(DE3)lysGOLD and, after disruption of
the bacteria, purified by TALON affinity chromatography. After
isolation and purification from E. coli, both polypeptides had an
estimated purity of about 80-90%. Rabbits were immunized with the
polypeptides, and the raised antiserum was purified. The purified
monospecific antibodies against aa 3-147 (2719, 2720) and against
aa 609-749 (2910 and 2911) were functionally characterized.
[0102] The antibodies obtained in this way were employed in various
immunological test methods that made it possible to detect
expression of GRIM1. The two antibodies directed against the N
terminus show exclusively nuclear localization of GRIM1 in
subnuclear compartments which are generally referred to as
"speckles". The number and size of the nuclear speckles varies from
cell line to cell line, only the exclusively nuclear localization
of GRIM1 being common to proliferating cells. The two antibodies
directed against the C terminus recognize exclusively nuclear GRIM1
in proliferating cells, but not in speckles, rather in a
pan-nuclear localization in the entire nucleoplasm.
EXAMPLE 9
[0103] Expression of GRIM1 mRNA and Protein
[0104] cDNA/mRNA Expression of hsGRIM1 and mmGRIM1
[0105] hsGRIM1 cDNA expression is ubiquitous according to the
report of the Unigene database (UniGene Cluster Hs. 134200, Homo
sapiens DKFZP564C186/DKFZP564C186 protein with the cDNA library
2334BT0407 used). Expression was detected:
[0106] Cervix, Pancreas, amnion_normal, bone,bone marrow, brain,
breast, cervix, colon, colon_est, colon_ins, denis_drash, ear,
epid_tumor, esophagus, cyc, genitourinary tract, germ cell,
head_neck, heart, kidney, kidney_tumor, liver, lung, lung_normal,
lung_tumor, lymph, muscle, nervous_normal, ovary, pancreas,
placenta, pool, pooled, pooled brain, lung, testis, pooled colon,
kidney, stomach, pooled lung and spleen, prostate, prostate_tumor,
salivary gland, skin, small intestine, stomach, testis, testis-cell
line, thymus pooled tonsil, uterus, uterus_tumor, whole embryo
[0107] The expression pattern of hsGRIM1 was visualized by Northern
blot analysis (CLONTECH hsMTN I and hsMTN II, probe: both
full-length hGRIM1 cDNA and N- and C-terminal fragments). The
hsGRIM1 transcript size is about 3.3 kb (consistent with the
deposited full-length mRNA under Acc. # AL 050019):
[0108] heart (1), brain (2), placenta (3), lung (4), liver (5),
skeletal muscle (6), kidney (7), pancreas (8), spleen (9), thymus
(10), prostate (11), testis (12, ovaries (13), small bowel (14),
large bowel (15), peripheral lympocytes (15
[0109] The expression of mmGRIM1 in various tissues of the adult
mouse (4-6 weeks) was determined by RT-PCR. The following tissues
and organs were classified as mmGRIM1-positive:
[0110] breast, breast of suckling animals, placenta, testis,
ovaries, fat, skin, bone, cartilage, spleen, lung, adrenergic
gland, kidney, liver, small bowel, stomach, pituitary, thymus,
tongue, skeletal muscle, heart, eye, spinal cord, cerebellum,
medulla, hypothalamus, cerebral cortex, whole brain
[0111] Expression of mmGRIM1 mRNA in embryonic development was
determined via RT-PCR with complete embryo mRNA. A distinct mmGRIM1
signal is detected at all times, and no large change in mmGRIM1
mRNA expression during development is observable.
[0112] Table 8 depicts a UNIGENE database analysis on EST
expression selected in homology to mm/hsGRIM1 protein. The
accession numbers of the IMAGE clone or of the sequences which are
unknown in some cases, and the tissue type from which the cDNAs
were isolated, are indicated. GRIM1 appears to be expressed
ubiquitously also at the protein level. In total, 308 ESTs were
found, and those which show an unambiguous tissue assignment are
listed below: TABLE-US-00014 TABLE 8 BI070012 cervix R50545 breast
BI870713 liver BG335512 placenta H26449 breast BE253493 eye
BE903516 placenta AI674272 uterus BF345524 brain BG699598 brain
AL382825 esopitagus BE070314 breast BI828140 brain R10573 pool
AW375280 colon BG163998 skin R10574 pool BG388020 prostate BG760638
skin AW058011 thymus pooled BG179766 prostate BE894669 skin
AI660736 thymus pooled BE583040 brain BG767001 skin AI765077 kidney
BG764320 skin BF207343 brain AI677761 pancreas BE263417 lung
BG337689 uterus AI266034 lung BE266643 lung BG394462 eye AI948457
kidney BE273581 kidney BF212645 bone marrow AA281338 tonsil
BG674251 skin BE278111 placenta AA213719 tonsil BE514600 ovary
BF984755 small intestine AI93564 lung BE778203 eye BG332620 lung
AI85822 lung BI258705 placenta BG767665 skin AI660558 colon
BI006483 kidney_tumor BG764689 skin AI681255 lung AW675721 cervix
BG390277 prostate AI272873 colon BE874785 colon BI489732 pooled
brain AI935918 pancreas BE545263 placenta lung testis AI799833
prostate BE746375 ovary BG776759 lung AI655977 germ cell AI989734
colon BG029437 breast AI633340 germ cell AW373221 colon BG699275
brain AI971531 germ cell AW365265 head_neck BE883535 uterus
AA026277 heart BE746427 ovary BI459847 testis cell line AA026278
heart AW575244 colon BI461880 testis cell line AI400930 prostate
BI113120 muscle BG830390 pancreas AI420991 prostate AI471616 kidney
BI760633 pooled colon AI682274 prostate AL529521 brain kidney
AI744703 colon BE779713 eye stomach U810701 prostate BI356811
placenta BE732072 placenta BE869684 colon AW237492 kidney AI254710
colon BE900176 placenta AW179094 stomach AI611186 uterus AW572690
lymph AW512856 uterus AI990470 thymus pooled BF868966 lung_tumor
BE793414 lung AW028378 stomach BF346952 brain AW366003 head_neck
R73178 breast AW673372 cervix AW104543 pool R93124 breast BI085040
salivary gland BE257381 eye AI678640 stomach BE515327 uterus
BI222365 placenta R50640 breast BE410145 placenta BE591125 uterus
BE746352 ovary AW365268 head_neck AW083553 colon AW366001 head_neck
BG328934 kidney BE783940 eye BF874087 lung_tumor BE892015 skin
BG327104 kidney AA427964 whole embryo BE543377 placenta BE754506
normal BE740821 ovary AW674628 placenta AW071075 breast AL526148
brain AW551591 colon T16893 brain BF341363 brain BF883105
lung_tumor BF773461 epid tumor BE262764 brain AW151790 stomach
BC284070 prostate BH15932 lung BE391861 uterus AW366039 head_neck
BE391378 uterus BI254811 placenta BG436850 lung BG576365 breast
BE301196 kidney BE26490 lung BI259365 placenta BE296203 muscle
BG829981 pancreas BG330302 lung BE278652 placenta AA315693 ATCC
111476 BI772656 pooled lung BF797114 lymph T19249 testis and spleen
BI857618 breast AW364797 denis_drash BG761400 skin BG256028
prostate BF223807 germ cell BG681166 skin AL526094 brain AA355948
ATCC 158121 AW365999 head_neck AW364751 denis_drash BI007523
kidney_tumor BE841836 stomach BF765941 colon_est AW351978 head_neck
BE379093 uterus AW364764 denis drash AW582985 Pancreas BF768551
epid_tumor BE070313 breast BE727721 skin BE898791 ovary BF026345
skin AH366053 head_neck BE760929 brain AW364775 denis drash
BF509301 muscle BI857073 breast AW673544 cervix BF830322 kidney
tumor BF883094 lung_tumor AW375245 colon BG110646 bone BE882355
uterus AW375242 colon BI084623 salivary gland AW473561 uterus
BF907757 uterus_tumor BI033036 nervous BG421457 kidney BG762371
skin normal BE901462 placenta AI632285 germ cell A356032 head neck
AW351639 colon AA427857 whole embryo BE593548 germ cell BI859530
breast AW351647 colon BI860984 breast BG91171 brain BE332104 breast
AW366056 head_neck AW375257 colon AW351638 colon AW564784 denis
drash AA502771 colon AA448947 whole embryo BG25338 liver BE742897
ovary BE795961 lung BE858431 prostate tumor AW020647 ear T16894
brain BF955772 nervous AI672826 colon AW366050 head_neck normal
BG330696 lung BE383442 brain AW366040 head neck BG421512 kidney
BF817292 colon_mus BG423930 kidney BE536984 cervix AI346533 colon
AA449675 whole embryo AW375211 colon BF000005 colon BF913827
uterus_tumor BF732410 ovary BI160700 pancreas AW074084 BE273057
kidney BG177975 prostate AW365264 head_neck BI857814 breast
AW528624 Cervix AW365167 head_neck BI033534 nervous BG030810 breast
AW375277 colon normal BG110635 bone AW575314 colon AW366007
head_neck AL563470 brain AW594248 germ cell AW375312 colon AW365161
head neck BE389202 uterus BI762663 pooled colon BI007446
kidney_tumor U636573 germ cell kidney BF338161 brain BE746451 ovary
stomach AW387399 stomach BF868930 lung_tumor AW272743 colon
AW601835 breast BE729701 skin AW675196 placenta AW601811 breast
AW806654 stomach AW179097 stomach BF846203 lung_normal BG281880
skin BF338242 brain BG254482 prostate BI094540 cervix BE896763
skin
EXAMPLE 10
[0113] Protein Expression
[0114] Since various experiments cannot be carried out directly on
humans, cell culture and animal experiments which, however, also
have validity for the human situation were carried out.
[0115] a) The expression of mmGRIM1 protein in primary cells and
tissues is indicated below (all the listed cells are classified as
GRIM1-positive in a Western blot (86 kD), and for cells the
expression was also checked in IIF). Signals are obtained with both
antibodies (N- and C-terminal epitope):
[0116] Mouse embryonic fibroblasts
[0117] Human cardiac tissue (DCM patients and normal heart)
[0118] Mouse smooth muscle cells (differentiated, undifferentiated
and proliferating) [0119] Mouse organs: testis, ovaries, spleen,
thymus and in a small amount also in prostate and stomach (depicted
in the following figure), brain, lung, liver.
[0120] b) It was possible to detect expression of mmGRIM1 protein
by in situ immunohistochemistry on sagittal sections of E10.5 mouse
embryos. mmGRIM1 is expressed in epithelial structures, in the
dermomyotome of the rear extremities, in the inner epithelial
regions of the neural tube and in other cell populations which are
not determined in detail. Concentrated expression is detectable
only in the dermomyotome; only a few mmGRIM1-positive cells are
detectable in the remaining areas. Ubiquitous expression of GRIM1
was found in mRNA and EST analyses. It was possible to show by in
situ immunohistochemical analyses of E10.5 mouse embryos that
although there is mmGRIM1 expression in every tissue type, it is
far from being in every cell, but rather in specialized cell
populations. mmGRIM1 is not expressed in every cell type but
distributed over the entire organism in specialized cell types.
[0121] c) The expression of rnGRIM1 (rn=rat) in the rat brain was
detected by in situ immunohistochemistry in sagittal sections
through the brain of a Sprague-Dawley rat (Rattus norvegicus
ssp).
[0122] rnGRIM1 is expressed in Purkinje cells of the cerebellum
(nuclear and in some cases also cytoplasmic; strong
immunoreactivity in 1-2 subnuclear structures), in bipolar cells of
the mitral layer of the olfactory lobe (nuclear and in some cases
also cytoplasmic; strong immunoreactivity in 1-2 subnuclear
structures), in all cells of the choroid plexus (nuclear and in
some cases also cytoplasmic; strong immunoreactivity in 2
subnuclear structures in most cases), in cells which have not yet
been determined in detail in the whole region of the cortex
(nuclear and in some cases also cytoplasmic; strong
immunoreactivity in 1-2 subnuclear structures). rnGRIM1 is not
expressed in every cell type, but distributed over the entire brain
area in specialized cell types.
[0123] d) Expression of GRIM1 protein in cell culture cells. All
the stated lines are classified as GRIM1-positive in Western blot
analyses (species-inclusive size of about 86 kD) and by indirect
immunofluorescence (IIF). Both assays were carried out with all the
available .alpha.-GRIM1 antibodies. In the proliferative state,
GRIM1 is detectable in every cell line in so-called speckle
structures (as also described in the literature for many
corepressors): TABLE-US-00015 Mouse: NIH3T3 fibroblasts, NIH3T3-L1
preadipocytes, NIH3T3-L1 adipocytes, C2C12 undifferentiated
skeletal muscle cells, C2C12 differentiated skeletal muscle cells,
N2a neural cells, P19 teratocarcinoma cells, 10 T.sub.1/2
fibroblasts, Monc-1 neural crest cells Human: HL1 myocardial cells,
293 embryonic kidney cells, LnCaP prostate carcinoma cells, PC3
prostate carcinoma cells Monkey: COS-7 and CV1 kidney cell lines
Rat: L6 skeletal muscle cells Hamster: BHK fetal kidney cells
[0124] e) Drosophila melanogaster GRIM1
[0125] Expression of an hsGRIM1-like protein in Drosophila
melanogaster was investigated by Western blotting and whole-fly in
situ hybridizations.
[0126] No specific signal was detected with the
anti-hsGRIM1-specific antibodies (resolution from 36 to 210 kD)
either in whole-larva lysate or in whole-cell extracts from fly
heads. Likewise, no specific dmGRIM1 (dm=Drosophila melanogaster)
signal was detectable with the available GRIM1 antibodies in in
situ hybridizations with various larval stages.
EXAMPLE 11
[0127] hGRIM1 Interactants
[0128] In the protein-protein interaction studies carried out to
date, the following potential mm/hsGRIM1 interactants were
investigated for direct or indirect interaction with the following
methods: TABLE-US-00016 METHOD for detecting protein-protein
interaction Symbol Co-IP from cell extracts of transfected cells
tfIP Co-IP from cell extracts of untransfected cells utfIP Co-IP
with in vitro translated proteins ivIP Yeast THS yTHS Mammalian THS
mTHS Pulldown analyses (direct interaction) PD Transcriptional
studies (transient transfections) tT
[0129] The underlined factors showed an interaction with
hs/mm/maGRIM1 and are described separately at the end of the list
(ma=mesocricetus aureus, BHK cells).
[0130] Nuclear Hormone Receptors
[0131] hsAR (PD, utfIP, tfIP, YTHS, tT), hsER.alpha. (utfIP, tfIP),
hsER.beta., rnGR (tT, tfIP), hsMR (tT), hsPR (tT), PPAR.gamma.
(tT), hsRAR.alpha. (tfIP), rnTR.alpha. (tT), hsTR.beta.-2 (tT),
mmERR1 (tfIP, tT), mmGCNF (PD, ivIP, tfIP, tT), hsRevErbA (tT),
mmRVR (tT, tfIP), mmSF1 (tT).
[0132] Transcription Factors
[0133] Myogenin (ivIP, tT), MEF2A (ivIP), MEF2B (ivIP), MEF2C
(ivIP, tT), eHand (ivIP), dHand (ivIP), E47 (ivIP), E12 (ivIP),
GATA4 (ivIP), MyoD (ivIP, tT), SEF (ivIP, tT), MRF4 (ivIP, tT),
Myf5 (ivIP), Sp1 (tT), CBF (tT).
[0134] Transcriptional Corepressors.
[0135] N-COR (ivIP, tfIP, utfIP, PD), Sin3A (ivIP, tfIP, utfIP),
Sin3B (tfIP, utfIP), SuN-CoR (ivIP, tT), SMRT (ivIP, tfIP, utfIP),
HDAC1 (tfIP, utfIp), HDAC2 (tfIP, utfIP), RIP140 (ivIP).
[0136] Transcriptional Coactivators:
[0137] P300 (tT), FHL2 (ivIP, PD), FHL3 (PD), TIF2 (ivIP), RIP140
(ivIP)
[0138] Further factors: Histones H3, H4, H2A, H2B (PD)
[0139] It has been possible to date to show a strong direct
interaction between hsGRIM1/mmGRIM1 only with the androgen receptor
(hsAR) (detected by tfIP in both directions, utflP in both
directions, YTHS, PD and tT). The interaction with the androgen
receptor is ligand-independent and cannot be influenced by the
presence of antagonists (Casodex, hydroxyflutamide, cyproterone
acetate). GRIM1 interacts both with the "normal" AR having 17
glutamines and with AR isoforms which are associated with Kennedy's
disease and comprise polyglutamine-rich insertions (17, 44 and 77
Q), likewise ligand-independently. The PD analyses were carried out
with the GRIM1 fragments GST-I/E/H/M, and all the tested fragments
apart from GRIM1 "E" (aa 3-147) interacted with the AR, in the
sequence M.gtoreq.H>>I. The physiological relevance of the
AR-GRIM1 interaction is described in detail in a separate
section.
[0140] Weak interactions were detected between hsGRIM1 and the
nuclear orphan receptors Germ Cell Nuclear Factor (mmGCNF) and
Estrogen receptor Related Receptor 1 (mmERR1) (only ivIP, tfIP and
PD), and a weak interaction with the estrogen receptor .alpha.
(hsER.alpha.; interaction was detected only in tfIP and proved to
be ligand-independent both in the presence of agonists (estradiol),
of antagonists (raloxifene, tamoxifen, ICI) and in the absence of a
ligand). The physiological relevance, in vivo colocalization or in
vivo interaction is very substantially unclear. It was not possible
to show any direct association between GRIM1 and mmGCNF, mmERR1 or
hsER.alpha. in transient transfections.
[0141] GRIM1 shows a strong interaction with the core histones H4,
H2B, H2A and H3 in GST pulldown analyses. Recombinantly expressed
GST-GRIM1 fusion proteins (GST-aa 3-147 and GST-aa 609-749) show a
marked interaction with all four core histones, providing further
support for the observed INHAT activity of the N- and C-terminal
GRIM1 domains.
[0142] Potential mm/hsGRIM1 interactants can be identified by
conventional methods. These methods include yeast two-hybrid screen
with various cDNA libraries and direct protein biochemical methods
such as Co-IP of GRIM1 and associated proteins, fractionation by
2-dimensional gel electrophoresis, mass spectrometric analysis and
microspray sequencing of the potentially interacting proteins from
various cell culture cell lines (C2C12 differentiated, C2C12
undifferentiated, 3T3L1 differentiated, 3T3L1 undifferentiated, BHK
and many more).
EXAMPLE 12
[0143] Physiological Function of GRIML: Analysis of Repression
Potential of hsGRIM1 in Transient Transfections
[0144] All the described data were obtained with the hsGRIM1 clone.
GRIM1 is a potent transcriptional repressor when used as Gal4-DBD
fusion protein in transient transfections. Gal-GRIM1 represses
expression of reporter genes both of Gal4-RE-controlled complex and
synthetic minimal reporter constructs. The repression potential of
GRIM1 is distinctly greater for example compared with the
corepressor Gal-SuN-CoR. In these experiments, the appropriate
corepressors are transiently expressed as heterologous Ga14-DBD
fusion proteins in cells. The repression capacity on various
synthetic promoter-reporter constructs compared with equal
quantities of transfected wild-type Gal4-DBD is determined by
luciferase reporter gene expression.
[0145] Table 9: Gal-GRIM1 is a stronger repressor than Gal-SuN-COR.
293 cells were cotransfected with G5ElbTATA-LUC reporter and the
stated amounts of corepressors. The same effect can be observed on
testing complex promoters (Gal4 .sub.3.times.-tk-LUC).
[0146] Gal-GRIM1 is equally strong as a repressor as a large number
of described corepressors such as, for example, Gal-HDAC1-10,
Gal-CBF1, Gal-Alien and represses only insignificantly weaker than
Gal-N-CoR-RD or Gal-SMRT-RD. The repression potential of Gal-GRIM1
was investigated in conjunction with the complex
Gal4.sub.3.times.-tk-LUC and Gal-MMTV-LUC promoter/reporter
constructs and in conjunction with the minimal promoter/reporter
constructs Gal.sub.5.times.-E1b-TATA-box and
Gal.sub.3.times.-.beta.globin-TATA-LUC. In addition, the repression
capacity of Gal-GRIM1 was investigated on an autonomous VP16
transactivation domain bound to DNA by means of LexA-DBD, via a
LexA.sub.8.times.-Gal.sub.5.times.-LUC reporter (provided by Dr S.
Khochbhin). The data obtained on Gal-GRIM1 has been tested
experimentally in the following cell lines: A7r5, C2C12, CV1, BHK,
293, N2a, COS-7. The following figure shows representative results
for BHK and C2C12 cells.
[0147] The Gal-GRIM1 repression is not impaired by addition of the
specific histone deacetylase (HDAC) inhibitors trichostatin A (TSA,
300 nM) and of sodium butyrate (Na-But, 5 mM), suggesting that the
mechanism of repression is fundamentally different from that
described for other corepressors. Comparable cases have been
described for other repressors, although no explanation of the
method is described. In addition, the described factors show no
involvement in skeletal muscle differentiation or adipogenesis.
Data on the independence of HDACs in transient transfections are
depicted in the following figure for the example of BHK cells. The
transcriptional activity of Gal-GRIM1 is uninfluenced by TSA in any
cell line already tested.
[0148] To determine the GRIM1 domains involved in repression, a
total of 11 deletion mutants of GRIM1 were constructed and all were
tested in the Gal4-DBD context for their repression potential in
all the reporter systems already described. All the GRIM1 domains
used show transcriptional repression in the range from 2.times.
(Gal-E and Gal-I) to 15-20.times. (full-length Gal-GRIM1). All the
tested deletion mutants proved to be insensitive to HDAC
inhibitors.
[0149] In summary, it was possible to show that GRIM1 has at least
5 independent repression domains, with the 3 main repression
domains being located in the hydrophobic core region of the protein
(mutQ, mutR, mutp). However, every analyzed deletion mutant shows a
repression capacity which is significantly above the background in
the Gal4-DBD context. TABLE-US-00017 TABLE 12 Summarizing list of
all hsGRIM1 deletion mutants used as Ga14-DBD fusion in transient
transfections to investigate their repression potential. C-terminal
N-terminal Intermediate deletion mutants deletion mutants deletion
mutants aa aa aa mut A 3-609 mut F 145-749 mut J 145-609 mut B
3-487 mut G 246-749 mut N 246-609 mut C 3-423 mut H 485-749 mut O
246-485 mut D 3-245 mut I 609-749 mut P 485-609 mut E 3-147 mut Q
246-423 mut R 423-485 mut S 145-245
[0150] The GRIM1 subdomains "E" and "I" described in the section
"INHAT activity" show the smallest repressive effect as Gal4-DBD
fusions in the transcriptional assays, but are involved in
preventing histone acetylation by p300. No further information on
the Q, P and R domains is available, and it was not possible to
prove a direct involvement in the recruitment of additional
corepressors by the interaction studies cited above. At the moment
there is only speculation about the mechanism of repression used by
GRIM1, although involvement of HDACs can be ruled out, and it
appears that at least two different mechanisms are used (E and I in
contrast to Q, R and P).
[0151] Results with the LexA.sub.8.times.Gal.sub.5.times. system
confirm these results but show a different relative repression of
the individual fragments investigated (data not shown).
[0152] Full-length GRIM1 is able to repress both complex naturally
occurring and synthetic minimal promoters. The following systems
have been tested in transient transfections: GRIM1 and complex
promoters (p21 (-4542)LUC, SKA-LUC, calponin-LUC, myogenin-LUC,
myogenin proximal promoter-LUC, thymidine kinase-LUC, probasin-LUC,
MMTV-LUC and many more).
[0153] Cotransfections of GRIM1 and E1b-TATA-box-containing minimal
promoters with synthetic TF binding sites (AR, Gal4 and SEF binding
sites) or with the complex MMTV promoter leads to repression of
basal transcriptional activity (all transfections shown were
carried out in BHK cells).
[0154] Repression of GRIM1 with minimal promoters without synthetic
TF binding sites cannot be demonstrated more accurately owing to
the low basal value of the constructs used.
[0155] The repression potential of GRIM1 on
promoter-luciferase-reporter luciferase-reporter constructs which
have been stably integrated into the genome of the particular cell
line (HRL+N with RARE.sub.3.times.-LUC, NIH3T3 with cycA-LUC, 293
with NFkB-RE.sub.4.times.-LUC) is summarized in the next figure.
Transfection of the cells with GRIM1 shows no effect on reporter
gene transcription in the induced and in the basal state.
[0156] In addition, direct effects of GRIM1 on various
transcription factors were investigated in transient transfections
(compare also Example 11). Besides the AR, in particular mmGCNF,
mmERR1 and mmRVR were analyzed experimentally as full-length
proteins with the corresponding reporter constructs for a
functional interaction with GRIM1. No significant effect was
observable in any of the transfections.
[0157] Interactions of GRIM1 with further transcription factors or
cofactors was investigated in the Gal4-DBD-based context, with the
respective Gal4-DBD-fused factor being cotransfected in constant
amount with an increasing amount of wild-type GRIM1. Factors
investigated in this system are: Gal4-p300, Gal4-DBD, Gal-Sp1-4,
Gal4-CBF). Once again, no significant effect was observed in any of
the transfections.
EXAMPLE 13
[0158] Mechanism of GRIM1 Repression
[0159] a) HDAC Activity
[0160] The mechanism of repression by recruitment of histone
deacetylases (HDACs), which is the most widespread to date, appears
not to be involved with GRIM1. The repression potential of
Gal4-DBD-GRIM1 in transient transfections is not impaired by
application of specific HDAC inhibitors in a concentration of up to
5-fold above the KD (final concentration of trichostatin A 300 nM,
of sodium butyrate 5 mM). Immunoaffinity-purified and
high-stringency-washed GRIM1 shows no endogenous HDAC activity. Nor
do associated complexes which have been coimmunoprecipitated with
GRIM1 by mild washing conditions show any HDAC activity. In
addition, direct involvement in HDAC/N-CoR/SMRT complexes have been
ruled out by Co-IP studies.
[0161] b) INHAT Activity
[0162] A further possibility for transcriptional repression
consists of direct masking of the histone N termini to prevent
acetylation and thus loss of cohesion of the chromatin in the
affected region. It was found by database analysis that the N
terminus (aa 25-135) and the absolute C terminus (aa 638-749) of
hsGRIM1 has acidic domains which display homology with a described
INHAT domain of the Set protein. INHAT is an acronym for protein
domains/proteins able to bind lysines and thus prevent histone
acetylation.
[0163] It was possible to show in GST pulldown analyses that the N-
and C-terminal fragments of hsGRIM1 can associate directly with
histones in vitro. In addition, it can be shown in in vitro
acetylation assays that recombinantly expressed GRIM1 aa 3-147 and
aa 609-749 efficiently blocks acetylation of all four core histones
(H3, H2A, H2B and H4) by p300. It was possible thereby to show that
a possible mechanism of the GRIM1-mediated repression might be
ensured by histone masking.
[0164] There are differences in the efficiency of inhibition of
acetylation of individual histones by GRIM1. As yet, only blockage
of acetylation of H2A and H2B have been unambiguously
demonstrable.
[0165] c) Further Mechanisms of Repression
[0166] Lysin Methylation of Histones
[0167] Methylation of specific lysines in the free histone N
termini (H3K9, H3K4) ensure compaction of the chromatin structure
by preventing acetylation of the lysines which are important for
regulation and thus preventing a loss of cohesion of the chromatin.
In addition, methylated lysines represent a high-affinity binding
site for the HP1 protein (heterochromatic protein 1). HP1 may
subsequently recruit DNA methylases, MeCPs and also HDACs and lead
to a targeted formation of transient or constitutive
heterochromatin and thus repress transcriptional processes.
Examples of HMTases described to date are Suv39h1, Suv39h2 and
PRMT1.
[0168] Homology analyses show that no conserved SET domain
responsible for histone methyltransferase (HMTase) activity is
present in the GRIM1 sequence. It is therefore assumed that GRIM1
has no endogenous HMTase activity.
[0169] Recruitment of Chromatin-Reorganizing Complexes
[0170] A further possibility for specific repression would be
targeted recruitment of chromatin-reorganizing complexes (e.g. BRG,
SWI/SNF, CHRAC, ARC, RSC etc.). Promoter regions of the genes to be
repressed can be wrapped up in compact chromatin by active
chromatin reorganization in such a way that transcriptional
initiation is distinctly impeded.
[0171] It is not possible to rule out a functional interaction of
GRIM1 with one of the components of one of the remodeling
complexes.
[0172] Blocking PIC Formation/Impeding th Initiation
Reaction/Impeding Elongation
[0173] A further form of repression is impeding or preventing the
formation of the preinitiation complex (PIC) at the start of
transcription. Successful initiation is possible only if the TATA
box-binding portion of TFIID can recruit the polymerase holocomplex
at the starting point of transcription and if all the basal
transcription factors involved are available in sufficient
quantity. A further prerequisite for successful transcription is
the need for the C terminus of RNAPII to be phosphorylated so that
leaving of the initiation site is possible.
[0174] It is possible in an experiment familiar to the skilled
worker to find whether GRIM1 directly interacts with one of the
basal transcription factors or elongation factors and thus blocks
efficient transcription (yeast two-hybrid system, pulldown
analyses). No phosphatase domain can be found within the GRIM1
structure by homology analyses, and it is yet to be proven
experimentally that GRIM1 is able to dephosphorylate RNAPII.
[0175] Titration out of Coactivators by Binding to GRIM1
[0176] The potential of GRIM1 to inactivate basal or general
coactivators by binding and thus to repress the initiation or
elongation process is currently under investigation by searching
for potential hsGRIM1 interactants.
[0177] A total of at least 5 independent repressor domains (RDs)
have been found within the GRIM1 protein in transient
transfections. The fragments E and I of GRIM1 showed the smallest
repression potential in transient transfections, but their function
in repression by GRIM1 was demonstrated in the INHAT analyses.
[0178] However, the deletion mutants P, Q, R and also S show a far
stronger repression function in Gal-based transfections, suggesting
further mechanisms beyond the known facts. It is possible to find
the contribution to repression made by the other regions of GRIM1
by conventional experiments. These were intended to identify
interacting GRIM1 partner proteins in order to find whether there
is an association with non-HDAC components of transcriptional
corepressor complexes.
EXAMPLE 14
[0179] Analysis of Functional Motifs in hsGRIM1
[0180] Functional motifs within the hsGRIM1 structure were found by
database homology analyses. A summary of the motifs found, their
localization within the complete protein and the experimental
strategy used is shown in the following table. TABLE-US-00018 TABLE
19 Summary of the functional motifs in hsGRIM1. Name Sequence aa
position Detection LxxLL I.sub.RDLI 212-216 Significance not yet
confirmed L.sub.QNLL 219-223 experimentally L.sub.AFLV 312-316
L.sub.TELL 362-366 II.sub.GCI 339-443 L.sub.FIL 476-480 IL.sub.RPL
686-690 NES L.sub.RVL.sub.AL.sub.VL 309-317 Detected by GFP
relocation experiments and directed point mutagenesis NLS
KRRK.sub.MADRK.sub.DEDRK.sub.QEK 649-666 Detected by directed point
mutagenesis and GFP-GRIMI localization CoRNR Box
.sub.PL.sub.AQVIIGCI.sub.KLI.sub.P 435-446 Not yet confirmed
experimentally. Has no influence on AR-dependent interactions HMG
Box poly E/poly D 25-51 INHAT activity of the investigated like
68-134 fragments. Direct DNA and/or 658-683 histone binding not yet
confirmed 697-749 experimentally. LxxLL: interaction motif by which
coactivators interact with NHRs. NES: nuclear export signal. NLS:
nuclear localization signal. CoRNR box: interation motif by which
corepressors interact with NHRs. HMG box: contact of HMG proteins
to DNA and to histones.
[0181] Detection of the NES within hsGRIM1 For direct analysis of
the motif, all four relevant lysines were inactivated by directed
point mutagenesis so that the .alpha.-helical structure of the NES
was destroyed. The inactivating mutations are depicted in the
following figure. A directed inactivation of the two 3'-located
lysines on its own proved to be insufficient to destroy the
functionality of the NES. TABLE-US-00019 TABLE 20 Directed
mutagenesis to inactivate the four lysines involved in the NES.
##STR1##
[0182] To demonstrate the functionality of the export signal,
hsGRIM1 protein fragments were fused to GFP (green fluorescent
protein) (depicted in the following figure) and expressed in cells
by transient transfection. The localization of the respective
fragments was detected by fluorescence microscopy. TABLE-US-00020
TABLE 21 GFP fusion proteins used to demonstrate the functionality
of the nuclear export signal (NES). .DELTA.NES characterizes the
point mutations described above. GFP*hsGRIM1 ##STR2## GFP*
hsGRIM1.DELTA.NES ##STR3## GFP*hsGRIM1 .LAMBDA. aa 246-334 ##STR4##
GFP*hsGRIM1 aa 246-334 ##STR5##
[0183] Only GFP showed a pan-cellular localization. Full-length
GFP-hsGRIM1 was located exclusively in the nucleus through the
dominance of the NLS, as were the NES deletion mutant and the
fragment from which a region flanking the NES had been deleted
(hsGRIM1 .DELTA.aa246-334). On fusion of the hsGRIM1 fragment which
contains the NES to GFP (hsGRIM1 aa246-334), the resulting protein
was located exclusively in the cytoplasm. On coupling of the
point-mutated fragment of aa 246-334 to GFP, the pan-cellular
localization was restored so that it was possible to demonstrate
the functionality of the motif.
[0184] Detection of the NLS within hsGRIM1
[0185] To analyze the motif, besides the deletion mutants detailed
hereinafter, all seven relevant basic residues were inactivated by
directed point mutagenesis. Inactivation of the 3'-located 3 basic
residues alone proved to be insufficient to inactivate the NLS.
TABLE-US-00021 TABLE 22 Directed mutagenesis for inactivation of
the seven basic residues involved in the NLS. ##STR6##
[0186] To demonstrate the functionality of the import signal,
hsGRIM1 protein fragments were fused to GFP (depicted in the
following figure) and expressed in cells by transient transfection.
The localization of the respective fragments was detected by
fluorescence microscopy. TABLE-US-00022 TABLE 23 GFP fusion
proteins used to demonstrate the functionality of the nuclear
localization signal (NLS). GFP*hsGRIM1 ##STR7## GFP* hsGRIM1
.DELTA.NLS ##STR8## GFP*hsGRIM1 aa 1-609 ##STR9## GFP*hsGRIM1 aa
609-749 ##STR10## and .DELTA.NLSII
[0187] Only GFP showed a pan-cellular localization. Full-length
GFP-hsGRIM1 was located exclusively in the nucleus. On deletion of
the last 140 amino acids (hsGRIM1 aa1-609), the GFP fusion protein
showed both a cytoplasmic and a weak nuclear localization, while
the corresponding counterpart fused to GFP (GFP-hsGRIM1 aa609-749)
shows an exclusively nuclear localization. The full-length hsGRIM1
with the seven-fold point mutation within the NLS showed the same
distribution pattern as the hsGRIM1 aa1-609 deletion mutant and
confirmed the functionality of the NLS. On coupling of the
point-mutated fragment of aa609-749 to GFP, the fusion protein was
likewise located in the cytoplasm.
[0188] Detection of the CoRNR Box within hsGRIM1
[0189] To analyze the CoRNR box, all three highly conserved and
necessary isoleucines in the hydrophobic core of the helix were
replaced by alanines or by a serine: I436A, I437S, I444A. Further
functional characterization of the point mutations made have
revealed to date only that the repression potential of hsGRIM1 is
not impaired, and that the interaction with the AR is not impaired
by destroying the CoRNR box.
[0190] Intracellular Relocalization of mmGRIM1 in Cell
Culture-Based Differentiation Models
[0191] GRIM1 shows a change in its subcellular localization during
specific differentiation processes. In cell culture systems, GRIM1
is transported from the nucleus into the surrounding cytoplasm
during skeletal muscle differentiation of mouse C2C12 cells. The
timeframe for the observed translocation of GRIM1 is in the range
for the expression of the myogenic bHLH transcription factor
myogenin (My14) and of the structural protein skeletal muscle
myosin heavy chain (MHC), i.e. at a more advanced time during
muscle differentiation than previously described factors such as
HDAC4, HDAC5 and HDAC7. C2C12 cells differentiate under low-serum
conditions within 6-8 days from proliferating myoblasts to fused,
multinuclear myotubes. During this, the cells undergo a G.sub.0
arrest, the CDK inhibitor p21 is upregulated, and an ordered
cascade of myogenic transcription factors such as Myf5, MyoD and
myogenin provide the transcriptional precondition for myogenesis.
In differentiating C2C12 skeletal muscle cells, GRIM1 leaves the
nuclei of fused myotubes after about 6 to 8 days, whereas GRIM1 is
always located in the nucleus in mononuclear myoblasts. Expression
of the structural protein of muscle MHC takes place at the time
when the first GRIM1-free nuclei are to be observed, and was used
in IIFs as temporal marker for GRIM relocalization.
[0192] The data determined for C2C12 also apply to other cellular
skeletal muscle differentiation systems. The same effect was also
detectable in rat L6 skeletal muscle cells. The cells have
different differentiation kinetics and do not fuse until day 10-12
in differentiation medium (DM). GRIM1 also relocalizes in L6 cells,
and GRIM1-free nuclei are detectable only from day 16 onwards in
DM. In a further differentiation system, mouse 10T1/2 fibroblasts
were stably transfected with an estrogen-inducible MyoD expression
cassette (cells kindly made available by Dr. D. Bergstrom).
Addition of 1 .mu.M estradiol enables skeletal muscle
differentiation via MyoD expression. After only about 3-4 days in
DM, many cells can be detected as multinuclear myotubes.
[0193] During skeletal muscle differentiation, the GRIM1 protein is
not degraded and/or broken down but is present to the full extent
also in the cytoplasm. GRIM1 immunoreactivity can be detected
unchanged in Western blot analyses with whole cell extracts of
C2C12 cell extracts which have been differentiated to myotubes in
DM for 8 days (on day 8 after addition of DM, .gtoreq.60% of all
the cells present were myotubes). The results can also be further
confirmed by cell fractionations. In addition, RT-PCR analyses have
confirmed that the amount of mmGRIM1-mRNA is not changed during
C2C12 differentiation.
[0194] As a further check, it was investigated whether GRIM1 also
experiences subcellular relocalization during smooth muscle
differentiation. For this purpose, mouse smooth muscle cells were
obtained ex vivo and dedifferentiated by growth factor addition.
Cells were additionally differentiated again further to smooth
muscle cells via differentiation medium in vitro. The results show
that GRIM1 does not leave the nucleus during smooth muscle
differentiation. In addition, a cell culture smooth muscle
differentiation system (Monc-1 cells) was used in order to obtain
in vitro data about possible GRIM1 relocalization. As already shown
for the ex vivo cells, no GRIM1 relocalization in smooth muscle
cells was observable during the investigated differentiation
times.
[0195] A further in vitro differentiation model used for GRIM1
relocalization analyses is differentiation of 3T3-L1 preadipocytes
to mature fat cells. In the total of three differentiation
protocols carried out, GRIM1 twice showed a detectable
relocalization from the core into the cytoplasm, while in one case
no change in the localization of the GRIM1 immunoreactivity was
detectable. Western blot analyses have shown that no decrease in
GRIM1 immunoreactivity in whole-cell extracts was detectable in any
of the three protocols.
EXAMPLE 15
[0196] Regulation of GRIM1 Function
[0197] Complete experimental results are not yet available on the
regulation of GRIM1 function and on signal pathways which might be
involved in the subcellular relocalization of GRIM1. No signal
pathway modulating the transcriptional activity of GRIM1 has yet
been unambiguously identifiable. Nor is there any experimental
evidence of involvement of a signal cascade intervening in the
GRIM1 relocalization process. The data available on this topic can
be briefly summarized below.
[0198] Transcriptional Activity:
[0199] In the direct vicinity of the NES within the GRIM1 sequence
there is a consensus PKC phosphorylation site (serine 308). The
transcriptional activity of Gal-GRIM1 was investigated in transient
transfections in various cell lines. Neither the treatment of the
cells with the specific PKC inhibitor Go6850 (Parke-Davis, formerly
Godecke AG) nor a stimulation with the PKC-activating agent TPA
brought about a change in the transcriptional activity.
[0200] The sole experimental evidence for modification of GRIM1 is
the detection of direct acetylation of GRIM1 (aa3-147 and of
full-length GRIM1) by the HAT domain of p300.
[0201] Cotransfection of full-length p300 and Gal-GRIM1 shows no
change in the repression pattern, and addition of trichostatin A
(TSA) enhances general transcription but relative repression by
Gal-GRIM1 remains unchanged.
[0202] Cotransfection of Gal-GRIM1 in combination with a
constitutively active version of Rho GTPase (RhoAV14) likewise
shows no change in the activity.
[0203] Relocalization:
[0204] Nuclear export of GFP-GRIM1 or of the mutant GFP-GRIM1
aa234-336 is not influenced by treatment of the cells with the
bacterial toxin leptomycin B (LMB, 20 nM for 6 h before cell
harvesting and analysis by IIF). LMB is a specific inhibitor of the
export factor exportin1/Crm1, so that it can be concluded from the
experiment that there must be an exportin-independent transport
mechanism.
[0205] 14-3-3 proteins are responsible for active removal of the
HDACs 4, 5 and 7 from the nucleus and retention in the cytoplasm
during muscle differentiation. It was possible by Co-IP studies to
rule out an interaction between GRIM1 and 14-3-3 proteins (family
members zeta, beta, tau).
[0206] Treatment of differentiating C2C12 skeletal muscle cells
with various inhibitors of central signal transduction pathways
have no effect on relocalization or the number of nuclear speckles
of GRIM1 within the investigated differentiation from day 0 to day
6 after addition of the differentiation medium. Tested inhibitors:
pan-HDAC inhibitors TSA and sodium butyrate, protein kinase
inhibitors Go6850 (specific PKC inhibitor, 1 .mu.M), LY294002
(specific P13kinase inhibitor, 10 .mu.M), Ro3l-8220 (PKC
broad-spectrum inhibitor, 0.5 .mu.M), PD98059 (MEK1 inhibitor, 10
.mu.M), fasudil (Rock kinase inhibitor, 10 .mu.M), SB203580
(p38.alpha./.beta. inhibitor). Overacetylation and blockade of
central signal cascades had drastic morphologically visible
consequences for some of the cells, but there was no effect on
relocalization or the number of detectable GRIM1 speckles.
EXAMPLE 16
[0207] Interaction of hsGRIM1 and the Androgen Receptor (AR) and
Functional Consequences Resulting therefrom
[0208] It was possible to demonstrate in a yeast two-hybrid
interaction assay that GRIM1 interacts with the nuclear hormone
receptor androgen receptor (AR). The GRIM1 interaction domain in
the yeast-based assay is located in the C terminus between aa 606
to 749. Further functional characterization of the interaction
domains within GRIM1 was carried out by pulldown analyses with the
expressible fragments of GRIM1. The interaction of GRIM1 with the
AR is independent of the type of ligand used (agonists, antagonists
or no ligands). The available data are summarized in the following
picture: TABLE-US-00023 TABLE 24 The GRIM1 AR interaction domain is
located between aa460-609. ##STR11## Regions of the protein
depicted in white have not yet been expressed recombinantly.
[0209] The C terminus of GRIM1 shows, in contrast to the data
obtained in yeast, no in vitro interaction with the AR, but the
fragment from aa 460 to aa 609 shows a strong interaction
independent of the ligand used.
[0210] The interaction between the AR and GRIM1 was likewise
detectable in in vivo interaction assays. In
co-immunoprecipitations, the interaction between GRIM1 and the AR
was likewise ligand-independent and could not be significantly
blocked even in washing buffer with 400 mM NaCl. It was possible to
show the interaction with IP studies using antibodies both against
the AR and against GRIM1.
[0211] The AR is a member of the superfamily of ligand-activatable
nuclear hormone receptors. After binding of the natural AR ligand
5.alpha.-dihydrotestosterone, the AR is transported from the
cytoplasm into the nucleus and brings about transcriptional
activation of its target genes. Besides the activating ligands,
each receptor has specific antagonists which likewise bring about
translocation into the nucleus, but bring about a repressive
conformation of the receptor ligand binding domain.
Antagonist-bound receptor is able to recruit corepressor complexes
and actively repress transcription. However, in some cell lines or
in certain pathological situations, antagonists may become partial
agonists and unnaturally bring about a transactivation by the
receptor. It is known in the case of the AR that substances
employed therapeutically for prostate carcinomas, such as
cyproterone acetate, Casodex or hydroxyflutamide, may have partial
agonistic effects in hormone-refractory tumors.
[0212] It was possible to show in the cell culture system that
GRIM1 is able specifically to repress an AR transactivation brought
about by CPA, Casodex or OH-flutamide. Non-liganded or DHT-loaded
AR is not transcriptionally affected by GRIM1. The observed effect
varies in the range between 2-3-fold repression of the CPA-induced
AR transactivation. TABLE-US-00024 TABLE 25 GRIM1 selectively
represses antagonist- activated AR. ##STR12## Transient
transfections were carried out in BHK cells in which the
antagonists used act as partial agonists. A minimal promoter with
synthetic AR binding sites was used as reporter
(ARE.sub.2x-(tk)TATA-LUC). The same effect was obtained on use of 1
.mu.M Casodex or OH-flutamide instead of 1 .mu.M CPA.
[0213] A further indication of the functionality of the effect is
that the superactivation of the CPA-loaded AR by the coactivator
TIF2 in BHK cells can be repressed or at least markedly restricted
by cotransfection of GRIM1. TABLE-US-00025 TABLE 26 GRIM1 represses
CPA-loaded and TIF2 superactivated androgen receptor. ##STR13##
Transient transfections were carried out in BHK cells in which CPA
(1 .mu.M) acts as partial agonist. ARE.sub.2x-(tk)TATA-LUC was used
as reporter.
[0214]
Sequence CWU 1
1
8 1 2250 DNA Homo sapiens 1 atggcagctg cggggagccg caagaggcgc
ctggcggagc tgacggtgga cgagttccta 60 gcttcgggct ttgactccga
gtccgaatcc gagtccgaaa attctccaca agcggagaca 120 cgggaagcac
gcgaggctgc ccggagtccg gataagccgg gcgggagccc ctcggccagc 180
cggcgtaaag gccgtgcctc tgagcacaaa gaccagctct ctcggctgaa ggacagagac
240 cccgagttct acaagttcct gcaggagaat gaccagagcc tgctaaactt
cagcgactcg 300 gacagctctg aggaggaaga ggggccgttc cactccctgc
cagatgtgct ggaggaagcc 360 agtgaggagg aggatggagc ggaggaagga
gaagatgggg acagagtccc cagagggctg 420 aaggggaaga agaattctgt
tcctgtgacc gtcgccatgg ttgagagatg gaagcaggca 480 gcaaagcaac
gcctcactcc aaagctgttc catgaagtgg tacaggcgtt ccgagcagct 540
gtggccacca cccgagggga ccaggaaagt gctgaggcca acaaattcca ggtcacggac
600 agtgctgcat tcaatgctct ggttaccttc tgcatcagag acctcattgg
ctgtctccag 660 aagctgctgt ttggaaaggt ggcaaaggat agcagcagga
tgctgcagcc gtccagcagc 720 ccgctctggg ggaagcttcg tgtggacatc
aaggcttacc tgggctcggc catacagctg 780 gtgtcctgtc tgtcggagac
gacggtgttg gcggccgtgc tgcggcacat cagcgtgctg 840 gtgccctgct
tcctgacctt ccccaagcag tgccgcatgc tgctcaagag aatggtggtc 900
gtatggagca ctggggagga gtctctgcgg gtgctggctt tcctggtcct cagcagagtc
960 tgccggcaca agaaggacac tttccttggc cccgtcctca agcaaatgta
catcacgtat 1020 gtgaggaact gcaagttcac ctcgcctggt gccctcccct
tcatcagttt catgcagtgg 1080 accttgacgg agctgctggc cctggagccg
ggtgtggcct accagcacgc cttcctctac 1140 atccgccagc tcgccataca
cctgcgcaac gccatgacca cccgcaagaa ggaaacatac 1200 cagtctgtgt
acaactggca gtatgtgcac tgcctcttcc tgtggtgccg ggtcctgagc 1260
actgcgggcc ccagcgaagc cctccagccc ttggtctacc cccttgccca agtcatcatt
1320 ggctgtatca agctcatccc cactgcccgc ttctacccgc tgcgaatgca
ctgcatccgt 1380 gccctgacgc tgctctcggg gagctcgggg gccttcatcc
cggtgctgcc tttcatcctg 1440 gagatgttcc agcaggtcga cttcaacagg
aagccagggc gcatgagctc caagcccatc 1500 aacttctccg tgatcctgaa
gctgtccaat gtcaacctgc aggagaaggc gtaccgggac 1560 ggcctggtgg
agcagctgta cgacctcacc ctggagtacc tgcacagcca ggcacactgc 1620
atcggcttcc cggagctggt gctgcctgtg gtcctgcagc tgaagtcgtt cctccgggag
1680 tgcaaggtgg ccaactactg ccggcaggtg cagcagctgc ttgggaaggt
tcaggagaac 1740 tcggcataca tctgcagccg ccgccagagg gtttccttcg
gcgtctctga gcagcaggca 1800 gtggaagcct gggagaagct gacccgggaa
gaggggacac ccttgacctt gtactacagc 1860 cactggcgca agctgcgtga
ccgggagatc cagctggaga tcagtggcaa agagcggctg 1920 gaagacctga
acttccctga gatcaaacga aggaagatgg ctgacaggaa ggatgaggac 1980
aggaagcaat ttaaagacct ctttgacctg aacagctctg aagaggacga caccgaggga
2040 ttctcggaga gagggatact gaggcccctg agcactcggc atggggtgga
agacgatgaa 2100 gaggacgagg aggagggcga ggaggacagc agcaactcgg
aggatggaga cccagacgca 2160 gaggcggggc tggcccctgg ggagctgcag
cagctggccc aggggccgga ggacgagctg 2220 gaggatctgc agctctcaga
ggacgactga 2250 2 2253 DNA Mus musculus 2 atggctgcgt ctcgcgctcc
ccgcaggcgc ctggaggacc tcagtgtgga cgagttcctg 60 gcttccggct
tcgagtccgg atccgagtcg gagctggagg gcgccgcgga ggcggcggcg 120
gaggagcgca gggcgcgagg agccgcgtgg aaccgggagc ggcggggcgc gcgcacctcc
180 ccgggccccg caggacgccc gcgtaagggc cgcgcctctg agcacaaaga
ccagctctct 240 cggctgaagg acagagaccc cgagttctac aagttcctgc
aggagaatga ccggagccta 300 ctggacttca gtgactcgga cagctctgcg
gaagaagaag agccattcca ctccctgcca 360 gacacgctgg aggaagcgag
cgaaacagag gaagacggag gagaggacag tgacgcgttg 420 cccagagggc
tgaggagcaa gaagaatgag cctgtacccg tgaccctcgc catggtggaa 480
aggtggaggc agggctccag gcaccacctt agtcccaggc tgttccatga agttgtacag
540 gcgttccgag cagctgtagc caccacccaa ggagagcagg aagctgctga
gacttgcagg 600 ttccaggttg cagatagtgc tgtgttcaat gctctggtta
ctttctgcat tcgagacctc 660 tgtggttgcc ttcagaagct gctgtttgga
aagacaccaa aggatagcaa taggctgctg 720 ccatccagta gcccactgtg
ggggaagctc cgtgtggatg tcaagacata cctaagtgcg 780 gtkgtgcagc
tggtagcctg tctagcggaa gccacagtgt ctgcagctgt cctgcagcat 840
atcagcagct tggttcctta cttcctgact ttcccgaagc agtgccgaat gctgctcaag
900 aggatggtgg ttctgtggag cacgggtgaa gagtctctgc gggtcctggc
cttcctggta 960 ctcatcagag tctgtcggca caagaaggaa gccttccttg
gtcccattct gaagcaaatg 1020 tacatcatgt atgtgagaaa ctgcaagttc
acctccccca gtaccctccc cctcataagc 1080 ttcatgcagc ggacactgac
tgaaatgctt gccttggacc ccagcgtctc ctatcagcac 1140 gccttcctct
acatccgcca gcttgccgtc cacctgcgga atgctatgac cacaggcaag 1200
aaggagacac accagtctgt gtacaactgg cagtatgtgc actgcctcta cctgtggtgt
1260 cgwgtcctga gtacccttgg ttccagtgag atcctgcagc cgctactcta
ccctctctca 1320 cagatcatca ttggctgtat caagttgttg cccactgctc
gattttatcc attgcgcatg 1380 cattgtgtac gtgccctgac actgctgtcc
cagaccatcg gcaccttcat acctgtcctg 1440 cccttcattc tygagatttt
ccagcaggtg gacttcaata ggcggccagg tcgcatgagc 1500 tccaagccca
tcaacttctc tgtgatcttg aagctgtcca gcaccaacct gcaggagaag 1560
gcgtaccggg acgggctgct ggaacagctg tgtgacctta ctctggaata cctgcacagc
1620 caggcccaca gcatcgcttt cccagagttg gtgttgccta ctgttctaca
gctgaaatct 1680 tttctccggg agtgcaaagt ggctaactac tgccggcagg
tgcgccactm gctggagaaa 1740 gtgcaagaga atgcacaaca tatccaaagt
cttcgacaga gcgcgacctt cagcgtgtct 1800 gaccggacgg cagtggatgc
gtgggagaag caggttygtg aagaggggac cccactcacc 1860 agatactacg
gccactggaa gaagctgagg gaccgtgaaa tccagctgga aatcagtggc 1920
aaagagcggc tagaagacct gaacttccca gagatcaaaa ggcggaaggt ggaagacagg
1980 aaggatgaag acaggaaaga attaaaggac ctgtttgagt tggacagttc
tgagggcgag 2040 gacagcaccg acttctttga gagaggagta cctaggctcc
cagaagctca ccaaggactg 2100 aaagaagatc aggaagaaga agataaagaa
gaaggtgaca gcgattcaga ggatggagac 2160 acagacacgg gagtggatct
gagcgaactg tggcagctgg ctcagggacc acaagatgag 2220 ctggaggatc
ttcagctctc agaagaggac tga 2253 3 749 PRT Homo sapiens 3 Met Ala Ala
Ala Gly Ser Arg Lys Arg Arg Leu Ala Glu Leu Thr Val 1 5 10 15 Asp
Glu Phe Leu Ala Ser Gly Phe Asp Ser Glu Ser Glu Ser Glu Ser 20 25
30 Glu Asn Ser Pro Gln Ala Glu Thr Arg Glu Ala Arg Glu Ala Ala Arg
35 40 45 Ser Pro Asp Lys Pro Gly Gly Ser Pro Ser Ala Ser Arg Arg
Lys Gly 50 55 60 Arg Ala Ser Glu His Lys Asp Gln Leu Ser Arg Leu
Lys Asp Arg Asp 65 70 75 80 Pro Glu Phe Tyr Lys Phe Leu Gln Glu Asn
Asp Gln Ser Leu Leu Asn 85 90 95 Phe Ser Asp Ser Asp Ser Ser Glu
Glu Glu Glu Gly Pro Phe His Ser 100 105 110 Leu Pro Asp Val Leu Glu
Glu Ala Ser Glu Glu Glu Asp Gly Ala Glu 115 120 125 Glu Gly Glu Asp
Gly Asp Arg Val Pro Arg Gly Leu Lys Gly Lys Lys 130 135 140 Asn Ser
Val Pro Val Thr Val Ala Met Val Glu Arg Trp Lys Gln Ala 145 150 155
160 Ala Lys Gln Arg Leu Thr Pro Lys Leu Phe His Glu Val Val Gln Ala
165 170 175 Phe Arg Ala Ala Val Ala Thr Thr Arg Gly Asp Gln Glu Ser
Ala Glu 180 185 190 Ala Asn Lys Phe Gln Val Thr Asp Ser Ala Ala Phe
Asn Ala Leu Val 195 200 205 Thr Phe Cys Ile Arg Asp Leu Ile Gly Cys
Leu Gln Lys Leu Leu Phe 210 215 220 Gly Lys Val Ala Lys Asp Ser Ser
Arg Met Leu Gln Pro Ser Ser Ser 225 230 235 240 Pro Leu Trp Gly Lys
Leu Arg Val Asp Ile Lys Ala Tyr Leu Gly Ser 245 250 255 Ala Ile Gln
Leu Val Ser Cys Leu Ser Glu Thr Thr Val Leu Ala Ala 260 265 270 Val
Leu Arg His Ile Ser Val Leu Val Pro Cys Phe Leu Thr Phe Pro 275 280
285 Lys Gln Cys Arg Met Leu Leu Lys Arg Met Val Val Val Trp Ser Thr
290 295 300 Gly Glu Glu Ser Leu Arg Val Leu Ala Phe Leu Val Leu Ser
Arg Val 305 310 315 320 Cys Arg His Lys Lys Asp Thr Phe Leu Gly Pro
Val Leu Lys Gln Met 325 330 335 Tyr Ile Thr Tyr Val Arg Asn Cys Lys
Phe Thr Ser Pro Gly Ala Leu 340 345 350 Pro Phe Ile Ser Phe Met Gln
Trp Thr Leu Thr Glu Leu Leu Ala Leu 355 360 365 Glu Pro Gly Val Ala
Tyr Gln His Ala Phe Leu Tyr Ile Arg Gln Leu 370 375 380 Ala Ile His
Leu Arg Asn Ala Met Thr Thr Arg Lys Lys Glu Thr Tyr 385 390 395 400
Gln Ser Val Tyr Asn Trp Gln Tyr Val His Cys Leu Phe Leu Trp Cys 405
410 415 Arg Val Leu Ser Thr Ala Gly Pro Ser Glu Ala Leu Gln Pro Leu
Val 420 425 430 Tyr Pro Leu Ala Gln Val Ile Ile Gly Cys Ile Lys Leu
Ile Pro Thr 435 440 445 Ala Arg Phe Tyr Pro Leu Arg Met His Cys Ile
Arg Ala Leu Thr Leu 450 455 460 Leu Ser Gly Ser Ser Gly Ala Phe Ile
Pro Val Leu Pro Phe Ile Leu 465 470 475 480 Glu Met Phe Gln Gln Val
Asp Phe Asn Arg Lys Pro Gly Arg Met Ser 485 490 495 Ser Lys Pro Ile
Asn Phe Ser Val Ile Leu Lys Leu Ser Asn Val Asn 500 505 510 Leu Gln
Glu Lys Ala Tyr Arg Asp Gly Leu Val Glu Gln Leu Tyr Asp 515 520 525
Leu Thr Leu Glu Tyr Leu His Ser Gln Ala His Cys Ile Gly Phe Pro 530
535 540 Glu Leu Val Leu Pro Val Val Leu Gln Leu Lys Ser Phe Leu Arg
Glu 545 550 555 560 Cys Lys Val Ala Asn Tyr Cys Arg Gln Val Gln Gln
Leu Leu Gly Lys 565 570 575 Val Gln Glu Asn Ser Ala Tyr Ile Cys Ser
Arg Arg Gln Arg Val Ser 580 585 590 Phe Gly Val Ser Glu Gln Gln Ala
Val Glu Ala Trp Glu Lys Leu Thr 595 600 605 Arg Glu Glu Gly Thr Pro
Leu Thr Leu Tyr Tyr Ser His Trp Arg Lys 610 615 620 Leu Arg Asp Arg
Glu Ile Gln Leu Glu Ile Ser Gly Lys Glu Arg Leu 625 630 635 640 Glu
Asp Leu Asn Phe Pro Glu Ile Lys Arg Arg Lys Met Ala Asp Arg 645 650
655 Lys Asp Glu Asp Arg Lys Gln Phe Lys Asp Leu Phe Asp Leu Asn Ser
660 665 670 Ser Glu Glu Asp Asp Thr Glu Gly Phe Ser Glu Arg Gly Ile
Leu Arg 675 680 685 Pro Leu Ser Thr Arg His Gly Val Glu Asp Asp Glu
Glu Asp Glu Glu 690 695 700 Glu Gly Glu Glu Asp Ser Ser Asn Ser Glu
Asp Gly Asp Pro Asp Ala 705 710 715 720 Glu Ala Gly Leu Ala Pro Gly
Glu Leu Gln Gln Leu Ala Gln Gly Pro 725 730 735 Glu Asp Glu Leu Glu
Asp Leu Gln Leu Ser Glu Asp Asp 740 745 4 750 PRT Mus musculus
MISC_FEATURE (577)..(577) The 'Xaa' at location 577 stands for Ser.
MISC_FEATURE (613)..(613) The 'Xaa' at location 613 stands for Arg,
or Cys. 4 Met Ala Ala Ser Arg Ala Pro Arg Arg Arg Leu Glu Asp Leu
Ser Val 1 5 10 15 Asp Glu Phe Leu Ala Ser Gly Phe Glu Ser Gly Ser
Glu Ser Glu Leu 20 25 30 Glu Gly Ala Ala Glu Ala Ala Ala Glu Glu
Arg Arg Ala Arg Gly Ala 35 40 45 Ala Trp Asn Arg Glu Arg Arg Gly
Ala Arg Thr Ser Pro Gly Pro Ala 50 55 60 Gly Arg Pro Arg Lys Gly
Arg Ala Ser Glu His Lys Asp Gln Leu Ser 65 70 75 80 Arg Leu Lys Asp
Arg Asp Pro Glu Phe Tyr Lys Phe Leu Gln Glu Asn 85 90 95 Asp Arg
Ser Leu Leu Asp Phe Ser Asp Ser Asp Ser Ser Ala Glu Glu 100 105 110
Glu Glu Pro Phe His Ser Leu Pro Asp Thr Leu Glu Glu Ala Ser Glu 115
120 125 Thr Glu Glu Asp Gly Gly Glu Asp Ser Asp Ala Leu Pro Arg Gly
Leu 130 135 140 Arg Ser Lys Lys Asn Glu Pro Val Pro Val Thr Leu Ala
Met Val Glu 145 150 155 160 Arg Trp Arg Gln Gly Ser Arg His His Leu
Ser Pro Arg Leu Phe His 165 170 175 Glu Val Val Gln Ala Phe Arg Ala
Ala Val Ala Thr Thr Gln Gly Glu 180 185 190 Gln Glu Ala Ala Glu Thr
Cys Arg Phe Gln Val Ala Asp Ser Ala Val 195 200 205 Phe Asn Ala Leu
Val Thr Phe Cys Ile Arg Asp Leu Cys Gly Cys Leu 210 215 220 Gln Lys
Leu Leu Phe Gly Lys Thr Pro Lys Asp Ser Asn Arg Leu Leu 225 230 235
240 Pro Ser Ser Ser Pro Leu Trp Gly Lys Leu Arg Val Asp Val Lys Thr
245 250 255 Tyr Leu Ser Ala Val Val Gln Leu Val Ala Cys Leu Ala Glu
Ala Thr 260 265 270 Val Ser Ala Ala Val Leu Gln His Ile Ser Ser Leu
Val Pro Tyr Phe 275 280 285 Leu Thr Phe Pro Lys Gln Cys Arg Met Leu
Leu Lys Arg Met Val Val 290 295 300 Leu Trp Ser Thr Gly Glu Glu Ser
Leu Arg Val Leu Ala Phe Leu Val 305 310 315 320 Leu Ile Arg Val Cys
Arg His Lys Lys Glu Ala Phe Leu Gly Pro Ile 325 330 335 Leu Lys Gln
Met Tyr Ile Met Tyr Val Arg Asn Cys Lys Phe Thr Ser 340 345 350 Pro
Ser Thr Leu Pro Leu Ile Ser Phe Met Gln Arg Thr Leu Thr Glu 355 360
365 Met Leu Ala Leu Asp Pro Ser Val Ser Tyr Gln His Ala Phe Leu Tyr
370 375 380 Ile Arg Gln Leu Ala Val His Leu Arg Asn Ala Met Thr Thr
Gly Lys 385 390 395 400 Lys Glu Thr His Gln Ser Val Tyr Asn Trp Gln
Tyr Val His Cys Leu 405 410 415 Tyr Leu Trp Cys Arg Val Leu Ser Thr
Leu Gly Ser Ser Glu Ile Leu 420 425 430 Gln Pro Leu Leu Tyr Pro Leu
Ser Gln Ile Ile Ile Gly Cys Ile Lys 435 440 445 Leu Leu Pro Thr Ala
Arg Phe Tyr Pro Leu Arg Met His Cys Val Arg 450 455 460 Ala Leu Thr
Leu Leu Ser Gln Thr Ile Gly Thr Phe Ile Pro Val Leu 465 470 475 480
Pro Phe Ile Leu Glu Ile Phe Gln Gln Val Asp Phe Asn Arg Arg Pro 485
490 495 Gly Arg Met Ser Ser Lys Pro Ile Asn Phe Ser Val Ile Leu Lys
Leu 500 505 510 Ser Ser Thr Asn Leu Gln Glu Lys Ala Tyr Arg Asp Gly
Leu Leu Glu 515 520 525 Gln Leu Cys Asp Leu Thr Leu Glu Tyr Leu His
Ser Gln Ala His Ser 530 535 540 Ile Ala Phe Pro Glu Leu Val Leu Pro
Thr Val Leu Gln Leu Lys Ser 545 550 555 560 Phe Leu Arg Glu Cys Lys
Val Ala Asn Tyr Cys Arg Gln Val Arg His 565 570 575 Xaa Leu Glu Lys
Val Gln Glu Asn Ala Gln His Ile Gln Ser Leu Arg 580 585 590 Gln Ser
Ala Thr Phe Ser Val Ser Asp Arg Thr Ala Val Asp Ala Trp 595 600 605
Glu Lys Gln Val Xaa Glu Glu Gly Thr Pro Leu Thr Arg Tyr Tyr Gly 610
615 620 His Trp Lys Lys Leu Arg Asp Arg Glu Ile Gln Leu Glu Ile Ser
Gly 625 630 635 640 Lys Glu Arg Leu Glu Asp Leu Asn Phe Pro Glu Ile
Lys Arg Arg Lys 645 650 655 Val Glu Asp Arg Lys Asp Glu Asp Arg Lys
Glu Leu Lys Asp Leu Phe 660 665 670 Glu Leu Asp Ser Ser Glu Gly Glu
Asp Ser Thr Asp Phe Phe Glu Arg 675 680 685 Gly Val Pro Arg Leu Pro
Glu Ala His Gln Gly Leu Lys Glu Asp Gln 690 695 700 Glu Glu Glu Asp
Lys Glu Glu Gly Asp Ser Asp Ser Glu Asp Gly Asp 705 710 715 720 Thr
Asp Thr Gly Val Asp Leu Ser Glu Leu Trp Gln Leu Ala Gln Gly 725 730
735 Pro Gln Asp Glu Leu Glu Asp Leu Gln Leu Ser Glu Glu Asp 740 745
750 5 15105 DNA Homo sapiens 5 ggcggcggaa gtgcgcagcc gcgcggcatt
ctggggccgg aagtggggtg cacgcttcgg 60 gttggtgtca tggcagctgc
ggggagccgc aagaggtaag ccgcgggtcc gagggccgat 120 ttggcctccc
ggtgggtgtc tgtatccaag ggggcttttc ttgctcctct caacggggct 180
tgggccaact tggccttccg gccactttga cttcttcctt aaccgcaggc gcctggcgga
240 gctgacggtg gacgagttcc tagcttcggg ctttgactcc gagtccgaat
ccgagtccga 300 aaattctcca caagcggaga cacgggaagc acgcgaggct
gcccggagtc cggataagcc 360 gggcgggagc ccctcggcca ggttagtggg
gacatgcgtg agcagaacct cttgctcgtc 420 ctgtcccacc ccggttgggt
tgctgacctc gcctggggtc tcagggccaa ggcctaagga 480 tcgccgacct
ctgcccccca gtcctttcag cttaacttac tctttcatcc gatgtttact 540
gagctcctgt tccgtgccaa actctgtcct aggcagggag cgtatgttgg tgagttgtgc
600 aaaaatctct tcccgatcta ggagaactct cacgttcttg ggtgaaatca
ccgcctgcag 660 ttcggtagaa gagagataag tgctggggag ggaaaggaag
gacagagggg ctgaagagag 720 agttgcaggg gagccgagtt gctctgttag
aaggagcgga ggctgtgctc ggtggctcac 780 gcctgtaatg ccagcacttt
gggaggccga ggcgggcaga ttacctgagg tcaggaattc 840 gagaccagcc
tgaccaacac ggaaaaatca tgtctctact aaaaatataa aaattagccg 900
ggcgtggtgg gcacctgtaa tcccaactac tcgggaggct gaggcaggag aatcgcttga
960 acttgggagg cggaggttgc agtgagccga gatggcacca ttgcattcca
gcctgggtga 1020 gagagtgaga
ctccatctca aaaagaaaaa gaaagaaatc aaaagataca aaaattagcc 1080
tggcgtggtg acacacacct gtcgtcccag ctactcggga ggctgaggca ggagaatcgc
1140 ttgaacccgg gaggcggagg ttgcggtgag tggagattgc gccactgcac
tccagcctgg 1200 gcgacagagt gagattccgc ctcaggagaa aaaaaaaaaa
aaaaaaaagc ggaggtggtt 1260 gtttgagaag atgacgtctg aataaagacc
tcgaacaagg aggccacaca gacagtaggg 1320 gaaacgtgtc ttaggatttc
aggcagcaga aaacagcaaa agccttgcag tgggagcatt 1380 tggggagagt
acagaggtaa acgtggctac ggcaggagtg agggggtgaa taggggtaga 1440
tgagcccaga agggtgatga agagccaggt gccgtagacc gtcataagat tgcctcgtaa
1500 aacagggagt tacttctggg tttgagcggt ggagtgcctt gtaggtgaca
agggtggaaa 1560 cagagagacc tttaggttaa ctgagaattg aactatgtgg
tggtttgaga ggaggcacga 1620 acggttttga aggtgttgct gatgagatct
gatgacaagt tgcgtgctgg gtattagaga 1680 cagagatttg cctttagtct
gagcctctgg aagggtgaag ccaccatcag gcaaaagtag 1740 cgcagggcac
aggcacagca agtccaggga agaccgcggc tgggggcctg tgtagcgcag 1800
ggcgcaggca cagcaagtcc agggaagacc gcggctgggg ggcctgtgta gtgtgtttga
1860 gggtatgttg tagaggttgt tgggtaggag acgtgagtct gccaaaaacg
tacctcctgg 1920 caatcctgtc ttgctagttg gtaacccttg tgaagggagt
agactgaccc tgtaggccac 1980 tccccgcccc ctctacttcc acttcccact
ggggtcgctg acttctgcct cctcagccgg 2040 cgtaaaggcc gtgcctctga
gcacaaagac cagctctctc ggctgaagga cagagacccc 2100 gagttctaca
agttcctgca ggagaatgac cagagcctgc taaacttcag cgactcggac 2160
agctctgagg aggaagaggg gccgttccac tccctgccag atgtgctgga ggtgagggcg
2220 tgggccaaac cagaaggggg gcacttgtct ctacactctc cttcagctca
gcctttctgt 2280 gcaggaagcc agtgaggagg aggatggagc ggaggaagga
gaagatgggg acagagtccc 2340 cagagggctg aaggggaaga agaattctgt
tcctgtgacc gtcgccatgg ttgagagatg 2400 gaagcaggca gcaaaggtga
gcagcagcca ggggcgggca gctgggtgcc caggcagaaa 2460 tctggccttg
cctcacctga ggtgagcagc agcgaggggc gggcagctgg gtgcccaggc 2520
agagatctgg ccttgcctca cctgaggcgc ctgaggctgt gctggtggga ggggctgttc
2580 tccacgcagg ggacgcttgg agccttcttc agcgcaagtg acaacacgtt
gctcctcctg 2640 cttgcttggt tgccccagga ctgtgtgttc acttttgggt
agattccagg ggctctgttg 2700 gcgccccact ccccaaagtc agtcccgctg
tgggtgggac ggactgtgcc tttgttggtt 2760 ggtggagctg gggtccttct
gagcctctca cagtgttttt ttccagggac aaggatacgg 2820 agagctccag
ataccacctg gaggtggcca tagtccagga tctggaactc ccagtccttt 2880
tcctggggct cctgagccag actccctcct cttcccagag aattctagac tttgtttcca
2940 tttttgttat cagtatgggg ctctgtgcct ccccccaacc tctgccctat
gtctgagggt 3000 gagggtgagg gtgcctttct ctggggctgc cgtttccact
ctgccaagtg cagtctcagc 3060 tcccctgacg cccctggtac tcttgctcct
tcagcaacgc ctcactccaa agctgttcca 3120 tgaagtggta caggcgttcc
gagcagctgt ggccaccacc cgaggggacc aggaaagtgc 3180 tgaggccaac
aaattccagg tcacggacag tgctggtgag cttgggggga gcctggcatc 3240
caggctgtct gttgcgttct ctgtcccgtg agtacatcca ggccttttcg ttgcagcatt
3300 caatgctctg gttaccttct gcatcagaga cctcattggc tgtctccaga
agctgctgtt 3360 tggaaaggtg gcaaaggata gcagcaggta agaggggagg
gggtgaaggg gggtagggtg 3420 gaaggtgggt cgggaccaca cacaggagaa
gccagaggcc ttgtggctag gacagagaca 3480 tggcaacaga gccagcgtct
tcttggggac cctgagaagg cagcggggca cgagggaccg 3540 ttgggagagg
ctgggcactt ggggccagtt ggagcggccc caccttcttt ccttcctgaa 3600
aagggaagtt gtctgctcag agtctcagaa ggtgtggcgt gtgcagctgc ctttgctctc
3660 agattttccc aggttagggt tcttgctcta tttccctttg taagaacttg
taagtcctgt 3720 ccatactcct tgatctctgc actatgactg gcctgtttcc
tggagctgtg tgtgtctgta 3780 gacaggggcc tcatcaatgc tgagaacctg
agggcctcaa cttccccttt agagaaagat 3840 tgggcagaga aggccgtgga
aggttttcag agaagagaga cgcacggcca agcatttggt 3900 agagcatgga
gcacgttggg gttccggggc tgtgcagcca tgtgaccttg aggcagtgga 3960
ggcctagagt tggaactgcc ccaggacacc ttcaaaggaa gaaaggagac ccacaggaga
4020 ggtctcgccg acagcggctc aggagacagg ggcactgttg ctggtggtag
tggtcggggc 4080 tggacggata gaacagagcc atgtccacat ggaggcagtg
ctgtccgaag gggacaagga 4140 agctggcaga tgggggtgga gggacaggca
gcgattggga ggcagcacgc actgcagcct 4200 gagccaccag gtcagggacc
cctgtcgagg cagcagctgt accggaggga gtcaggccgt 4260 ggggagcaga
gctgcctcct gggtctgggt gggcagctgc aggtgcccag cagccagtgt 4320
cagcacaggt gtgaggaggg tcgaggacgg ggggcctttc tggggccctg ccctttgtcc
4380 ttcccatgct tcagcagatt ttagtgagcg cctactgcgt gggctctggt
aaatgttttg 4440 ttttgttttg ttttgtttgt ttggtttttg gttttttttt
ttccttacga gataggatct 4500 tgctctgtcg cccaggctgg aatgcagtga
tcatagctca ctgcgacacg taccttctgg 4560 attcaagtga ttctctcacc
ttagcctccc aagtaactgg gactccgcca ccaggcctgg 4620 ataatatttt
gtatattttt gtagagacgg ggtctcactg tgttgttcag gctggtctca 4680
aactcctggg cttggagttt gatcctcctg ccttggcctc ccaaagtgca gtgagtactg
4740 gcatgagtct ccacacctgc cctctgattt tcttttcttt gtgcagcaaa
tgtgggccaa 4800 aggaaaccag tctgcggcag ttggtggtgc ctgggcgacc
ccagaggtgc tgagtaccag 4860 ctgctgggct tagggacctc gtgtggtctc
acggtgggga tgtgatcaag gggacctaga 4920 aaaggtttat gtctgagagg
gagttggagg ctgggacttc cggggactct tagggcggtg 4980 gccaaccctg
ggcagggcag acaggagttc aaggccactg aaggaggagc taatgcactt 5040
gagagggtcc tcctaagccc ctgtgtctgt ccagctgtaa ggggccctga gcttttttga
5100 gtggagagac ggggggtctc tgcagagtcg tagaggctat gctggccagg
gcacgcgcca 5160 acatgctgag cagcctcgcg tccgagccgt ggggcctccc
agccaggggt gggtggtctc 5220 tctgcaggat gctgcagccg tccagcagcc
cgctctgggg gaagcttcgt gtggacatca 5280 aggcttacct gggctcggcc
atacaggtgc tattctggtg gggagggcac gggggcctgg 5340 ctgtatctgg
tggtcggtcc ttttttgtat cccagaatac atgggtttgg ggcttcactg 5400
tcccctcctg cccccagctg gtgtcctgtc tgtcggagac gacggtgttg gcggccgtgc
5460 tgcggcacat cagcgtgctg gtgccctgct tcctgacctt ccccaagcag
tgccgcatgc 5520 tgctcaaggt tcgtggccca gtcccctccc tgtgtctgtc
atggggtcgg ggggccacac 5580 ggctaccccc accacatccc actcctggcc
agggcacagg tggtgcccac actccactgg 5640 ctccgcttgg ctagagaggc
cacagaagca cctggccccc acccccacct ggggtttctg 5700 tctcaggcag
tccctgcctg cccgggcagc gcggctcagt ccgtgagcaa agccacggtg 5760
aggtatcttc ttcccggtgt gatctcacat acgtcggcgt gtctgacgtg gtaaactttt
5820 gtggtcctct tctcacaagg agcaacacac ctgttctgtt cacttctgta
actgagcacc 5880 taactcacgg ctcccccgag gtgctttggg aagagctggt
ggccatggga gcctttgcct 5940 ggctggggag ggctgtgctg gccctggggc
atccctgctg aggaggctgg ggggccacca 6000 gtgacgtctg accttctgca
gagaatggtg atcgtatgga gcactgggga agagtctctg 6060 cgggtgctgg
ctttcctggt cctcagcaga gtctgccggc acaagaagga cactttcctt 6120
ggccccgtcc tcaaggtagt ggtgggccct gcgtctgtgt ccctcagcat ctgcattgga
6180 aatctcggcc taagggcagg gcgggctgcc tttgtggttg gtgcccctca
ctggaccctc 6240 actgcagctc tgagcagacc tgggcccttg gatcacgaat
gtctcacaga gcacctgggg 6300 gtggtgggca gggaagagag ccctcggcct
tctcagggcc ccacctgacc ctgcttcaca 6360 cagcttcccc aggggagggc
ctctctggct ggaggaggac actgggtgtt gggacctgag 6420 gccatggcca
gggtacagtc ctactgcccg tccccaagcc atgggtggcc tgcatgtggg 6480
gaccctcagc ccccagaggg gcccagtgtg cagcaggagc ttctgcccca gcttctccca
6540 ggcctgaggc tggtgggcac ttgggggtgg gggctgtgca aggggctacg
gctcttcctc 6600 gaggcccagc tctgagggaa aggcccaggt gttcacaggg
gccctggagt gggcggtgga 6660 ggtgcatggc cctgatccca ggtggctctg
acccgggtct ctccgcagca aatgtacatc 6720 acgtatgtga ggaactgcaa
gttcacctcg cctggtgccc tccccttcat cagtttcatg 6780 cagtggacct
tgacggagct gctggccctg gagccgggtg tggcctacca gcacgccttc 6840
ctctacatcc gccagctcgc catacacctg cgcaacgcca tgaccactcg caagaaggtg
6900 tgtggtgggg cccttccagg ctcatgctgg gcatggggtg gggcagccca
ggtgcccgac 6960 ccaaggcagg gcctggggcc tccccgaagc ccctgtctgg
agacagccca gcaccctggt 7020 gcagtcggtc cttgcaggtg ggggaagggt
ggatgggttg agaccccgtg tgcaagatga 7080 ggaaatgatt cctgtgccgg
cccaggagga acgtgcatca gcctgacttg tcagcctggc 7140 cagtagctga
cgtggttctc tctgaccagg aaacatacca gtctgtgtac aactggcagt 7200
atgtgcactg cctcttcctg tggtgccggg tcctgagcac tgcgggcccc agcgaagccc
7260 tccagccctt ggtctacccc cttgcccaag tcatcattgg ctgtatcaag
tgagttgtgg 7320 ggtgggcagg gttgtgcggg agggtcagga gaagcaaatt
tgctgggact gtgtgggcgg 7380 tgcctctggc gttcagggcc ttggggcctg
agtctgtgtc ctggccgtcc ctgaggaagg 7440 gctggggtcc ctgtacctgc
tacggggaga tgctctggat tctggagagc tagggctggt 7500 gggcacctgt
gacatgagct cctccaacag cggtttagcc gcctcgggtg ccacccagcg 7560
tgtgttctgg gggcttgtgt gcagtttgca gtgagtttgg ttcattacgt ggggttcttg
7620 ggtggagcac atctgatgca gtgaactgca ttttgggtgt gagcgcttag
gagggtccag 7680 gcacggtagg ggctgcagca ggaaaaggtg ggagcagtac
tgtggcctct ttggccagga 7740 gggggcatct gtctggctga gccttagaaa
ctcaaggctg ggaagggagg tgggaagtcc 7800 aagggaagaa gtaggaaagg
cgggaacagg gaggagagca ggcaggaaca gatgacccct 7860 gccaccgtgt
tgaaataaag ctgaaagctg ggtaaggtac ctgcagcccc atagctgggc 7920
aaggggtgca ggtccctgcg gttcagacgt gccttgtcct gctttagggg tctgatgtcg
7980 gtgagtgggg gaggaggtcc aagacagcag ggggaggggc aggggctgcc
agagccgggg 8040 cctctgctca ctcggccttc ccacccccag gctcatcccc
actgcccgct tctacccgct 8100 gcgaatgcac tgcatccgtg ccctgacgct
gctctcgggg agctcggggg ccttcatccc 8160 ggtgctgcct ttcatcctgg
aggtgagtga ggctgtggtg ggcgtgtggc acctctgcct 8220 gctcctgtag
ggagcatctg ctgctccggg cgtctgtgct gagttgtccg gcgacttccc 8280
ggagccctgg ccgcctcctt gtcacgggtg tcacgaggac agtgctcctc cgtgtgctgg
8340 gggcagtgtt ggggaacgtg ggaccactgg ggatgaaggc ggctgctgct
agggctgtgc 8400 ttgaggatgc cgggacctga ccctgtaggt gcttgcccaa
gactggagta ggcaggaggc 8460 gagatgggcc aaaacccaag tctgattgct
gaactgtaca ctgaacagtg ccctgccctg 8520 acagttgtgt gcatgtgtat
gcatttgtat ggaggatgtg tgtacatctg ggtgggtgcg 8580 tttgtctgtg
catgcatgcg tgtgtgtgca ccatctgggt gcatgggtct gtaaatctgt 8640
gtgtgtgtgc atatcagttc atgtgtgcat ctgtatgtgt gtatgcacgt gtatccatga
8700 atgcctgtgt gcctgcaggt gtgtgcatct gtgcgtgtgt acacctgtgt
gtatgcatgt 8760 gtgtaccttt gcgtgtacct ttgcgtgtgt gcacctgtgc
atgtgtcttt gtataccagt 8820 gtgtacctgt gtgtacctgt atgcatgcac
atgcgtgtgt acctgtgtgc acctgtctgc 8880 atgtgtgtac ctgtgcgtgt
gtgcacctgt gtgcatgcat ttgcgtctgc atgtgtctac 8940 ctgtgcatgc
atgaaccttt gcatgtgtgc atctgtgtgc atgcatgtga tgtgtgtctg 9000
cgtgcatcta tgtacctgtg tgcacctatg tgcatacaca cgtgtctgtg tgcacttgtg
9060 tgcatgcatt tgcatctgca tgtgtgtact gtgcgtgtgt acctgtgcgt
gtacctgtac 9120 acttgggtgc atgcatgcac gtctgcttgt gtgtgcctgt
gcgtgtgtgc acctgtgtgc 9180 atgtacgtgc gtctacgtgt gtgcctgtgc
atgtgtgcgc ctgtgtgtac ctgtgtgtat 9240 gcatctttgc acgtgcacat
gccactcagg tagggaaaga ttggagtccg aagtttcaac 9300 tttcagtggg
tgggttaggc cccaccccgc tgtaaattta ggaattcacg atctccaccc 9360
tgtttatcta atgagttctc agcctcatga aggcccagag tcgtgtcaca gctgtccttg
9420 gggctgggtc ccaggttgct gggtccagaa ggtatggaag ccccaggcac
gttctgattc 9480 cccttccact gaggcaggga tgctgaacat ctttaggaag
ccatgttcac tcccatggca 9540 gccagcagtg gtctctgatt gcccagccct
tggcctggcc cctgtgtctg tgggcctcca 9600 gctctgctgc ccagctccag
gcatgctttg tgtctgtttc ccttgtccaa tctccttggc 9660 tacgtgcttt
cttactctct tgcagtgtct gtttcttcac ttgtgcactg ccctggttca 9720
ctgcagccgc accctgtagg cccctctcac gcagggatgc aggcctctcc tctccggaaa
9780 aagcaaaccc taaaagctaa aacaaagccc tcagctgtag gccgtgcctg
cccttccccg 9840 gtgcctggac aggaagccag tcgcctgccc atacttttgg
cccaggctag agaagggcag 9900 tgtcctccca gaggttcatc agtaccaggg
cgttttccca tctggacctg agctcagctg 9960 tctggcagcc acccctgctg
agtggggtgt cttgctgggg cctccaccct tgggcccccc 10020 ataatctgct
tctgtcctct ggtgccccag catgtaccct ggatctctct ggttcacagc 10080
ctgagggctc ctagtggttg gggaggggtc acaagactga gaggccaggc tgactctttc
10140 tctgctcctc ctggcatgtc ctacggaggt gcatggcctg tggcttctgt
ggagggtgtg 10200 ggaggggccc cccaggcctc ccgtgacctc catctgtccc
gtcctgtgtc tggcactctt 10260 tgctgttgct gctgcgtctt ctggttgctc
gggacggagc cccatgtggc attgctgtgc 10320 tgagggccag gatgggcctc
agtgccatgt tgtcaggaat gggggctgtc ctggtactct 10380 gtgtggcagg
gacctctagg tctccagacg tgggtcctta gtgcttccca ggattttggg 10440
agagggcccg tgttcctgat ccttccctgc tgatcagagc cccactcggg gacacgccag
10500 gctgtgtggg gccatggggc tgggaccgtg cctagctgct tatctcttgt
ttcgggttgg 10560 gtctcctcgt gctgaagcct gaggaccagg gtgaccaggg
tgcagccagg tgcagggcca 10620 aagggaccag ggggaccagg gtgcagccag
ggtgaagcca gggtgaccag gcatggggcc 10680 gagagagcct gacactggcc
cttggggcag atgttccagc aggtcgactt caacaggaag 10740 ccagggcgca
tgagctccaa gcccatcaac ttctccgtga tcctgaagct gtccaatgtc 10800
aacctgcagg agaaggcgta ccgggtgagg ctggctcctg gggagggcct gggcagttcc
10860 cagggtgggg ttgggggtgc tgaggtggat gggagggggc tggcatcctc
caagttcaag 10920 catggacctt catggtctcc cagggctggg gcatggagcc
ccttccttgc agtccgtgcc 10980 ctggagatgg cgctgccctg acagcctgag
gggaaggggc ctgagtgcag ccccagcctc 11040 tccctgatgc actcggccct
ctctcctgct cttcaggacg gcctggtgga gcagctgtac 11100 gacctcaccc
tggagtacct gcacagccag gcacactgca tcggcttccc ggagctggtg 11160
ctgcctgtgg tcctgcaggt gtgtgtcctg cccacaccgg ctcgtggccg actcagactg
11220 tcttataacg ggcttggctg ccccaggctg gtaggaggtg cccttgtccc
gggtacctgg 11280 gactggggtg gaacctgact gctggcaacg cgccccgtga
ggcccccttg gaggggctga 11340 tgaggggttc taagccattc aggaggtttg
ggggcaggcc tgggcagtgg gctgaggacc 11400 ctgggagcac aacagcctcc
tccccaggtg ggaaccacaa ggctgatctt tgcttcgggg 11460 ttgggactga
gcatgccgac cctggctcag gctggtacgc tgatcgcaca cttccccagg 11520
ccatcccggg tgtggggagt gggtggagtg gcttctcagg tggcaggaag gcctggcctg
11580 ccccgccaaa taccccacat cagcctcata ggaaggccca gcctgccccg
ctaaataccc 11640 cacgtcagcc tcatcttagg caggagtggg gtggaggagg
gggttctcct tatcctcaga 11700 aggtccttct gggcccccac gggaggtctg
tttgctctca gccgtgatgt ttccagcctc 11760 agggagcgct gtgctagtgg
aaggggtggg ggcctgctgt ctggccctgg tgctggaatc 11820 agatgtgccg
actataggtc tgtgcagtgt ggggaggagg gatctgctca catgagccac 11880
aattggtcag aggcttatcc agatagaggt gtgtgcatgt gtgtgtactc acacacggcc
11940 acacatgtca catgcacaga gccgggaccc cctttctggg gcactcacag
catgggccac 12000 agcctctgtt cctgcccatg tcctacttgg gtggtgatac
ctggcattgg ggcatctctg 12060 cttctggact caagggccca gggtcgggtt
ctggggtagg ggtcagaaaa tgtttttggt 12120 gaggggccaa atggtaaacg
tggctttcta cggatgttca gctgcacagg caactgcaga 12180 accatgcaat
gcacacgtgt ggctgcacgt cagcgagact gtattttatt agtagtagta 12240
gtattgtttg agatggagtc tcgctgtgtt gcccaggctg gagtgcaatg gtgcgatctc
12300 ggctcactgc aatctccatc tcctgggctc aagcgattct ccagcctcag
cctcccaagg 12360 agctgggttt acaggcgccc accaccacac ccagctaatt
ttggtatttt agtagagacg 12420 gggttttgcc acgttggcca ggctggtctt
gaactcctga cctcaggtga tccacccgcc 12480 tcagcctccc aaagtgctgg
gattacaggc gagagccacc acgcctggca taaaacttta 12540 tttccagagg
tgagtagtgg gcagggttgg cccagagggt gggctgattc tgcctctgcc 12600
atcccagctg caacagctat gcacttgagc cctgagatgg gatccatgtc ccctcctggg
12660 gtatccccgt ggccaccacg cgtggttttg cacaggacct gggccagctg
tgcacatgga 12720 gcggtcctgg gcttcagtgg ctgacccctc ccttccgcag
ctgaagtcgt tcctccggga 12780 gtgcaaggtg gccaactact gccggcaggt
gcagcagctg cttgggaagg ttcaggagaa 12840 ctcggcatac atctgcagcc
gccgccagag ggtttccttc ggcgtctctg agcagcaggc 12900 agtggttagt
gggccctggg ggtagtgcca cctgagggca cctgccaggg tatagcccca 12960
gctacatgtg ggggtttgcc cagggtgagg catgaccctg aactccccca accccccagg
13020 aagcctggga gaagctgacc cgggaagagg ggacacccct gaccttgtac
tacagccact 13080 ggcgcaagct gcgtgaccgg gagatccagc tggagatcag
tggcaaagag cgggtgcggc 13140 tcggcgaggg gacctggggg tgtgttgtga
cttcctgggt ttcagatcta gcgcactatg 13200 acttgagacc agggcgaggg
tttggaaaca gtgccaggcg gccagggccg tgcccggatg 13260 attcgacttg
gagagggggt aggtgttgga gaactggcca gaaccaggcg tttccaggga 13320
ggggaagccc caggctgcac taggttgggg aggccatgcc ccctcaggcc tgatgggctg
13380 gaggctccgg gcaggtggag tggctggact gacctcgtca cccaggccag
tatgtgggca 13440 ccaggggccc gtgaggagaa gcaggaaggg ctctgccttt
gaccttggac atgggatgga 13500 caacttggag gatggctttg tgatttggga
acagagggga ctagaaattg gccacatggg 13560 gccctggtgg tgggtctggc
gatgcctggc cctgctgtgg ccgccagccc ctgccctctc 13620 tcacctgagc
ccctggttct ttggccttcc agctggaaga cctgaacttc cctgagatca 13680
aacgaaggaa gatggctgac aggaaggatg aggacaggaa gcaatttaaa gacctctttg
13740 acctgaacag ctctgaagag gacgacaccg agggattctc ggagagaggt
ggggcctgcg 13800 tggtgctccc aggggaaggg tgggcctgga gggctctgct
ggacttccca gagccacgag 13860 ggccacctgt acccatcctg cagggggctc
accagtctct ggcccagctg gggccaacct 13920 cagtgttgcc aggcttctgg
tgccagcgcc ttccctcctt gaagtgaagg cctactggga 13980 ttggtaactc
tgtccccagg cctgtgacct cccagttcct ccccagggct cctctccacc 14040
tgctggaagt cagcggaggg aagggtgttg ggagcctggc caccctcctg cccccactgt
14100 gactttgctg gtggaccctg tgggtgggag tcatatggac tctgcttctt
gttcctcagg 14160 gatactgagg cccctgagca ctcggcatgg ggtggaagac
gatgaagagg acgaggagga 14220 gggcgaggag gacagcagca actcggaggg
tgaatggtct tggggtgaga gggtgtggcc 14280 ctgtgagccc atctggcggg
agggcagagc cacgtgggcg gggggcgtgg ggctctgggc 14340 caggcttttc
cctccctggg aaggccaggc caaatgctct gttctctggc agccagcaac 14400
agggataaat taattagtgc cgtgattaat tagtgatgag taacctctaa ggctggcttc
14460 ttcctgataa agcaaaattt atgtagcctc catctctccc cgcagatgga
gacccagacg 14520 cagaggcggg gctggcccct ggggagctgc agcagctggc
ccaggggccg gaggacgagc 14580 tggaggatct gcagctctca gaggacgact
gaggcagccc atctgggggg cctgtagggg 14640 ctgccgggct ggtggccagt
gtttccacct ccctggcagt caggcctaga ggctggcgtc 14700 tgtgcagttg
ggggaggcag tagacacggg acaggcttta ttatttattt ttcagcatga 14760
aagaccaaac gtatcgagag ctgggctggg ctgggctggt gtggctgctg aagccccaca
14820 actgtgggct gctgaagtca actccgcggg ggaaacttac ccttgacgtc
agcagaccga 14880 gaccagttcc cagttccagg ggaaggcttc aaggcccctg
gcccttccac ccacctttgc 14940 cctcagtatg cagacctcgt ccatttgcac
caggttctgc cttcactcca ccaagtcttt 15000 gaaatttgtt tcctttcctt
tgaagtcaca ttttctttta aaattttttg ttttgcatcc 15060 gaaaccgaaa
gaaataaagc ggggggaggc agggccattg tgttg 15105 6 23806 DNA Mus
musculus misc_feature (4293)..(4312) n is a, c, g, or t
misc_feature (6219)..(7065) n is a, c, g, or t 6 aacatgcagg
atgccgccac ccacttgttg gtcctcgggt tgtacttctc tatggagttg 60
aggctggagc tgccgtcatt gccccccacg gcgtagagcc agccgtccat ggccaccagg
120 tcatgtgtgc tcctggggag ttggataggc aagagtaagc ccaaggggca
agccaagaga 180 caggccaaat gagtggggca cagagggagt aaccataggg
acagggcaac gaaaggagcc 240 ccctatgccc ggggggaacc caaggacctg
ccgtactgtt gaggtgctgt gtgcacaacc 300 cctggggcta cactcctgga
ataaaccaga tgtagtggca gaaccactca ccacccgccc 360 ctggccttat
acacctgcgg atattcatgg gtgccacact ctcccaggca ccagccttgg 420
tgctatatct ctccactgag ttgaggcagc ttgtgccatc attgccccct gcgacataca
480 gggcaccctc cagcactgct actcctgctg agctgcgccg gctcagcatg
gaagctacag 540 gtgtccatga attcacctgt agagacaata ggacaggctg
gaggcagacc caccaagggc 600 cccccaaccc tgccaccaca tagactctac
tcttggttgg gtatctgagg gtcaggggtg 660 ggatatagaa agcagacccc
ttgggtggca cagatggcca gaatctggga ttatgcccca 720 tcccccccag
cactttagag gaggacattg tgatggggtg cgctatgcac ctggggctca 780
tacttctcca cggtggccag gtgtgacgag ctgtcataac cacccacggc gtacaggttc
840 ccatctgtgg gaaggcaata tacccacgaa
gctcctgcag ggtgtaggag ttttgggggt 900 gggtgggaag gggagaggca
ggaaagcggg taccacggga agcctgtcac ccaccaagtg 960 tggccacacg
cacatatcgc ctccgggtgc tcatggcagc aatggatgtc catgttcccg 1020
tcagtgggtc atatcgttca gcactgtgga caccagaatg atgccgacgt tcagttacaa
1080 cagttaagtc atccttgccc aggctctgtc ccctcactcc ccccaaagct
acggctgaaa 1140 tcccgggcat cacagactct atgaatagag agcaaggtgt
gctgggcgga tactggctgc 1200 tggctgcttg gactggaggg agggcccctc
caggaggaga atggtgccaa ggccctgcgg 1260 atgctgtgcc ctgaggtaag
gaagccttga ggacccagta gcctggatta ggccctcatt 1320 ctcttagcca
gatgtgactt catagttaag ccagccttag ctgctgctgc tgagccgacc 1380
ctgcaaagcc actggggccc taacccaaac tactacctgt tgaggcagga agccccatcg
1440 tagccaccag ctgcatacag gagcccatgc agagcagcta cacctaggca
gctccgcctt 1500 gtgcccatgg agacctcggg ctgccatgtg tttgtcacag
ggtcatagga ctccaccgta 1560 gccaggtctg aggttccatc gtagctagga
gaagagtccg ctactgacca aggatggccc 1620 ggcaggcttg tgacaaacct
ttcagcctac atggccagct tctatgctta cccgcccaca 1680 gcatacagtc
ggttcccgac ggcagcaact cctacacgag cccgacgtgt agacatggag 1740
gccaccacat gccaacggtc agtccgcgtg tcatacgctt cacagtctcc gtggatcgcg
1800 aacaggctcc caccacctag agatgggtaa ctataaataa aggggctcaa
ggctgggcag 1860 aggactctag tgtgacccaa gtacaaagac atgaaggctc
caggggtggg aagagataga 1920 gactgggaca cactagtggg ccccatccta
tagggtctct gagacacaca ggtctaagta 1980 taatgggttg ggtttatagt
ctgagttggg gtggatggcc tagggcaaaa tgggaaggag 2040 aaggggttta
gcataccgac agcaaagagc acagggccag cgccctcaca tcgacggggt 2100
ctcgtgcggc tggttcccag aacacctctc tgctcaggca gcaggtggaa cttaagggcc
2160 tcgatgagca gatccttgca gtcagggtga tgcctcacaa ggctttcggc
atccacgtgg 2220 cccaggagga agtctcggct cagcagaggc agccgtacac
acttcatcaa ctagggcagg 2280 aagcgccatg taaggcaaaa ccctgggcgt
acttggccca gggctaagac ccactcaggt 2340 gcctatagca cctgtgagat
tgggtcttga gacaggtggc tcaaccactc caaaggagtc 2400 cctggatgga
agatgctaag gacttgaaga agtaagctta taccgcagcc acctgcattc 2460
ccttctgaca gcacgggagc tccaggacct ggcctcaccc ttggtacatg ctgcctccga
2520 gtgtctacgt catgtttgac ccagctcagg actgcacggt acacatcttc
ctctgagggc 2580 acgttcaggc tatcactaga gaccaattcc agcacctggg
ggcgagaggt cctcagacct 2640 gacatctggg taggggaggg cccagagaac
cctgaacgtg tctgagagta tcagacagag 2700 ggcatgtgcg tgtccatgca
caaacacgca cacttaacgt tcttagagtg aagaaccaag 2760 cagagtcaag
attcacagat tttgttccca gtagaggggt gaggggtgtc caaggtccta 2820
agtagtctcg ggtacataga atcagagact gggaaatcat ggccctagaa tgcggtaaag
2880 acaggcatct aagacaaatg attgggtgac atgagaagat ggaaacaggg
ttgggaggag 2940 atggccaggg aatgaatcat agggtgaaac tcgcaccaga
aacttgaaat gaggattgca 3000 gaacactggc tgcaagggtc tgggcttggg
caatacatgg tgggggctgg gcccaaactg 3060 gcatgttacc tgcttcagtg
gttacctgct tcagtggcaa cagcatgaac tcctcagtct 3120 tggccacatc
cacgaagtgc tgcagcacgt acctgtgtgc tgccttgagc aggtcactgc 3180
atgagtgtgt gtctgcaaat cctcgaatgc ccaggcagtt ggaagggtcc agctggctca
3240 ggaggaactt gcagcaggca tctcgaacac cattcaactg tagaaggctg
gcagctggga 3300 gcagagtctg gggcatgaag atgggtgagg caggagggtt
ccagaagagc acagagtgac 3360 tggatattgg ggagaggcat gggggggggg
cagggcacac agatctgggc tggagatctc 3420 aggggacgtg ggtatgggaa
tggtaggcct cacctgaacg ttgccctcgc ccaccacgat 3480 ctcagctgtg
tatgcaaact gcaccagctg gtctaaggct tgcgggtcaa tgtcatgcag 3540
tgtcacatgt gtctggcggc tctcactcat ctcatctgtg aaaacagtgg attaggcggc
3600 ctacccgggg agatctgcgg agattccccc tagggctgcc cagactgttg
tatgtggtac 3660 ttgcttgtga acatggcgtg gaagtagggg ctacaggagg
ccaacaccac cttatgagcc 3720 cggatctcct tggcagccac atgcaggacg
atgtcgcaca ggaggccacg ctgccgcatg 3780 cggctcatag ccacaaaagc
atcatggtag tgccgctttg agttgtgtgc cacactgtga 3840 ccctctctgc
tcaacagctg catcgcacct tccatcgggg ctgcaggccg ggcctgccgg 3900
ggccgagctc gctctgcctc ggggctgcag gccgcaggac taggtcaggg tttgtaccct
3960 gtagagggcc actcacccgc ctttgagaac tcccagcaca gccaaggcag
ctgcggtcta 4020 tgcccaggag ttcgttcagt catcaggcac caaagcccat
ggaggaccag cctgtcactc 4080 actgccactt tccagggctc tagatctctc
tccggtccag gaacctcgct ggagccggag 4140 cagattcttt cataaatgta
agcctgctct acctgtcccc cttcctctgc tgactacctc 4200 tcctggggct
ccgagcccct catctagtgc ccaggccact catggggagg aagttggcaa 4260
acgttgtcct ctgaatcccc agatcccagt cannnnnnnn nnnnnnnnnn nncccgccct
4320 cccgcccgcc ctcccactca cgccggcggc tggggcggcg gtggcgcctc
aggcccgggc 4380 cccgggctgc tgtgctcggg actctgcgtc ctgcccgccg
gccgctcgcc acggggctgc 4440 ataactccag ccgccaaagc cgcgaccctg
atggacggat tgggacgcgg gtgcccgccc 4500 ggccgctcgc tccactcggg
cgctcgcccg tcgccgcaga ccctgtttct cggcgccgcc 4560 cgccctgccg
cacgcaaggt tgcaggagcc gctgccaccc gcgtccccgc tctcctccca 4620
ggagcgcctt ccttcgtgtg gcgcctaggt tagcttcaca tcccagcgcc ccggttctca
4680 cctgctagag cttgctgtga ctgacaaagc tccatagcac gggggtagag
cgggacactt 4740 gcaagtgtgt gtcccccccc ttaccctcag gcctcctgtg
cacaagcgcg tgcaaaggtt 4800 cagcgcccct atacatgcat atttgagcat
gaggcggacg ggggctgcta gcgtctgaga 4860 ggtgcaatta tctattgaca
tggacgcagc attcttgtca cacccaggct cgccaaggaa 4920 ttgggtgtcc
tttgacttaa aaaaaaaaat atatatatat gtttatatat atatattata 4980
tatatatata cccgtacaca taatatatat atatatatat atacatacac acatatatat
5040 atatatatat tttttttatg tgtacgggtg ttttgcctac ttgtgtacta
tataaattac 5100 gtgtgtgcag tgcctacgga aaccagaaga gagcatcaga
cctcctggga caacattggt 5160 gatccacttc gtagaccctg ggaattggac
cccagtcctc tggaagagca gcttagtgct 5220 tttaagtccc ggccatctct
atgcagccct tgcttgctcg gtcggtcggt ctgctgcctg 5280 cctacaattt
atcatctatc aatccatcca tctatccctt tatctgcttt tttcctggaa 5340
tttgctctgt atggactgat ctcagagaca gcacagagcg agctcgcttg tctctgagcg
5400 cggagataaa aggcgtgcgg tcgggacagc cctttacttt ttaaaaaaaa
aaaagatatt 5460 tttccccctc caccctccgg gcgtggtagg caggctgttc
agatttccag ccttgtttcc 5520 cgggtgcgag cggtcgttcc aggcggggct
ttgagctgtg agagcctgag ccgagctcgc 5580 cgccctcccg cccggggacc
gcggccgcgg cggctcgaga acccgaaagc cagcagcagg 5640 ggtggcatgg
gcggggctcg cgggaggaag tcggagggcg caaggcattc gtgggcccgg 5700
aagtgcggcg cacgcggctg ggcgcgccat ggctgcgtct cgcgctcccc gcaggtgagt
5760 ggcctgggcg ggcgcgcgga cccgggcggg cgcctggtcg cggggtccgt
tccgatcctg 5820 acgcggtgtc actggcaggc gcctggagga cctcagtgtg
gacgagttcc tggcttccgg 5880 cttcgagtcc ggatccgagt cggagctgga
gggcgccgcg gaggcggcgg cggaggagcg 5940 cagggcgcga ggagccgcgt
ggaaccggga gcggcggggc gcgcgcacct ccccgggccc 6000 cgcaggacgg
tacgcggggc tccagaaagt ctttttgcaa aacgggcagc tgtcgtgggc 6060
tggagcagcg ggttgagctt aaggaacaca cgacgggggc ctcggggtcc gagcgatggg
6120 aagggttgca aggttggaaa tggaggagag cttggagggg gaagcgcacg
cgtggttggg 6180 agggggaagc gcacgcgtgg ttgggagggg gaagcgcann
nnnnnnnnnn nnnnnnnnnn 6240 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6300 nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6360 nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6420
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
6480 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 6540 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 6600 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6660 nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6720 nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 6780
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
6840 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn 6900 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn 6960 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 7020 nnnnnnnnnn nnnnnnnnnn
nnnnnnnnnn nnnnnnnnnn nnnnnagggg gaagcgcacg 7080 cgtggttggg
agggggaagc gcacgcgtgg cttggagggg gaagtgcacg cgtggttggg 7140
agggggaagc gcacgcgtgg ttgggagggg gaagtccacg cgtggttggg agggggaagc
7200 gcacgcgtgg ttgggagggg gaagcgcacg cgtggcttgg agggggaagt
ccacgcgtgg 7260 ttgggagggg gaagcgcacg cgtggcttgg agggggaagt
gcacgcgtgg tttggaggtg 7320 ctgttggagg agagctgagg acggtctgtt
ttacagcaat gaggcgagca tggtgggtgg 7380 ggtcacagct gctctggggc
cgggagggcg ctcagcgggt gctgagtttg agatggttgt 7440 gttggaacaa
tgtcagaggt gctgtgcgtg tgacgatgga ggagaggcaa gcccgatggt 7500
catcgatccc acagtgaagg gaataaacta acatggcctc acggaggcga caaccagatt
7560 ggctgtgtag ggtcgaaaaa ctccgtcccc cgacccgaca cccgttttct
cagcccgcgt 7620 aagggccgcg cctctgagca caaagaccag ctctctcggc
tgaaggacag agaccccgag 7680 ttctacaagt tcctgcagga gaatgaccgg
agcctactgg acttcagtga ctcggacagc 7740 tctgcggaag aagaagagcc
attccactcc ctgccagaca cgctggaggt gaaacccggc 7800 agacctgggc
gtgggggcac ctgtggtttc tatacaccgt cccagccttg tccttctgta 7860
tctggtccag gaagctagcg aaacagagga agacggagga gaggacagtg acgcgttgcc
7920 cagagggctg aggagcaaga agaatgagcc tgtacccgtg accctcgcca
tggtggaaag 7980 gtggaggcag ggctccaggg taagaagcag tataagctgg
gtggctgcat ctccacaggc 8040 gggagactct ggcctcgcct cgtttgaggc
gtctgagaca gcaggtgaaa agactgtgtt 8100 tctgggaagc gcacaatgga
ggtctttcca agtgccaggg tcctccagct acgtctcctt 8160 atttgtagtg
catagtcctg cctgcttcct cacactctct atcagggttc tgacccttgt 8220
ggggtggttg tagagtttgt ttcatttctg ttaacaggat accacagacg gggcaatttg
8280 ttttttgttg ttgacttact cacctgcata taggttggtg taccgtatgc
aagttcatag 8340 ttaaccgatg gttgtgagcc actgtgtggg tgctgggagt
caaacctgga tcgtctggaa 8400 gaacagtcag tactcttaac taccaagcca
tctttccagc tccagactag ttaatttgga 8460 aagaaattta actggctcac
attgctggag gtaggcaggt caaaaagtac ggtgatgtct 8520 ttgttaaagg
ccttgtgttg tgatccttgt tcagttacat cttttgaatt gttagaaatc 8580
aagcccagga cttgtacatt gtaggtaatc acactgtcct gagccctgag ccttttcgtt
8640 tggtttagtt tggtttggtt tttgagacag ggtttctctg tgcagccctg
gctgtcctgg 8700 aactcactct gtagaccagg ctggcctcga actcagaaat
ctgcctgcct ctgccttcca 8760 agtgctggga ttaaaggcat gtgccaccac
tgcccggatg tttcatgtga tcttcattac 8820 tttccaaagg ctttatttcc
aaataccacc aaataaactt ggatattaaa tttctactac 8880 ataagccttt
gggctatagt caggcttagc attatttctt cactggggag tgggaaatga 8940
agtcagtgtc ctttgaataa tttcacactt aacttagagg ttttgggatt tttttttttt
9000 tttttttaca gttggagatc actatgtaaa tcaccctggt ggctaacttt
gaactctatg 9060 tagagtagtt tggcctcaaa acttgctgta gtctccttgc
ctttgctttc ccagtgctgg 9120 cgtagccatc agctatcatg cttggctttc
actttggttt ctttagggaa gggagtgaaa 9180 agggagattc ccatacactt
tctggagcaa gtttgtaatg cagtattctc tcccctttct 9240 tctgggactc
ttgacacagg ctctgcctgt ccaggtgtgc gcaataccta ttttgttgtt 9300
ggctcatggt ccagtgatct tagccccaca gttgctctgg ccctgagatt gtgggttaga
9360 ttagcacctt ttcaacttgg agcaggttcc tagagccagg tactgtgaac
tactctttta 9420 gtgatcttga tctcttgccc cttcagcacc accttagtcc
caggctgttc catgaagttg 9480 tacaggcgtt ccgagcagct gtagccacca
cccaaggaga gcaggaagct gctgagactt 9540 gcaggttcca ggttgcagat
agtgctggtg agctggggac tggggacctg tgccaggctt 9600 cagtgtgaca
cctgtgtgcc ctatataggt gaatgacact ttggttcctt gcagtgttca 9660
atgctctggt tactttctgc attcgagacc tctgtggttg ccttcagaag ctgctgtttg
9720 gaaagacacc aaaggatagc aataggtaag gtatggtcag gggttttgtc
attaaggtag 9780 aggaggtcac aaagtcttaa ggaagcacag aaagcagcta
ataaatacag atgagtcatg 9840 gtggggctgg gagtagacat agccatgtgt
cttgagatag cagatgcaga agagcaaccc 9900 taaccctaac ctgaaggaga
agatggtggc agttggttga tgcactgaca ggatgagtac 9960 atctagaggc
agcagagagt gaggagagca agaaaaccca acaggtggag agggcagaaa 10020
ataagccagg tgatatgaca gaagcctaag tgaatggcta acacacccag aatctgggga
10080 gcagttgtca tgaagggaca agtgagattt tagaaacagg gttgccctct
gaattgcata 10140 caatgcttga ggtgtcacac agatgagata gtgttggtct
aggtatctgg aaggttagac 10200 agtggggtgc cttaaatggg ctttggtttt
tgttcttggt acagttcaat agatcttggt 10260 aaacacctac tatgttgcca
ggtgttcatg gttatacggc ttgtgctcgt gcaggcttgt 10320 gtctgaattt
cggaatgggg tgggtgtatt ccctgcaggc tgctgccatc cagtagccca 10380
ctgtggggga agctccgtgt ggatgtcaag acatacctaa gtgcggtggt gcaggtacag
10440 aatcccagga gacctggtca tgtctgatag tctggtgctc tcttacaccc
cagattccag 10500 tgactttctc ttcttcttta gctggtagcc tgtctagcgg
aagccacggt gtctgcagct 10560 gtcctgcagc atatcagcag cttggttcct
tactacctga cattccctaa gcagtgccga 10620 atgttgctca aggtacgtta
gcttgtcctt gtcctactcc tggctagaat agaggcagca 10680 tttgtgccta
tacttgctct gctcctactg tgtcctctac tcagcaaggg tccccagtct 10740
cagatggtcc cccgtccttt caggacagcg tggcttagta atgtgagact aggctccact
10800 gaggcatgcg tgctagccat ggagggttct aggccaggaa aggcagcgat
gtatcttttt 10860 tcagaggatg gtggttctgt ggagcacggg tgaagaatct
ctgcgggtcc tggccttcct 10920 ggtactcatc agagtctgcc ggcacaagaa
ggaagccttc cttggtccca ttctgaaggt 10980 agcgtgagtc ttctatttgt
tcattaaggt ctgtgttggg aattgtggtt gatggtcctt 11040 acgaggactt
gacaccaaac tctgatcata gttttcttac caagaaaatt gggtgatggt 11100
gggaacagga acccatggtc ttgtgggtat tggtggtgat ggtgggaggg gctcttttcc
11160 tctgggtacg agaacgttat acctagaaaa taggaacgtg tctgaggcca
ccaccaagat 11220 cagatcctgc tgtgtagctc caggcctgga gtggcatcac
ctggactcct gctgtacaga 11280 cctagctcga agtagagcgt ctctacttcc
ttcctcggcc ttcggtccag tccctgccta 11340 gtcctcatct tacccttctc
ttgtacacct tcctctcttc tgcatatctt agattatttg 11400 tggtgggtac
aagtgtgtag gctgagaaac atcagagttc ttgagtgctg ggaaccattc 11460
aggaattgag tctgcagctc tttcagacgg cagttctaat ggaaaaccct aggtgtgggc
11520 aggtggcttg gaactgaaca gtcgttgctc aggtggctct taattttggt
ttgtcagcaa 11580 atgtacatca tgtatgtgag aaactgcaag ttcacctccc
ccagtaccct ccccctcata 11640 agcttcatgc aacggacact gactgaaatg
cttgccttgg accccagcgt ctcctatcag 11700 cacgccttcc tctacatccg
acagcttgcc gtccacctgc ggaatgctat gaccgcaggc 11760 aagaaggtag
gaggaggtca gacccttcga agccagtagc cccacccggg ccttagggtg 11820
gctagaagcc actctttaat gggggaggct gctcagcgct gagggaggct ttgggtggtt
11880 ctggcaggga agaacgggcc taggatgtgg actcactctc cccaggagac
acaccagtct 11940 gtgtacaact ggcagtatgt gcactgcctc tacctgtggt
gtcgagtcct gagtaccctt 12000 ggttccagtg agatcctgca gccgctactc
taccctctct cacagatcat cattggctgt 12060 atcaagtgag ttcagggaca
gggctatgtt agattagcct aggggttggg ggcagcaagt 12120 tcctcttggt
atagggcaga agttcctgat ttgctagctt ctgtgaatta cattatacaa 12180
agaattttct cgggtggcca caccttaaat cttagctttt gtggctatag tctctgagga
12240 agggcctgag aagatgaact ggatgtggat ttcaggcagc tgggcttcgc
agtcagcctc 12300 agcttaggta aataggttag gtgtccacag tgtagcacct
atatggaccc ggctcgggcc 12360 cttgtgctat tggagggcaa tcagatgtag
ttctaagtgc ttttgatagt ctgcgaatgt 12420 gctgtgccta cctctttggc
taggaggggc ctctgtgtcc tctcgagctt tctcctgaca 12480 ctggagactg
agctaggaag gggaagaggc aggtggtaca agaaaacaag gcagcaggtg 12540
gtgaggaaag gaggcaggag tgaatgaact aagctaaagt tggatgagcg cctttcagtc
12600 tcgcagctga gcagtggggc accaaggcca agcattttgg ttgtccctgt
tggttttgct 12660 gactgtttaa attactattc cccatgggtg ctatgagtac
ccaccctgta ggagtggcca 12720 gggtctcctg gagttggagc ttgtgaacgg
ggaggagtca gaggcagcgg gcagttaact 12780 gatgtttggg cctggctacc
tgactgggct ccgctcctct tttctccagg ttgttgccca 12840 ctgctcgatt
ttatccattg cgcatgcatt gtgtacgtgc cctgacactg ctgtcccaga 12900
ccatcggcac cttcatacct gtcctgccct tcattctcga ggtgactgtg ctgggcacaa
12960 ctcgtgttgg acttagaaat gtctttagga aaccacacct ccgtctctgc
gtctgccagt 13020 gctaacttct aacaatgacc ccgtgacccc cctctctttg
cagtccttac agatctgcat 13080 tgagtctctc caagtctcac agttactatg
acacctttct cgttttcctt attgtacctc 13140 taccttgagt cgctgtccct
gccctcgggc ttcctatcag gtgcagattc cttgagaaag 13200 tgaagcacaa
gggtgaatcc aagcccctgc taacaccgac accgcccttt ctttagtcta 13260
tggcgaggat aaaggtggat attgataacc agctctatat attagttagc atcctttgct
13320 cagagaaagg actttcccta cctccaattt attaatcact gtagctggtt
caccttctag 13380 caacatgtta cgtaagttct cttatcaggg cttccctata
gaacctcccc tctctatgct 13440 tctgtgtgta ccttgtgcct ccttggttta
cctgttctgt gtggggcatc ctcggaccaa 13500 gaggttaggc taggctccga
tttccctctt aggaaggcca aagtgtatat agagcctgca 13560 aggtttgttg
ggtgtggaaa acacagtctc ccttctgtct ttctttattg gtggttttgt 13620
ccatagtgcc tgggatgcgt gaggcccaga tccgtatcta aaggcggtta agtgtatgca
13680 ggcgtcctta tggttatctg tcaaggggct cagggctctg gaaggcctgg
tcttggggga 13740 tctctttggc tcgggcttac actctgtggg tagattttcc
agcaggtgga cttcaatagg 13800 cggccaggtc gcatgagctc caagcccatc
aacttctctg tgatcttgaa gctgtccagc 13860 accaacctgc aggagaaggc
gtaccgggtg aggctgaact caaggccttc tgggaaaggc 13920 ctggggctat
gttttagttg gtttacaagg acttaataag gccgttgata tacaagggag 13980
ggattcccat tgtggaccat ttgtggtttc ttttgagaca agatctctct attatattgc
14040 tcttgctgtc ctggaacttg ctatattgat caggctagcc ttgaactcac
agagttccac 14100 ttttcaaatg ctaagattga ggtttttgta gtcatggagc
aatgactcct taatctcagt 14160 tcccgtgaag ctgggtgtgg cttggggggg
tctgtgcgta ctgttttata gccctggtct 14220 tggagattga gcagagttgg
ggtgaaatgt aagaaaatgg gctactgggc ttcatcatct 14280 gatgaggttg
ggcctgcccc ccaggacggg ctgctggaac agctgtgtga ccttactctg 14340
gaatacctgc acagccaggc ccacagcatc gctttcccag agttggtgtt gcctactgtt
14400 ctacaggtat gtactcatca tctgacttcc tttccctggc cagaccttgg
cgtcctcctt 14460 gtggctatct tggacaatct tttagtaggt ttgtcgtccc
cttatgttct agtcctggac 14520 cagtagcaac accgtgttta tccagttacc
aatgggctgc cagatccatt ggcgccagtg 14580 cagggagaga gcgatcattt
gccttgatag cttaaggaac tttctctagg gactgtggaa 14640 ttaacttgta
aaggattatc ggcctctgct gaagttgggc tggggctggg taacttaatc 14700
aggatctcag gaatttgcag acccttcccc agtctgactg gagcctacca tcaactatgg
14760 aagtctaaag cctgggttgg tctggtgccc gttttgtttt tccgtgtggt
catggtctct 14820 ctgtgcccct tctgagaatc tgtttgctct cagctcagct
attttccatc taggtccttc 14880 tcataggagg gggctggtgc tgtctctttt
ctggtactgg taggggatta gatccattgc 14940 tcacagactt gtcgtggtgt
tgctcatacc tgagctatgc ctgggtcagg ctcctatggg 15000 cacaggtgta
tgtgtgacgt tacatgtata gctgtgcaca cggctgtgtc gcggttggtt 15060
gcataatgtg gcattggctt ccccctgtgt cctcttggct gatgatgtca ggtagtcagg
15120 atacctttag ttttaggctc agcattttag gtgtatggat gtgtggatct
agagtagaag 15180 ttagcagatg tgtttcagtt aaaataaaag tagcaaaaaa
aaaaaaagaa acttttaagg 15240 tttatgtctt gtcacagctc ttcatctgct
tacctctcgg gctgcgggga caagcatggt 15300 cttatgcctg tgagacctaa
gagattgaca gcaggtcctt gtgattgtca catggctttt 15360 cactgcttgg
acccctcgag ggtgaggaat ggaactgtca ctggagctgt actgagtctt 15420
tcctttctgc agctgaaatc ttttctccgg gagtgcaaag tggctaacta ctgccggcag
15480 gtgcgccagc tgctggagaa agtgcaagag aatgcacgac acatccaaag
tcttcgacag 15540 agcgcgacct tcagcgtgtc tgaccggacg gcagtggtgg
gttggacttg gcatctgcac 15600 tggaatggtg ctgcctgggg tcacctgtta
gggtgtggtg gatgtttaag ggttttccag 15660 aatgatcatg gtcctgaatt
tacttcaatc cttaggatgc gtgggagaag caggttcgtg 15720 aagaggggac
cccactcacc agatactacg gccactggaa gaagctgagg gaccgtgaga 15780
tccagctgga aatcagtggc aaagagcggg tatggctggg gtctgtgaag atggcctagt
15840 agaaagtggc caagcaggtt atttgtaaag gggttggggc tgagggggag
aagttggaaa 15900 aggaaacgaa atcaggaggg aagaaagaaa
ggtgagtttg agtgaaccct gtaagtttct 15960 gtttacctgc tgggcagctg
gtccccggta ttgatataga tggttcccga ggcgcagctg 16020 ggccagttct
ctcggcctct ccctaccctc acttgttgtt ctgtttcttg gtccttgaag 16080
ctagaagacc tgaacttccc agagatcaaa aggcggaagg tggaagacag gaaggatgaa
16140 gacaggaaag aattaaagga cctgtttgag ttggacagtt ctgagggcga
ggacagcacc 16200 gacttctttg agagaggtga ggcctgagct ggagtggttg
agagctgtgt gtgtctgggg 16260 cattcatgtc tggctagtgg gcagacctgg
tttgttgttt ttaattttta tgtctgtgtg 16320 ttttggctac atgtctgtct
gtatccagat cccctgagac tggagttaga gacacttgtg 16380 agcctcagtg
tgggcactgg gatcaaaccc atgtccttga accactgagc catctctcta 16440
gcctccagat ctgttttttt tttttttttt taaagattta tttatttatt atatgtaagt
16500 acactgtagc tgtcttcaga cactccagaa gagggcgtca gatcttgtta
cagatggttg 16560 tgacccacca tgtggttgct gggatttgaa ctcaggacct
ttggaagagc agtcgggtgc 16620 tcttacccac tgagccatct caccagccct
agatctgttt cttataccac tcattggccc 16680 cttatgtttg agctggggac
tcagtattgg taattgtcac catgctgggg cctctatagg 16740 tggcagagag
ggctagtggc cctgttgcta ctgctgagac ctggggccct gttgcatgtc 16800
cagagccagc cctcctgtgt agctactcat tgctttgtta tttcacccgg tgtgtgtgtg
16860 tgtctgtgtg tgtgtctgtg tgtgtctgtg tgtgtgtgtg acgaccgttc
ttcatttgca 16920 ggagtaccta ggctcccgga agctcaccaa ggactgaaag
aagatcagga agaagaagat 16980 aaagaagaag gtgacagcga ttcagagggt
aagtggtttc tatatgaaat agattggccc 17040 tggagtctgt ctggaggcca
gaccatggga agacagagct ctacgccagg cttctgcccc 17100 tccctggggt
gccaggccag ctgctctgta ctctttagca accagctata gcatgctgat 17160
aaatttaatt aatgctgcaa ttaattagtg atgagtaact tctaaagctg gctttttcat
17220 gatatagcag aatttattta tgtggcctct tccttctctg tagatggaga
cacagacacg 17280 ggagtggatc tgagcgaact gtggcagctg gctcagggac
cacaagatga gctggaggat 17340 cttcagctct cagaagagga ctgagggcct
cctgctgcct cctggaggcc tcgacagcaa 17400 ctgtgcggcg agaggacagc
agagcgtaaa caggctttta ttgttacagc acagcagaac 17460 aagtgtacta
ggagccattt tgaccctgaa agggagggaa ctgtttctgc agagccttct 17520
gctgccggaa gccgaaggct gaccctgatg ttgtcccagt tcagagtggg gagggcggct
17580 ccatggcctc tcctctccac tttgtctttg gtttgtctgc agagcactgc
agttacctct 17640 agaagataaa cccattactt ttgtttgttt gtttgtttgt
ttgtttgttt gtttgttttt 17700 tcgagacagg gtttctctgt gtagccctgg
ctgtcctgga actcactctg tagaccaggc 17760 tggcctcgaa ctcagaaatc
cgcctgcctc tgcctcccga gtgctgggat taaaggtgtg 17820 cgccacctct
gcccagctca ttgcttttct tcttaagcct tatgttttcc ttttaaaaat 17880
actctcggca tttttgcatc taaaaagggg aaataccctg ggggcggagt gaggttagca
17940 atttgactgg gaatcacgga gtatgacggt tgcctgccgg atcattggag
aagctgggaa 18000 gggtctgggg cacctgaaag cagagctatg gtccctgacc
cattctcctg cttgggtgag 18060 gcttgtcctg tgggcaggtg ggtcccttca
tacggggagg tggtggttgc tggcgacaga 18120 ggttgatgtc ctgggctgag
ctcaggtgct tgcaggcttg gtggctgcag cggcagggcc 18180 acagggaagc
tggccatgta gaagacacgg cccaggcgct tggccaccta caaagagacg 18240
aagctgggtg aatacctgct aggtaaaggc gcctagggaa acaactccag gtacagagaa
18300 tcctaaaacc ctctctagag tggtgaaaac actggacaaa cttacttgtg
ccctgatctt 18360 gagggcaggc cctagcttca gtcccatggt gttcagaagg
tgctcttctg tcagtagagg 18420 caaagtctct ccgtctattc cttgttctct
gaatacctgg ggacaaggtg ggcccataga 18480 gtagagttgt agagcttgca
tggaaaaata acaaaaggaa agttcttttt tgtgggagac 18540 cgggacattg
tccagaactg ggccagaacc ctagggacct gccactcccc tcaccctggc 18600
atactcgcca cagccagaaa ggccccccac gaagttgcag acatcatcta cagtccactt
18660 gttgacatcc tcaagggttg tggtctcctc atcagtgaag agccccccca
tggtacctgg 18720 agatagaaag gatgtcacta gggcttgggc tttcccattt
ctttgcctcc ttccccactg 18780 gctgaccttt ctctgaattg gtaaagtctt
tggcttaatg acaaatgatt tttctcccca 18840 cgggatcccc tatcagtggg
tgactaaggc tgcccacctg tgtggaagta ggggctcact 18900 gccccgcaag
ggaatcccag gggcagtgga gggggcagtg tggaccctga tagaagcccc 18960
cttccttctg ctctggtccc tcctgctggg acttggctgg gagtggaggt cccttccctg
19020 gcagctgctg tctcggggtc ctctccatct gagtccttgg atggctcctc
agagacttct 19080 tgagcccaga ggccagcccc tgtcgtctcc ttgggttggc
tgggctggac accaaggctc 19140 cccttttcgg accggcttcg agcagactcc
ttgggtggga tagggggtcc tgggcctggg 19200 ggtccctggg gtggcagggc
cagtagtggt gctgagctgt gtttcaaaac cagcatggag 19260 ccacgtcgct
gcagttcgtc gggaccctcg gggatatgca aggccacctc tggtgccggt 19320
agcagtggcc ggtgaacact gcccagttct ttctgcctaa gcaactctga catttccagc
19380 ctggaccaaa aggacagacg agtcagcctg ggcacgctta gaactgctcc
cgctggcctt 19440 gcagtcgctt accgagccag gctctgcttc cgtagaagtt
cctgctgccg agctaacatc 19500 tcagcctggg cggagggtag gaagctgaaa
cctgggatag aagaaaaggc tgctgaagcc 19560 ggatgctctt cactactttt
atctatctgc ctaaagagag cctcatagga acacaaacgg 19620 cttacatacc
cgactttctc acctggggtc tgacacacag ctgtgggcat ccccagaaaa 19680
ggaggcggga gatgggagct cattgtgatg tgggcggcat tctggggtga cagcaaaggg
19740 ggtggctgtg acatttccct gtggacagca gaatagtgag cacatgccta
ggtccgtctg 19800 tgcacacccc ctctttcctg tcatcacctc tgggtgcacc
cctgctaacc cttctagttg 19860 ccgacccctc aagtggccgt tggtttcctg
attatggctt gcttgtgtac acacacacac 19920 acacacacac accccgcgcg
caagtctcct tgtcagcaca gaggcaatgg ccccatgggc 19980 cacactgcgt
cagtaattac cggggtgggg gatacctctc atcctgagca gtgaggggag 20040
gggtgcagga cagggcaatt agccccaatt tgtagatgtg gcagatgctg cagtctgcca
20100 ggatcagggt gagataggat gtgtttggga aacccgggga ccctgcctaa
tgctctcctt 20160 ggcttaactc gtctgccgcg gagactactg tgagcatgat
gtctctggag tttatagcag 20220 ggtagggatg gagggggcgt atgtccttca
ctctgaatcc tctgtggttg ctttctcctt 20280 gccttcctca acatcccacc
accaccacct gtgaggggag ggcctctgct ctcggggcag 20340 ctctactagg
ctggaatagt cagagttgct tgacaagaaa attgtatttg gcattcttgc 20400
cagggagtgg gcatttctgc atattttgcc ccccaaattg agtctttgca gtagttttag
20460 gagcttcctg cagtggacct tgaacgtggt gaatgcatta tgcatatatg
tgatggaaga 20520 ctcaagggaa agaaagaatt ggccttctca agtaacaagc
atagcatgtg gggccttgct 20580 gaagcacgtc ttaatacctt tcagagaaag
atgtggcagc agctggaacc gccccttccc 20640 ggcgctgaac taggccttgc
ttccgtcgat ggtctgctgt agacgagggc aggtgaactt 20700 ccaagccact
gggacccctc agtgttgcag ttgctacttc ctggcgtacc ctaagcagat 20760
ctgagaaggg aggtgatagc gggcccccct gataagctcc cgtaggattt cccctttcca
20820 tcttggtgag cagggcattg gcccggtgaa ggtatgaaca gagtgggagg
aggagactct 20880 cacatgtcct gagcctgctt ttgtgtcaca ggctagagga
tcagcaggga aggcactttg 20940 ctttaacacc tacatgctgt agccaggcaa
gtgaacttga aaaggaatgt gaataagtag 21000 gtcatcgaaa ctggagaaga
aaggctccaa catgtagagt attagtgcta atttggtttg 21060 aatattcaca
aggtcagagg tggaagggca tagggctgct ccaagtaggt gagtgggaca 21120
agagtctatg gattaggagg ggacctcaaa cagaagcctg cccaggtaac tcctttctag
21180 acaggagaca agaacaggaa ggacttgccg tccctataaa gcgattcata
gcatacatgg 21240 ttataatagc agataccact gactagcagc aacacacaat
cacagacctg catggttgtg 21300 cccccattgc atttaggaag agcggaggct
ccgagcccct ttaatccacc tgctcaagct 21360 ccacttctct accgctctgc
ctgtttataa atcacgtagc tgaaaaaggg acgtgggcct 21420 agctagggct
tgaagccaaa tcccagaccc taataagccc aggccacccc agtcatgcgc 21480
cacgcaatgg tgattgacag ccagcttgct tgcacagtct cctgcacagg ccagcagagc
21540 tgggtaagca attaaaaaga aagattttta aattattaac atttagtgaa
ttattcaggc 21600 tctcggtgcc tgagctgctg ccctagggaa cagcgccttc
tgcaagaatc aggaatcctt 21660 ttcccttctc tgagcccccg tccacttctg
ccagaaaaga caagatttgt ccggtgctgt 21720 agcagcacag ggtgtctcac
aagtcctcgg tagcaaggac ggtgcctcaa ggcttggctt 21780 ccctcaagca
acctggggtg catcctgact gtgacgcagg ttcctatttt tgccatgccc 21840
tcctccctgt gtatcaacaa aacctgaaaa gtctgacaca gcatcaacac gaccacagat
21900 tcacgtattg cactgaccac tctgggagaa actagcacag gccagaggcc
acatctcttc 21960 aacaaagaac aggtgaacaa actcttctgt cccaccctca
cctgggctcc tcggggacgc 22020 acagtagagg actgaggcca gtgaggctgc
atcactaccc ggaagccctt ctatgtattc 22080 aatcctggtc ttggtgttga
cctccaagca tccagatctt tttcttgggg ttggcttagt 22140 ggggtggaaa
ggtaagagag atttggctca cttgaagtct gagcgccggc cgtgggggtg 22200
gcacataccg tggctgggga tacccaagtg gtagagacgt tgagtttcct caaaggaact
22260 ggcctcactc agggccgaca taagctgatg gtagtggtcc tcaggggcca
ttgtagactc 22320 tagcttggga agcagcgaag tcgctgttgg acagagcaca
tggtgaggac cacggggaaa 22380 gtcaaggctt cttatggttt ccacccaagg
tctggggagc ccttcaggac ccagttgctc 22440 ccccccttcc ctgtgacctg
catccttttc cagagtcggg ctttcacctt gggggtcctt 22500 gcttctggcc
ttcttctctg aagagcagtg actggtgatg caggggcagc tcagcctctt 22560
gcccaacagg tcgcctgtag aggatagagt gagccccacc cagtgtctca tggcttcgag
22620 agaaagccat ctctgcctta gctcagtctg gagccccaat caccactatg
ttctccctgg 22680 cagttaaaga ccttgtccga tctggtgtcc caggctagag
agcacgatga caggtagatg 22740 gagagtgcag taaagacaaa gcgatcctga
cgctggcctt gggcacgtaa gtaacagtga 22800 gagctctgca tcagtgtttc
cccagtcagt ctcctgtaag ctccttgcct ctcccctcgg 22860 tgcatttgcc
catctgcata tccaggtctg ggttcttaat gggactccta agcatttcac 22920
tccatagtgc acagtctata tgtgctatgc ttttatgagg ccaggccaga cactgcttac
22980 aatatcccca gagataatac atgccgaatc tcaggcatta attgacccaa
ttccagactg 23040 gcctgaacac tgctgcctaa cttcctcctg agctcctgcc
ctgaggctag ggcctttgct 23100 gaacagccca tgacatcgcc atcatggctt
caagctaagg ttcctgtctc tgccctcctt 23160 atcctcacca gctaggggag
tgtggaggaa caattccgtg ggtgattcaa aatctccata 23220 atgcccaata
aggggctgct tctacttgtg tgcccaccct ccctcaaagc tggtggtgtc 23280
agcgccagga ctgaattcca tttaaaaaaa attccttaat ttgtcatttt tatcagattg
23340 gggaagtgtg gcagcaaaaa cgctgatcaa tctccacgtt atggctgaca
gaatggtgct 23400 aataacttat tgatctcaac tacctcactg ctgttagtct
agcaggccat ctcagcctcc 23460 gccccagaga aagagacagc gcctactgtc
tggacaaggg ctccacccat aatatctcaa 23520 ggcagatggg tccagcagac
agataatggg tatctggctt cacttggtgc caccaatagt 23580 ttatcagaga
ccagtaggca gaagaggcag cagctggagt agacaggaag gagggtccct 23640
caggtctgtg tggcctggtc tcactcactt tgtacctgtt attcttgtcc caacccacag
23700 atccccttta aggactaggc aagggctctt ttcacagcag tctattctca
ctttaaaagt 23760 ggctaccccc aaaggagagg gggagggaga gagggagaaa aggagg
23806 7 163 PRT Homo sapiens 7 Met His His His His His His Gly Met
Ala Ser Glu Phe Gly Ser Ala 1 5 10 15 Gly Ser Arg Lys Arg Arg Leu
Ala Glu Leu Thr Val Asp Glu Phe Leu 20 25 30 Ala Ser Gly Phe Asp
Ser Glu Ser Glu Ser Glu Ser Glu Asn Ser Pro 35 40 45 Gln Ala Glu
Thr Arg Glu Ala Arg Glu Ala Ala Arg Ser Pro Asp Lys 50 55 60 Pro
Gly Gly Ser Pro Ser Ala Ser Arg Arg Lys Gly Arg Ala Ser Glu 65 70
75 80 His Lys Asp Gln Leu Ser Arg Leu Lys Asp Arg Asp Pro Glu Phe
Tyr 85 90 95 Lys Phe Leu Gln Glu Asn Asp Gln Ser Leu Leu Asn Phe
Ser Asp Ser 100 105 110 Asp Ser Ser Glu Glu Glu Glu Gly Pro Phe His
Ser Leu Pro Asp Val 115 120 125 Leu Glu Glu Ala Ser Glu Glu Glu Asp
Gly Ala Glu Glu Gly Glu Asp 130 135 140 Gly Asp Arg Val Pro Arg Gly
Leu Lys Gly Lys Lys Asn Ser Val Pro 145 150 155 160 Ser Thr Ile 8
176 PRT Homo sapiens 8 Met Arg Gly Ser His His His His His His Gly
Met Ala Ser Met Thr 1 5 10 15 Gly Gly Gln Gln Met Gly Arg Asp Leu
Tyr Asp Asp Asp Asp Lys Asp 20 25 30 Arg Trp Gly Ser Glu Glu Gly
Thr Pro Leu Thr Leu Tyr Tyr Ser His 35 40 45 Trp Arg Lys Leu Arg
Asp Arg Glu Ile Gln Leu Glu Ile Ser Gly Lys 50 55 60 Glu Arg Leu
Glu Asp Leu Asn Phe Pro Glu Ile Lys Arg Arg Lys Met 65 70 75 80 Ala
Asp Arg Lys Asp Glu Asp Arg Lys Gln Phe Lys Asp Leu Phe Asp 85 90
95 Leu Asn Ser Ser Glu Glu Asp Asp Thr Glu Gly Phe Ser Glu Arg Gly
100 105 110 Ile Leu Arg Pro Leu Ser Thr Arg His Gly Val Glu Asp Asp
Glu Glu 115 120 125 Asp Glu Glu Glu Gly Glu Glu Asp Ser Ser Asn Ser
Glu Asp Gly Asp 130 135 140 Pro Asp Ala Glu Ala Gly Leu Ala Pro Gly
Glu Leu Gln Gln Leu Ala 145 150 155 160 Gln Gly Pro Glu Asp Glu Leu
Glu Asp Leu Gln Leu Ser Glu Asp Asp 165 170 175
* * * * *