U.S. patent application number 10/997454 was filed with the patent office on 2008-05-22 for modified truncated complement system regulators.
This patent application is currently assigned to Washington University. Invention is credited to John P. Atkinson, Dennis Hourcade, Malgorzata Krych.
Application Number | 20080119400 10/997454 |
Document ID | / |
Family ID | 24793320 |
Filed Date | 2008-05-22 |
United States Patent
Application |
20080119400 |
Kind Code |
A1 |
Atkinson; John P. ; et
al. |
May 22, 2008 |
Modified truncated complement system regulators
Abstract
Analogs of regulators of complement activation (RCA) proteins
which have altered specificities and affinities for the targets C3b
and/or C4b are described. These analogs are obtained by
substituting amino acids which affect the complement inhibitory
activities of these proteins, substituting, rearranging or adding
SCRs (short consensus repeats) or SCR regions to the proteins,
deleting amino acid sequences, and combinations thereof.
Inventors: |
Atkinson; John P.; (St.
Louis, MO) ; Hourcade; Dennis; (Creve Coeur, MO)
; Krych; Malgorzata; (St. Louis, MO) |
Correspondence
Address: |
HAMILTON, BROOK, SMITH & REYNOLDS, P.C.
530 VIRGINIA ROAD, P.O. BOX 9133
CONCORD
MA
01742-9133
US
|
Assignee: |
Washington University
St. Louis
MO
|
Family ID: |
24793320 |
Appl. No.: |
10/997454 |
Filed: |
November 23, 2004 |
Related U.S. Patent Documents
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
|
|
08126505 |
Sep 24, 1993 |
6897290 |
|
|
10997454 |
|
|
|
|
07695514 |
May 3, 1991 |
|
|
|
08126505 |
|
|
|
|
Current U.S.
Class: |
424/184.1 ;
435/320.1; 435/325; 435/69.1; 514/21.2; 530/350; 536/23.1;
800/13 |
Current CPC
Class: |
A61P 9/10 20180101; A01K
2217/05 20130101; C07K 14/70596 20130101; A01K 67/0275 20130101;
A61P 43/00 20180101; C07K 2319/00 20130101; A01K 2267/03 20130101;
A61K 38/00 20130101; A61P 9/00 20180101; A61P 37/00 20180101; C07K
14/472 20130101 |
Class at
Publication: |
514/12 ; 530/350;
536/23.1; 435/320.1; 435/325; 800/13; 435/69.1 |
International
Class: |
A61K 38/00 20060101
A61K038/00; C07K 14/00 20060101 C07K014/00; C12N 15/11 20060101
C12N015/11; C12N 15/00 20060101 C12N015/00; A61P 43/00 20060101
A61P043/00; C12N 5/06 20060101 C12N005/06; A01K 67/00 20060101
A01K067/00; C12P 21/04 20060101 C12P021/04 |
Claims
1-34. (canceled)
35. A modified form of a regulator of complement activation protein
(RCA protein), wherein said RCA protein is selected from the group
consisting of complement receptor 1, decay accelerating factor,
membrane cofactor protein, C4 binding protein, factor H and any one
of the foregoing RCA proteins wherein the carboxy terminus is
removed to allow the RCA protein to be secreted; wherein said
modified form is selected from the group consisting of: a) a hybrid
RCA protein comprising short consensus repeats (SCRs) of said RCA
protein and SCRs from a different RCA protein selected from the
group consisting of complement receptor 1, decay accelerating
factor, membrane cofactor protein, C4 binding protein and factor H;
b) a recombined RCA protein in which the SCRs of said RCA protein
are rearranged; c) a truncated RCA protein consisting of three SCRs
of said RCA protein; and d) a modified RCA protein comprising any
combination of a), b) and c), wherein the modified form has
complement regulatory activity.
36. The modified form of a RCA protein of claim 35, wherein the
complement regulatory activity is one or more activities selected
from the group consisting of C3b binding activity, C4b binding
activity, C3b and C4b binding activity, C3b cofactor activity, C4b
cofactor activity, and decay accelerating activity.
37. The modified form of a RCA protein of claim 35, wherein said
modified form is a recombined RCA protein in which the SCRs of said
RCA protein are rearranged.
38. The modified form of a RCA protein of claim 35, wherein said
modified form is a truncated RCA protein consisting of three SCRs
of said RCA protein.
39. A hybrid regulator of complement activation protein (RCA
protein), wherein said hybrid RCA protein comprises: a) short
consensus repeats (SCRS) of a first RCA protein selected from the
group consisting of complement receptor 1, decay accelerating
factor, membrane cofactor protein, C4 binding protein and factor H;
and b) SCRs from a different RCA protein selected from the group
consisting of complement receptor 1, decay accelerating factor,
membrane cofactor protein, C4 binding protein and factor H, wherein
said hybrid RCA protein has complement regulatory activity.
40. The hybrid RCA protein of claim 39, wherein said first RCA
protein is decay accelerating factor, and said different RCA
protein is selected from the group consisting of complement
receptor 1, membrane cofactor protein, C4 binding protein and
factor H.
41. The hybrid RCA protein of claim 40, wherein said different RCA
protein is membrane cofactor protein.
42. The hybrid RCA protein of claim 41, wherein said hybrid RCA
protein comprises SCRs 2-3 of decay accelerating factor, and SCRs
1, 2, 3 and 4 of membrane cofactor protein.
43. The hybrid RCA protein of claim 42, wherein said hybrid RCA
protein has decay accelerating activity, and one or more additional
complement regulatory activities selected from the group consisting
of C3b binding activity, C4b binding activity, C3b and C4b binding
activity, C3b cofactor activity, and C4b cofactor activity.
44. The hybrid RCA protein of claim 39, wherein said hybrid RCA
protein is soluble.
45. An isolated DNA molecule that encodes the modified form of a
RCA protein of claim 35.
46. An expression vector comprising a DNA molecule that encodes the
modified form of a RCA protein of claim 35 operably linked to a
control sequence compatible with a host cell.
47. A recombinant host cell comprising the expression vector of
claim 46.
48. A nonhuman transgenic animal having a DNA molecule that encodes
the modified form of a RCA protein of claim 35 stably integrated
into its genome.
49. A method for making the modified form of a RCA protein any one
of claims 35, 39 and 40, comprising expressing a DNA encoding said
modified form of a RCA protein in a host cell.
50. A pharmaceutical composition comprising the modified form of an
RCA protein of claim 35 and a pharmaceutically acceptable
carrier.
51. An isolated DNA molecule that encodes the hybrid RCA protein of
claim 39.
52. An expression vector comprising a DNA molecule that encodes the
hybrid RCA protein of claim 39 operably linked to a control
sequence compatible with a host cell.
53. A recombinant host cell comprising the expression vector of
claim 52.
54. A nonhuman transgenic animal having a DNA molecule that encodes
the hybrid RCA protein of claim 39 stably integrated into its
genome.
55. A method for making the hybrid RCA protein of claim 39,
comprising expressing a DNA encoding said hybrid RCA protein in a
host cell.
56. A pharmaceutical composition comprising the hybrid RCA protein
of claim 39 and a pharmaceutically acceptable carrier.
57. A hybrid regulator of complement activation protein (RCA
protein), wherein said hybrid RCA protein comprises: a) short
consensus repeats (SCRs) 2, 3 and 4 of decay accelerating factor;
and b) SCRs of a RCA protein selected from the group consisting of
complement receptor 1, complement receptor 2, membrane cofactor
protein, C4 binding protein and factor H, wherein said hybrid RCA
protein has decay accelerating activity and one or more activities
selected from the group consisting of C3b binding activity, C4b
binding activity, C3b and C4b binding activity, C3b cofactor
activity, and C4b cofactor activity.
58. The hybrid RCA protein of claim 57, wherein said hybrid RCA
protein comprises SCRs of membrane cofactor protein.
59. The hybrid RCA protein of claim 58, wherein said hybrid RCA
protein comprises SCRs 3 and 4 of membrane cofactor protein.
60. The hybrid RCA protein of claim 59, wherein said hybrid RCA
protein comprises SCRs 2, 3 and 4 of membrane cofactor protein.
61. The hybrid RCA protein of claim 60, wherein said hybrid RCA
protein comprises SCRs 1, 2, 3 and 4 of membrane cofactor
protein.
62. The hybrid RCA protein of claim 57, wherein the hybrid RCA
protein binds C3b and C4b and has C3b cofactor activity, C4b
cofactor activity, and decay accelerating activity.
63. The hybrid RCA protein of claim 57, wherein said hybrid RCA
protein is soluble.
64. An isolated DNA molecule that encodes the hybrid RCA protein of
claim 57.
65. An expression vector comprising a DNA molecule that encodes the
hybrid RCA protein of claim 57 operably linked to a control
sequence compatible with a host cell.
66. A recombinant host cell comprising the expression vector of
claim 65.
67. A method for making the hybrid RCA protein of claim 57,
comprising expressing a DNA encoding said hybrid RCA protein in a
host cell.
68. A pharmaceutical composition comprising the hybrid RCA protein
of claim 57 and a pharmaceutically acceptable carrier.
Description
RELATED APPLICATIONS
[0001] This application is a continuation of U.S. application Ser.
No. 08/126,505, filed Sep. 24, 1993 (Now U.S. Pat. No. 6,897,290
B1), which is a continuation-in-part of U.S. application Ser. No.
07/695,514, filed May 3, 1991 (Now abandoned). The entire teachings
of U.S. application Ser. No. 08/126,505 (Now U.S. Pat. No.
6,897,290 B1) are incorporated herein by reference.
BACKGROUND OF THE INVENTION
[0002] The invention relates to modified and/or shortened forms of
complement regulators derived from regulatory proteins of
complement activation (RCA), especially CR1.
[0003] The complement system serves to aid in the removal of
foreign substances and of immune complexes from animal hosts. This
system and its regulation is reviewed by Hourcade, D., et al.,
Advances in Immunol (1989) 45: 381-416. Briefly, the complement
system generates, either by a "classical pathway" or an
"alternative pathway," C3b which binds to target immune complexes
or foreign substances and marks them for destruction or clearance.
C3b is generated from its precursor C3 by the proteolytic enzymes
collectively designated "C3 convertase." One form of C3 convertase
is generated in the classical pathway by the association of the
proteins C4b and C2a. The other form is generated in the
alternative pathway by association of C3b and Bb. Both C3
convertases can associate with an additional C3b subunit to form
the C5 convertases, C3bBbC3b and C4bC2aC3b, both of which are
active in the production of the C5-C9 membrane attack complex which
can cause cell lysis, and the production of C5a, a major
proinflammatory agent.
[0004] Both C3b, and less directly, C4b, are agonists in the
complement system. This is shown in the diagram in FIG. 1.
[0005] The complement system is regulated via a number of
interrelated mechanisms. There are two general mechanisms for
inhibition of the destructive components of the complement system.
The first mechanism is generally reversible, facilitating the
dissociation of the C3 convertases--i.e., C3b from Bb and C4b from
C2a. Facilitation of dissociation is sometimes known as decay
acceleration. The dissociation may also involve reversible binding
of the antagonist proteins to C3b or C4b components, thus
preventing their reassociation. The other mechanism, which is an
irreversible inactivation process, results from proteolytic
cleavage of the C3 convertase components C3b or C4b by the serine
protease factor I. This proteolytic cleavage occurs only in the
presence of a cofactor. Both general regulatory mechanisms, the
facilitation of dissociation of C3b and C4b and the inactivation of
C3B and C4b through cleavage by factor I, also apply to the
inhibition of the alternative pathway C5 convertase (C3bBbC3b) and
the classical pathway C5 convertase (C4bC2aC3b).
[0006] The proteins encoded by a region of the genome which is
designated the "regulators of complement activation" (RCA) gene
cluster are involved in both of the foregoing mechanisms.
Currently, it is known that at least six complement proteins are
encoded by this region. These are summarized in Table 1.
TABLE-US-00001 TABLE 1 RCA Proteins: Functional Profile Primary
Decay Acceleration Cofactor Name Ligand(s) (Dissociation) Activity
CR1 C3b/C4b + + MCP C3b/C4b - + DAF C3b/C4b + - C3 Convertases C4bp
C4b + + Factor H C3b + + CR2 C3dg - ND
[0007] These proteins share certain structural similarities which
are further described below.
[0008] The reversible binding to C4b or C3b to dissociate the C3
convertases is effected by two plasma proteins designated C4
binding protein (C4bp) and factor H, and by two membrane proteins
designated decay acceleration factor (DAF) and complement receptor
1 (CR1). Reversible binding to C4b is effected by C4bp, DAF and CR1
while reversible binding to C3b is effected by factor H, DAF and
CR1.
[0009] The irreversible inactivation of the C3 convertases
resulting from proteolytic cleavage of convertase components C3b or
C4b by the enzyme factor I can occur by virtue of cofactor activity
effected by the above-mentioned factor H and C4bp in the plasma and
by CR1 and membrane cofactor protein (MCP) at the cell surface.
Cofactor activity for cleavage of C3b is effected by factor H, CR1
and MCP while cofactor activity for cleavage of C4b is effected by
C4bp, CR1 and MCP. It is also possible that the sixth protein,
complement receptor 2 (CR2), has this cofactor activity at the cell
surface.
[0010] In summary, of the six proteins encoded by the RCA gene
cluster, factor H, C4bp, and CR1 have both reversible dissociation
activity and irreversible cofactor activity; DAF has only
reversible dissociation activity, and MCP and possibly CR2 have
only irreversible cofactor activities. CR1, DAF and MCP interact
with both C3b and C4b; C4bp interacts primarily with C4b, and
factor H interacts primarily with C3b.
[0011] The cDNAs corresponding to CR1, CR2, DAF, MCP, C4bp, and
factor H have all been obtained and sequenced. Evaluation of these
comparative sequences has lead to the alignment set forth in FIG.
2A which shows the organization of the RCA proteins into short
consensus repeat ("SCR") containing and non-SCR-containing regions
with the N-terminal ends at the left. In this figure, TM refers to
transmembrane domain, C to cytoplasmic domain, 0 to 0-linked
glycosylation domain, G to glycolipid anchor, U to domain with
unknown significance and D to a disulfide bridge-containing
domain.
[0012] There is considerable uniformity among the RCA family of
proteins. All of them are composed of 60-70 amino acid repeating
units commonly designated "short consensus repeats" (SCRs). Each
SCR shares a number of invariant or highly conserved amino acid
residues with other SCRs in the same protein or SCRs in other
family members. Those members of the family which are membrane
bound also have at their C termini either transmembrane regions and
intracellular regions or a glycolipid anchor.
[0013] The SCRs form the extracellular portions of those members of
the family which are membrane-bound and almost all of the protein
structure in the secreted members. Two covalently-crosslinked
cysteine pairs establish two loops within each SCR. The smallest
family members are DAF and MCP; each contains four SCRs followed by
an 0-linked glycosylation region. DAF is terminated with a
glycolipid anchor while MCP ends with an extracytoplasmic segment
of unknown significance, a transmembrane region and an
intracellular domain. Of the secreted members of the family, factor
H contains twenty SCRs, while the native form of C4bp is an
association of seven subunits of eight SCRs (the C4bp alpha chains)
and one subunit of three SCRs (the C4bp beta chain). Both C4bp
chains conclude with non-SCR domains that interconnect the chains
through disulfide linkages. CR2 contains sixteen SCRs, a
transmembrane region and an intracellular domain. The most common
polymorphic form of CR1 contains four repeating units of seven
similar SCRs (long homologous repeats or LHRS) numbered 1-28,
followed by an additional two SCRs designated 29 and 30, a
transmembrane region and an intracellular region.
[0014] Klickstein, L. B., et al., J. Exy. Med. (1988)
168:1699-1717, described the identification of distinct C3b and C4b
recognition sites in CR1 using deletion mutagenesis. They concluded
that a singled primary C4b binding site is located in SCR 1-2 (SEQ
ID NOS: 1 and 3), while two major C3b binding sites are located in
SCR 8-9 (SEQ ID NOS: 2 and 4), and SCR 15-16. C3b cofactor activity
was localized to SCR 8-9 and SCR 15-16. More recently it has been
shown the CR1 active site containing SCR 8-9 extends to SCR 10, and
by analogy, the active site that contains SCR 15-16 (which is only
one amino acid different than SCR 8-9) must extend to SCR 17.
(Kalli, et al., J. Exp. Med. 174, 1451-1460 (1991); Makrides, et
al., J. Biol. Chem. 267, 24754-24761 (1992)). The CR1 active site
containing SCR 1-2 extends to SCR 3 and/or 4, as reported by
Makrides, et al., (1992).
[0015] The murine C4bp binding site, and presumably the C4b
cofactor and C4bC2a decay acceleration active sites, was reported
to extend from SCRs 1-3 in the alpha chain by Ogata, et al., J.
Immunology 150, 2273-2280 (1993).
[0016] The factor H binding site, and probably the C3b cofactor and
C3bBb decay acceleration active sites, lies within the first five
SCRs. The CR2 binding site for C3b proteolytic products extends
through the first two SCRs, Kalli, et al., J. Immunology 147:
509-594 (1991); Carel, et al., J. Biol. Chem. 265: 12293-12299
(1990).
[0017] The MCP active sites extend through all four SCRs: SCRs 2-4
are required for C3b and C4b cofactor activity. SCR 1 appears
unnecessary for C3b cofactor activity and binding but appears
necessary for efficient C4b cofactor activity and binding, as
reported by Adams, et al., J. Immunology 147:3005-3011 (1991). The
DAF active sites extend through SCRs 2-4, as reported by Coyne, et
al., J. Immunology 149: 2906-2913 (1992).
[0018] Hourcade, D., et al., J. Exp. Med. 168: 1255-1270 (1988),
described a cDNA clone designated CR1-4 that encodes the first
eight and one-half amino terminal SCRs of CR1. This cDNA was
transfected into COS cells which resulted in the synthesis of a
secreted truncated form of CR1 with a molecular weight of 78 kd
(Krych, N. et al., Proc. Natl. Acad. Sci. USA 88:4353-4357 (1991).
This shortened form of the protein, as shown herein below, binds
mainly C4b. This shortened form has now been determined to have C4b
cofactor activity, as described herein.
[0019] The multiple binding sites of CR1 can cooperate in their
interactions with C3b-containing targets. In vitro, CR1 binds
C3-C3b dimers much more tightly than C3b monomers because binding
to dimers can occur simultaneously at two sites in the same CR1
molecule, as reported by Wong and Farrell, J. Immunol. (1991)
146:656; Ross and Medof Adv. Immunol. (1985) 37:217). Deletion of
one of the two primary C3b binding sites can reduce the binding of
CR1 to C3-C3b by a factor of ten, as reported by Wong and Farrell,
J. Immunol. (1991) 146:656. It is likely that the primary C4b
binding site also cooperates with the primary C3b binding sites in
interactions with targets that contain both C3b and C4b. These
effects have an important consequence in vivo: CR1 has a higher
affinity for targets densely coated with C3b and with targets
densely coated with C3b plus C4b.
[0020] The C5 convertases, which are important in the stimulation
of inflammation and in lysis of some target cells, are composed of
multiple CR1 ligands: The classical C5 convertase contains C3b and
C4b (C4bC3bC2a) while the alternative pathway C5 convertase
contains two C3b proteins (C3bC3bBb). Inactivation of the C5
convertases by CR1 can also involve cooperation between more than
one CR1 binding site. Wong and Farrell. J. Immunol. (1991) 146:656
showed that more than one CR1 C3b binding site may be essential for
effective inhibition of alternative pathway C3 and C5
convertases.
[0021] The proteins encoded by the RCA gene cluster can be prepared
recombinantly and used in diagnosis and therapy for the regulation
of the complement system. The problems of transplantation of
xenografts are reviewed by Platt, J. L., et al., in Immunology
Today (1990) 11:450-457. Evidence has accumulated that the
immediate hyperacute rejection of discordant xenografts is caused
by recipient complement activity. Transgenic animals expressing
human complement regulators (such as DAF or MCP) on cell surfaces
could be an abundant source of organs that would be protected from
hyperacute rejection in human recipients. A soluble complement
inhibitor could also play a role in protecting xenografts from
complement-mediated rejection.
[0022] The ability of a recombinant soluble form of CR1 to inhibit
inflammation in the reversed passive Arthus reaction in rats was
described by Yeh, C. G., et al., J. Immunol (1991) 146:250-256.
This soluble CR1 was obtained from Chinese hamster ovary (CHO)
cells expressing a CR1 genetic construct which has been mutated to
remove the transmembrane and cytoplasmic domains. The ability of a
similar soluble CR1, produced recombinantly in CHO cells, to
inhibit post-ischemic myocardial inflammation and necrosis in rats
was reported by Weissman, H. F., et al., Science (1990) 249:
146-151.
[0023] Proteins related to the RCA proteins have also been shown to
be produced by viruses, presumably as a mechanism whereby infection
by the virus can be facilitated, as reported by Kotwaal, J., et
al., Nature (1988) 335:176-178; McNearney, T. A., J Exp Med (1987)
16:1525-1535.
[0024] Complete inhibition of the complement system on a long-term
basis is not likely to be desirable in most individuals. In some
cases of autoimmune disease, inhibition of the classical pathway
alone may be sufficient. In the case of the xenograft transplants,
however, stringent inhibition of both pathways may be important.
Similar stringency may be required for other applications.
Accordingly, alternative modulators of the complement system with
regulatable binding activities would be desirable.
[0025] It is therefore an object of the present invention to
provide modified complement regulators which can be administered in
soluble form for treatment of inflammatory disorders or to reduce
an individuals ability to reject foreign materials.
[0026] It is a further object of the present invention to provide
modified complement regulators which are shorter and more easily
and economically produced than the more complex naturally occurring
proteins.
[0027] It is another object of the present invention to provide
complement regulators which combine the activities of different
complement regulators to provide enhanced capability of inhibiting
complement proteins specifically and systemically, both in the
classical and alternative pathways.
SUMMARY OF THE INVENTION
[0028] Analogs of RCA-encoded proteins which are modified
full-length or truncated forms of these regulatory proteins are
described and demonstrated to inhibit complement. These analogs
include RCA proteins having site-specific mutations, RCA protein
hybrids which contain one or more SCRs from more than one RCA
protein, modifications of RCA recombinants in which SCRs are
arranged in different orders, and truncated versions of RCA
proteins containing as few three SCRs. The modified proteins retain
complement inhibitory activity, which may be altered in
specificity, affinity, or mechanism.
[0029] The modified proteins are demonstrated by the .DELTA.SCR11,
subst1 mutation of CR1-4 (8,9) which carries only three complete
SCRs (amino acids 1-194, SCR 1-3) and has C3b activity; the 1ar,
5r, 8br mutation of CR1-4 (8,9) which has accentuated C3b and C4b
activities; a three SCR form composed of the first 194 amino acids
of 1ar, 5r, 8br; mutants 10, 10, 10, 11, 11c, 8a, 6b, 14, and 15,
which are mutations of CR1-4 with enhanced activities; and CR1-4
stop7, CR1-4 (8,9) stop7, and 1ar, 5r, 8br stop 7, which are all
shorter (7 SCR) forms of their respective parent proteins (CR1-4,
CR1-4 (8,9) and 1ar, 5r, 8br, respectively).
[0030] These analogs are useful in controlling the complement
system, and thus may be useful in the treatment of autoimmune
diseases, the suppression of rejection of transplants, in diagnosis
and the reduction in tissue damage associated with myocardial
infarctions and cerebral vascular accidents. They may also play a
role in the diagnosis of conditions associated with complement
activation and immune complex formation.
BRIEF DESCRIPTION OF THE DRAWINGS
[0031] FIG. 1 is a schematic of the classical and alternate
complement pathways.
[0032] FIGS. 2A and 2B compare the amino acid sequences of SCR-1
and SCR-8 (FIG. 2A) and SCR-2 and SCR-9 (FIG. 2B) of full-length
CR1, indicating the substitutions made in SCR-1 and SCR-2 based on
the amino acid residue present in corresponding SCR, as described
in Examples 2 and 3 and Table 2.
[0033] FIGS. 3A and 3B compare the amino acid sequences of SCR-1
and SCR-8 (SEQ ID NOS: 1 and 2, respectively) (FIG. 3A) and SCR-2
and SCR-9 (SEQ ID NOS: 3 and 4 respectively) (FIG. 3B) of
full-length CR1. The amino acid sequences of SCR 15-16 are
identical to those of SCR 8-9, with the exception of a T at
position 110. The enumeration is consistent with that of Hourcade
et al., J. Exp. Med. (1988) 168:1255-1270, with the first amino
acid (Q) of the mature receptor designated as amino acid 1. The
corresponding position in CR1 SCR-8 is similarly designated.
DETAILED DESCRIPTION OF THE INVENTION
[0034] Described herein is a family of RCA protein analogs which
can be used to modulate the complement system. The analogues can be
used to alter the binding specificity of the members of this
protein family in both membrane-bound and soluble forms.
[0035] As used herein, "RCA proteins" refers to proteins encoded by
the gene cluster at the genetic region1q32, which encodes six known
proteins effective in complement regulation: factor H, C4bp, CR1,
CR2, DAF and MCP. In addition, apparent coding regions similar to
the amino terminal coding region of the CR1 gene and the MCP gene
have been located in this cluster, although it is unclear whether
these sequences are expressed (Hourcade, D., et al., J. Biol. Chem.
265:974-980 (1990); Hourcade, D., et al., Genomics 12:289-300
(1992)). The term "RCA proteins" also refers to a group of proteins
which include short consensus repeats (SCR) which are homologous to
SCR in CR1, CR2, MCP, DAF, factor H, and C4bp.
[0036] The modified proteins described herein are collectively
referred to as "modified RCA proteins". These include truncated,
hybrid and recombined forms of the RCA proteins. "Truncated"
proteins are shorter versions of the RCA proteins, typically
modified so as to remove the C-terminal regions which effect
membrane binding or secretion and sometimes modified further by
deletion of one or more SCRs. "Hybrid" proteins are RCA proteins
that are composed of portions, i.e., the SCRs, of one RCA protein
combined with SCRs of one or more other RCA proteins. "Recombined"
forms are those wherein the SCRs of an RCA protein are rearranged
in a new order. Also included are proteins having site specific
substitutions and deletions of one or more amino acids. "Modified
RCA proteins" include proteins which result from combinations of
these changes.
[0037] CR1 contains 30 SCR, followed by transmembrane and
cytoplasmic regions. CR2 contains 16 SCR, followed by transmembrane
and cytoplasmic regions. DAF and MCP each contain four SCR,
followed by glycolipid anchor regions. The C4 binding protein is a
more complex protein, having seven alpha subunits and one beta
subunit. The alpha subunit consists of eight SCR and the beta
subunit consists of three SCR. Factor H consists of twenty SCR.
[0038] In some embodiments, modifications are made using
corresponding SCRs of the protein as sites for alteration. By
"corresponding SCR" is meant the most highly homologous SCR as
judged by the amino acid sequences of the protein. Exon structure
can in some cases facilitate this assignment. As noted above, SCRs
1-3 of CR1 correspond to SCRs 2-4 of DAF. SCRs 1-3 of factor H,
CR1, C4bp and MCP are corresponding sequences among these proteins.
CR1 is organized into a series of long homologous repeats (LHRS)
containing 7 SCRs so that CR1 SCRs 1-7 correspond to CR1 SCRs 8-14;
15-21; and 22-28. CR2 is organized into a series of long homologous
repeats of 4 SCRs in length. SCRs 1-2 of CR1 correspond to SCRs
3-4, SCRs 7-8, SCRs 11-12 and SCRs 15-16 of CR2.
[0039] These proteins are characterized by C4b-binding activity,
C3b-binding activity, C4b cofactor activity, and C3b cofactor
activity. In general, it takes two to three SCRs for each activity.
Activities which are biologically important include decay
acceleration or dissociation, C3b cofactor activity and C4b
cofactor activity. Cofactor activity requires binding but binding
alone may not be sufficient for cofactor activity.
[0040] It is generally accepted that CR1 C4b binding and cofactor
activity requires SCRs 1, 2 and 3, 8, 9, and 10, or 15, 16, and 17,
which are corresponding regions of the protein. C3b binding and
cofactor activity requires SCRs 8, 9, and 10, or 15, 16, and 17,
which are corresponding regions of the protein. MCP C4b binding and
cofactor activity requires SCRs 1, 2, 3, and 4. MCP C3b binding
requires SCRs 3 and 4; cofactor activity requires SCRs 2, 3, and 4.
C4bp C4b binding and cofactor activity requires SCRs 1, 2, and 3.
DAF decay accelerating activity requires SCRs 2, 3, and 4. Factor H
C3b binding activity is mediated by the first five SCRs.
[0041] Based on these discoveries, it is possible to design a more
potent soluble complement inhibitor by modifying corresponding
regions to increase affinity for C4b and C3b or to design soluble
complement inhibitors that specifically inhibit one part of the
complement system. These modifications can be in the form of
specific substitutions of amino acids that alter C3b or C4b binding
within corresponding SCRs of CR1 or other RCA proteins, or
substitution of SCRs from one protein into another.
Substitution of SCR Regions from One Regulatory Protein into a
Second Regulatory Protein
[0042] The identification of the amino acid sequences essential (or
refractory) to binding to C4b and C3b and C4b and C3b cofactor
activity permits transposition of similar sequences into
corresponding regions of the same protein or corresponding regions
of other family members or alteration of sequences which bind C3b
and C4b so as to alter their affinities. Corresponding regions have
been identified by degree of amino acid sequence homology.
[0043] In the case of CR1, four corresponding regions of interest
are SCRs 1-3, SCRs 8-10, SCRs 15-17 and SCRs 22-24. The SCR
portions 2-4 for DAF correspond to 1-3, 8-10, 15-17 and 22-24 for
CR1. Substitution of portions of DAF with homologous CR1 sequences
provides forms of DAF with cofactor activity and/or binding
activity, such as is exhibited by CR1. Similarly, substitutions of
portions of MCP with homologous sequences provides forms of MCP
with increased binding affinity and cofactor activity and/or
increased dissociation activity.
[0044] Specific Amino Acid Substitutions
[0045] Addition of Binding Sites
[0046] Specific amino acids are selected for substitution based on
studies that elucidate their roles in complement regulation in
specific active sites. Substitution can be employed in order to
alter the activity of additional RCA active sites in the same or
other proteins. In this manner, binding and cofactor sites can be
added to SCRs not normally contributing directly to binding
capacity.
[0047] The standard one letter abbreviations for amino acids are
used herein.
[0048] For example, the C3b and C4b binding and f 4 cofactor
sequence in CR1, N-A-A-H-W-S-T-K-P-P-I-C-Q (amino acids 49-61 of
SEQ ID NO: 4) can be transferred to corresponding locations or to
locations referenced to conserved amino acids in alternative SCRs
to confer C3b binding. Conversely, the homologous sequence in SCR-2
of CR1, i.e., D-T-V-W-D-N-E-T-P-I-C-D (amino acids 49-61 of SEQ ID
NO: 3) can be transferred by substitution to other locations in
C3b-binding SCRs in order to decrease C3b and C4b binding and
cofactor activity. The C4b binding regions are shown to be
associated with three separate critical locations in SCR-1 and
SCR-2 of CR1 in the proximity of amino acids 35, 64-65, and 92-94.
Alterations in amino acid sequences of the corresponding SCRs in
CR1 or in additional RCA family members or their truncated, hybrid,
or recombined forms in these positions alter C4b binding and
cofactor activities and in at least one case alter C3b binding and
cofactor activities.
[0049] Substitution of Similar Amino Acids
[0050] Structurally similar amino acids can be substituted in such
transfers for some of the specified amino acids. Structurally
similar amino acids include: (I, L, V); (F, Y); (K, R); (Q, N); (D,
E); and (G, A).
[0051] Deletion of Amino Acids
[0052] It also may be advantageous to delete amino acids from
specific active sites in order to alter or enhance complement
regulatory activity.
[0053] Construction of Truncated Forms
[0054] In some embodiments, it will be advantageous to delete a
specific activity by deletion of a region known to have a
particular activity. It may also be desirable to delete the region
of the protein which anchors the naturally occurring protein to the
cell surface, for example, the transmembrane and cytoplasmic
regions or the glycolipid anchor region.
[0055] Modifications which Enhance Cofactor Activity
[0056] In general, either C3b or C4b cofactor activity can be
enhanced by substitutions which increase the binding activity of
the other factor, i.e., to increase C4b cofactor activity, amino
acids are substituted into the modified protein which increase C3b
binding and vice versa. This is demonstrated in the examples,
specifically by the mutants shown in Table 2. An example of a
single amino acid substitution that enhances both C4b cofactor
activity and C3b cofactor activity is CR1 mutant 6b: changing the G
at 79 to D increases C4b cofactor, as well as C3b binding and
cofactor activity.
[0057] Preparation of the Analogs
[0058] The modified proteins described herein are most conveniently
prepared using recombinant techniques, although in some cases they
can be prepared by enzymatic cleavage, for example, to yield
truncated or soluble forms of the naturally occurring proteins. The
genes encoding the various members of the RCA protein family are of
known sequence and are published.
[0059] cDNA encoding CR1 has been described by Klickstein, L. B.,
et al., J. Exp. Med. (1987) 165: 1095, Klickstein, L. B., et al.,
J. Exp. Med. (1988) 168:1699; Hourcade, D., et al., J. Exp. Med.
(1988) 168: 1255, the teachings of which are incorporated by
reference. SEQ ID NO: 12 is the nucleotide sequence and SEQ ID NO:
13 is the amino acid sequence for CR1. Sequence ID No 14 is the
nucleotide sequence and SEQ ID NO: 15 is the amino acid sequence
for CR2. SEQ ID NO: 16 is the nucleotide sequence and SEQ ID NO: 17
is the amino acid sequence for DAF. SEQ ID NO: 18 is the nucleotide
sequence and SEQ ID NO: 9 is the amino acid sequence for MCP.
[0060] The cDNA encoding CR2 has been described by Moore, M. D., et
al., Proc. Natl. Acad. Sci. USA (1987) 84:9194, and by Weiss, J.
J., et al., J. Exp. Med. (1988) 167:1047, the teachings of which
are incorporated by reference.
[0061] The cDNA encoding DAF has been described by Caras, I. W., et
al., Nature (1987) 25:545, and by Medof, M. E., et al., Proc. Natl.
Acad. Sci. USA (1987) 84:2007, the teachings of which are
incorporated by reference.
[0062] The cDNA encoding MCP has been described by Lublin, D. M.,
et al., J. Exp. Med. (1988) 168:181, the teachings of which are
incorporated by reference.
[0063] The cDNA encoding the C4bp alpha chain has been described by
Chung, L. P., et al., Biochem. J. (1985) 230:133, and the cDNA
encoding the C4bp beta chain has been described by Hillarp, A., and
Dahlback, B., Proc. Natl. Acad. Sci. USA (1990) 87:1183, the
teachings of which are incorporated by reference.
[0064] The cDNA encoding factor H has been described by Ripoche,
J., et al., Biochem. J. (1988) 249:593, the teachings of which are
incorporated by reference.
[0065] Since the cDNAs encoding these proteins are known and the
amino acid sequences have been deduced, comparison of corresponding
regions of the various proteins of the member families has been
possible. In addition, the availability of the cDNA sequence makes
possible the preparation of genetic constructs encoding truncated
forms and other modified forms of the proteins using standard
site-directed mutagenesis techniques, such as those described by
Kunkel, T. A., et al., Methods Enzymol (1987) 154: 367-382.
[0066] Accordingly, the first step in the preparation of the
analogs requires identification of the corresponding region of the
target protein through sequence homology and site-directed
mutagenesis in this region of the gene to alter C4b or C3b binding
properties.
[0067] After the gene encoding the analog is prepared, the modified
gene is expressed using standard recombinant techniques. The gene
sequence is ligated into a suitable expression vector under the
control of sequences known to be appropriate to the desired host.
Production of recombinant proteins in microbial systems such as E.
coli, B. subtilis, various strains of yeasts, and other fungi, such
as Aspergillus, is well known. It may be advantageous to produce
the desired analogs in cells of higher organisms as well, such as
the standard BPV/C127 system, the Baculovirus/insect cell system,
CHO cells, COS cells, and other mammalian cells, or in transgenic
animals. Standard expression systems in various cell lines are well
known and standard in the art.
[0068] Transgenic animals can be constructed for several species.
The gene is placed under the control of a suitable promoter, for
example, the metallothionine promoter or a tissue specific
promoter, and the gene microinjected into an embryo, which is then
implanted into a surrogate mother. Production in transgenic animals
is important in the context of preparing transplants for use in
other species.
[0069] Construction of Transgenic Animals.
[0070] Animal Sources
[0071] Animals suitable for transgenic experiments can be obtained
from standard commercial sources. These include animals such as
mice and rats for testing of genetic manipulation procedures, as
well as larger animals such as pigs, cows, sheep, goats, and other
animals that have been genetically engineered using techniques
known to those skilled in the art. These techniques are briefly
summarized below based principally on manipulation of mice and
rats.
[0072] Microinjection Procedures
[0073] The procedures for manipulation of the embryo and for
microinjection of DNA are described in detail in Hogan et al.
Manipulating the mouse embryo, Cold Spring Harbor Laboratory, Cold
Spring Harbor, N.Y. (1986), the teachings of which are incorporated
herein. These techniques are readily applicable to embryos of other
animal species, and, although the success rate is lower, it is
considered to be a routine practice to those skilled in this
art.
[0074] Transgenic Animals
[0075] Female animals are induced to superovulate using methodology
adapted from the standard techniques used with mice, that is, with
an injection of pregnant mare serum gonadotrophin (PMSG; Sigma)
followed 48 hours later by an injection of human chorionic
gonadotrophin (hCG; Sigma). Females are placed with males
immediately after hCG injection. Approximately one day after hCG,
the mated females are sacrificed and embryos are recovered from
excised oviducts and placed in Dulbecco's phosphate buffered saline
with 0.5% bovine serum albumin (BSA; Sigma). Surrounding cumulus
cells are removed with hyaluronidase (1 mg/ml). Pronuclear embryos
are then washed and placed in Earle's balanced salt solution
containing 0.5% BSA (EBSS) in a 37.5.degree. C. incubator with a
humidified atmosphere at 5% CO.sub.2, 95% air until the time of
injection.
[0076] Randomly cycling adult females are mated with vasectomized
males to induce a false pregnancy, at the same time as donor
females. At the time of embryo transfer, the recipient females are
anesthetized and the oviducts are exposed by an incision through
the body wall directly over the oviduct. The ovarian bursa is
opened and the embryos to be transferred are inserted into the
infundibulum. After the transfer, the incision is closed by
suturing.
[0077] Embryonic Stem (ES) Cell Methods
[0078] Introduction of cDNA into ES Cells:
[0079] Methods for the culturing of ES cells and the, subsequent
production of transgenic animals, the introduction of DNA into ES
cells by a variety of methods such as electroporation, calcium
phosphate/DNA precipitation, and direct injection are described in
detail in Teratocarcinomas and embryonic stem cells, a practical
approach, ed. E. J. Robertson, (IRL Press 1987), the teachings of
which are incorporated herein. Selection of the desired clone of
transgene-containing ES cells is accomplished through one of
several means. In cases involving sequence specific gene
integration, a nucleic acid sequence for recombination with the
gene of interest or sequences for controlling expression thereof is
co-precipitated with a gene encoding a marker such as neomycin
resistance. Transfection is carried out by one of several methods
described in detail in Lovell-Badge, in Teratocarcinomas and
embryonic stem cells, a practical approach, ed. E. J. Robertson,
(IRL Press 1987) or in Potter et al Proc. Natl. Acad. Sci. USA 81,
7161 (1984). Calcium phosphate/DNA precipitation, direct injection,
and electroporation are the preferred methods. In these procedures,
a number of ES cells, for example, 0.5.times.10.sup.6, are plated
into tissue culture dishes and transfected with a mixture of the
linearized nucleic acid sequence and 1 mg of pSV2neo DNA (Southern
and Berg, J. Mol. Appl. Gen. 1:327-341 (1982)) precipitated in the
presence of 50 mg lipofectin in a final volume of 100 .mu.l. The
cells are fed with selection medium containing 10% fetal bovine
serum in DMEM supplemented with an antibiotic such as G418 (between
200 and 500 .mu.g/ml). Colonies of cells resistant to G418 are
isolated using cloning rings and expanded. DNa is extracted from
drug resistant clones and Southern blotting experiments using the
nucleic acid sequence as a probe are used to identify those clones
carrying the desired nucleic acid sequences. In some experiments,
PCR methods are used to identify the clones of interest.
[0080] DNA molecules introduced into ES cells can also be
integrated into the chromosome through the process of homologous
recombination, described by Capecchi, (1989). Direct injection
results in a high efficiency of integration. Desired clones are
identified through PCR of DNA prepared from pools of injected ES
cells. Positive cells within the pools are identified by PCR
subsequent to cell cloning (Zimmer and Gruss, Nature 338, 150-153
(1989)). DNA introduction by electroporation is less efficient and
requires a selection step. Methods for positive selection of the
recombination event (i.e., neo resistance) and dual
positive-negative selection (i.e., neo resistance and ganciclovir
resistance) and the subsequent identification of the desired clones
by PCR have been described by Joyner et al., Nature 338, 153-156
(1989) and Capecchi, (1989), the teachings of which are
incorporated herein.
[0081] Embryo Recovery and ES Cell Injection
[0082] Naturally cycling or superovulated females mated with males
are used to harvest embryos for the injection of ES cells. Embryos
of the appropriate age are recovered after successful mating.
Embryos are flushed from the uterine horns of mated females and
placed in Dulbecco's modified essential medium plus 10% calf serum
for injection with ES cells. Approximately 10-20 ES cells are
injected into blastocysts using a glass microneedle with an
internal diameter of approximately 20 .mu.m.
[0083] Transfer of Embryos to Pseudolpregnant Females
[0084] Randomly cycling adult females are paired with vasectomized
males. Recipient females are mated such that they will be at 2.5 to
3.5 days post-mating (for mice, or later for larger animals) when
required for implantation with blastocysts containing ES cells. At
the time of embryo transfer, the recipient females are
anesthetized. The ovaries are exposed by making an incision in the
body wall directly over the oviduct and the ovary and uterus are
externalized. A hole is made in the uterine horn with a needle
through which the blastocysts are transferred. After the transfer,
the ovary and uterus are pushed back into the body and the incision
is closed by suturing. This procedure is repeated on the opposite
side if additional transfers are to be made.
[0085] Identification of Transgenic Animals
[0086] Samples (1-2 cm of mouse tails) are removed from young
animals. For larger animals, blood or other tissue can be used. To
test for chimeras in the homologous recombination experiments,
i.e., to look for contribution of the targeted ES cells to the
animals, coat color has been used in mice, although blood could be
examined in larger animals. DNA is prepared and analyzed by both
Southern blot and PCR to detect transgenic founder (F.sub.0)
animals and their progeny (F.sub.1 and F.sub.2).
[0087] Once the transgenic animals are identified, lines are
established by conventional breeding and used as the donors for
tissue removal and implantation using standard techniques for
implantation into humans.
Purification of Analogs
[0088] The analogs recombinantly produced in culture or animals can
be purified from the cell culture using standard purification
techniques such as chromatography, for example, immunoaffinity or
ion-exchange chromatography, and electrophoresis, generally using
the same procedures as have been published for use in purifying the
naturally occurring material.
[0089] While recombinant production of the analogs is the most
convenient and practical method of preparing the proteins, it may
also be desirable to synthesize the analogs using protein synthesis
techniques, such as standard solid-phase peptide synthesis
technology. This approach may be suitable in particular in the case
of truncated forms of the RCA protein family having modified amino
acid sequences, especially in view of the discovery that proteins
containing as few as three SCRs have useful biological
activity.
[0090] Assay Systems
[0091] The analogs are tested for the desired biological activities
among those characteristic of the RCA family using in vitro or in
vivo assays. In vitro systems such as those described by Wong and
Farrell (J. Immunol. (1991) 146:656) can be used to measure effects
on the complement pathways. In vivo and general biological effects
can be assessed as described by Weisman, et al. (Science (1990)
249:146); or Yeh, et al. (J. Immunol. (1991) 146:250), the
teachings of which are incorporated by reference.
[0092] Binding Assays.
[0093] Affinity chromatography columns were prepared as described
by Krych, et al., Proc. Natl. Acad. Sci. USA 88, 4353-4357. Binding
assays were performed using 100 ml of iC3-Sepharose (iC3-S) or
C4b-Sepharose (C4b-S) and 1.8 ml of medium containing of the mutant
protein. Media were diluted to desired concentrations of NaCl.
After 1 hr on a rotator at room temperature, samples were
centrifuged, media removed and bound protein eluted from the
Sepharose using 400 mM NaCl with 1% NP-40. Eluted proteins were
quantitated by ELISA, using two monoclonal anti-CR1 antibodies, 3D9
and E11 (Hogg, et al., Eur. J. Immunol. 14, 236-240 (1984), O'Shea,
et al., J. Immunol. 134, 2580-2587 (1985)). Since neither
monoclonal antibody reacts with CR1 SCRs-1, 2, 8 or 9, all of the
mutants derived from either CR1-4 or CR1-4 (8, 9) could be
quantitated: using this assay. For each mutant protein at least
three binding assays from different transfections were
performed.
[0094] Assays for Cofactor Activity
[0095] C3 and C4 were purified according to the method of Dykman,
et al., Proc. Natl. Acad. Sci. USA 80, 1698-1702 (1983), Dykman.,
et al., J. Exp. Med. 157, 2160-2165 (1983) or purchased (Quidel,
San Diego, Calif.), converted to C3b and C4b and labeled with
.sup.125I using Iodogen coated beads (Pierce). Cofactor assays were
performed using 200 ng of labeled C3b or C4b, 60 ng of factor I
(Quidel) and media with mutant proteins. Amounts of the cofactor
proteins were estimated in ELISA assay based on a standard curve of
secreted CR1 (sCR1, Weisman, et al., Science 249, 146-151 (1990)).
To test for cleavage of C3b, samples containing approximately 6 pg
of the mutant proteins were incubated for 1 hr at 37.degree. C. To
test for cleavage of C4b, samples were incubated for up to 16 hours
at 37.degree. C. After incubation, samples were reduced by boiling
in the buffer containing 2% SDS and 5% beta-mercaptoethanol in
0.25% TRIS, pH 6.8 and electrophoresed on a 4-20% SDS-PAG
(Integrated Separations) or on a 10% self-made gel. After drying
the gels were autoradiographed at -70.degree. C. using an
intensifying screen.
[0096] Preparation and Administration of Pharmaceutical
Compositions
[0097] The most potent analogs based on the in vitro assays are
tested in vivo. In general, the in vitro assays are accepted as
highly correlated with the corresponding in vivo activity. The
appropriate dosages are determined by comparing the in vitro
activity of the naturally occurring protein with that of the
analog, comparing the in vitro activity of the naturally occurring
protein with the in vivo activity of the naturally occurring
protein, then calculating the expected in vivo activity of the
analog, adjusting for any measured differences in half-life.
[0098] Complement activation can account for substantial tissue
damage in a wide variety of autoimmune/immune complex mediated
syndromes such as systemic lupus erythematosus, rheumatoid
arthritis, hemolytic anemias, myasthenia gravis and others.
Inhibition of the complement systems is a desirable therapeutic
intervention in these cases. In some instances, specific inhibition
of the classical pathway alone by RCA analogs could be preferred
since long-term inhibition of the alternative pathway could lead to
side effects.
[0099] Inhibition of complement activation could also be desirable
in cases that involve tissue damage brought about by vascular
injury such as myocardial infarction, cerebral vascular accidents
or acute shock lung syndrome. In these cases, the complement system
may contribute to the destruction of partially damaged tissue as in
reperfusion injury. Highly stringent inhibition of complement for
relatively brief periods might be preferred in these instances and
soluble RCA analogs designed for higher potency may prove
especially useful.
[0100] Complement inhibition may also prove important in the
prevention of xenograft rejection. Organs derived from animals
transgenic for human DAF or MCP may be protected at least in part
from complement-mediated hyperacute rejection by the expression of
transgenic DAF or MCP on the cell surfaces of the xenograft.
Animals transgenic for RCA analogs designed for higher potency may
provide more successful xenografts. Soluble RCA analogs may also
prove useful in protecting the transplant in the recipient.
[0101] Soluble analogs having decreased activity may also be useful
as competitive inhibitors of the natural inhibitors, in cases where
an increased complement mediated response is desirable or where an
individual has a disorder in which their immunity is compromised by
overproduction of the natural inhibitors.
[0102] The analogs can be administered locally or systemically in
pharmaceutically acceptable carries such as saline, phosphate
buffered saline, or a controlled release formulation. The dosage
level and mode of administration of the analogs depend on the
nature of the analog, the nature of the condition to be treated,
and the history of the individual patient. Systemic administration
is generally required, which may be by injection or by transmucosal
or transdermal delivery. Administration by injection may be
intravenous, intramuscular, intraperitoneal or subcutaneous.
Formulations for injection are generally biocompatible solutions of
the active ingredient such as Hank's solution or Ringer's solution.
Formulations for transdermal or transmucosal administration
generally include penetrants such as fusidic acid or bile salts in
combination with detergents or surface-active agents. The
formulations can then be manufactured as aerosols, suppositories,
or patches. Oral administration is generally not favored for
protein or peptide active ingredients; however, if suitably
formulated so as to be protected from the digestive enzymes, oral
administration can also be employed.
[0103] Suitable formulations for a desired mode of administration
can be found in Remington's Pharmaceutical Sciences, latest
edition, Mack Publishing Co., Easton, Pa. The dosage levels and
precise formulations are obtainable by routine optimization
procedures as is generally known in the art.
[0104] Diagnostic Applications
[0105] The analogs which are capable of binding C3b and/or C4b are
useful as diagnostic tools in assessing the presence, absence or
amount of C3b or C4b or C3b/C4b-bearing immune complexes in
biological fluids. Such assays take advantage of the ability of the
analog specifically to bind C3b and/or C4b and can be conducted in
a variety of formats as is generally known. Formats for specific
binding partner assays include direct and competitive formats,
sandwich assays, and agglutination assays. Complexation between
members of the specific binding pair can be conducted in solution
or on a solid phase and can be detected using a variety of labeling
techniques including fluorescence, radioisotopes, chromophores, and
visible particles.
[0106] Typical reagent kits useful in assays for C3b and/or C4b
and/or C3b/C4b-bearing immune complexes include the analog
specifically binding to the analyte, optionally coupled to a solid
support and additional labeling reagents useful in the assay. For
example, one of many formats for the assay might include treating
the sample to be tested with a solid support to which is coupled
the analog as a specific binding partner, washing the support which
has been treated with sample suspected of containing analyte, and
then treating the washed support with anti-C3b or anti-C4b antibody
labeled with an enzyme such as horseradish peroxidase. The presence
of labeled enzyme on a support then is detected by addition of a
substrate solution which results in the development of a color in
the presence of the enzyme.
[0107] The present invention will be further understood by
reference to the following non-limiting examples.
EXAMPLE 1
Binding Specificity of Truncated CR1
[0108] The cDNA which encodes the first 543 amino acids of mature
CR1 (SCR 1-8 and 1/2 of SCR-9) was transfected into COS cells to
obtain a secreted protein designated CR1-4.
[0109] Two mouse monoclonal anti-CR1 antibodies were used to
determine the immunoreactivity of CR1-4, and as a method to assay
for this protein. Antibody E11 (Hogg, N., et al., Eur. J. Immunol.
(1984) 14:236-243), and 3D9 (O'Shea, J. J., et al., J. Immunol.
(1985) 134:2580-2587), recognized CR1 and bound to the
recombinantly produced CR1-4.
[0110] Binding to C4b or to C3b was tested using either C4b or iC3
(C3 containing a broken thioester bond analogous in reactivity and
function to C3b) bound to a Sepharose.TM. support as described by
Dykman, T., et al., Proc. Natl. Acad. Sci. USA (1983) 80:1698-1702,
and Cole, J. L., et al., Proc. Natl. Acad. Sci. USA (1985)
82:859-863. Binding to these solid-supported substrates was
verified by testing with an ELISA for CR1.
[0111] The C4b derivatized support was able to bind CR1-4. Most
efficient binding occurred at between 12.5 and 25 mM salt.
[0112] Solubilized C4b inhibited binding of the CR1-4 protein to
C4b-derivatized support but soluble iC3 did not inhibit this
binding. This confirms that CR1-4 binds primarily C4b but not iC3
(C3b).
EXAMPLE 2
Construction of CR1-4 Analogs
[0113] Various mutated forms of CR1-4 were constructed and tested
for binding activity to C4b and iC3 as described above in Example
1. Table 2 shows the analogs constructed and the binding and
cofactor activities of the analogs. In Table 2, the modifications
are shown by the number of the amino acid and the conversion
effected. The numbers correspond to those set forth in FIGS. 2A and
2B and FIGS. 3A and 3B which set forth the amino sequences of SCR-1
(SEQ ID NO: 1), SCR-2, (SEQ ID NO: 3) SCR-8 (SEQ ID NO: 2) and
SCR-9 (SEQ ID NO: 4) in CR1.
[0114] The combined changes shown in truncated forms should be
predictive of the activity of a modified full length protein. There
is strong evidence available that establishes three primary binding
and cofactor sites in the major polymorphic form (30 SCRs) of CR1.
One site interacts almost exclusively with C4b (SCRs 1-4) while the
other two sites, nearly identical in amino acid sequence, interact
with both C4b and C3b (SCRs 8-11 and SCRs 15-18) [Klickstein, L.
B., et al., J. Exp. Med. 168: 1699-1717 (1988); Kalli, et al., J.
Exp. Med. 174: 1451-1460 (1991); Makrides, S. C., et al., J. Biol.
Chem. 267:24754-24761 (1992)].
[0115] Electron microscope analysis of CR1 suggests that CR1 is a
semi-rigid rod composed of individual globular domains (the SCRs).
Thus, CR1 is pictured as an elongated protein bearing
well-separated active sites. These sites may not work
cooperatively. A comparison of several polymorphic forms,
containing the first site (SCRs 1-4) and one, two and three copies
of the second site, demonstrated little difference among them in
cofactor and decay acceleration assays, as reported by Seya, T., et
al., J. Immunology 135: 2661 (1985). Indeed, after the filing of
the application to which this claims priority, it was shown that
modified membrane forms of CR1 bearing a single active site alone
(SCR 1-4 or SCR 15-18) could protect CHO cells from human
complement-mediated lysis (Makrides, S. C., et al., J. Biol. Chem.
267: 24754-24761 (1992)). This provides evidence that a simple CR1
form with a single active site could be biologically active, and
that modifications that improve the activity of each primary site
in the constructs could be incorporated together in a multiple-site
CR1 form, yielding an improved regulator.
TABLE-US-00002 TABLE 2 CR 1-4 Mutants C4b C3b C4b- C3b- Cofactor
cofactor Designation Description.sup.a binding binding activity
activity CR1-4 original peptide +++ + ++ + .DELTA.SCR-1 amino acids
1-60 deleted - - ND ND .DELTA.SCR-2 amino acids 61-122 deleted - -
ND ND .DELTA.SCR-3 amino acids 123-194 deleted ND ND - - CR1-4 Stop
7 truncated after amino acid 449 ND + ++ + 1 35: G.fwdarw. E (amino
acid 35 of SEQ ID NOS: 1 and 2, respectively); + + - ++ 37:
S.fwdarw. Y (amino acid 37 of SEQ ID NOS: 1 and 2, respectively) 1a
35: G.fwdarw. E (amino acid 35 of SEQ ID NOS: 1 and 2,
respectively) + + + + 1b 37: S.fwdarw. Y (amino acid 37 of SEQ ID
NOS: 1 and 2, respectively) ++ ++ + + 2 44, 47, 49: I . . . K . . .
S.fwdarw. T . . . D . . . L +++ + ++ + (amino acids 44, 47 and 49
of SEQ ID NOS: 1 and 2, respectively) 3 52.fwdarw. 54:
T-G-A.fwdarw. S-S-P (amino acids 52-54 of SEQ ID NOS: 1 and 2,
respectively) +++ + ++ + 4 57, 59: R . . . R.fwdarw. V . . . K +++
+ ++ + (amino acids 57, 59 of SEQ ID NOS: 1 and 2, respectively) 5
64, 65: R-N.fwdarw. K-T (amino acids 4, 5 of SEQ ID NOS: 3 and 4,
respectively) + + - + 5a 64: R.fwdarw. K (amino acid 4 of SEQ ID
NOS: 3 and 4, respectively) +/- - ND ND 5b 65: N.fwdarw. T (amino
acid 5 of SEQ ID NOS: 3 and 4, respectively) +/- - ND ND 6 78, 79:
K-G.fwdarw. T-D (amino acids 18, 19 of SEQ ID NOS: 3 and 4,
respectively) +++ + ++ + 6b 79: G.fwdarw. D (amino acid 19 of SEQ
ID NOS: 3 and 4, respectively) +++ +++ +++ +++ 1b, 6b 37, 79:
S.fwdarw. Y; G.fwdarw. D (amino acid 37 of SEQ ID NOS: 1 and 2, and
amino acid 19 of +++ ++ + + SEQ ID NOS: 3 and 4, respectively) 7
85, 87: Q . . . K.fwdarw. R . . . N (amino acids 25, 27 of SEQ ID
NOS: 3 and 4) +++ + ++ + 8 92, 94: K . . . Y.fwdarw. T . . . H
(amino acids 32 and 34 of SEQ ID NOS: 3 and 4, respectively) + + -
+ 8a 92: K.fwdarw. T (amino acid 32 of SEQ ID NOS: 3 and 4,
respectively) +++ + +++ ++ 8b 94: Y.fwdarw. H (amino acid 34 of SEQ
ID NOS: 3 and 4, respectively) + + - + 9 99, 103: S . . . T.fwdarw.
H . . . E +++ + ++ + (amino acids 38 and 43 of SEQ ID NOS: 3 and 4,
respectively) 10 109-112: D-T-V-I.fwdarw. N-A-A-H ++++ ++ +++ ++
(amino acids 49-52 of SEQ ID NOS: 3 and 4, respectively) 10, 11
109-112: D-T-V-I.fwdarw. N-A-A-H; +++ +++ +++ +++ 114-117, 121:
D-N-E-T . . . D.fwdarw. S-T-K-P . . . Q (amino acids 49-52 of SEQ
ID NOS: 3 and 4, respectively); (amino acids 54-61 of SEQ ID NOS: 3
and 4, respectively) 11 114-117, 121: D-N-E-T . . . D.fwdarw.
S-T-K-P . . . Q +++ ++ ++ + (amino acids 54-61 of SEQ ID NOS: 3 and
4, respectively) 11a 114: D.fwdarw. S (amino acid 54 of SEQ ID NOS:
3 and 4, respectively) +++ + ND ND 11b 115: N.fwdarw. T (amino acid
55 of SEQ ID NOS: 3 and 4, respectively) +++ + ND ND 11c 116:
E.fwdarw. K (amino acid 56 of SEQ ID NOS: 3 and 4, respectively)
++++ ++ +++ +++ 11d 117: T-P (amino acid 57 of SEQ ID NOS: 3 and 4,
respectively) +++ + ND ND 11c, d 116, 117: E-T.fwdarw. K-P (amino
acids 56 and 57 of SEQ ID NOS: 3 and 4, respectively) ++++ ++ +++
+++ 11e 121: D.fwdarw. Q (amino acid 61 of SEQ ID NOS: 3 and 4,
respectively) +++ + ND ND 11.DELTA.SCR1 1-60 deleted, 114-117, 121:
++ ++ ++ ++ D-N-E-T . . . D.fwdarw. S-T-K-P . . . Q (amino acids
54-61 of SEQ ID NOS: 3 and 4, respectively) 12 1, 3: Q . . .
N.fwdarw. H . . . Q (amino acids 1-3 of SEQ ID NOS: 1 and 2,
respectively) + + ++ + 13 6-9: E-W-L-P.fwdarw. D-H-F-L (amino acids
of 6-9 of SEQ ID NOS: 1 and 2, respectively) + + ++ + 14 12-16,
18-21: R-P-T-N-L . . . D-E-F-E.fwdarw. K-L-K-T-Q . . . N-A-S-D ++++
++ +++ +++ (amino acids 12-16 and 18-21 of SEQ ID NOS: 1 and 2,
respectively) 27, 29: Y . . . N.fwdarw. S . . . K (amino acids 27
and 29 of SEQ ID NOS: 1 and 2, respectively) ++++ ++ +++ +++ ND Not
determined. . . . intervening amino acids which were not
changed.
[0116] As shown in Table 2, deletion of either SCR-1 or SCR-2
results in loss of C4b-binding activity; thus both regions are
required for binding to C4b. Binding to C3b is conferred by
insertion of the SCR-9 sequence S-T-K-P-(P-I-C)-Q (amino acids
54-61 of SEQ ID NO: 4) into SCR-2 (SEQ ID NO: 3); however, deletion
of SCR-1 (SEQ ID NO: 1) partially eliminates binding to C3b in this
analog. Thus, efficient binding to C3b requires not only the
relevant sequence in SCR-2 (SEQ ID NO: 3), but an additional
portion of SCR-1 (SEQ ID NO: 1). The ability to bind C3b conferred
by altering the sequence in SCR-2 (SEQ ID NO: 3) does not affect
binding to C4b.
[0117] As also shown by Table 2, it is possible to weaken or
destroy binding to C4b by altering amino acids 35, 64-65 or 94. It
is possible to strengthen C4b binding by altering amino acid
92.
[0118] These results indicate that by manipulation of C3b and C4b
binding sites in CR1 the affinity and specificity of CR1-4 can be
altered. Similar alterations may be made to corresponding regions
in additional members of the RCA protein family.
EXAMPLE 3
Construction of a More Potent Soluble CR1 Analog
[0119] A stringent inhibitor of the complement system has
applications where higher potency is required. In this example, the
sCR1 sequence is used as starting material for a family of analogs
of higher inhibitory capacity for both classical and alternative
pathways. Other CR1 truncated, full-length, recombined or hybrid
forms of CR1 could also be used.
[0120] As noted above, the sCR1 protein contains four corresponding
regions: SCRs 1-2 (SEQ ID NO: 1 and 3 respectively), SCRs 8-9 (SEQ
ID NO: 2 and 4 respectively), SCRs 15-16 and SCRs 22-23. The
primary C4b-binding site is within SCRs 1-2 (SEQ ID NO: 1 and 3
respectively) and the two primary C3b-binding sites are in SCRs 8-9
(SEQ ID NO: 2 and 4 respectively) and SCRs 15-16. SCRs 22-23 has no
reported binding activity although it is highly homologous in amino
acid sequence to the other three corresponding regions.
[0121] One or more substitutions are introduced into the
corresponding positions of soluble CR1 SCRs 1-2, 8-9, 15-16, and
22-23. Specific substitutions designed to increase C3b and C4b
cofactor activity and binding activity are selected from the
following:
TABLE-US-00003 positions substitution 35 G (amino acid 35 of SEQ ID
NO: 1) 64-65 R-N (amino acids 4-5 of SEQ ID NO: 3) 94 Y (amino acid
34 of SEQ ID NO: 3) 109-112 N-A-A-H (amino acids 49-52 of SEQ ID
NO: 4) 114-121 S-T-K-P-P-I-C-Q (amino acids 54-61 of SEQ ID NO: 4)
109-121 N-A-A-H-W-S-T-K-P-P-I-C-Q (amino acids 49-61 of SEQ ID NO:
4) 92 T (amino acid 32 of SEQ ID NO: 4) 79 D (amino acid 19 of SEQ
ID NO: 4) 12-16 K-L-K-T-Q (amino acids 12-16 of SEQ ID NO: 2) 18-21
N-A-S-D (amino acids 18-21 of SEQ ID NO: 2) 12-21
K-L-K-T-Q-T-N-A-S-D (amino acids 12-21 of SEQ ID NO: 2) 27-29 S-L-K
(amino acids 27-29 of SEQ ID NO: 2)
[0122] In some cases, some of these amino acids may already be
present in the corresponding position. In some cases structurally
similar amino acids may be substituted instead of those substituted
above.
[0123] The use of these substitutions at some or all four
corresponding positions will yield a full length soluble CR1 form
with more potent active sites, and with an additional active site
at SCR 22-24.
[0124] Substitutions described above were introduced into the four
corresponding regions of interest in order to increase the affinity
of sCR1 for its C3b and C4b-containing targets. These modifications
were designed to increase the use of existing coenzyme and
dissociation functions and potentially to establish new coenzyme
and dissociation functions. The introduction of amino acid
sequences significant to C3b binding into the C4b binding region
presumably would not interfere substantially with C4b activities
since such a substitution in CR1-4 did not detectably alter the C4b
binding properties of CR1-4. The introduction of amino acids
significant to C4b-binding into C3b-binding regions presumably
would not interfere substantially with C3b binding activities since
substantial C3b-binding occurs in CR1-4 mutant 11, in which many
amino acids specific to SCRs 1-2, including those significant in
C4b binding, are already present.
[0125] Specific substitutions as shown in FIGS. 2A and 2B and FIGS.
3A and 3B designed to increase affinity to C4b were described in
Example 3: SCRs 1-2 (SEQ ID NO: 1 and 3 respectively), the primary
C4b-binding site of CR1, modified by replacement of K (amino acid
32 of SEQ ID NO: 3) at position 92 with T (amino acid 32 of SEQ ID
NO: 4); SCRs 8-9 (SEQ ID NOS: 2 and 4) and the identical SCRs 15-16
modified by replacement of E (amino acid 35 of SEQ ID NO: 2) at a
position corresponding to 35 by G (amino acid 35 of SEQ ID NO:
1)(or A), H (amino acid 34 of SEQ ID NO: 4) at a position
corresponding to 94 by Y (amino acid 34 of SEQ ID NO: 3) (or F), K
(amino acid 4 of SEQ ID NO: 4) at a position corresponding to 64 by
R (amino acid 4 of SEQ ID NO: 3) and T (amino acid 5 of SEQ ID NO:
4) at a position corresponding to 65 by N (amino acid 5 of SEQ ID
NO: 3) (or Q). SCRs 22-23 were also modified by replacement of G at
a position corresponding to 64 by R (amino acid 4 of SEQ ID NO: 3)
(or K) (amino acid 4 of SEQ ID NO: 4), P at a position
corresponding to 65 with N (amino acid 5 of SEQ ID NO: 3) (or Q), E
at a position corresponding to 92 with T (amino acid 32 of SEQ ID
NO: 4) at a position corresponding to 94 with Y (amino acid 34 of
SEQ ID NO: 3).
[0126] Together, these modifications result in higher affinity for
C4b and may also result in the establishment of new coenzyme or
dissociation functions or the improvement in the capacity of the
coenzyme and dissociation functions present on the sCR1 starting
material.
[0127] As described in previous examples, substitution of a short
stretch of amino acids into SCRs 1-2 (SEQ ID NOS: 1 and 3) of CR1-4
by corresponding amino acids of SCRs 8-9 (amino acid 34 of SEQ ID
NO: 3) confers C3b binding capacity to CR1-4. Replacement of
D-N-E-T-P-I-C-D (amino acid 54-61 of SEQ ID NO: 3) at position
114-121 in SCR1 SCRs 1-2 by S-T-K-P-P-I-C-Q (amino acid 54-61 of
SEQ ID NO: 4) increases the affinity of sCR1 for C3b. Further,
replacement of D-K-K-A-P-I-C-D (SEQ ID NO: 5) at positions 114-121
in SCRs 22-23 by S-T-K-P-P-I-C-Q (amino acids 54-61 of SEQ ID NO:
4) increases the affinity of sCR1 for C3b. Some structurally
similar amino acids may also be substituted in the sequence.
[0128] A more potent complement inhibitor, in general, provides
increased C4b-binding and increased C3b-binding. This is achieved
by introducing all the modifications set forth above.
EXAMPLE 4
Construction Of Truncated CR1 Mutants Having Increased Activity
[0129] More than 25 mutants were assayed for cofactor activity at
several different salt concentrations. The assay was done as
described in Adams, E. M., Brown, M .C., Nunge, M., Krych, M., and
Atkinson, J. P. (1991) J. Immunology, 147: 3005-3011.
[0130] A number of mutations of the active sites in SCRs 1-3, the
site that normally interacts with C4b and not C3b, which improve on
the natural activity of this site, both for C4b and C3b cofactor
activity. At least one mutant in the active sites in SCR 8-11,
which is apparently the more potent natural active site, that
improves on the cofactor activity of this site for both C3b and C4b
was also detected.
[0131] In summary, specific CR1 proteins that have both C3b and C4b
cofactor activity and are in some ways an improvement on the full
length soluble CR1 have now been identified. These constructs are
less than one-fourth (seven SCR constructs) to a third the size of
soluble CR1 (sCR1) and have both C4b binding and cofactor activity
as well as C3b binding and cofactor activity. Constructs including
as few as three SCRs appear to have C3b cofactor activity,
indicative of C3b binding activity. In some cases the data
indicates that the modified active site appears to be more potent
than the natural active sites.
[0132] These forms and their activities are shown in Table 3:
TABLE-US-00004 TABLE 3 Activities of CR1-4 (8, 9) Mutants C4b C3b
C4b- C3b- cofactor cofactor Designation Description binding binding
activity activity CR1-4 (8, 9) * ++++ ++++ ++++ ++++ .DELTA.SGR10
Deletion of amino acids 123-194 ND ND -- -- ND ND -- --
.DELTA.SCR11, ** ND ND ND ++++ subst1 CR1-4 (8, 9) stop 7 truncated
after amino acid 449 ND ++++ ++++ ++++ 1br 37: Y.fwdarw.S (amino
acid 37 of SEQ ID NOS: 2 and 1, ++++ +++ ++++ ++ respectively) 2r
44, 47, 49: T . . . D . . . L.fwdarw.I . . . K . . . S ++++ +++
++++ +++ (amino acids 44, 47 and 49 of SEQ ID NOS: 2 and 1,
respectively) 3r 52-54, 57, 59: +++ ++ ++++ ++++ S-S-P . . . V . .
. K.fwdarw.T-G-A . . . R . . . R (amino acids 52-54, 57, 59 of SEQ
ID NOS: 2 and 1, respectively) 6r 78-79, 82: T-D . . . V.fwdarw.K-G
. . . F ++ ++++ ++++ ++++ (amino acids 18-19, 22 of SEQ ID NOS: 4
and 3, respectively) 7r 85, 87: R . . . N.fwdarw.Q . . . K ++++ +++
++++ ++++ (amino acids 25, 27 of SEQ ID NOS: 4 and 3, respectively)
8r 92, 94: T . . . H.fwdarw.K . . . Y ++++ ++++ ++++ ++++ (amino
acids 32-34 of SEQ ID NOS: 4 and 3, respectively) 9r 99, 103, 106:
++++ ++++ +++ ++++ H . . . E . . . L.fwdarw.S . . . T . . . I
(amino acids 39, 43 and 46 of SEQ ID NOS: 4 and 3, 4 and 3,
respectively) 10r 109-112: N-A-A-H.fwdarw.D-T-V-I +++ ++ ++ ++
(amino acids 49-52 of SEQ ID NOS: 4 and 3, respectively) 10br 110:
A.fwdarw.T (amino acid 50 of SEQ ID NOS: 4 and 3 ++++ ++++ ++++
++++ respectively) 10cr 111: A.fwdarw.V (amino acid 51 of SEQ ID
NOS: 4 and 3, ++++ ++++ ++++ ++++ respectively) 10dr 112:
H.fwdarw.I (amino acid 52 of SEQ ID NOS: 4 and 3, ++++ ++++ ++++
++++ respectively) 10ar 109: N.fwdarw.D (amino acid 49 of SEQ ID
NOS: 4 and 3, ++ ++ + ++ respectively) 11r 114-117, 121: S-T-K-P .
. . Q.fwdarw.D-N-E-T . . . D +++ ++ +++ + (amino acids 54-57, 61 of
SEQ ID NOS: 4 and 3, respectively) 11ar 114: S.fwdarw.D (amino acid
54 of SEQ ID NOS: 4 and 3, ++++ ++++ ++++ ++++ respectively) 11br
115: T.fwdarw.N (amino acid 55 of SEQ ID NOS: 4 and 3, ++++ ++++
++++ ++++ respectively) 11er 121: Q.fwdarw.D (amino acid 61 of SEQ
ID NOS: 4 and 3, ++++ ++++ ++++ ++++ respectively) 11cr 116:
K.fwdarw.E (amino acid 56 of SEQ ID NOS: 4 and 3, +++ +++ + +
respectively) 11dr 117: P.fwdarw.T (amino acid 57 of SEQ ID NOS: 4
and 3, +++ +++ ++++ ++++ respectively) 10, 11r 109-112, 114-117,
121: + + ++ -- N-A-A-H . . . S-T-K-P . . . Q.fwdarw.D-T-V-I . . .
D-N-E-T . . . D (amino acids 49-52, 54-57, 61 of SEQ ID NOS: 4 and
3, respectively) 12r 1, 3: H . . . Q.fwdarw.Q . . . N ++++ ++++
++++ ++++ (amino acids 1 and 3 of SEQ ID NOS: 2 and 1,
respectively) 13r 6-9: D-H-F-L.fwdarw.E-W-L-P ++++ ++++ ++++ ++++
(amino acids 6-9 of SEQ ID NOS: 2 and 1, respectively) 14r 12-16,
18-21: +++ +++ ++++ +++ K-L-K-T-Q- . . . N-A-S-D.fwdarw.R-P-T-N-L .
. . D-E-F-E (amino acids 12-16, 18-21 of SEQ ID NOS: 2 and 1,
respectively) 15r 27, 29: S . . . K.fwdarw.Y . . . N +++ ++++ ++++
+++ (amino acids 27 and 29 of SEQ ID NOS: 2 and 1, respectively)
1ar, 5r 35, 64-65, 94: E . . . K-T . . . H.fwdarw.G . . . R-N . . .
Y +++++ +++++ +++++ +++++ 8br (amino acid 35 of SEQ ID NOS: 2 and
1, amino acids 4-5 and 34 of SEQ ID NOS: 4 and 3 respectively)
*CR1-4 with its first 122 amino acids (SCR1-2) replaced with CR1
amino acids 497-618 (SCR 8-9), as enumerated by Klickstein, L. B.,
J. Exp. Med. 168: 1699-1717. **CR1-4 (8, 9) with deletion of
194-253; substitution of amino acids 271-543 with:
T-R-T-T-F-H-L-G-R-K-C-S-T-A-V-S-P-A-T-T-S-E-G-L-R-L-C-A-A-H-P-R-E-T-G-A-L-
-Q-P-P-H-V-K (SEQ ID NO: 11)
EXAMPLE 5
DAF Analogs
[0133] The membrane-bound complement regulator DAF facilitates the
dissociation of C3b and C4b-containing convertases but does not
bind C3b or C4b, nor does it serve as cofactor for factor
I-mediated proteolytic inactivation of C3b or C4b. It would be
desirable, under certain situations, to increase the complement
regulatory activity of DAF or of truncated, recombined or hybrid
forms of DAF.
[0134] Based on amino sequence homology, DAF SCRs 2-3 corresponds
to the CR1 active regions. Homology between DAF SCRs 2-3 and CR1
SCRs 1-2 is 40% while homology between DAF SCRs 2-3 and CR1 SCRs
8-9 is 39%. Since there are several unmatched amino acids in these
alignments, the position numbers of DAF do not precisely match
those of CR1.
[0135] One or more substitutions are introduced into the DAF SCRs
2-3. Specific substitutions designed to confer C3b and C4b cofactor
activity and binding activity are selected from the following:
[0136] DAF position(s) enumerated as in Lublin and Atkinson, Ann.
Rev. Immunol. 9:431 (1991):
TABLE-US-00005 DAF position DAF sequence Substitution 180-187
S-D-P-L-P-E-C-R S-T-K-P-P-I-C-Q (SEQ ID NO: 6) (amino acids 54-61
of SEQ ID NO: 4) 175-178 S-S-V-Q N-A-A-H (SEQ ID NO: 7) (amino
acids 49-52 of SEQ ID NO: 4) 175-187 S-D-P-L-P-E-C-R-S-S-V-Q
S-T-K-P-P-I-C-Q-N-A-A-H (SEQ ID NO: 8) (SEQ ID NO: 9) 130 P R 145 G
D 77-84 Q-P-Y-I-T-Q-N-Y K-L-K-T-Q-T-N-A-S-D (SEQ ID NO: 10) (amino
acids 12-21 of SEQ ID NO: 2) 90-92 V-V-E S-L-K (amino acids 27-29
of SEQ ID NO: 2)
[0137] Replacement of S-D-P-L-P-E-C-R (SEQ ID NO: 6) at DAF
position 180-187 (using the enumeration in Lublin and Atkinson,
Ann. Rev. Immunol. (1989) 7:35-58) with S-T-K-P-P-I-C-O (amino acid
54-61 of SEQ ID NO: 4) increases the affinity of DAF for C3b.
Replacement of S-D-P-L-P-E-C (amino acid 1-7 of SEQ ID NO: 6) with
S-T-K-P-P-I-C-Q (amino acid 54-61 of SEQ ID NO: 4) leaves R at the
end of this segment, since R is found in the corresponding
positions of CR1 SCR 1-2 (SEQ ID NOS: 1 and 3)and CR1 SCR 8-9 (SEQ
ID NOS: 2 and 4). Some structurally similar amino acids can be
substituted in the S-T-K-P-P-I-C-Q (amino acid 54-61 of SEQ ID NO:
4) sequence.
[0138] Replacement of P at DAF position 130 with R increases the
affinity of DAF for C4b. All other amino acids found important in
C4b interactions in CR1 are already present in DAF SCRs 2-3.
[0139] These substitutions, by increasing the affinity of DAF for
C3b and, possibly, C4b, enhance the respective 5 inhibitory effects
of DAF on complement activation.
EXAMPLE 6
Analogs of Factor H
[0140] Factor H is a plasma protein consisting solely of 20 SCRs.
Factor H exhibits C3b binding and C3b cofactor and dissociation
capacity but no apparent ability to inactivate or bind C4b. Since
factor H has already evolved as a plasma protein, it could be
advantageous to use it as the starting material for soluable RCA
analogs. Although the active sites of factor H are not precisely
known, a proteolytic fragment composed of the first 5.5 SCRs of
factor H exhibits C3b-binding and cofactor activity (Alsenz et al.,
Biochem. J. (1984) 389).
[0141] To increase the affinity of factor H for C3b binding,
N-A-A-H-W-S-T-K-P-P-I-C-Q (amino acid 49-61 of SEQ ID NO: 4) is
introduced into several of the SCRs, none bearing a close
correspondence to CR1 SCR 8-9. In one embodiment, the first six
SCRs are left unmodified, thus retaining the original active
site(s), and the remaining fourteen SCRs are modified by
substitution of the N-A-A-H-W-S-T-K-P-P-I-C-Q (amino acid 49-61 of
SEQ ID NO: 4) into the position(s) homologous to CR1. Some
structurally similar amino acids can substitute in the
N-A-A-H-W-S-T-K-P-P-I-C-Q (amino acid 49-61 of SEQ ID NO: 4)
sequence. Homologous positions are readily identified because the W
that precedes this C3b binding segment is found in most SCRs: in
all factor H SCRs except SCR 10 and SCR 20 while the C within the
C3b binding segment is found in all the known SCRs. While not all
substitutions will necessarily confer added binding activity, since
these factor H SCRs are less homologous to the CR1 SCR 1-2 and CR1
SCR 8-9 regions. However, at least some modified H SCRs gain
C3b-binding capacity, resulting in a factor H analog with much
higher affinity for C3b. As discussed above, higher affinity would
lead to the greater use of the active sites already present in the
first six H SCRs.
[0142] Modifications and variations of the present invention will
be obvious to those skilled in the art from the foregoing detailed
description. These modifications and variations are intended to
come within the scope of the following claims.
Sequence CWU 1
1
19160PRTHomo sapiensSCR-1 1Gln Cys Asn Ala Pro Glu Trp Leu Pro Phe
Ala Arg Pro Thr Asn Leu 1 5 10 15Thr Asp Glu Phe Glu Phe Pro Ile
Gly Thr Tyr Leu Asn Tyr Glu Cys 20 25 30Arg Pro Gly Tyr Ser Gly Arg
Pro Phe Ser Ile Ile Cys Leu Lys Asn 35 40 45Ser Val Trp Thr Gly Ala
Lys Asp Arg Cys Arg Arg 50 55 60260PRTHomo sapiensSCR-8 2His Cys
Gln Ala Pro Asp His Phe Leu Phe Ala Lys Leu Lys Thr Gln 1 5 10
15Thr Asn Ala Ser Asp Phe Pro Ile Gly Thr Ser Leu Lys Tyr Glu Cys
20 25 30Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser Ile Thr Cys Leu Asp
Asn 35 40 45Leu Val Trp Ser Ser Pro Lys Asp Val Cys Lys Arg 50 55
60362PRTHomo sapiensSCR-2 3Lys Ser Cys Arg Asn Pro Pro Asp Pro Val
Asn Gly Met Val His Val 1 5 10 15Ile Lys Gly Ile Gln Phe Gly Ser
Gln Ile Lys Tyr Ser Cys Thr Lys 20 25 30Gly Tyr Arg Leu Ile Gly Ser
Ser Ser Ala Thr Cys Ile Ile Ser Gly 35 40 45Asp Thr Val Ile Trp Asp
Asn Glu Thr Pro Ile Cys Asp Arg 50 55 60462PRTHomo sapiensSCR-9
4Lys Ser Cys Lys Thr Pro Pro Asp Pro Val Asn Gly Met Val His Val 1
5 10 15Ile Thr Asp Ile Gln Val Gly Ser Arg Ile Asn Tyr Ser Cys Thr
Thr 20 25 30Gly His Arg Leu Ile Gly His Ser Ser Ala Glu Cys Ile Leu
Ser Gly 35 40 45Asn Ala Ala His Trp Ser Thr Lys Pro Pro Ile Cys Gln
Arg 50 55 6058PRTHomo sapiensPositions 114-121 in SCRs 22-23 5Asp
Lys Lys Ala Pro Ile Cys Asp1 568PRTHomo sapiensDAF Positions
180-187 6Ser Asp Pro Leu Pro Glu Cys Arg1 574PRTHomo sapiensDAF
Positions 175-178 7Ser Ser Val Gln1812PRTHomo sapiensDAF Positions
175-187 8Ser Asp Pro Leu Pro Glu Cys Arg Ser Ser Val Gln 1 5
10912PRTArtificial SequenceSubstitution at positions 175-187 of
human DAF 9Ser Thr Lys Pro Pro Ile Cys Gln Asn Ala Ala His 1 5
10108PRTHomo sapiensDAF Positions 77-84 10Gln Pro Tyr Ile Thr Gln
Asn Tyr1 51143PRTArtificial SequenceSubstitution at positions
271-543 in the delta SCR11 analog of human CR1 11Thr Arg Thr Thr
Phe His Leu Gly Arg Lys Cys Ser Thr Ala Val Ser 1 5 10 15Pro Ala
Thr Thr Ser Glu Gly Leu Arg Leu Cys Ala Ala His Pro Arg 20 25 30Glu
Thr Gly Ala Leu Gln Pro Pro His Val Lys 35 40126801DNAHomo
sapiensCR-1 12caa tgc aat gcc cca gaa tgg ctt cca ttt gcc agg cct
acc aac cta 48Gln Cys Asn Ala Pro Glu Trp Leu Pro Phe Ala Arg Pro
Thr Asn Leu1 5 10 15act gat gag ttt gag ttt ccc att ggg aca tat ctg
aac tat gaa tgc 96Thr Asp Glu Phe Glu Phe Pro Ile Gly Thr Tyr Leu
Asn Tyr Glu Cys20 25 30cgc cct ggt tat tcc gga aga ccg ttt tct atc
atc tgc cta aaa aac 144Arg Pro Gly Tyr Ser Gly Arg Pro Phe Ser Ile
Ile Cys Leu Lys Asn35 40 45tca gtc tgg act ggt gct aag gac agg tgc
aga cgt aaa tca tgt cgt 192Ser Val Trp Thr Gly Ala Lys Asp Arg Cys
Arg Arg Lys Ser Cys Arg50 55 60aat cct cca gat cct gtg aat ggc atg
gtg cat gtg atc aaa ggc atc 240Asn Pro Pro Asp Pro Val Asn Gly Met
Val His Val Ile Lys Gly Ile65 70 75 80cag ttc gga tcc caa att aaa
tat tct tgt act aaa gga tac cga ctc 288Gln Phe Gly Ser Gln Ile Lys
Tyr Ser Cys Thr Lys Gly Tyr Arg Leu85 90 95att ggt tcc tcg tct gcc
aca tgc atc atc tca ggt gat act gtc att 336Ile Gly Ser Ser Ser Ala
Thr Cys Ile Ile Ser Gly Asp Thr Val Ile100 105 110tgg gat aat gaa
aca cct att tgt gac aga att cct tgt ggg cta ccc 384Trp Asp Asn Glu
Thr Pro Ile Cys Asp Arg Ile Pro Cys Gly Leu Pro115 120 125ccc acc
atc acc aat gga gat ttc att agc acc aac aga gag aat ttt 432Pro Thr
Ile Thr Asn Gly Asp Phe Ile Ser Thr Asn Arg Glu Asn Phe130 135
140cac tat gga tca gtg gtg acc tac cgc tgc aat cct gga agc gga ggg
480His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro Gly Ser Gly
Gly145 150 155 160aga aag gtg ttt gag ctt gtg ggt gag ccc tcc ata
tac tgc acc agc 528Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser Ile
Tyr Cys Thr Ser165 170 175aat gac gat caa gtg ggc atc tgg agc ggc
ccc gcc cct cag tgc att 576Asn Asp Asp Gln Val Gly Ile Trp Ser Gly
Pro Ala Pro Gln Cys Ile180 185 190ata cct aac aaa tgc acg cct cca
aat gtg gaa aat gga ata ttg gta 624Ile Pro Asn Lys Cys Thr Pro Pro
Asn Val Glu Asn Gly Ile Leu Val195 200 205tct gac aac aga agc tta
ttt tcc tta aat gaa gtt gtg gag ttt agg 672Ser Asp Asn Arg Ser Leu
Phe Ser Leu Asn Glu Val Val Glu Phe Arg210 215 220tgt cag cct ggc
ttt gtc atg aaa gga ccc cgc cgt gtg aag tgc cag 720Cys Gln Pro Gly
Phe Val Met Lys Gly Pro Arg Arg Val Lys Cys Gln225 230 235 240gcc
ctg aac aaa tgg gag ccg gag cta cca agc tgc tcc agg gta tgt 768Ala
Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg Val Cys245 250
255cag cca cct cca gat gtc ctg cat gct gag cgt acc caa agg gac aag
816Gln Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gln Arg Asp
Lys260 265 270gac aac ttt tca cct ggg cag gaa gtg ttc tac agc tgt
gag ccc ggc 864Asp Asn Phe Ser Pro Gly Gln Glu Val Phe Tyr Ser Cys
Glu Pro Gly275 280 285tac gac ctc aga ggg gct gcg tct atg cgc tgc
aca ccc cag gga gac 912Tyr Asp Leu Arg Gly Ala Ala Ser Met Arg Cys
Thr Pro Gln Gly Asp290 295 300tgg agc cct gca gcc ccc aca tgt gaa
gtg aaa tcc tgt gat gac ttc 960Trp Ser Pro Ala Ala Pro Thr Cys Glu
Val Lys Ser Cys Asp Asp Phe305 310 315 320atg ggc caa ctt ctt aat
ggc cgt gtg cta ttt cca gta aat ctc cag 1008Met Gly Gln Leu Leu Asn
Gly Arg Val Leu Phe Pro Val Asn Leu Gln325 330 335ctt gga gca aaa
gtg gat ttt gtt tgt gat gaa gga ttt caa tta aaa 1056Leu Gly Ala Lys
Val Asp Phe Val Cys Asp Glu Gly Phe Gln Leu Lys340 345 350ggc agc
tct gct agt tac tgt gtc ttg gct gga atg gaa agc ctt tgg 1104Gly Ser
Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser Leu Trp355 360
365aat agc agt gtt cca gtg tgt gaa caa atc ttt tgt cca agt cct cca
1152Asn Ser Ser Val Pro Val Cys Glu Gln Ile Phe Cys Pro Ser Pro
Pro370 375 380gtt att cct aat ggg aga cac aca gga aaa cct ctg gaa
gtc ttt ccc 1200Val Ile Pro Asn Gly Arg His Thr Gly Lys Pro Leu Glu
Val Phe Pro385 390 395 400ttt gga aaa gca gta aat tac aca tgc gac
ccc cac cca gac aga ggg 1248Phe Gly Lys Ala Val Asn Tyr Thr Cys Asp
Pro His Pro Asp Arg Gly405 410 415acg agc ttc gac ctc att gga gag
agc acc atc cgc tgc aca agt gac 1296Thr Ser Phe Asp Leu Ile Gly Glu
Ser Thr Ile Arg Cys Thr Ser Asp420 425 430cct caa ggg aat ggg gtt
tgg agc agc cct gcc cct cgc tgt gga att 1344Pro Gln Gly Asn Gly Val
Trp Ser Ser Pro Ala Pro Arg Cys Gly Ile435 440 445ctg ggt cac tgt
caa gcc cca gat cat ttt ctg ttt gcc aag ttg aaa 1392Leu Gly His Cys
Gln Ala Pro Asp His Phe Leu Phe Ala Lys Leu Lys450 455 460acc caa
acc aat gca tct gac ttt ccc att ggg aca tct tta aag tac 1440Thr Gln
Thr Asn Ala Ser Asp Phe Pro Ile Gly Thr Ser Leu Lys Tyr465 470 475
480gaa tgc cgt cct gag tac tac ggg agg cca ttc tct atc aca tgt cta
1488Glu Cys Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser Ile Thr Cys
Leu485 490 495gat aac ctg gtc tgg tca agt ccc aaa gat gtc tgt aaa
cgt aaa tca 1536Asp Asn Leu Val Trp Ser Ser Pro Lys Asp Val Cys Lys
Arg Lys Ser500 505 510tgt aaa act cct cca gat cca gtg aat ggc atg
gtg cat gtg atc aca 1584Cys Lys Thr Pro Pro Asp Pro Val Asn Gly Met
Val His Val Ile Thr515 520 525gac atc cag gtt gga tcc aga atc aac
tat tct tgt act aca ggg cac 1632Asp Ile Gln Val Gly Ser Arg Ile Asn
Tyr Ser Cys Thr Thr Gly His530 535 540cga ctc att ggt cac tca tct
gct gaa tgt atc ctc tcg ggc aat gct 1680Arg Leu Ile Gly His Ser Ser
Ala Glu Cys Ile Leu Ser Gly Asn Ala545 550 555 560gcc cat tgg agc
acg aag ccg cca att tgt caa cga att cct tgt ggg 1728Ala His Trp Ser
Thr Lys Pro Pro Ile Cys Gln Arg Ile Pro Cys Gly565 570 575cta ccc
ccc acc atc gcc aat gga gat ttc att agc acc aac aga gag 1776Leu Pro
Pro Thr Ile Ala Asn Gly Asp Phe Ile Ser Thr Asn Arg Glu580 585
590aat ttt cac tat gga tca gtg gtg acc tac cgc tgc aat cct gga agc
1824Asn Phe His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro Gly
Ser595 600 605gga ggg aga aag gtg ttt gag ctt gtg ggt gag ccc tcc
ata tac tgc 1872Gly Gly Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser
Ile Tyr Cys610 615 620acc agc aat gac gat caa gtg ggc atc tgg agc
ggc ccg gcc cct cag 1920Thr Ser Asn Asp Asp Gln Val Gly Ile Trp Ser
Gly Pro Ala Pro Gln625 630 635 640tgc att ata cct aac aaa tgc acg
cct cca aat gtg gaa aat gga ata 1968Cys Ile Ile Pro Asn Lys Cys Thr
Pro Pro Asn Val Glu Asn Gly Ile645 650 655ttg gta tct gac aac aga
agc tta ttt tcc tta aat gaa gtt gtg gag 2016Leu Val Ser Asp Asn Arg
Ser Leu Phe Ser Leu Asn Glu Val Val Glu660 665 670ttt agg tgt cag
cct ggc ttt gtc atg aaa gga ccc cgc cgt gtg aag 2064Phe Arg Cys Gln
Pro Gly Phe Val Met Lys Gly Pro Arg Arg Val Lys675 680 685tgc cag
gcc ctg aac aaa tgg gag ccg gag cta cca agc tgc tcc agg 2112Cys Gln
Ala Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg690 695
700gta tgt cag cca cct cca gat gtc ctg cat gct gag cgt acc caa agg
2160Val Cys Gln Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gln
Arg705 710 715 720gac aag gac aac ttt tca ccc ggg cag gaa gtg ttc
tac agc tgt gag 2208Asp Lys Asp Asn Phe Ser Pro Gly Gln Glu Val Phe
Tyr Ser Cys Glu725 730 735ccc ggc tat gac ctc aga ggg gct gcg tct
atg cgc tgc aca ccc cag 2256Pro Gly Tyr Asp Leu Arg Gly Ala Ala Ser
Met Arg Cys Thr Pro Gln740 745 750gga gac tgg agc cct gca gcc ccc
aca tgt gaa gtg aaa tcc tgt gat 2304Gly Asp Trp Ser Pro Ala Ala Pro
Thr Cys Glu Val Lys Ser Cys Asp755 760 765gac ttc atg ggc caa ctt
ctt aat ggc cgt gtg cta ttt cca gta aat 2352Asp Phe Met Gly Gln Leu
Leu Asn Gly Arg Val Leu Phe Pro Val Asn770 775 780ctc cag ctt gga
gca aaa gtg gat ttt gtt tgt gat gaa gga ttt caa 2400Leu Gln Leu Gly
Ala Lys Val Asp Phe Val Cys Asp Glu Gly Phe Gln785 790 795 800tta
aaa ggc agc tct gct agt tat tgt gtc ttg gct gga atg gaa agc 2448Leu
Lys Gly Ser Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser805 810
815ctt tgg aat agc agt gtt cca gtg tgt gaa caa atc ttt tgt cca agt
2496Leu Trp Asn Ser Ser Val Pro Val Cys Glu Gln Ile Phe Cys Pro
Ser820 825 830cct cca gtt att cct aat ggg aga cac aca gga aaa cct
ctg gaa gtc 2544Pro Pro Val Ile Pro Asn Gly Arg His Thr Gly Lys Pro
Leu Glu Val835 840 845ttt ccc ttt gga aaa gca gta aat tac aca tgc
gac ccc cac cca gac 2592Phe Pro Phe Gly Lys Ala Val Asn Tyr Thr Cys
Asp Pro His Pro Asp850 855 860aga ggg acg agc ttc gac ctc att gga
gag agc acc atc cgc tgc aca 2640Arg Gly Thr Ser Phe Asp Leu Ile Gly
Glu Ser Thr Ile Arg Cys Thr865 870 875 880agt gac cct caa ggg aat
ggg gtt tgg agc agc cct gcc cct cgc tgt 2688Ser Asp Pro Gln Gly Asn
Gly Val Trp Ser Ser Pro Ala Pro Arg Cys885 890 895gga att ctg ggt
cac tgt caa gcc cca gat cat ttt ctg ttt gcc aag 2736Gly Ile Leu Gly
His Cys Gln Ala Pro Asp His Phe Leu Phe Ala Lys900 905 910ttg aaa
acc caa acc aat gca tct gac ttt ccc att ggg aca tct tta 2784Leu Lys
Thr Gln Thr Asn Ala Ser Asp Phe Pro Ile Gly Thr Ser Leu915 920
925aag tac gaa tgc cgt cct gag tac tac ggg agg cca ttc tct atc aca
2832Lys Tyr Glu Cys Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser Ile
Thr930 935 940tgt cta gat aac ctg gtc tgg tca agt ccc aaa gat gtc
tgt aaa cgt 2880Cys Leu Asp Asn Leu Val Trp Ser Ser Pro Lys Asp Val
Cys Lys Arg945 950 955 960aaa tca tgt aaa act cct cca gat cca gtg
aat ggc atg gtg cat gtg 2928Lys Ser Cys Lys Thr Pro Pro Asp Pro Val
Asn Gly Met Val His Val965 970 975atc aca gac atc cag gtt gga tcc
aga atc aac tat tct tgt act aca 2976Ile Thr Asp Ile Gln Val Gly Ser
Arg Ile Asn Tyr Ser Cys Thr Thr980 985 990ggg cac cga ctc att ggt
cac tca tct gct gaa tgt atc ctc tca ggc 3024Gly His Arg Leu Ile Gly
His Ser Ser Ala Glu Cys Ile Leu Ser Gly995 1000 1005aat act gcc cat
tgg agc acg aag ccg cca att tgt caa cga att cct 3072Asn Thr Ala His
Trp Ser Thr Lys Pro Pro Ile Cys Gln Arg Ile Pro1010 1015 1020tgt
ggg cta ccc cca acc atc gcc aat gga gat ttc att agc acc aac 3120Cys
Gly Leu Pro Pro Thr Ile Ala Asn Gly Asp Phe Ile Ser Thr Asn1025
1030 1035 1040aga gag aat ttt cac tat gga tca gtg gtg acc tac cgc
tgc aat ctt 3168Arg Glu Asn Phe His Tyr Gly Ser Val Val Thr Tyr Arg
Cys Asn Leu1045 1050 1055gga agc aga ggg aga aag gtg ttt gag ctt
gtg ggt gag ccc tcc ata 3216Gly Ser Arg Gly Arg Lys Val Phe Glu Leu
Val Gly Glu Pro Ser Ile1060 1065 1070tac tgc acc agc aat gac gat
caa gtg ggc atc tgg agc ggc ccc gcc 3264Tyr Cys Thr Ser Asn Asp Asp
Gln Val Gly Ile Trp Ser Gly Pro Ala1075 1080 1085cct cag tgc att
ata cct aac aaa tgc acg cct cca aat gtg gaa aat 3312Pro Gln Cys Ile
Ile Pro Asn Lys Cys Thr Pro Pro Asn Val Glu Asn1090 1095 1100gga
ata ttg gta tct gac aac aga agc tta ttt tcc tta aat gaa gtt 3360Gly
Ile Leu Val Ser Asp Asn Arg Ser Leu Phe Ser Leu Asn Glu Val1105
1110 1115 1120gtg gag ttt agg tgt cag cct ggc ttt gtc atg aaa gga
ccc cgc cgt 3408Val Glu Phe Arg Cys Gln Pro Gly Phe Val Met Lys Gly
Pro Arg Arg1125 1130 1135gtg aag tgc cag gcc ctg aac aaa tgg gag
cca gag tta cca agc tgc 3456Val Lys Cys Gln Ala Leu Asn Lys Trp Glu
Pro Glu Leu Pro Ser Cys1140 1145 1150tcc agg gtg tgt cag ccg cct
cca gaa atc ctg cat ggt gag cat acc 3504Ser Arg Val Cys Gln Pro Pro
Pro Glu Ile Leu His Gly Glu His Thr1155 1160 1165cca agc cat cag
gac aac ttt tca cct ggg cag gaa gtg ttc tac agc 3552Pro Ser His Gln
Asp Asn Phe Ser Pro Gly Gln Glu Val Phe Tyr Ser1170 1175 1180tgt
gag cct ggc tat gac ctc aga ggg gct gcg tct ctg cac tgc aca 3600Cys
Glu Pro Gly Tyr Asp Leu Arg Gly Ala Ala Ser Leu His Cys Thr1185
1190 1195 1200ccc cag gga gac tgg agc cct gaa gcc ccg aga tgt gca
gtg aaa tcc 3648Pro Gln Gly Asp Trp Ser Pro Glu Ala Pro Arg Cys Ala
Val Lys Ser1205 1210 1215tgt gat gac ttc ttg ggt caa ctc cct cat
ggc cgt gtg cta ttt cca 3696Cys Asp Asp Phe Leu Gly Gln Leu Pro His
Gly Arg Val Leu Phe Pro1220 1225 1230ctt aat ctc cag ctt ggg gca
aag gtg tcc ttt gtc tgt gat gaa ggg 3744Leu Asn Leu Gln Leu Gly Ala
Lys Val Ser Phe Val Cys Asp Glu Gly1235 1240 1245ttt cgc tta aag
ggc agt tcc gtt agt cat tgt gtc ttg gtt gga atg 3792Phe Arg Leu Lys
Gly Ser Ser Val Ser His Cys Val Leu Val Gly Met1250 1255 1260aga
agc ctt tgg aat aac agt gtt cct gtg tgt gaa cat atc ttt tgt 3840Arg
Ser Leu Trp Asn Asn Ser Val Pro Val Cys Glu His Ile Phe Cys1265
1270 1275 1280cca aat cct cca gct atc ctt aat ggg aga cac aca gga
act ccc tct 3888Pro Asn Pro Pro Ala Ile Leu Asn Gly Arg His Thr Gly
Thr Pro Ser1285 1290 1295gga gat att ccc tat gga aaa gaa ata tct
tac aca tgt gac ccc cac 3936Gly Asp Ile Pro Tyr Gly Lys Glu Ile Ser
Tyr Thr Cys Asp Pro His1300 1305 1310cca gac aga ggg atg acc ttc
aac ctc att ggg gag agc acc atc cgc 3984Pro Asp Arg Gly Met Thr Phe
Asn Leu Ile Gly Glu Ser Thr Ile Arg1315 1320 1325tgc aca agt gac
cct cat ggg aat ggg gtt tgg agc agc cct gcc cct 4032Cys Thr Ser Asp
Pro His Gly Asn Gly Val Trp Ser Ser Pro Ala Pro1330
1335 1340cgc tgt gaa ctt tct gtt cgt gct ggt cac tgt aaa acc cca
gag cag 4080Arg Cys Glu Leu Ser Val Arg Ala Gly His Cys Lys Thr Pro
Glu Gln1345 1350 1355 1360ttt cca ttt gcc agt cct acg atc cca att
aat gac ttt gag ttt cca 4128Phe Pro Phe Ala Ser Pro Thr Ile Pro Ile
Asn Asp Phe Glu Phe Pro1365 1370 1375gtc ggg aca tct ttg aat tat
gaa tgc cgt cct ggg tat ttt ggg aaa 4176Val Gly Thr Ser Leu Asn Tyr
Glu Cys Arg Pro Gly Tyr Phe Gly Lys1380 1385 1390atg ttc tct atc
tcc tgc cta gaa aac ttg gtc tgg tca agt gtt gaa 4224Met Phe Ser Ile
Ser Cys Leu Glu Asn Leu Val Trp Ser Ser Val Glu1395 1400 1405gac
aac tgt aga cga aaa tca tgt gga cct cca cca gaa ccc ttc aat 4272Asp
Asn Cys Arg Arg Lys Ser Cys Gly Pro Pro Pro Glu Pro Phe Asn1410
1415 1420gga atg gtg cat ata aac aca gat aca cag ttt gga tca aca
gtt aat 4320Gly Met Val His Ile Asn Thr Asp Thr Gln Phe Gly Ser Thr
Val Asn1425 1430 1435 1440tat tct tgt aat gaa ggg ttt cga ctc att
ggt tcc cca tct act act 4368Tyr Ser Cys Asn Glu Gly Phe Arg Leu Ile
Gly Ser Pro Ser Thr Thr1445 1450 1455tgt ctc gtc tca ggc aat aat
gtc aca tgg gat aag aag gca cct att 4416Cys Leu Val Ser Gly Asn Asn
Val Thr Trp Asp Lys Lys Ala Pro Ile1460 1465 1470tgt gag atc ata
tct tgt gag cca cct cca acc ata tcc aat gga gac 4464Cys Glu Ile Ile
Ser Cys Glu Pro Pro Pro Thr Ile Ser Asn Gly Asp1475 1480 1485ttc
tac agc aac aat aga aca tct ttt cac aat gga acg gtg gta act 4512Phe
Tyr Ser Asn Asn Arg Thr Ser Phe His Asn Gly Thr Val Val Thr1490
1495 1500tac cag tgc cac act gga cca gat gga gaa cag ctg ttt gag
ctt gtg 4560Tyr Gln Cys His Thr Gly Pro Asp Gly Glu Gln Leu Phe Glu
Leu Val1505 1510 1515 1520gga gaa cgg tca ata tat tgc acc agc aaa
gat gat caa gtt ggt gtt 4608Gly Glu Arg Ser Ile Tyr Cys Thr Ser Lys
Asp Asp Gln Val Gly Val1525 1530 1535tgg agc agc cct ccc cct cgg
tgt att tct act aat aaa tgc aca gct 4656Trp Ser Ser Pro Pro Pro Arg
Cys Ile Ser Thr Asn Lys Cys Thr Ala1540 1545 1550cca gaa gtt gaa
aat gca att aga gta cca gga aac agg agt ttc ttt 4704Pro Glu Val Glu
Asn Ala Ile Arg Val Pro Gly Asn Arg Ser Phe Phe1555 1560 1565tcc
ctc act gag atc atc aga ttt aga tgt cag ccc ggg ttt gtc atg 4752Ser
Leu Thr Glu Ile Ile Arg Phe Arg Cys Gln Pro Gly Phe Val Met1570
1575 1580gta ggg tcc cac act gtg cag tgc cag acc aat ggc aga tgg
ggg ccc 4800Val Gly Ser His Thr Val Gln Cys Gln Thr Asn Gly Arg Trp
Gly Pro1585 1590 1595 1600aag ctg cca cac tgc tcc agg gtg tgt cag
ccg cct cca gaa atc ctg 4848Lys Leu Pro His Cys Ser Arg Val Cys Gln
Pro Pro Pro Glu Ile Leu1605 1610 1615cat ggt gag cat acc cta agc
cat cag gac aac ttt tca cct ggg cag 4896His Gly Glu His Thr Leu Ser
His Gln Asp Asn Phe Ser Pro Gly Gln1620 1625 1630gaa gtg ttc tac
agc tgt gag ccc agc tat gac ctc aga ggg gct gcg 4944Glu Val Phe Tyr
Ser Cys Glu Pro Ser Tyr Asp Leu Arg Gly Ala Ala1635 1640 1645tct
ctg cac tgc acg ccc cag gga gac tgg agc cct gaa gcc cct aga 4992Ser
Leu His Cys Thr Pro Gln Gly Asp Trp Ser Pro Glu Ala Pro Arg1650
1655 1660tgt aca gtg aaa tcc tgt gat gac ttc ctg ggc caa ctc cct
cat ggc 5040Cys Thr Val Lys Ser Cys Asp Asp Phe Leu Gly Gln Leu Pro
His Gly1665 1670 1675 1680cgt gtg cta ctt cca ctt aat ctc cag ctt
ggg gca aag gtg tcc ttt 5088Arg Val Leu Leu Pro Leu Asn Leu Gln Leu
Gly Ala Lys Val Ser Phe1685 1690 1695gtt tgc gat gaa ggg ttc cga
tta aaa ggc agg tct gct agt cat tgt 5136Val Cys Asp Glu Gly Phe Arg
Leu Lys Gly Arg Ser Ala Ser His Cys1700 1705 1710gtc ttg gct gga
atg aaa gcc ctt tgg aat agc agt gtt cca gtg tgt 5184Val Leu Ala Gly
Met Lys Ala Leu Trp Asn Ser Ser Val Pro Val Cys1715 1720 1725gaa
caa atc ttt tgt cca aat cct cca gct atc ctt aat ggg aga cac 5232Glu
Gln Ile Phe Cys Pro Asn Pro Pro Ala Ile Leu Asn Gly Arg His1730
1735 1740aca gga act ccc ttt gga gat att ccc tat gga aaa gaa ata
tct tac 5280Thr Gly Thr Pro Phe Gly Asp Ile Pro Tyr Gly Lys Glu Ile
Ser Tyr1745 1750 1755 1760gca tgc gac acc cac cca gac aga ggg atg
acc ttc aac ctc att ggg 5328Ala Cys Asp Thr His Pro Asp Arg Gly Met
Thr Phe Asn Leu Ile Gly1765 1770 1775gag agc tcc atc cgc tgc aca
agt gac cct caa ggg aat ggg gtt tgg 5376Glu Ser Ser Ile Arg Cys Thr
Ser Asp Pro Gln Gly Asn Gly Val Trp1780 1785 1790agc agc cct gcc
cct cgc tgt gaa ctt tct gtt cct gct gcc tgc cca 5424Ser Ser Pro Ala
Pro Arg Cys Glu Leu Ser Val Pro Ala Ala Cys Pro1795 1800 1805cat
cca ccc aag atc caa aac ggg cat tac att gga gga cac gta tct 5472His
Pro Pro Lys Ile Gln Asn Gly His Tyr Ile Gly Gly His Val Ser1810
1815 1820cta tat ctt cct ggg atg aca atc agc tac act tgt gac ccc
ggc tac 5520Leu Tyr Leu Pro Gly Met Thr Ile Ser Tyr Thr Cys Asp Pro
Gly Tyr1825 1830 1835 1840ctg tta gtg gga aag ggc ttc att ttc tgt
aca gac cag gga atc tgg 5568Leu Leu Val Gly Lys Gly Phe Ile Phe Cys
Thr Asp Gln Gly Ile Trp1845 1850 1855agc caa ttg gat cat tat tgc
aaa gaa gta aat tgt agc ttc cca ctg 5616Ser Gln Leu Asp His Tyr Cys
Lys Glu Val Asn Cys Ser Phe Pro Leu1860 1865 1870ttt atg aat gga
atc tcg aag gag tta gaa atg aaa aaa gta tat cac 5664Phe Met Asn Gly
Ile Ser Lys Glu Leu Glu Met Lys Lys Val Tyr His1875 1880 1885tat
gga gat tat gtg act ttg aag tgt gaa gat ggg tat act ctg gaa 5712Tyr
Gly Asp Tyr Val Thr Leu Lys Cys Glu Asp Gly Tyr Thr Leu Glu1890
1895 1900ggc agt ccc tgg agc cag tgc cag gcg gat gac aga tgg gac
cct cct 5760Gly Ser Pro Trp Ser Gln Cys Gln Ala Asp Asp Arg Trp Asp
Pro Pro1905 1910 1915 1920ctg gcc aaa tgt acc tct cgt gca cat gat
gct ctc ata gtt ggc act 5808Leu Ala Lys Cys Thr Ser Arg Ala His Asp
Ala Leu Ile Val Gly Thr1925 1930 1935tta tct ggt acg atc ttc ttt
att tta ctc atc att ttc ctc tct tgg 5856Leu Ser Gly Thr Ile Phe Phe
Ile Leu Leu Ile Ile Phe Leu Ser Trp1940 1945 1950ata att cta aag
cac aga aaa ggc aat aat gca cat gaa aac cct aaa 5904Ile Ile Leu Lys
His Arg Lys Gly Asn Asn Ala His Glu Asn Pro Lys1955 1960 1965gaa
gtg gct atc cat tta cat tct caa gga ggc agc agc gtt cat ccc 5952Glu
Val Ala Ile His Leu His Ser Gln Gly Gly Ser Ser Val His Pro1970
1975 1980cga act ctg caa aca aat gaa gaa aat agc agg gtc ctt cct
5994Arg Thr Leu Gln Thr Asn Glu Glu Asn Ser Arg Val Leu Pro1985
1990 1995tgacaaagta ctatacagct gaagaacatc tcgaatacaa ttttggtggg
aaaggagcca 6054attgatttca acagaatcag atctgagctt cataaagtct
ttgaagtgac ttcacagaga 6114cgcagacatg tgcacttgaa gatgctgccc
cttccctggt acctagcaaa gctcctgcct 6174ctttgtgtgc gtcactgtga
aacccccacc cttctgcctc gtgctaaacg cacacagtat 6234ctagtcaggg
gaaaagactg catttaggag atagaaaata gtttggatta cttaaaggaa
6294taaggtgttg cctggaattt ctggtttgta aggtggtcac tgttcttttt
taaaatattt 6354gtaatatgga atgggctcag taagaagagc ttggaaaatg
cagaaagtta tgaaaaataa 6414gtcacttata attatgctac ctactgataa
ccactcctaa tattttgatt cattttctgc 6474ctatcttctt tcacatatgt
gtttttttac atacgtactt ttcccccctt agtttgtttc 6534cttttatttt
atagagcaga accctagtct tttaaacagt ttagagtgaa atatatgcta
6594tatcagtttt tactttctct agggagaaaa attaatttac tagaaaggca
tgaaatgatc 6654atgggaagag tggttaagac tactgaagag aaatatttgg
aaaataagat ttcgatatct 6714tctttttttt tgagatggag tctggctctg
tctcccaggc tggagtgcag tggcgtaatc 6774tcggctcact gcaacgtccg cctcccg
6801131998PRTHomo sapiensCR-1 13Gln Cys Asn Ala Pro Glu Trp Leu Pro
Phe Ala Arg Pro Thr Asn Leu 1 5 10 15Thr Asp Glu Phe Glu Phe Pro
Ile Gly Thr Tyr Leu Asn Tyr Glu Cys 20 25 30Arg Pro Gly Tyr Ser Gly
Arg Pro Phe Ser Ile Ile Cys Leu Lys Asn 35 40 45Ser Val Trp Thr Gly
Ala Lys Asp Arg Cys Arg Arg Lys Ser Cys Arg 50 55 60Asn Pro Pro Asp
Pro Val Asn Gly Met Val His Val Ile Lys Gly Ile65 70 75 80Gln Phe
Gly Ser Gln Ile Lys Tyr Ser Cys Thr Lys Gly Tyr Arg Leu 85 90 95Ile
Gly Ser Ser Ser Ala Thr Cys Ile Ile Ser Gly Asp Thr Val Ile 100 105
110Trp Asp Asn Glu Thr Pro Ile Cys Asp Arg Ile Pro Cys Gly Leu Pro
115 120 125Pro Thr Ile Thr Asn Gly Asp Phe Ile Ser Thr Asn Arg Glu
Asn Phe 130 135 140His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro
Gly Ser Gly Gly145 150 155 160Arg Lys Val Phe Glu Leu Val Gly Glu
Pro Ser Ile Tyr Cys Thr Ser 165 170 175Asn Asp Asp Gln Val Gly Ile
Trp Ser Gly Pro Ala Pro Gln Cys Ile 180 185 190Ile Pro Asn Lys Cys
Thr Pro Pro Asn Val Glu Asn Gly Ile Leu Val 195 200 205Ser Asp Asn
Arg Ser Leu Phe Ser Leu Asn Glu Val Val Glu Phe Arg 210 215 220Cys
Gln Pro Gly Phe Val Met Lys Gly Pro Arg Arg Val Lys Cys Gln225 230
235 240Ala Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg Val
Cys 245 250 255Gln Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gln
Arg Asp Lys 260 265 270Asp Asn Phe Ser Pro Gly Gln Glu Val Phe Tyr
Ser Cys Glu Pro Gly 275 280 285Tyr Asp Leu Arg Gly Ala Ala Ser Met
Arg Cys Thr Pro Gln Gly Asp 290 295 300Trp Ser Pro Ala Ala Pro Thr
Cys Glu Val Lys Ser Cys Asp Asp Phe305 310 315 320Met Gly Gln Leu
Leu Asn Gly Arg Val Leu Phe Pro Val Asn Leu Gln 325 330 335Leu Gly
Ala Lys Val Asp Phe Val Cys Asp Glu Gly Phe Gln Leu Lys 340 345
350Gly Ser Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser Leu Trp
355 360 365Asn Ser Ser Val Pro Val Cys Glu Gln Ile Phe Cys Pro Ser
Pro Pro 370 375 380Val Ile Pro Asn Gly Arg His Thr Gly Lys Pro Leu
Glu Val Phe Pro385 390 395 400Phe Gly Lys Ala Val Asn Tyr Thr Cys
Asp Pro His Pro Asp Arg Gly 405 410 415Thr Ser Phe Asp Leu Ile Gly
Glu Ser Thr Ile Arg Cys Thr Ser Asp 420 425 430Pro Gln Gly Asn Gly
Val Trp Ser Ser Pro Ala Pro Arg Cys Gly Ile 435 440 445Leu Gly His
Cys Gln Ala Pro Asp His Phe Leu Phe Ala Lys Leu Lys 450 455 460Thr
Gln Thr Asn Ala Ser Asp Phe Pro Ile Gly Thr Ser Leu Lys Tyr465 470
475 480Glu Cys Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser Ile Thr Cys
Leu 485 490 495Asp Asn Leu Val Trp Ser Ser Pro Lys Asp Val Cys Lys
Arg Lys Ser 500 505 510Cys Lys Thr Pro Pro Asp Pro Val Asn Gly Met
Val His Val Ile Thr 515 520 525Asp Ile Gln Val Gly Ser Arg Ile Asn
Tyr Ser Cys Thr Thr Gly His 530 535 540Arg Leu Ile Gly His Ser Ser
Ala Glu Cys Ile Leu Ser Gly Asn Ala545 550 555 560Ala His Trp Ser
Thr Lys Pro Pro Ile Cys Gln Arg Ile Pro Cys Gly 565 570 575Leu Pro
Pro Thr Ile Ala Asn Gly Asp Phe Ile Ser Thr Asn Arg Glu 580 585
590Asn Phe His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Pro Gly Ser
595 600 605Gly Gly Arg Lys Val Phe Glu Leu Val Gly Glu Pro Ser Ile
Tyr Cys 610 615 620Thr Ser Asn Asp Asp Gln Val Gly Ile Trp Ser Gly
Pro Ala Pro Gln625 630 635 640Cys Ile Ile Pro Asn Lys Cys Thr Pro
Pro Asn Val Glu Asn Gly Ile 645 650 655Leu Val Ser Asp Asn Arg Ser
Leu Phe Ser Leu Asn Glu Val Val Glu 660 665 670Phe Arg Cys Gln Pro
Gly Phe Val Met Lys Gly Pro Arg Arg Val Lys 675 680 685Cys Gln Ala
Leu Asn Lys Trp Glu Pro Glu Leu Pro Ser Cys Ser Arg 690 695 700Val
Cys Gln Pro Pro Pro Asp Val Leu His Ala Glu Arg Thr Gln Arg705 710
715 720Asp Lys Asp Asn Phe Ser Pro Gly Gln Glu Val Phe Tyr Ser Cys
Glu 725 730 735Pro Gly Tyr Asp Leu Arg Gly Ala Ala Ser Met Arg Cys
Thr Pro Gln 740 745 750Gly Asp Trp Ser Pro Ala Ala Pro Thr Cys Glu
Val Lys Ser Cys Asp 755 760 765Asp Phe Met Gly Gln Leu Leu Asn Gly
Arg Val Leu Phe Pro Val Asn 770 775 780Leu Gln Leu Gly Ala Lys Val
Asp Phe Val Cys Asp Glu Gly Phe Gln785 790 795 800Leu Lys Gly Ser
Ser Ala Ser Tyr Cys Val Leu Ala Gly Met Glu Ser 805 810 815Leu Trp
Asn Ser Ser Val Pro Val Cys Glu Gln Ile Phe Cys Pro Ser 820 825
830Pro Pro Val Ile Pro Asn Gly Arg His Thr Gly Lys Pro Leu Glu Val
835 840 845Phe Pro Phe Gly Lys Ala Val Asn Tyr Thr Cys Asp Pro His
Pro Asp 850 855 860Arg Gly Thr Ser Phe Asp Leu Ile Gly Glu Ser Thr
Ile Arg Cys Thr865 870 875 880Ser Asp Pro Gln Gly Asn Gly Val Trp
Ser Ser Pro Ala Pro Arg Cys 885 890 895Gly Ile Leu Gly His Cys Gln
Ala Pro Asp His Phe Leu Phe Ala Lys 900 905 910Leu Lys Thr Gln Thr
Asn Ala Ser Asp Phe Pro Ile Gly Thr Ser Leu 915 920 925Lys Tyr Glu
Cys Arg Pro Glu Tyr Tyr Gly Arg Pro Phe Ser Ile Thr 930 935 940Cys
Leu Asp Asn Leu Val Trp Ser Ser Pro Lys Asp Val Cys Lys Arg945 950
955 960Lys Ser Cys Lys Thr Pro Pro Asp Pro Val Asn Gly Met Val His
Val 965 970 975Ile Thr Asp Ile Gln Val Gly Ser Arg Ile Asn Tyr Ser
Cys Thr Thr 980 985 990Gly His Arg Leu Ile Gly His Ser Ser Ala Glu
Cys Ile Leu Ser Gly 995 1000 1005Asn Thr Ala His Trp Ser Thr Lys
Pro Pro Ile Cys Gln Arg Ile Pro 1010 1015 1020Cys Gly Leu Pro Pro
Thr Ile Ala Asn Gly Asp Phe Ile Ser Thr Asn1025 1030 1035 1040Arg
Glu Asn Phe His Tyr Gly Ser Val Val Thr Tyr Arg Cys Asn Leu 1045
1050 1055Gly Ser Arg Gly Arg Lys Val Phe Glu Leu Val Gly Glu Pro
Ser Ile 1060 1065 1070Tyr Cys Thr Ser Asn Asp Asp Gln Val Gly Ile
Trp Ser Gly Pro Ala 1075 1080 1085Pro Gln Cys Ile Ile Pro Asn Lys
Cys Thr Pro Pro Asn Val Glu Asn 1090 1095 1100Gly Ile Leu Val Ser
Asp Asn Arg Ser Leu Phe Ser Leu Asn Glu Val1105 1110 1115 1120Val
Glu Phe Arg Cys Gln Pro Gly Phe Val Met Lys Gly Pro Arg Arg 1125
1130 1135Val Lys Cys Gln Ala Leu Asn Lys Trp Glu Pro Glu Leu Pro
Ser Cys 1140 1145 1150Ser Arg Val Cys Gln Pro Pro Pro Glu Ile Leu
His Gly Glu His Thr 1155 1160 1165Pro Ser His Gln Asp Asn Phe Ser
Pro Gly Gln Glu Val Phe Tyr Ser 1170 1175 1180Cys Glu Pro Gly Tyr
Asp Leu Arg Gly Ala Ala Ser Leu His Cys Thr1185 1190 1195 1200Pro
Gln Gly Asp Trp Ser Pro Glu Ala Pro Arg Cys Ala Val Lys Ser 1205
1210 1215Cys Asp Asp Phe Leu Gly Gln Leu Pro His Gly Arg Val Leu
Phe Pro 1220 1225 1230Leu Asn Leu Gln Leu Gly Ala Lys Val Ser Phe
Val Cys Asp Glu Gly 1235 1240 1245Phe Arg Leu Lys Gly Ser Ser Val
Ser His Cys Val Leu Val Gly Met 1250 1255 1260Arg Ser Leu Trp Asn
Asn Ser Val Pro Val Cys Glu His Ile Phe Cys1265 1270 1275 1280Pro
Asn Pro Pro Ala Ile Leu Asn Gly Arg His Thr Gly Thr Pro Ser 1285
1290 1295Gly Asp Ile Pro Tyr Gly Lys Glu Ile Ser Tyr Thr
Cys Asp Pro His 1300 1305 1310Pro Asp Arg Gly Met Thr Phe Asn Leu
Ile Gly Glu Ser Thr Ile Arg 1315 1320 1325Cys Thr Ser Asp Pro His
Gly Asn Gly Val Trp Ser Ser Pro Ala Pro 1330 1335 1340Arg Cys Glu
Leu Ser Val Arg Ala Gly His Cys Lys Thr Pro Glu Gln1345 1350 1355
1360Phe Pro Phe Ala Ser Pro Thr Ile Pro Ile Asn Asp Phe Glu Phe Pro
1365 1370 1375Val Gly Thr Ser Leu Asn Tyr Glu Cys Arg Pro Gly Tyr
Phe Gly Lys 1380 1385 1390Met Phe Ser Ile Ser Cys Leu Glu Asn Leu
Val Trp Ser Ser Val Glu 1395 1400 1405Asp Asn Cys Arg Arg Lys Ser
Cys Gly Pro Pro Pro Glu Pro Phe Asn 1410 1415 1420Gly Met Val His
Ile Asn Thr Asp Thr Gln Phe Gly Ser Thr Val Asn1425 1430 1435
1440Tyr Ser Cys Asn Glu Gly Phe Arg Leu Ile Gly Ser Pro Ser Thr Thr
1445 1450 1455Cys Leu Val Ser Gly Asn Asn Val Thr Trp Asp Lys Lys
Ala Pro Ile 1460 1465 1470Cys Glu Ile Ile Ser Cys Glu Pro Pro Pro
Thr Ile Ser Asn Gly Asp 1475 1480 1485Phe Tyr Ser Asn Asn Arg Thr
Ser Phe His Asn Gly Thr Val Val Thr 1490 1495 1500Tyr Gln Cys His
Thr Gly Pro Asp Gly Glu Gln Leu Phe Glu Leu Val1505 1510 1515
1520Gly Glu Arg Ser Ile Tyr Cys Thr Ser Lys Asp Asp Gln Val Gly Val
1525 1530 1535Trp Ser Ser Pro Pro Pro Arg Cys Ile Ser Thr Asn Lys
Cys Thr Ala 1540 1545 1550Pro Glu Val Glu Asn Ala Ile Arg Val Pro
Gly Asn Arg Ser Phe Phe 1555 1560 1565Ser Leu Thr Glu Ile Ile Arg
Phe Arg Cys Gln Pro Gly Phe Val Met 1570 1575 1580Val Gly Ser His
Thr Val Gln Cys Gln Thr Asn Gly Arg Trp Gly Pro1585 1590 1595
1600Lys Leu Pro His Cys Ser Arg Val Cys Gln Pro Pro Pro Glu Ile Leu
1605 1610 1615His Gly Glu His Thr Leu Ser His Gln Asp Asn Phe Ser
Pro Gly Gln 1620 1625 1630Glu Val Phe Tyr Ser Cys Glu Pro Ser Tyr
Asp Leu Arg Gly Ala Ala 1635 1640 1645Ser Leu His Cys Thr Pro Gln
Gly Asp Trp Ser Pro Glu Ala Pro Arg 1650 1655 1660Cys Thr Val Lys
Ser Cys Asp Asp Phe Leu Gly Gln Leu Pro His Gly1665 1670 1675
1680Arg Val Leu Leu Pro Leu Asn Leu Gln Leu Gly Ala Lys Val Ser Phe
1685 1690 1695Val Cys Asp Glu Gly Phe Arg Leu Lys Gly Arg Ser Ala
Ser His Cys 1700 1705 1710Val Leu Ala Gly Met Lys Ala Leu Trp Asn
Ser Ser Val Pro Val Cys 1715 1720 1725Glu Gln Ile Phe Cys Pro Asn
Pro Pro Ala Ile Leu Asn Gly Arg His 1730 1735 1740Thr Gly Thr Pro
Phe Gly Asp Ile Pro Tyr Gly Lys Glu Ile Ser Tyr1745 1750 1755
1760Ala Cys Asp Thr His Pro Asp Arg Gly Met Thr Phe Asn Leu Ile Gly
1765 1770 1775Glu Ser Ser Ile Arg Cys Thr Ser Asp Pro Gln Gly Asn
Gly Val Trp 1780 1785 1790Ser Ser Pro Ala Pro Arg Cys Glu Leu Ser
Val Pro Ala Ala Cys Pro 1795 1800 1805His Pro Pro Lys Ile Gln Asn
Gly His Tyr Ile Gly Gly His Val Ser 1810 1815 1820Leu Tyr Leu Pro
Gly Met Thr Ile Ser Tyr Thr Cys Asp Pro Gly Tyr1825 1830 1835
1840Leu Leu Val Gly Lys Gly Phe Ile Phe Cys Thr Asp Gln Gly Ile Trp
1845 1850 1855Ser Gln Leu Asp His Tyr Cys Lys Glu Val Asn Cys Ser
Phe Pro Leu 1860 1865 1870Phe Met Asn Gly Ile Ser Lys Glu Leu Glu
Met Lys Lys Val Tyr His 1875 1880 1885Tyr Gly Asp Tyr Val Thr Leu
Lys Cys Glu Asp Gly Tyr Thr Leu Glu 1890 1895 1900Gly Ser Pro Trp
Ser Gln Cys Gln Ala Asp Asp Arg Trp Asp Pro Pro1905 1910 1915
1920Leu Ala Lys Cys Thr Ser Arg Ala His Asp Ala Leu Ile Val Gly Thr
1925 1930 1935Leu Ser Gly Thr Ile Phe Phe Ile Leu Leu Ile Ile Phe
Leu Ser Trp 1940 1945 1950Ile Ile Leu Lys His Arg Lys Gly Asn Asn
Ala His Glu Asn Pro Lys 1955 1960 1965Glu Val Ala Ile His Leu His
Ser Gln Gly Gly Ser Ser Val His Pro 1970 1975 1980Arg Thr Leu Gln
Thr Asn Glu Glu Asn Ser Arg Val Leu Pro1985 1990 1995143036DNAHomo
sapiensCR2 14att tct tgt ggc tct cct ccg cct atc cta aat ggc cgg
att agt tat 48Ile Ser Cys Gly Ser Pro Pro Pro Ile Leu Asn Gly Arg
Ile Ser Tyr1 5 10 15tat tct acc ccc att gct gtt ggt acc gtg ata agg
tac agt tgt tca 96Tyr Ser Thr Pro Ile Ala Val Gly Thr Val Ile Arg
Tyr Ser Cys Ser20 25 30ggt acc ttc cgc ctc att gga gaa aaa agt cta
tta tgc ata act aaa 144Gly Thr Phe Arg Leu Ile Gly Glu Lys Ser Leu
Leu Cys Ile Thr Lys35 40 45gac aaa gtg gat gga acc tgg gat aaa cct
gct cct aaa tgt gaa tat 192Asp Lys Val Asp Gly Thr Trp Asp Lys Pro
Ala Pro Lys Cys Glu Tyr50 55 60ttc aat aaa tat tct tct tgc cct gag
ccc ata gta cca gga gga tac 240Phe Asn Lys Tyr Ser Ser Cys Pro Glu
Pro Ile Val Pro Gly Gly Tyr65 70 75 80aaa att aga ggc tct aca ccc
tac aga cat ggt gat tct gtg aca ttt 288Lys Ile Arg Gly Ser Thr Pro
Tyr Arg His Gly Asp Ser Val Thr Phe85 90 95gcc tgt aaa acc aac ttc
tcc atg aac gga aac aag tct gtt tgg tgt 336Ala Cys Lys Thr Asn Phe
Ser Met Asn Gly Asn Lys Ser Val Trp Cys100 105 110caa gca aat aat
atg tgg ggg ccg aca cga cta cca acc tgt gta agt 384Gln Ala Asn Asn
Met Trp Gly Pro Thr Arg Leu Pro Thr Cys Val Ser115 120 125gtt ttc
cct ctc gag tgt cca gca ctt cct atg atc cac aat gga cat 432Val Phe
Pro Leu Glu Cys Pro Ala Leu Pro Met Ile His Asn Gly His130 135
140cac aca agt gag aat gtt ggc tcc att gct cca gga ttg tct gtg act
480His Thr Ser Glu Asn Val Gly Ser Ile Ala Pro Gly Leu Ser Val
Thr145 150 155 160tac agc tgt gaa tct ggt tac ttg ctt gtt gga gaa
aag atc att aac 528Tyr Ser Cys Glu Ser Gly Tyr Leu Leu Val Gly Glu
Lys Ile Ile Asn165 170 175tgt ttg tct tcg gga aaa tgg agt gct gtc
ccc ccc aca tgt gaa gag 576Cys Leu Ser Ser Gly Lys Trp Ser Ala Val
Pro Pro Thr Cys Glu Glu180 185 190gca cgc tgt aaa tct cta gga cga
ttt ccc aat ggg aag gta aag gag 624Ala Arg Cys Lys Ser Leu Gly Arg
Phe Pro Asn Gly Lys Val Lys Glu195 200 205cct cca att ctc cgg gtt
ggt gta act gca aac ttt ttc tgt gat gaa 672Pro Pro Ile Leu Arg Val
Gly Val Thr Ala Asn Phe Phe Cys Asp Glu210 215 220ggg tat cga ctg
caa ggc cca cct tct agt cgg tgt gta att gct gga 720Gly Tyr Arg Leu
Gln Gly Pro Pro Ser Ser Arg Cys Val Ile Ala Gly225 230 235 240cag
gga gtt gct tgg acc aaa atg cca gta tgt gaa gaa att ttt tgc 768Gln
Gly Val Ala Trp Thr Lys Met Pro Val Cys Glu Glu Ile Phe Cys245 250
255cca tca cct ccc cct att ctc aat gga aga cat ata ggc aac tca cta
816Pro Ser Pro Pro Pro Ile Leu Asn Gly Arg His Ile Gly Asn Ser
Leu260 265 270gca aat gtc tca tat gga agc ata gtc act tac act tgt
gac ccg gac 864Ala Asn Val Ser Tyr Gly Ser Ile Val Thr Tyr Thr Cys
Asp Pro Asp275 280 285cca gag gaa gga gtg aac ttc atc ctt att gga
gag agc act ctc cgt 912Pro Glu Glu Gly Val Asn Phe Ile Leu Ile Gly
Glu Ser Thr Leu Arg290 295 300tgt aca gtt gat agt cag aag act ggg
acc tgg agt ggc cct gcc cca 960Cys Thr Val Asp Ser Gln Lys Thr Gly
Thr Trp Ser Gly Pro Ala Pro305 310 315 320cgc tgt gaa ctt tct act
tct gcg gtt cag tgt cca cat ccc cag atc 1008Arg Cys Glu Leu Ser Thr
Ser Ala Val Gln Cys Pro His Pro Gln Ile325 330 335cta aga ggc cga
atg gta tct ggg cag aaa gat cga tat acc tat aac 1056Leu Arg Gly Arg
Met Val Ser Gly Gln Lys Asp Arg Tyr Thr Tyr Asn340 345 350gac act
gtg ata ttt gct tgc atg ttt ggc ttc acc ttg aag ggc agc 1104Asp Thr
Val Ile Phe Ala Cys Met Phe Gly Phe Thr Leu Lys Gly Ser355 360
365aag caa atc cga tgc aat gcc caa ggc aca tgg gag cca tct gca cca
1152Lys Gln Ile Arg Cys Asn Ala Gln Gly Thr Trp Glu Pro Ser Ala
Pro370 375 380gtc tgt gaa aag gaa tgc cag gcc cct cct aac atc ctc
aat ggg caa 1200Val Cys Glu Lys Glu Cys Gln Ala Pro Pro Asn Ile Leu
Asn Gly Gln385 390 395 400aag gaa gat aga cac atg gtc cgc ttt gac
cct gga aca tct ata aaa 1248Lys Glu Asp Arg His Met Val Arg Phe Asp
Pro Gly Thr Ser Ile Lys405 410 415tat agc tgt aac cct ggc tat gtg
ctg gtg gga gaa gaa tcc ata cag 1296Tyr Ser Cys Asn Pro Gly Tyr Val
Leu Val Gly Glu Glu Ser Ile Gln420 425 430tgt acc tct gag gtg tgg
aca ccc cct gta ccc caa tgc aaa gtg gca 1344Cys Thr Ser Glu Val Trp
Thr Pro Pro Val Pro Gln Cys Lys Val Ala435 440 445gcg tgt gaa gct
aca gga agg caa ctc ttg aca aaa ccc cag cac caa 1392Ala Cys Glu Ala
Thr Gly Arg Gln Leu Leu Thr Lys Pro Gln His Gln450 455 460ttt gtt
aga cca gat gtc aac tct tct tgt ggt gaa ggg tac aag tta 1440Phe Val
Arg Pro Asp Val Asn Ser Ser Cys Gly Glu Gly Tyr Lys Leu465 470 475
480agt ggg agt gtt tat cag gag tgt caa ggc aca att cct tgg ttt atg
1488Ser Gly Ser Val Tyr Gln Glu Cys Gln Gly Thr Ile Pro Trp Phe
Met485 490 495gag att cgt ctt tgt aaa gaa atc acc tgc cca cca ccc
cct gtt atc 1536Glu Ile Arg Leu Cys Lys Glu Ile Thr Cys Pro Pro Pro
Pro Val Ile500 505 510tac aat ggg gca cac acc ggg agt tcc tta gaa
gat ttt cca tat gga 1584Tyr Asn Gly Ala His Thr Gly Ser Ser Leu Glu
Asp Phe Pro Tyr Gly515 520 525acc acg gtc act tac aca tgt aac cct
ggg cca gaa aga gga gtg gaa 1632Thr Thr Val Thr Tyr Thr Cys Asn Pro
Gly Pro Glu Arg Gly Val Glu530 535 540ttc agc ctc att gga gag agc
acc atc cgt tgt aca agc aat gat caa 1680Phe Ser Leu Ile Gly Glu Ser
Thr Ile Arg Cys Thr Ser Asn Asp Gln545 550 555 560gaa aga ggc acc
tgg agt ggc cct gct ccc ctg tgt aaa ctt tcc ctc 1728Glu Arg Gly Thr
Trp Ser Gly Pro Ala Pro Leu Cys Lys Leu Ser Leu565 570 575ctt gct
gtc cag tgc tca cat gtc cat att gca aat gga tac aag ata 1776Leu Ala
Val Gln Cys Ser His Val His Ile Ala Asn Gly Tyr Lys Ile580 585
590tct ggc aag gaa gcc cca tat ttc tac aat gac act gtg aca ttc aag
1824Ser Gly Lys Glu Ala Pro Tyr Phe Tyr Asn Asp Thr Val Thr Phe
Lys595 600 605tgt tat agt gga ttt act ttg aag ggc agt agt cag att
cgt tgc aaa 1872Cys Tyr Ser Gly Phe Thr Leu Lys Gly Ser Ser Gln Ile
Arg Cys Lys610 615 620gct gat aac acc tgg gat cct gaa ata cca gtt
tgt gaa aaa gaa aca 1920Ala Asp Asn Thr Trp Asp Pro Glu Ile Pro Val
Cys Glu Lys Glu Thr625 630 635 640tgc cag cat gtg aga cag agt ctt
caa gaa ctt cca gct ggt tca cgt 1968Cys Gln His Val Arg Gln Ser Leu
Gln Glu Leu Pro Ala Gly Ser Arg645 650 655gtg gag cta gtt aat acg
tcc tgc caa gat ggg tac cag ttg act gga 2016Val Glu Leu Val Asn Thr
Ser Cys Gln Asp Gly Tyr Gln Leu Thr Gly660 665 670cat gct tat cag
atg tgt caa gat gct gaa aat gga att tgg ttc aaa 2064His Ala Tyr Gln
Met Cys Gln Asp Ala Glu Asn Gly Ile Trp Phe Lys675 680 685aag att
cca ctt tgt aaa gtt att cac tgt cac cct cca cca gtg att 2112Lys Ile
Pro Leu Cys Lys Val Ile His Cys His Pro Pro Pro Val Ile690 695
700gtc aat ggg aag cac aca ggc atg atg gca gaa aac ttt cta tat gga
2160Val Asn Gly Lys His Thr Gly Met Met Ala Glu Asn Phe Leu Tyr
Gly705 710 715 720aat gaa gtc tct tat gaa tgt gac caa gga ttc tat
ctc ctg gga gag 2208Asn Glu Val Ser Tyr Glu Cys Asp Gln Gly Phe Tyr
Leu Leu Gly Glu725 730 735aaa aaa ttg cag tgc aga agt gat tct aaa
gga cat gga tct tgg agc 2256Lys Lys Leu Gln Cys Arg Ser Asp Ser Lys
Gly His Gly Ser Trp Ser740 745 750ggg cct tcc cca cag tgc tta cga
tct cct cct gtg act cgc tgc cct 2304Gly Pro Ser Pro Gln Cys Leu Arg
Ser Pro Pro Val Thr Arg Cys Pro755 760 765aat cca gaa gtc aaa cat
ggg tac aag ctc aat aaa aca cat tct gca 2352Asn Pro Glu Val Lys His
Gly Tyr Lys Leu Asn Lys Thr His Ser Ala770 775 780tat tcc cac aat
gac ata gtg tat gtt gac tgc aat cct ggc ttc atc 2400Tyr Ser His Asn
Asp Ile Val Tyr Val Asp Cys Asn Pro Gly Phe Ile785 790 795 800atg
aat ggt agt cgc gtg att agg tgt cat act gat aac aca tgg gtg 2448Met
Asn Gly Ser Arg Val Ile Arg Cys His Thr Asp Asn Thr Trp Val805 810
815cca ggt gtg cca act tgt atc aaa aaa gcc ttc ata ggg tgt cca cct
2496Pro Gly Val Pro Thr Cys Ile Lys Lys Ala Phe Ile Gly Cys Pro
Pro820 825 830ccg cct aag acc cct aac ggg aac cat act ggt gga aac
ata gct cga 2544Pro Pro Lys Thr Pro Asn Gly Asn His Thr Gly Gly Asn
Ile Ala Arg835 840 845ttt tct cct gga atg tca atc ctg tac agc tgt
gac caa ggc tac ctg 2592Phe Ser Pro Gly Met Ser Ile Leu Tyr Ser Cys
Asp Gln Gly Tyr Leu850 855 860ctg gtg gga gag gca ctc ctt ctt tgc
aca cat gag gga acc tgg agc 2640Leu Val Gly Glu Ala Leu Leu Leu Cys
Thr His Glu Gly Thr Trp Ser865 870 875 880caa cct gcc cct cat tgt
aaa gag gta aac tgt agc tca cca gca gat 2688Gln Pro Ala Pro His Cys
Lys Glu Val Asn Cys Ser Ser Pro Ala Asp885 890 895atg gat gga atc
cag aaa ggg ctg gaa cca agg aaa atg tat cag tat 2736Met Asp Gly Ile
Gln Lys Gly Leu Glu Pro Arg Lys Met Tyr Gln Tyr900 905 910gga gct
gtt gta act ctg gag tgt gaa gat ggg tat atg ctg gaa ggc 2784Gly Ala
Val Val Thr Leu Glu Cys Glu Asp Gly Tyr Met Leu Glu Gly915 920
925agt ccc cag agc cag tgc caa tcg gat cac caa tgg aac cct ccc ctg
2832Ser Pro Gln Ser Gln Cys Gln Ser Asp His Gln Trp Asn Pro Pro
Leu930 935 940gcg gtt tgc aga tcc cgt tca ctt gct cct gtc ctt tgt
ggt att gct 2880Ala Val Cys Arg Ser Arg Ser Leu Ala Pro Val Leu Cys
Gly Ile Ala945 950 955 960gca ggt ttg ata ctt ctt acc ttc ttg att
gtc gtt acc tta tac gtg 2928Ala Gly Leu Ile Leu Leu Thr Phe Leu Ile
Val Val Thr Leu Tyr Val965 970 975ata tca aaa cac aga gca cgc aat
tat tat aca gat aca agc cag aaa 2976Ile Ser Lys His Arg Ala Arg Asn
Tyr Tyr Thr Asp Thr Ser Gln Lys980 985 990gaa gct ttt cat tta gaa
gca cga gaa gta tat tct gtt gat cca tac 3024Glu Ala Phe His Leu Glu
Ala Arg Glu Val Tyr Ser Val Asp Pro Tyr995 1000 1005aac cca gcc agc
3036Asn Pro Ala Ser1010151012PRTHomo sapiensCR2 15Ile Ser Cys Gly
Ser Pro Pro Pro Ile Leu Asn Gly Arg Ile Ser Tyr1 5 10 15Tyr Ser Thr
Pro Ile Ala Val Gly Thr Val Ile Arg Tyr Ser Cys Ser 20 25 30Gly Thr
Phe Arg Leu Ile Gly Glu Lys Ser Leu Leu Cys Ile Thr Lys 35 40 45Asp
Lys Val Asp Gly Thr Trp Asp Lys Pro Ala Pro Lys Cys Glu Tyr 50 55
60Phe Asn Lys Tyr Ser Ser Cys Pro Glu Pro Ile Val Pro Gly Gly Tyr65
70 75 80Lys Ile Arg Gly Ser Thr Pro Tyr Arg His Gly Asp Ser Val Thr
Phe 85 90 95Ala Cys Lys Thr Asn Phe Ser Met Asn Gly Asn Lys Ser Val
Trp Cys 100 105 110Gln Ala Asn Asn Met Trp Gly Pro Thr Arg Leu Pro
Thr Cys Val Ser 115 120 125Val Phe Pro Leu Glu Cys Pro Ala Leu Pro
Met Ile His Asn Gly His 130 135 140His Thr Ser Glu Asn Val Gly Ser
Ile Ala Pro Gly Leu Ser Val Thr145 150 155 160Tyr Ser Cys Glu Ser
Gly Tyr Leu Leu Val Gly Glu Lys Ile Ile Asn 165 170 175Cys Leu Ser
Ser Gly Lys Trp Ser Ala Val Pro Pro Thr Cys Glu Glu 180 185 190Ala
Arg Cys Lys Ser Leu Gly Arg Phe Pro Asn Gly Lys Val Lys Glu 195
200 205Pro Pro Ile Leu Arg Val Gly Val Thr Ala Asn Phe Phe Cys Asp
Glu 210 215 220Gly Tyr Arg Leu Gln Gly Pro Pro Ser Ser Arg Cys Val
Ile Ala Gly225 230 235 240Gln Gly Val Ala Trp Thr Lys Met Pro Val
Cys Glu Glu Ile Phe Cys 245 250 255Pro Ser Pro Pro Pro Ile Leu Asn
Gly Arg His Ile Gly Asn Ser Leu 260 265 270Ala Asn Val Ser Tyr Gly
Ser Ile Val Thr Tyr Thr Cys Asp Pro Asp 275 280 285Pro Glu Glu Gly
Val Asn Phe Ile Leu Ile Gly Glu Ser Thr Leu Arg 290 295 300Cys Thr
Val Asp Ser Gln Lys Thr Gly Thr Trp Ser Gly Pro Ala Pro305 310 315
320Arg Cys Glu Leu Ser Thr Ser Ala Val Gln Cys Pro His Pro Gln Ile
325 330 335Leu Arg Gly Arg Met Val Ser Gly Gln Lys Asp Arg Tyr Thr
Tyr Asn 340 345 350Asp Thr Val Ile Phe Ala Cys Met Phe Gly Phe Thr
Leu Lys Gly Ser 355 360 365Lys Gln Ile Arg Cys Asn Ala Gln Gly Thr
Trp Glu Pro Ser Ala Pro 370 375 380Val Cys Glu Lys Glu Cys Gln Ala
Pro Pro Asn Ile Leu Asn Gly Gln385 390 395 400Lys Glu Asp Arg His
Met Val Arg Phe Asp Pro Gly Thr Ser Ile Lys 405 410 415Tyr Ser Cys
Asn Pro Gly Tyr Val Leu Val Gly Glu Glu Ser Ile Gln 420 425 430Cys
Thr Ser Glu Val Trp Thr Pro Pro Val Pro Gln Cys Lys Val Ala 435 440
445Ala Cys Glu Ala Thr Gly Arg Gln Leu Leu Thr Lys Pro Gln His Gln
450 455 460Phe Val Arg Pro Asp Val Asn Ser Ser Cys Gly Glu Gly Tyr
Lys Leu465 470 475 480Ser Gly Ser Val Tyr Gln Glu Cys Gln Gly Thr
Ile Pro Trp Phe Met 485 490 495Glu Ile Arg Leu Cys Lys Glu Ile Thr
Cys Pro Pro Pro Pro Val Ile 500 505 510Tyr Asn Gly Ala His Thr Gly
Ser Ser Leu Glu Asp Phe Pro Tyr Gly 515 520 525Thr Thr Val Thr Tyr
Thr Cys Asn Pro Gly Pro Glu Arg Gly Val Glu 530 535 540Phe Ser Leu
Ile Gly Glu Ser Thr Ile Arg Cys Thr Ser Asn Asp Gln545 550 555
560Glu Arg Gly Thr Trp Ser Gly Pro Ala Pro Leu Cys Lys Leu Ser Leu
565 570 575Leu Ala Val Gln Cys Ser His Val His Ile Ala Asn Gly Tyr
Lys Ile 580 585 590Ser Gly Lys Glu Ala Pro Tyr Phe Tyr Asn Asp Thr
Val Thr Phe Lys 595 600 605Cys Tyr Ser Gly Phe Thr Leu Lys Gly Ser
Ser Gln Ile Arg Cys Lys 610 615 620Ala Asp Asn Thr Trp Asp Pro Glu
Ile Pro Val Cys Glu Lys Glu Thr625 630 635 640Cys Gln His Val Arg
Gln Ser Leu Gln Glu Leu Pro Ala Gly Ser Arg 645 650 655Val Glu Leu
Val Asn Thr Ser Cys Gln Asp Gly Tyr Gln Leu Thr Gly 660 665 670His
Ala Tyr Gln Met Cys Gln Asp Ala Glu Asn Gly Ile Trp Phe Lys 675 680
685Lys Ile Pro Leu Cys Lys Val Ile His Cys His Pro Pro Pro Val Ile
690 695 700Val Asn Gly Lys His Thr Gly Met Met Ala Glu Asn Phe Leu
Tyr Gly705 710 715 720Asn Glu Val Ser Tyr Glu Cys Asp Gln Gly Phe
Tyr Leu Leu Gly Glu 725 730 735Lys Lys Leu Gln Cys Arg Ser Asp Ser
Lys Gly His Gly Ser Trp Ser 740 745 750Gly Pro Ser Pro Gln Cys Leu
Arg Ser Pro Pro Val Thr Arg Cys Pro 755 760 765Asn Pro Glu Val Lys
His Gly Tyr Lys Leu Asn Lys Thr His Ser Ala 770 775 780Tyr Ser His
Asn Asp Ile Val Tyr Val Asp Cys Asn Pro Gly Phe Ile785 790 795
800Met Asn Gly Ser Arg Val Ile Arg Cys His Thr Asp Asn Thr Trp Val
805 810 815Pro Gly Val Pro Thr Cys Ile Lys Lys Ala Phe Ile Gly Cys
Pro Pro 820 825 830Pro Pro Lys Thr Pro Asn Gly Asn His Thr Gly Gly
Asn Ile Ala Arg 835 840 845Phe Ser Pro Gly Met Ser Ile Leu Tyr Ser
Cys Asp Gln Gly Tyr Leu 850 855 860Leu Val Gly Glu Ala Leu Leu Leu
Cys Thr His Glu Gly Thr Trp Ser865 870 875 880Gln Pro Ala Pro His
Cys Lys Glu Val Asn Cys Ser Ser Pro Ala Asp 885 890 895Met Asp Gly
Ile Gln Lys Gly Leu Glu Pro Arg Lys Met Tyr Gln Tyr 900 905 910Gly
Ala Val Val Thr Leu Glu Cys Glu Asp Gly Tyr Met Leu Glu Gly 915 920
925Ser Pro Gln Ser Gln Cys Gln Ser Asp His Gln Trp Asn Pro Pro Leu
930 935 940Ala Val Cys Arg Ser Arg Ser Leu Ala Pro Val Leu Cys Gly
Ile Ala945 950 955 960Ala Gly Leu Ile Leu Leu Thr Phe Leu Ile Val
Val Thr Leu Tyr Val 965 970 975Ile Ser Lys His Arg Ala Arg Asn Tyr
Tyr Thr Asp Thr Ser Gln Lys 980 985 990Glu Ala Phe His Leu Glu Ala
Arg Glu Val Tyr Ser Val Asp Pro Tyr 995 1000 1005Asn Pro Ala Ser
1010161930DNAHomo sapiensDAF 16gac tgt ggc ctt ccc cca gat gta cct
aat gcc cag cca gct ttg gaa 48Asp Cys Gly Leu Pro Pro Asp Val Pro
Asn Ala Gln Pro Ala Leu Glu1 5 10 15ggc cgt aca agt ttt ccc gag gat
act gta ata acg tac aaa tgt gaa 96Gly Arg Thr Ser Phe Pro Glu Asp
Thr Val Ile Thr Tyr Lys Cys Glu20 25 30gaa agc ttt gtg aaa att cct
ggc gag aag gac tca gtg acc tgc ctt 144Glu Ser Phe Val Lys Ile Pro
Gly Glu Lys Asp Ser Val Thr Cys Leu35 40 45aag ggc atg caa tgg tca
gat att gaa gag ttc tgc aat cgt agc tgc 192Lys Gly Met Gln Trp Ser
Asp Ile Glu Glu Phe Cys Asn Arg Ser Cys50 55 60gag gtg cca aca agg
cta aat tct gca tcc ctc aaa cag cct tat atc 240Glu Val Pro Thr Arg
Leu Asn Ser Ala Ser Leu Lys Gln Pro Tyr Ile65 70 75 80act cag aat
tat ttt cca gtc ggt act gtt gtg gaa tat gag tgc cgt 288Thr Gln Asn
Tyr Phe Pro Val Gly Thr Val Val Glu Tyr Glu Cys Arg85 90 95cca ggt
tac aga aga gaa cct tct cta tca cca aaa cta act tgc ctt 336Pro Gly
Tyr Arg Arg Glu Pro Ser Leu Ser Pro Lys Leu Thr Cys Leu100 105
110cag aat tta aaa tgg tcc aca gca gtc gaa ttt tgt aaa aag aaa tca
384Gln Asn Leu Lys Trp Ser Thr Ala Val Glu Phe Cys Lys Lys Lys
Ser115 120 125tgc cct aat ccg gga gaa ata cga aat ggt cag att gat
gta cca ggt 432Cys Pro Asn Pro Gly Glu Ile Arg Asn Gly Gln Ile Asp
Val Pro Gly130 135 140ggc ata tta ttt ggt gca acc atc tcc ttc tca
tgt aac aca ggg tac 480Gly Ile Leu Phe Gly Ala Thr Ile Ser Phe Ser
Cys Asn Thr Gly Tyr145 150 155 160aaa tta ttt ggc tcg act tct agt
ttt tgt ctt att tca ggc agc tct 528Lys Leu Phe Gly Ser Thr Ser Ser
Phe Cys Leu Ile Ser Gly Ser Ser165 170 175gtc cag tgg agt gac ccg
ttg cca gag tgc aga gaa att tat tgt cca 576Val Gln Trp Ser Asp Pro
Leu Pro Glu Cys Arg Glu Ile Tyr Cys Pro180 185 190gca cca cca caa
att gac aat gga ata att caa ggg gaa cgt gac cat 624Ala Pro Pro Gln
Ile Asp Asn Gly Ile Ile Gln Gly Glu Arg Asp His195 200 205tat gga
tat aga cag tct gta acg tat gca tgt aat aaa gga ttc acc 672Tyr Gly
Tyr Arg Gln Ser Val Thr Tyr Ala Cys Asn Lys Gly Phe Thr210 215
220atg att gga gag cac tct att tat tgt act gtg aat aat gat gaa gga
720Met Ile Gly Glu His Ser Ile Tyr Cys Thr Val Asn Asn Asp Glu
Gly225 230 235 240gag tgg agt ggc cca cca cct gaa tgc aga gga aaa
tct cta act tcc 768Glu Trp Ser Gly Pro Pro Pro Glu Cys Arg Gly Lys
Ser Leu Thr Ser245 250 255aag gtc cca cca aca gtt cag aaa cct acc
aca gta aat gtt cca act 816Lys Val Pro Pro Thr Val Gln Lys Pro Thr
Thr Val Asn Val Pro Thr260 265 270aca gaa gtc tca cca act tct cag
aaa acc acc aca aaa acc acc aca 864Thr Glu Val Ser Pro Thr Ser Gln
Lys Thr Thr Thr Lys Thr Thr Thr275 280 285cca aat gct caa gca aca
cgg agt aca cct gtt tcc agg aca acc aag 912Pro Asn Ala Gln Ala Thr
Arg Ser Thr Pro Val Ser Arg Thr Thr Lys290 295 300cat ttt cat gaa
aca acc cca aat aaa gga agt gga acc act tca ggt 960His Phe His Glu
Thr Thr Pro Asn Lys Gly Ser Gly Thr Thr Ser Gly305 310 315 320act
acc cgt ctt cta tct ggg cac acg tgt ttc acg ttg aca ggt ttg 1008Thr
Thr Arg Leu Leu Ser Gly His Thr Cys Phe Thr Leu Thr Gly Leu325 330
335ctt ggg acg cta gta acc atg ggc ttg ctg act tagccaaaga
agagttaaga 1061Leu Gly Thr Leu Val Thr Met Gly Leu Leu Thr340
345agaaaataca cacaagtata cagactgttc ctagtttctt agacttatct
gcatattgga 1121taaaataaat gcaattgtgc tcttcattta ggatgctttc
attgtcttta agatgtgtta 1181ggaatgtcaa cagagcaagg agaaaaaagg
cagtcctgga atcacattct tagcacacct 1241gcgcctcttg aaaatagaac
aacttgcaga attgagagtg attcctttcc taaaagtgta 1301agaaagcata
gagatttgtt cgtattaaga atgggatcac gaggaaaaga gaaggaaagt
1361gatttttttc cacaagatct gaaatgatat ttccacttat aaaggaaata
aaaaatgaaa 1421aacattattt ggatatcaaa agcaaataaa aacccaattc
agtctcttct aagcaaaatt 1481gctaaagaga gatgaccaca ttataaagta
atctttggct aaggcatttt catctttcct 1541tcggttggca aaatatttta
aaggtaaaac atgctggtga accagggtgt tgatggtgat 1601aagggaggaa
tatagaatga aagactgaat cttcctttgt tgcacaaata gagtttggaa
1661aaagcctgtg aaaggtgtct tctttgactt aatgtcttta aaagtatcca
gagatactac 1721aatattaaca taagaaaaga ttatatatta tttctgaatc
gagatgtcca tagtcaaatt 1781tgtaaatctt attcttttgt aatatttatt
tatatttatt tatgacagtg aacattctga 1841ttttacatgt aaaacaagaa
aagttgaaga agatatgtga agaaaaatgt atttttccta 1901aatagaaata
aatgatccca ttttttggt 193017347PRTHomo sapiensDAF 17Asp Cys Gly Leu
Pro Pro Asp Val Pro Asn Ala Gln Pro Ala Leu Glu 1 5 10 15Gly Arg
Thr Ser Phe Pro Glu Asp Thr Val Ile Thr Tyr Lys Cys Glu 20 25 30Glu
Ser Phe Val Lys Ile Pro Gly Glu Lys Asp Ser Val Thr Cys Leu 35 40
45Lys Gly Met Gln Trp Ser Asp Ile Glu Glu Phe Cys Asn Arg Ser Cys
50 55 60Glu Val Pro Thr Arg Leu Asn Ser Ala Ser Leu Lys Gln Pro Tyr
Ile65 70 75 80Thr Gln Asn Tyr Phe Pro Val Gly Thr Val Val Glu Tyr
Glu Cys Arg 85 90 95Pro Gly Tyr Arg Arg Glu Pro Ser Leu Ser Pro Lys
Leu Thr Cys Leu 100 105 110Gln Asn Leu Lys Trp Ser Thr Ala Val Glu
Phe Cys Lys Lys Lys Ser 115 120 125Cys Pro Asn Pro Gly Glu Ile Arg
Asn Gly Gln Ile Asp Val Pro Gly 130 135 140Gly Ile Leu Phe Gly Ala
Thr Ile Ser Phe Ser Cys Asn Thr Gly Tyr145 150 155 160Lys Leu Phe
Gly Ser Thr Ser Ser Phe Cys Leu Ile Ser Gly Ser Ser 165 170 175Val
Gln Trp Ser Asp Pro Leu Pro Glu Cys Arg Glu Ile Tyr Cys Pro 180 185
190Ala Pro Pro Gln Ile Asp Asn Gly Ile Ile Gln Gly Glu Arg Asp His
195 200 205Tyr Gly Tyr Arg Gln Ser Val Thr Tyr Ala Cys Asn Lys Gly
Phe Thr 210 215 220Met Ile Gly Glu His Ser Ile Tyr Cys Thr Val Asn
Asn Asp Glu Gly225 230 235 240Glu Trp Ser Gly Pro Pro Pro Glu Cys
Arg Gly Lys Ser Leu Thr Ser 245 250 255Lys Val Pro Pro Thr Val Gln
Lys Pro Thr Thr Val Asn Val Pro Thr 260 265 270Thr Glu Val Ser Pro
Thr Ser Gln Lys Thr Thr Thr Lys Thr Thr Thr 275 280 285Pro Asn Ala
Gln Ala Thr Arg Ser Thr Pro Val Ser Arg Thr Thr Lys 290 295 300His
Phe His Glu Thr Thr Pro Asn Lys Gly Ser Gly Thr Thr Ser Gly305 310
315 320Thr Thr Arg Leu Leu Ser Gly His Thr Cys Phe Thr Leu Thr Gly
Leu 325 330 335Leu Gly Thr Leu Val Thr Met Gly Leu Leu Thr 340
345181050DNAHomo sapiensMCP 18tgt gag gag cca cca aca ttt gaa gct
atg gag ctc att ggt aaa cca 48Cys Glu Glu Pro Pro Thr Phe Glu Ala
Met Glu Leu Ile Gly Lys Pro1 5 10 15aaa ccc tac tat gag att ggt gaa
cga gta gat tat aag tgt aaa aaa 96Lys Pro Tyr Tyr Glu Ile Gly Glu
Arg Val Asp Tyr Lys Cys Lys Lys20 25 30gga tac ttc tat ata cct cct
ctt gcc acc cat act att tgt gat cgg 144Gly Tyr Phe Tyr Ile Pro Pro
Leu Ala Thr His Thr Ile Cys Asp Arg35 40 45aat cat aca tgg cta cct
gtc tca gat gac gcc tgt tat aga gaa aca 192Asn His Thr Trp Leu Pro
Val Ser Asp Asp Ala Cys Tyr Arg Glu Thr50 55 60tgt cca tat ata cgg
gat cct tta aat ggc caa gca gtc cct gca aat 240Cys Pro Tyr Ile Arg
Asp Pro Leu Asn Gly Gln Ala Val Pro Ala Asn65 70 75 80ggg act tac
gag ttt ggt tat cag atg cac ttt att tgt aat gag ggt 288Gly Thr Tyr
Glu Phe Gly Tyr Gln Met His Phe Ile Cys Asn Glu Gly85 90 95tat tac
tta att ggt gaa gaa att cta tat tgt gaa ctt aaa gga tca 336Tyr Tyr
Leu Ile Gly Glu Glu Ile Leu Tyr Cys Glu Leu Lys Gly Ser100 105
110gta gca att tgg agc ggt aag ccc cca ata tgt gaa aag gtt ttg tgt
384Val Ala Ile Trp Ser Gly Lys Pro Pro Ile Cys Glu Lys Val Leu
Cys115 120 125aca cca cct cca aaa ata aaa aat gga aaa cac acc ttt
agt gaa gta 432Thr Pro Pro Pro Lys Ile Lys Asn Gly Lys His Thr Phe
Ser Glu Val130 135 140gaa gta ttt gag tat ctt gat gca gta act tat
agt tgt gat cct gca 480Glu Val Phe Glu Tyr Leu Asp Ala Val Thr Tyr
Ser Cys Asp Pro Ala145 150 155 160cct gga cca gat cca ttt tca ctt
att gga gag agc acg att tat tgt 528Pro Gly Pro Asp Pro Phe Ser Leu
Ile Gly Glu Ser Thr Ile Tyr Cys165 170 175ggt gac aat tca gtg tgg
agt cgt gct gct cca gag tgt aaa gtg gtc 576Gly Asp Asn Ser Val Trp
Ser Arg Ala Ala Pro Glu Cys Lys Val Val180 185 190aaa tgt cga ttt
cca gta gtc gaa aat gga aaa cag ata tca gga ttt 624Lys Cys Arg Phe
Pro Val Val Glu Asn Gly Lys Gln Ile Ser Gly Phe195 200 205gga aaa
aaa ttt tac tac aaa gca aca gtt atg ttt gaa tgc gat aag 672Gly Lys
Lys Phe Tyr Tyr Lys Ala Thr Val Met Phe Glu Cys Asp Lys210 215
220ggt ttt tac ctc gat ggc agc gac aca att gtc tgt gac agt aac agt
720Gly Phe Tyr Leu Asp Gly Ser Asp Thr Ile Val Cys Asp Ser Asn
Ser225 230 235 240act tgg gat ccc cca gtt cca aag tgt ctt aaa gtg
tcg act tct tcc 768Thr Trp Asp Pro Pro Val Pro Lys Cys Leu Lys Val
Ser Thr Ser Ser245 250 255act aca aaa tct cca gcg tcc agt gcc tca
ggt cct agg cct act tac 816Thr Thr Lys Ser Pro Ala Ser Ser Ala Ser
Gly Pro Arg Pro Thr Tyr260 265 270aag cct cca gtc tca aat tat cca
gga tat cct aaa cct gag gaa gga 864Lys Pro Pro Val Ser Asn Tyr Pro
Gly Tyr Pro Lys Pro Glu Glu Gly275 280 285ata ctt gac agt ttg gat
gtt tgg gtc att gct gtg att gtt att gcc 912Ile Leu Asp Ser Leu Asp
Val Trp Val Ile Ala Val Ile Val Ile Ala290 295 300ata gtt gtt gga
gtt gca gta att tgt gtt gtc ccg tac aga tat ctt 960Ile Val Val Gly
Val Ala Val Ile Cys Val Val Pro Tyr Arg Tyr Leu305 310 315 320caa
agg agg aag aag aaa ggg aaa gca gat ggt gga gct gaa tat gcc 1008Gln
Arg Arg Lys Lys Lys Gly Lys Ala Asp Gly Gly Ala Glu Tyr Ala325 330
335act tac cag act aaa tca acc act cca gca gag cag aga ggc 1050Thr
Tyr Gln Thr Lys Ser Thr Thr Pro Ala Glu Gln Arg Gly340 345
35019350PRTHomo sapiensMCP 19Cys Glu Glu Pro Pro Thr Phe Glu Ala
Met Glu Leu Ile Gly Lys Pro1 5 10 15Lys Pro Tyr Tyr Glu Ile Gly Glu
Arg Val Asp Tyr Lys Cys Lys Lys 20 25 30Gly Tyr Phe Tyr Ile Pro Pro
Leu Ala Thr His Thr Ile Cys Asp Arg 35 40 45Asn His Thr Trp Leu Pro
Val Ser Asp Asp Ala Cys Tyr Arg Glu Thr 50 55 60Cys Pro Tyr Ile Arg
Asp Pro Leu Asn Gly Gln Ala Val Pro Ala Asn65 70
75 80Gly Thr Tyr Glu Phe Gly Tyr Gln Met His Phe Ile Cys Asn Glu
Gly 85 90 95Tyr Tyr Leu Ile Gly Glu Glu Ile Leu Tyr Cys Glu Leu Lys
Gly Ser 100 105 110Val Ala Ile Trp Ser Gly Lys Pro Pro Ile Cys Glu
Lys Val Leu Cys 115 120 125Thr Pro Pro Pro Lys Ile Lys Asn Gly Lys
His Thr Phe Ser Glu Val 130 135 140Glu Val Phe Glu Tyr Leu Asp Ala
Val Thr Tyr Ser Cys Asp Pro Ala145 150 155 160Pro Gly Pro Asp Pro
Phe Ser Leu Ile Gly Glu Ser Thr Ile Tyr Cys 165 170 175Gly Asp Asn
Ser Val Trp Ser Arg Ala Ala Pro Glu Cys Lys Val Val 180 185 190Lys
Cys Arg Phe Pro Val Val Glu Asn Gly Lys Gln Ile Ser Gly Phe 195 200
205Gly Lys Lys Phe Tyr Tyr Lys Ala Thr Val Met Phe Glu Cys Asp Lys
210 215 220Gly Phe Tyr Leu Asp Gly Ser Asp Thr Ile Val Cys Asp Ser
Asn Ser225 230 235 240Thr Trp Asp Pro Pro Val Pro Lys Cys Leu Lys
Val Ser Thr Ser Ser 245 250 255Thr Thr Lys Ser Pro Ala Ser Ser Ala
Ser Gly Pro Arg Pro Thr Tyr 260 265 270Lys Pro Pro Val Ser Asn Tyr
Pro Gly Tyr Pro Lys Pro Glu Glu Gly 275 280 285Ile Leu Asp Ser Leu
Asp Val Trp Val Ile Ala Val Ile Val Ile Ala 290 295 300Ile Val Val
Gly Val Ala Val Ile Cys Val Val Pro Tyr Arg Tyr Leu305 310 315
320Gln Arg Arg Lys Lys Lys Gly Lys Ala Asp Gly Gly Ala Glu Tyr Ala
325 330 335Thr Tyr Gln Thr Lys Ser Thr Thr Pro Ala Glu Gln Arg Gly
340 345 350
* * * * *