U.S. patent number RE42,704 [Application Number 09/232,488] was granted by the patent office on 2011-09-13 for production of fibrinogen in transgenic animals.
This patent grant is currently assigned to Pharming Intellectual Property B.V.. Invention is credited to Donald C. Foster, Donna E. Prunkard.
United States Patent |
RE42,704 |
Prunkard , et al. |
September 13, 2011 |
Production of fibrinogen in transgenic animals
Abstract
Materials and methods for producing fibrinogen in transgenic
non-human mammals are disclosed. DNA segments encoding A.alpha.,
B.beta. and .gamma. chains of fibrinogen are introduced into the
germ line of a non-human mammal, and the mammal or its female
progeny produces milk containing fibrinogen expressed from the
introduced DNA segments. Non-human mammalian embryos and transgenic
non-human mammals carrying DNA segments encoding heterologous
fibrinogen polypeptide chains are also disclosed.
Inventors: |
Prunkard; Donna E. (Seattle,
WA), Foster; Donald C. (Lake Forest Park, WA) |
Assignee: |
Pharming Intellectual Property
B.V. (Al Leiden, NL)
|
Family
ID: |
22765288 |
Appl.
No.: |
09/232,488 |
Filed: |
January 15, 1999 |
Related U.S. Patent Documents
|
|
|
|
|
|
|
Application
Number |
Filing Date |
Patent Number |
Issue Date |
|
Reissue of: |
08206176 |
Mar 3, 1994 |
5639940 |
Jun 17, 1997 |
|
|
Current U.S.
Class: |
800/7; 800/16;
800/18; 800/17; 435/320.1; 800/25; 800/14; 800/15 |
Current CPC
Class: |
C12N
15/8509 (20130101); A01K 67/0278 (20130101); C12N
15/87 (20130101); C07K 14/4732 (20130101); C12N
15/89 (20130101); C07K 14/4717 (20130101); C07K
14/75 (20130101); C07K 14/47 (20130101); A01K
2217/00 (20130101); A01K 2227/105 (20130101); A01K
2207/15 (20130101); A01K 2227/103 (20130101); C07K
2319/00 (20130101); A01K 2267/01 (20130101) |
Current International
Class: |
C12P
21/00 (20060101); C12N 15/00 (20060101); A01K
67/00 (20060101) |
Field of
Search: |
;800/7,14-18,25
;435/320.1 |
References Cited
[Referenced By]
U.S. Patent Documents
Foreign Patent Documents
|
|
|
|
|
|
|
WO88/00239 |
|
Jan 1988 |
|
WO |
|
WO 90/05188 |
|
May 1990 |
|
WO |
|
WO91/03216 |
|
Jun 1991 |
|
WO |
|
WO91/08216 |
|
Jun 1991 |
|
WO |
|
WO92/11358 |
|
Jul 1992 |
|
WO |
|
WO92/11757 |
|
Jul 1992 |
|
WO |
|
Other References
Garner's Preliminary Motion No. 1, Aug. 16, 2001. cited by other
.
Declaration of Alan Colman, Aug. 16, 2001. cited by other .
Declaration of David Toman, Aug. 16, 2001. cited by other .
Declaration of Lothar Hennighausen, Aug. 16, 2001. cited by other
.
Declaration of Jeffrey Rosen, Aug. 16, 2001. cited by other .
Yarus et al, Genetic Engineering, 18: 57-81 (1996). cited by other
.
Reddy et al., Animal Biotechnology 2: 15-29 (1991). cited by other
.
Tojo et al., J. Reprod. Develop. 39: 145-155 (1993). cited by other
.
Farrell et al., Biochemistry, 30: 9414-9420 (1991). cited by other
.
Velander's Preliminary Motion No. 1, Aug. 16, 2001. cited by other
.
Garner's Opposition to Velander's Preliminary Motion No. 1, Aug.
16, 2001. cited by other .
Prunkard et al., Nature Biotechnology, 14:867-871, 870 (1996).
cited by other .
Hurtley et al., Annu. Rev. Cell Biol. 5: 277-307 (1989). cited by
other .
Prunkard et al., (Abstract) J. Cell Biochem., Suppl. 17C (1993).
cited by other .
Drohan et al., Transgenic Research, 3: 355-364 (1994). cited by
other .
Velander et al., PNAS (USA), 89: 12003-12007 (1992). cited by other
.
Pittius et al., PNAS USA, 85: 5874 (1988). cited by other .
Simons et al., Biotechnology, 6:179 (1988). cited by other .
Clark et al., Biotechnology, 7:487 (1989). cited by other .
Garner's Motion to Correct Inventorship, Aug. 16, 2001. cited by
other .
Garner's Preliminary Motion No. 2, Aug. 16, 2001. cited by other
.
Hennighausen, Protein Expression and Publication 1: 3-8 (1990).
cited by other .
Garner Exhibit No. 28--Typed Notes, Aug. 16, 2001. cited by other
.
Masoud et al., Ann. Zootech. 45: -9 (1996). cited by other .
Garner's Reply in Support of Garner's Preliminary Motion 1, Aug.
16, 2001. cited by other .
Second Declaration of Alan Colman, Aug. 16, 2001. cited by other
.
Declaration of Richard F. Lathe, Aug. 16, 2001. cited by other
.
Behringer et al., "Synthesis Of Functional Human Hemoglobin In
Transgenic Mice," Science, 245:971-973 (1989). cited by other .
Binnie et al, "Characterization Of Purified Recombinant Fibrinogen:
Partial Phosphorylation Of Fibrinopeptide A," Biochemistry,
32:107-113 (1993). cited by other .
Bolyard, M.G., et al., "Expression In Escherichia Coli Of The Human
Fibrinogen B.beta. Chains And Its Cleavage By Thrombin," Blood,
73:1202-1206, 1989. cited by other .
Burdon et al., "Expression Of A Whey Acidic Protein Transgene
During Mammary Development," The Journal of Biological Chemistry,
266(11):6909-6914 (1991). cited by other .
Chung, Dominic W., et al., "Characterization Of A Complementary
Deoxyribonucleic Acid and Genomic Deoxyribonucleic Acid For The
.beta. Chain Of Human Fibrinogen," Biochemistry, 22:3244-3250.
cited by other .
Chung et al., "Characterization Of A Complementary Deoxyribonucleic
Acid Coding For The .gamma. Chain Of Human Fibrinogen,"
Biochemistry, 22(13):3250-3256 (1983). cited by other .
Chung et al., "Nucleotide Sequences Of The Three Genes Coding For
Human Fibrinogen," Adv. Exp. Med., 201:39-48 (1990). cited by other
.
Clark et al., "Pharmaceuticals From Transgenic Livestock," Tibtech,
5:20-24 (1987). cited by other .
Clark et al., "Expression of human anti-hemophilic factor IX in the
milk of transgenic sheep," Bio/Technology, 7:487-492 (1989). cited
by other .
Clark et al., "Rescuing Transgene Expression By Co-Integration,"
Bio/Technology, 10:1450-1454. cited by other .
Danishefsky, K., et al., "Intracellular Fate Of Fibrinogen B.beta.
Chain Expressed in COS Cells," Biochem, Biophys. Acta, 1048:202-208
(1990). cited by other .
James Darnell, Molecular Cell Biology abstract "The Organization Of
DNA Differs Greatly In Prokaryotic and Eukaryotic Cells," p. 137
(Scientific American Books, 1986). cited by other .
DiTullio et al., "Production Of Cystic Fibrosis Transmembrane
Conductance Regulator In The Milk of Transgenic Mice,"
Bio/Technology, 10:74-77 (1992). cited by other .
Dobrovolsky et al, "Human .gamma.-Interferon Expression In The
Mammary Gland Of Transgenic Mice," FEBS Letters, 319:181-184
(1993). cited by other .
Drohan et al., "Inefficient Processing Of Human Protein C In The
Mouse Mammary Gland," Transgenic Research, 3:355-364 (1994). cited
by other .
Farrell, D.H. et al., "Binding Of Recombinant Fibrinogen Mutants To
Platelets," J. Biol. Chem., 269:226-231 (1994). cited by other
.
Farrell et al., "Recombinant Human Fibrinogen And Sulfation Of The
65 ' Chain," Biochemistry, 30(39):9414-9420 (1991). cited by other
.
Greenberg et al., "Expression Of Biologically Active Heterodimeric
Bovine Follicle-Stimulating Hormone In Milk Of Transgenic Mice,"
Biochemistry, 88:8327-8331 (1991). cited by other .
Grosschedl, R., et al., "Introduction Of A .mu. Immunoglobulin Gene
Into The Mouse Germ Line; Specific Expression In Lymphoid Cells And
Synthesis Of Functional Antibody," Cell, 38:647-658 (1984). cited
by other .
Hartwig, R. et al., "Studies On The Assembly And Secretion Of
Fibrinogen," J. Biol. Chem., 266:6578-6585 (1991). cited by other
.
Hennighausen, Lothar, "The Mammary Gland As A Bioreactor:
Production Of Foreign Proteins In Milk," Protein Expression And
Purification, 1:3-8 (1990). cited by other .
Hurtley et al., "Protein Obligomerization In The Endoplasmic
Reticulum," Annu. Rev. Cell Biol, 5: 277-307 (1989). cited by other
.
Lee, S.H., et al., "Production Of Biomedical Proteins In The Milk
Of Transgenic Dairy Cows; The State Of The Art," J. Controlled
Release, 29:213-221 (1994). cited by other .
Lord, Susan T., "Expression Of A Cloned Human Fibrinogen cDNA In
Escherichia Coli: Synthesis Of An A Alpha Polypeptide," DNA,
4:33-38 (1980). cited by other .
Lord, Susan T., et al., "Expression Of A Fibrinogen Fusion Peptide
In Escherichia Coli: A Model Thrombin Substrate For
Structure/Function Analysis," Blood, 73:166-171 (1989). cited by
other .
Massoud et al., "The Deleterious Effects Of Human Erythropoietin
Gene Driven By The Rabbit Whey Acidic Protein Gene Promoter In
Transgenic Rabbits," Ann Zootech, 45:1-9 (1996). cited by other
.
McClenaghan et al., "Secretory Proteins Compete For Production In
The Mammary Gland Of Transgenic Mice," Biochemical Journal,
310:637-641 (1995). cited by other .
Mishina et al., "Molecular Distinction Between Fetal And Adult
Forms Of Muscle Acetylcholine Receptor," Nature, 321:406-411
(1986). cited by other .
Morocol et al., "The Porcine Mammary Gland As A Bioreactor For
Complex Proteins," Recombinant DNA Technology II, 721:21-233
(1994). cited by other .
Nicoll et al., "Estimates Of Parenchymal, Stromal, And Lymph Node
Deoxyribonucleic Acid In Mammary Glands Of C311/Crgl/2 MICE," Life
Sciences, 4(9):993-1001 (1965). cited by other .
Palmiter, Richard D., et al., "Transgenic Mice," Cell, 41:343-345
(1985). cited by other .
Pittius et al., "A Milk Protein Gene Promoter Directs The
Expression Of Human Tissue Plasminogen Activator cDNA To The
Mammary Gland In Transgenic Mice," Biochemistry, 85:5874-5878
(1988). cited by other .
Prunkard et al. UCLA Symposia On Molecular And Cellular Biology
(Feb. 8-14, 1993, Taos, New Mexico), "High Level Secretion Of
Recombinant Human Fibrinogen In BHK Cells Is Limited By A
Post-Transcriptional Process," Journal of Ceilular Biochemistry,
vol. 17C, Abstract H123 (1993). cited by other .
Prunkard et al., "High-Level Expression Of Recombinant Human
Fibrinogen In The Milk Of Transgenic Mice," Nature Biotechnology,
14:867-871 (1996). cited by other .
Reddy et al., "Expression Of Human Growth Hormone In The Milk Of
Transgenic Mice," Animal Biotechnology, 2(1):15-29. (1991). cited
by other .
Redman, C., et al., "Recombinant Production, Secretion, And
Clotting Behavior Of Fibrinogen And Cell Line Used Therein,"
Chemical Abstracts, 115: 526 (1991). cited by other .
Rixon, M.W., et al., "Nucleotide Sequence Of The Gene For The
.gamma. Chain Of Human Fibrinogen," Biochemistry, 24:2077-2086
(1985). cited by other .
Rixon et al., "Characterization Of A Complementary Deoxyribonucleic
Acid Coding For The .alpha. Chain Of Human Fibrinogen,"
Biochemistry, 22(13):3237-3244 (1983). cited by other .
Roy et al., "Assembly And Secretion Of Recombinant Human
Fibrinogen," The Journal of Biological Chemistry, 266(8):4758-4763
(1991). cited by other .
Shani et al., "Expression Of Human Serum Albumin In The Milk Of
Transgenic Mice," Transgenic Research, 1(5):195-208 (1992). cited
by other .
Simons et al., "Gene Transfer Into Sheep," Bio/Technology,
6:179-183 (1988). cited by other .
Sinai Yarus et al., "Engineering Transgenes For Use In The Mammary
Gland," Genetic Engineering, 18:57-81 (1996). cited by other .
Storb et ai., "Transgenic Mice With .mu. And .notlessthan. Genes
Encoding Antiphosphorylcholine Antibodies," The Journal of
Experimental Medicine, 164:627-641 (1986). cited by other .
Tojo et al., "Production And Characterization Of Transgenic Mice
Expressing A hGH Fusion Gene Driven By The Promoter Of Mouse Whey
Acidic Protein (mWAP) Putatively Specific To Mammary Gland,"
Journal of Reproduction and Development, 39(2):145-155 (1993).
cited by other .
Turner et ai., "Proteins Are Secreted By Both Constitutive And
Regulated Secretory Pathways In Lactating Mouse Mammary Epithelial
Cells," The Journal of Cell Biology, 117:269-278 (1992). cited by
other .
Velander et al., "High-Level Expression Of A Heterologous Protein
In The Milk Of Transgenic Swine USing The cDNA Encoding Human
Protein C," Applied Biological Sciences, 89:12003-12007 (1992).
cited by other .
Whitelaw et al., "Position-Independent Expression Of The Ovine
.beta.-Lactoglobulin Gene In Transgenic Mice," Biochemical Journal,
286:31-39 (1992). cited by other .
Wilkins et al., "Isolation Of Recombinant Proteins From Milk,"
Journal of Cellular Biochemistry, 49:333-338 (1992). cited by other
.
Yu et al., "Fibrinogen Precursors," The Journal of Biological
Chemistry, 259(16)10574-10581 (1984). cited by other .
Yu et al., "Functional Human CD4 Protein Produced In Milk Of
Transgenic Mice," Mol. Biol. Med., 6:255-261 (1989). cited by other
.
Bolyard et al., Gene 66: 183-192, 1988. cited by other .
Wall, 1996, Themogeneology 45:57-68. cited by other .
Houdebine, 1994, Journal of Biotechnology 34:269-287. cited by
other .
Genes, Lewin, ed., John Wiley and Sons, N.Y. pp. 87-96. cited by
other .
D.F. Hosher, "Disorders of Blood Coagulation" in Cecil Textbook of
Medicine, 18.sup.th Ed., Wyngaarden et al., eds., W.B. Saunders
Co., Philadelphia, 1988. pp. 1060-1065. cited by other.
|
Primary Examiner: Crouch; Deborah
Attorney, Agent or Firm: Alston and Bird LLP
Claims
We claim:
1. A method for producing biocompetent fibrinogen comprising:
providing a first DNA segment encoding a secretion signal operably
linked to a heterologous fibrinogen A.alpha. chain, .Iadd.the DNA
segment comprising genomic DNA encoding the A.alpha. chain,
.Iaddend.a second DNA segment encoding a secretion signal operably
linked to a heterologous fibrinogen B.beta. chain, .Iadd.the DNA
segment comprising genomic DNA encoding the B.beta. chain,
.Iaddend.and a third DNA segment encoding a secretion signal
operably linked to a heterologous fibrinogen .gamma. chain,
.Iadd.the DNA segment comprising genomic DNA encoding the .gamma.
chain, .Iaddend.wherein each chain is from the same species, and
wherein each of said first, second and third segments is operably
linked to additional DNA segments required for its expression in
the mammary gland of a host female mammal .Iadd.and the first,
second, third segments are linked in a single vector.Iaddend.;
introducing said DNA segments into a fertilized egg of a non-human
mammalian species heterologous to the species of origin of said
fibrinogen chains; inserting said egg into an oviduct or uterus of
a female of said mammalian species to obtain offspring carrying
said DNA segments; breeding said offspring to produce female
progeny that express said first, second and third DNA segments and
produce milk containing biocompetent fibrinogen encoded by said
segments; collecting milk from said female progeny; and and
recovering the biocompetent fibrinogen from the milk.
2. A method according to claim 1 wherein said species into which
said DNA segments are introduced is selected from the group
consisting of sheep, pigs, goats, and cattle.
.[.3. A method according to claim 1 wherein each of said first,
second and third DNA segments comprises an intron..].
.[.4. A method according to claim 1 wherein the molar ratio of said
first, second and third DNA segments is within the range of
0.5-1:0.5-1:0.5-1..].
5. A method according to claim 1 wherein each of said first, second
and third DNA segments is operably linked to a transcription
promoter selected from the group consisting of casein,
.beta.-lactoglobulin, .alpha.-lactalbumin and whey acidic protein
gene promoters.
6. A method according to claim 1 wherein said first, second and
third DNA segments are expressed under the control of a
.beta.-lactoglobulin promoter.
7. A method according to claim 1 wherein said introducing step
comprises injecting said first, second and third DNA segments into
a pronucleus of said fertilized egg.
8. A method according to claim 1 wherein said fibrinogen is human
fibrinogen.
9. A method according to claim 1 wherein said second DNA segment
comprises a sequence of nucleotides as shown in SEQ ID NO: 3 from
nucleotide 470 to nucleotide 8100.
10. A method according to claim 1 wherein said second DNA segment
comprises a sequence of nucleotides as shown in SEQ ID NO: 3 from
nucleotide 512 to nucleotide 8100.
11. A method according to claim 1 wherein said species into which
said DNA segments is introduced is sheep.
12. A method of producing biocompetent fibrinogen comprising:
incorporating .Iadd.into operable linkage .Iaddend.a .[.first.].
DNA segment encoding a secretion signal .[.operably linked
to.]..Iadd., a genomic DNA segment encoding .Iaddend.an A.alpha.
chain of fibrinogen .[.into a .beta.-lactoglobulin gene.].
.Iadd.and an additional segment required for expression of the
A.alpha..Iaddend. chain in the mammary gland of a mammal to produce
a first gene fusion.[.comprising a .beta.-lactoglobulin promoter
operably linked to the first DNA segment.].; incorporating
.Iadd.into operable linkage .Iaddend.a .[.second.]. DNA segment
encoding a secretion signal .[.operably linked to.]..Iadd., a
genomic DNA segment encoding .Iaddend.a B.beta. chain of fibrinogen
.[.into a .beta.-lactoglobulin gene.]. .Iadd.and an additional
segment required for expression of the B.beta. chain .Iaddend.to
produce a second gene fusion .[.comprising a .beta.-lactoglobulin
promoter operably linked to the second DNA segment.].;
incorporating .Iadd.into operable linkage .Iaddend.a .[.third.].
DNA segment encoding a secretion signal .[.operably linked
to.]..Iadd., a genomic DNA segment encoding .Iaddend.a .gamma.
chain of fibrinogen .Iadd.and an additional segment required for
expression of the .Iaddend..gamma. chain .[.into a
.beta.-lactoglobulin gene.]. to produce a third gene fusion.Iadd.,
.Iaddend..[.comprising a .beta.-lactoglobulin promoter operably
linked to the third DNA segment.]. wherein each of said first,
second and third segments are of the same species; .Iadd.linking
the first, second and third gene fusions in a single vector;
.Iaddend.introducing said first, second and third gene fusions into
the germ line of a non-human mammal so that said DNA segments are
expressed in a mammary gland of said mammal or its female progeny
and biocompetent fibrinogen is secreted into milk of said mammal or
its female progeny; obtaining milk from said mammal or its female
progeny; and recovering said fibrinogen from said milk.
13. A method according to claim 12 wherein said mammal is a sheep,
pig, goat or cow.
.[.14. A method according to claim 12 wherein each of said first,
second and third gene fusions comprises an intron..].
.[.15. A method according to claim 12 wherein the molar ratio of
said first, second and third gene fusions introduced is within the
range of 0.5-1:0.5-1:0.5-1..].
16. A method according to claim 12 wherein said introducing step
comprises injecting said first, second and third gene fusions into
a pronucleus of a fertilized egg and inserting said egg into an
oviduct of a pseudopregnant female to produce female offspring
carrying said gene fusions in the germ line, wherein said egg and
said pseudopregnant female are of the same species.
17. A method according to claim 12 wherein said mammal is a
sheep.
.[.18. A method for producing biocompetent fibrinogen comprising:
providing a transgenic female non-human mammal carrying in its
germline heterologous DNA segments encoding A.alpha., B.beta. and
.gamma. chains of flbrinogen, wherein said segments are expressed
in a mammary gland of said mammal and biocompetent fibrinogen
encoded by said segments is secreted into milk of said mammal;
collecting milk from said mammal; and recovering said biocompetent
fibrinogen from said milk..].
.[.19. A method according to claim 18 wherein said mammal is a
sheep, pig, goat or cow..].
.[.20. A method according to claim 18 wherein said mammal is a
sheep..].
.[.21. A transgenic non-human female mammal that produces
recoverable amounts of biocompetent human fibrinogen in its milk,
wherein said mammal comprises: a first DNA segment encoding a
secretion signal operably linked to a heterologous fibrinogen
A.alpha. chain, a second DNA segment encoding a secretion signal
operably linked to a heterologous fibrinogen B.beta. chain, and a
third DNA segment encoding a secretion signal operably linked to a
heterologous fibrinogen .gamma. chain, and further wherein each
chain is derived from the same species and is operably linked to
additional DNA segments required for its expression in the mammary
gland of a host female mammal..].
.[.22. A mammal according to claim 21 wherein said mammal is a
sheep..].
23. A process for producing a transgenic offspring of a mammal
comprising: providing a first DNA segment encoding a secretion
signal operably linked to a heterologous fibrinogen A.alpha. chain,
.Iadd.the DNA segment comprising genomic DNA encoding the A.alpha.
chain; .Iaddend.a second DNA segment encoding a secretion signal
operably linked to a heterologous fibrinogen B.beta. chain,
.Iadd.the DNA segment comprising genomic DNA encoding the B.beta.
chain; .Iaddend.and a third DNA segment encoding a secretion signal
operably linked to a heterologous fibrinogen .gamma. chain,
.Iadd.the DNA segment comprising genomic DNA encoding the .gamma.
chain; .Iaddend.wherein each chain is derived from the same
species, and wherein each of said first, second and third segments
is operably linked to additional DNA segments required for its
expression in the mammary gland of a host female mammal;
.Iadd.linking the first, second and third segments in a single
vector;.Iaddend. introducing said DNA segments into a fertilized
egg of a non-human mammalian species heterologous to the species of
origin of said fibrinogen chains; inserting said fertilized egg
into an oviduct or uterus of a female of said mammalian species;
and allowing said fertilized egg to develop thereby producing
transgenic offspring carrying said first, second and third DNA
segments, wherein female progeny of said mammal express said DNA
segments in a mammary gland to produce biocompetent fibrinogen.
24. A process according to claim 23 wherein said offspring is
female.
25. A process according to claim 23 wherein said offspring is
male.
.[.26. A non-human mammal produced according to the process of
claim 23..].
.[.27. A non-human mammal according to claim 26 wherein said mammal
is female..].
.[.28. A non-human female mammal according to claim 27 that
produces milk containing biocompetent fibrinogen encoded by said
DNA segments..].
.[.29. A non-human mammal according to claim 26 wherein said mammal
is male..].
.[.30. A non-human mammal carrying in its germline DNA segments
encoding human A.alpha., B.beta. and .gamma. chains of fibrinogen,
wherein female progeny of said mammal express said DNA segments in
a mammary gland to produce biocompetent human fibrinogen..].
.[.31. A mammal non-human according to claim 30 wherein said mammal
is female..].
.[.32. A mammal non-human according to claim 30 wherein said mammal
is male..].
.[.33. A mammal according to claim 30, wherein said mammal is a
sheep..].
.Iadd.34. A set of DNA sequences comprising: a first DNA segment
encoding a secretion signal operably linked to a heterologous
fibrinogen A.alpha. chain, the DNA segment comprising genomic DNA
encoding the A.alpha. chain; a second DNA segment encoding a
secretion signal operably linked to a heterologous fibrinogen
B.beta. chain, the DNA segment comprising genomic DNA encoding the
B.beta. chain; and a third DNA segment encoding a secretion signal
operably linked to a heterologous fibrinogen .gamma. chain, the DNA
segment comprising genomic DNA encoding the .gamma. chain, wherein
each chain is from the same species, and wherein each of said
first, second and third segments is operably linked to additional
DNA segments required for its expression in the mammary gland of a
host female mammal; and the first, second, third segments are
linked in a single vector..Iaddend.
Description
BACKGROUND OF THE INVENTION
The final step in the blood coagulation cascade is the
thrombin-catalyzed conversion of the soluble plasma protein
fibrinogen to insoluble fibrin. Thrombin cleaves a small peptide
(fibrinopeptide A) from one of the three component chains (the
A.alpha.-chain) of fibrinogen. Fibrin monomers subsequently
polymerize and are cross-linked by activated factor XIII to form a
stable clot.
Fibrinogen is a key component of biological tissue glues (see,
e.g., U.S. Pat. Nos. 4,377,572 and 4,442,655), which mimic the
formation of natural blood clots to promote hemostasis and repair
damaged tissue. Tissue glues provide an adjunct or alternative to
sutures, staples and other mechanical means for wound closure.
However, the principal ingredients of these products (fibrinogen,
factor XIII and thrombin) are prepared from pooled human plasma by
cryoprecipitation (e.g. U.S. Pat. Nos. 4,377,572; 4,362,567;
4,909,251) or ethanol precipitation (e.g. U.S. Pat. No. 4,442,655)
or from single donor plasma (e.g. U.S. Pat. No. 4,627,879; Spotnitz
et al., Am. Surg. 55: 166-168, 1989). The resultant
fibrinogen/factor XIII preparation is mixed with bovine thrombin
immediately before use to convert the fibrinogen to fibrin and
activate the factor XIII, thus initiating coagulation of the
adhesive.
Commercially available adhesives are of pooled plasma origin.
Because blood-derived products have been associated with the
transmission of human immunodeficiency virus (HIV), hepatitis virus
and other etiologic agents, the acceptance and availability of such
adhesives is limited. At present they are not approved for use in
the United States.
While the use of autologous plasma reduces the risk of disease
transmission, autologous adhesives can only be used in elective
surgery when the patient is able to donate the necessary blood in
advance.
As noted above, fibrinogen consists of three polypeptide chains,
each of which is present in two copies in the assembled molecule.
These chains, designated the A.alpha., B.beta. and .gamma.-chains,
are coordinately expressed, assembled and secreted by the liver.
While it might be expected that recombinant DNA technology could
provide an alternative to the isolation of fibrinogen from plasma,
this goal has proven to be elusive. The three fibrinogen chains
have been individually expressed in E. coli (Lord, DNA 4: 33-38,
1985; Bolyard and Lord, Gene 66: 183-192, 1988; Bolyard and Lord,
Blood 73: 1202-1206), but functional fibrinogen has not been
produced in a prokaryotic system. Expression of biologically
competent fibrinogen in yeast has not been reported. Cultured
transfected mammalian cells have been used to express biologically
active fibrinogen (Farrell et al., Blood 74: 55a, 1989; Hartwig and
Danishefsky, J. Biol. Chem. 266: 6578-6585, 1991; Farrell et al.,
Biochemistry 30: 9414-9420, 1991), but expression levels have been
so low that production of recombinant fibrinogen in commercial
quantities is not feasible. Experimental evidence suggests that
lower transcription rates in cultured cells as compared to liver
may be a factor in the low expression rates achieved to date, but
increasing the amount of fibrinogen chain mRNA in transfected BHK
cells did not produce corresponding increases in fibrinogen protein
secretion (Prunkard and Foster, XIV Congress of the International
Society on Thrombosis and Haemostasis, 1993). These latter results
suggest that proper assembly and processing of fibrinogen involves
tissue-specific mechanisms not present in common laboratory cell
lines.
There remains a need in the art for methods of producing large
quantities of high quality fibrinogen for use in tissue adhesives
and other applications. There is a further need for fibrinogen that
is free of blood-borne pathogens. The present invention fulfills
these needs and provides other, related advantages.
SUMMARY OF THE INVENTION
It is an object of the present invention to provide commercially
useful quantities of recombinant fibrinogen, particularly
recombinant human fibrinogen. It is a further object of the
invention to provide materials and methods for expressing
fibrinogen in the mammary tissue of transgenic animals,
particularly livestock animals such as cattle, sheep, pigs and
goats.
Within one aspect, the present invention provides a method for
producing fibrinogen comprising (a) providing a first DNA segment
encoding a secretion signal operably linked to a fibrinogen
A.alpha. chain, a second DNA segment encoding a secretion signal
operably linked to a fibrinogen B.beta. chain, and a third DNA
segment encoding a secretion signal operably linked to a fibrinogen
.gamma. chain, wherein each of the first, second and third segments
is operably linked to additional DNA segments required for its
expression in the mammary gland of a host female mammal; (b)
introducing the DNA segments into a fertilized egg of a non-human
mammalian species; (c) inserting the egg into an oviduct or uterus
of a female of the species to obtain offspring carrying the DNA
constructs; (d) breeding the offspring to produce female progeny
that express the first, second and third DNA segments and produce
milk containing biocompetent fibrinogen encoded by the segments;
(e) collecting milk from the female progeny; and (f) recovering the
fibrinogen from the milk. Within one embodiment, the egg containing
the introduced segments is cultured for a period of time prior to
insertion.
Within another aspect, the invention provides a method of producing
fibrinogen comprising the steps of (a) incorporating a first DNA
segment encoding a secretion signal operably linked to an A.alpha.
chain of fibrinogen into a .beta.-lactoglobulin gene to produce a
first gene fusion; (b) incorporating a second DNA segment encoding
a secretion signal operably linked to a B.beta. chain of fibrinogen
into a .beta.-lactoglobulin gene to produce a second gene fusion;
(c) incorporating a third DNA segment encoding a secretion signal
operably linked to a .gamma. chain of fibrinogen into a
.beta.-lactoglobulin gene to produce a third gene fusion; (d)
introducing the first, second and third gene fusions into the germ
line of a non-human mammal so that the DNA segments are expressed
in a mammary gland of the mammal or its female progeny and
biocompetent fibrinogen is secreted into milk of the mammal or its
female progeny; (e) obtaining milk from the mammal or its female
progeny; and (f) recovering the fibrinogen from the milk. Within
preferred embodiments, the mammal is a sheep, pig, goat or
bovine.
Within another aspect, the invention provides a method for
producing fibrinogen comprising the steps of (a) providing a
transgenic female non-human mammal carrying in its germline
heterologous DNA segments encoding A.alpha., B.beta. and .gamma.
chains of fibrinogen, wherein the DNA segments are expressed in a
mammary gland of the mammal and fibrinogen encoded by the DNA
segments is secreted into milk of the mammal; (b) collecting milk
from the mammal; and (c) recovering the fibrinogen from the
milk.
Within another aspect, the invention provides a non-human mammalian
embryo containing in its nucleus heterologous DNA segments encoding
A.alpha., B.beta. and .gamma. chains of fibrinogen. Within a
related aspect, the invention provides a transgenic non-human
female mammal that produces recoverable amounts of human fibrinogen
in its milk.
Within another aspect, the invention provides a method for
producing a transgenic offspring of a mammal comprising the steps
of (a) providing a first DNA segment encoding a fibrinogen A.alpha.
chain, a second DNA segment encoding a fibrinogen B.beta. chain,
and a third DNA segment encoding a fibrinogen .gamma. chain,
wherein each of said first, second and third segments is operably
linked to additional DNA segments required for its expression in a
mammary gland of a host female mammal and secretion into milk of
the host female mammal; (b) introducing the DNA segments into a
fertilized egg of a mammal of a non-human species; (c) inserting
the egg into an oviduct or uterus of a female of the non-human
species to obtain an offspring carrying the first, second and third
DNA segments. In a related aspect, the invention provides non-human
mammals produced according to this process.
Within an additional aspect, the invention provides a non-human
mammal carrying its germline DNA segments encoding heterologous
A.alpha., B.beta. and .gamma. chains of fibrinogen, wherein female
progeny of the mammal express the DNA segments in a mammary gland
to produce biocompetent fibrinogen.
These and other aspects of the invention will become evident to the
skilled practitioner upon reference to the following detailed
description and the attached drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 illustrates the subcloning of a human fibrinogen A.alpha.
chain DNA sequence.
FIG. 2 is a partial restriction map of the vector Zem228. Symbols
used are MT-1p, mouse metallothionein promoter; SV40t, SV40
terminator; and SV40p, SV40 promoter.
FIG. 3 illustrates the subcloning of a human fibrinogen B.beta.
chain DNA sequence.
FIG. 4 illustrates the subcloning of a human fibrinogen .gamma.
chain DNA sequence.
FIG. 5 is a partial restriction map of the vector Zem219b. Symbols
used are MT-1p, mouse metallothionein promoter; hGHt, human growth
hormone terminator; SV40p, SV40 promoter; DHFR, dihydrofolate
reductase gene; and SV40t, SV40 terminator.
DETAILED DESCRIPTION OF THE INVENTION
Prior to setting forth the invention in detail, it will be helpful
to define certain terms used herein:
As used herein, the term "biocompetent fibrinogen" is used to
denote fibrinogen that polymerizes when treated with thrombin to
form insoluble fibrin.
The term "egg" is used to denote an unfertilized ovum, a fertilized
ovum prior to fusion of the pronuclei or an early stage embryo
(fertilized ovum with fused pronuclei).
A "female mammal that produces milk containing biocompetent
fibrinogen" is one that, following pregnancy and delivery,
produces, during the lactation period, milk containing recoverable
amounts of biocompetent fibrinogen. Those skilled in the art will
recognized that such animals will produce milk and therefore the
fibrinogen, discontinuously.
The term "progeny" is used in its usual sense to include children
and descendants.
The term "heterologous" is used to denote genetic material
originating from a different species than that into which it has
been introduced, or a protein produced from such genetic
material.
Within the present invention, transgenic animal technology is
employed to produce fibrinogen within the mammary glands of a host
female mammal. Expression in the mammary gland and subsequent
secretion of the protein of interest into the milk overcomes many
difficulties encountered in isolating proteins from other sources.
Milk is readily collected, available in large quantities, and well
characterized biochemically. Furthermore, the major milk proteins
are present in milk at high concentrations (from about 1 to 15
g/l).
From a commercial point of view, it is clearly preferable to use as
the host a species that has a large milk yield. While smaller
animals such as mice and rats can be used (and are preferred at the
proof-of-concept stage), within the present invention it is
preferred to use livestock mammals including, but not limited to,
pigs, goats, sheep and cattle. Sheep are particularly preferred due
to such factors as the previous history of transgenesis in this
species, milk yield, cost and the ready availability of equipment
for collecting sheep milk. See WO 88/00239 for a comparison of
factors influencing the choice of host species. It is generally
desirable to select a breed of host animal that has been bred for
dairy use, such as East Friesland sheep, or to introduce dairy
stock by breeding of the transgenic line at a later date. In any
event, animals of known, good health status should be used.
Fibrinogen produced according to the present invention may be human
fibrinogen or fibrinogen of a non-human animal. For medical uses,
it is preferred to employ proteins native to the patient. The
present invention thus provides fibrinogen for use in both human
and veterinary medicine. Cloned DNA molecules encoding the
component chains of human fibrinogen are disclosed by Rixon et al.
(Biochem. 22: 3237, 1983), Chung et al. (Biochem. 22: 3244, 1983),
Chung et al. (Biochem. 22: 3250, 1983), Chung et al. (Adv. Exp.
Med. Biol. 281: 39-48, 1990) and Chung et al. (Ann. NY Acad. Sci.
408: 449-456, 1983). Bovine fibrinogen clones are disclosed by
Brown et al. (Nuc. Acids Res. 17: 6397, 1989) and Chung et al.
(Proc. Natl. Acad. Sci. USA 78: 1466-1470, 1981). Other mammalian
fibrinogen clones are disclosed by Murakawa et al. (Thromb.
Haemost. 69: 351-360, 1993). Representative sequences of human
A.alpha., B.beta. and .gamma. chain genes are shown in SEQ ID NOS:
1, 3 and 5, respectively. Those skilled in the art will recognize
that allelic variants of these sequences will exist; that
additional variants can be generated by amino acid substitution,
deletion, or insertion; and that such variants are useful within
the present invention. In general, it is preferred that any
engineered variants comprise only a limited number of amino acid
substitutions, deletions, or insertions, and that any substitutions
are conservative. Thus, it is preferred to produce fibrinogen chain
polypeptides that are at least 90%, preferably at least 95, and
more preferably 99% or more identical in sequence to the
corresponding native chains. The term ".gamma. chain" is meant to
include the alternatively spliced .gamma.' chain of fibrinogen
(Chung et al., Biochem. 23: 4232-4236, 1984). A human .gamma.'
chain amino acid sequence is shown in SEQ ID NO: 6. The shorter
.gamma. chain is produced by alternative splicing at nucleotides
9511 and 10054 of SEQ ID NO: 5, resulting in translation
terminating after nucleotide 10065 of SEQ ID NO: 5.
To obtain expression in the mammary gland, a transcription promoter
from a milk protein gene is used. Milk protein genes include those
genes encoding caseins, beta-lactoglobulin (BLG),
.alpha.-lactalbumin, and whey acidic protein. The
beta-lactoglobulin promoter is preferred. In the case of the ovine
beta-lactoglobulin gene, a region of at least the proximal 406 bp
of 5' flanking sequence of the ovine BLG gene (contained within
nucleotides 3844 to 4257 of SEQ ID NO:7) will generally be used.
Larger portions at the 5' flanking sequence, up to about 5 kbp, are
preferred. A larger DNA segment encompassing the 5' flanking
promoter region and the legion encoding the 5' non-coding portion
of the beta-lactoglobulin gene (contained within nucleotides 1 to
4257 of SEQ ID NO:7) is particularly preferred. See Whitelaw et
al., Biochem J. 28: 31-39, 1992. Similar fragments of promoter DNA
from other species are also suitable.
Other regions of the beta-lactoglobulin gene may also be
incorporated in constructs, as may genomic regions of the gene to
be expressed. It is generally accepted in the art that constructs
lacking introns, for example, express poorly in comparison with
those that contain such DNA sequences (see Brinster et al., Proc.
Natl. Acad. Sci. USA 85: 836-840, 1988; Palmiter et al., Proc.
Natl. Acad. Sci. USA 88: 478-482, 1991; Whitelaw et al., Transgenic
Res. 1: 3-13, 1991; WO 89/01343; WO 91/02318). In this regard, it
is generally preferred, where possible, to use genomic sequences
containing all or some of the native introns of a gene encoding the
protein or polypeptide of interest. Within certain embodiments of
the invention, the further inclusion of at least some introns from
the beta-lactoglobulin gene is preferred. One such region is a DNA
segment which provides for intron splicing and RNA polyadenylatiom
from the 3' non-coding region of the ovine beta-lactoglobulin gene.
When substituted for the natural 3' non-coding sequences of a gene,
this ovine beta-lactoglobulin segment can both enhance and
stabilize expression levels of the protein or polypeptide of
interest. Within other embodiments, the region surrounding the
initiation ATG of one or more of the fibrinogen sequences is
replaced with corresponding sequences from a milk specific protein
gene. Such replacement provides a putative tissue-specific
initiation environment to enhance expression. It is convenient to
replace the entire fibrinogen chain pre-pro and 5' non-coding
sequences with those of, for example, the BLG gene, although
smaller regions may be replaced.
For expression of fibrinogen, DNA segments encoding each of the
three component polypeptide chains of fibrinogen are operably
linked to additional DNA segments required for their expression to
produce expression units. Such additional segments include the
above-mentioned milk protein gene promoter, as well as sequences
which provide for termination of transcription and polyadenylation
of mRNA. The expression units will further include a DNA segment
encoding a secretion signal operably linked to the segment encoding
the fibrinogen polypeptide chain. The secretion signal may be a
native fibrinogen secretion signal or may be that of another
protein, such as a milk protein. The term "secretion signal" is
used herein to denote that portion of a protein that directs it
through the secretory pathway of a cell to the outside. Secretion
signals are most commonly found at the amino-termini of proteins.
See, for example, von Heinje, Nuc. Acids Res. 14: 4683-4690, 1986;
and Meade et al., U.S. Pat. No. 4,873,316, which are incorporated
herein by reference.
Construction of expression units is conveniently carried out by
inserting a fibrinogen chain sequence into a plasmid or phage
vector containing the additional DNA segments, although the
expression unit may be constructed by essentially any sequence of
ligations. It is particularly convenient to provide a vector
containing a DNA segment encoding a milk protein and to replace the
coding sequence for the milk protein with that of a fibrinogen
chain (including a secretion signal), thereby creating a gene
fusion that includes the expression control sequences of the milk
protein gene. In any event, cloning of the expression units in
plasmids or other vectors facilitates the amplification of the
fibrinogen sequences. Amplification is conveniently carried out in
bacterial (e.g. E. coli) host cells, thus the vectors will
typically include an origin of replication and a selectable marker
functional in bacterial host cells.
In view of the size of the fibrinogen chain genes it is most
practical to prepare three separate expression units, mix them, and
introduce the mixture into the host. However, those skilled in the
art will recognize that other protocols may be followed. For
example, expression units for the three chains can be introduced
individually into different embryos to be combined later by
breeding. In a third approach, the three expression units can be
linked in a single suitable vector, such as a yeast artificial
chromosome or phage P1 clone. Coding sequences for two or three
chains can be combined in polycistronic expression units (see,
e.g., Levinson et al., U.S. Pat. No. 4,713,339).
The expression unit(s) is(are) then introduced into fertilized eggs
(including early-stage embryos) of the chosen host species.
Introduction of heterologous DNA can be accomplished by one of
several routes, including microinjection (e.g. U.S. Pat. No
4,873,191), retroviral infection (Jaenisch, Science 240: 1468-1474,
1988) or site-directed integration using embryonic stem (ES) cells
(reviewed by Bradley et al., Bio/Technology 10: 534-539, 1992). The
eggs are then implanted into the oviducts or uteri of
pseudopregnant females and allowed to develop to term. Offspring
carrying the introduced DNA in their germ line can pass the DNA on
to their progeny in the normal, Mendelian fashion, allowing the
development of transgenic herds. General procedures for producing
transgenic animals are known in the art. See, for example, Hogan et
al., Manipulating the Mouse Embryo: A Laboratory Manual, Cold
Spring Harbor Laboratory, 1986; Simons et al., Bio/Technology 6:
179-183, 1988; Wall et al., Biol. Reprod. 32: 645-651, 1985; Buhler
et al., Bio/Technology: 140-143, 1990; Ebert et al.,
Bio/Technology: 835-838, 1991; Krimpenfort et al., Bio/Technology
9: 844-847, 1991; Wall et al. J. Cell. Biochem. 49: 113-120, 1992;
and WIPO publications WO 88/00239, WO 90/05188, WO 92/11757; and GB
87/00458, which are incorporated herein by reference. Techniques
for introducing foreign DNA sequences into mammals and their germ
cells were originally developed in the mouse. See, e.g., Gordon et
al., Proc. Natl. Acad. Sci. USA 77: 7380-7384, 1980; Gordon and
Ruddle, Science 214: 1244-1246, 1981; Palmiter and Brinster, Cell
41: 343-345, 1985; Brinster et al., Proc. Natl. Acad. Sci. USA 82:
4438-1442, 1985; and Hogan et al. (ibid.). These techniques were
subsequently adapted for use with larger animals, including
livestock species (see e.g., WIPO publications WO 88/00239, WO
90/05188, and WO 92/11757; and Simons et al., Bio/Technology 6:
179-183, 1988). To summarize, in the most efficient route used to
date in the generation of transgenic mice or livestock, several
hundred linear molecules of the DNA of interest are injected into
one of the pro-nuclei of a fertilized egg. Injection of DNA into
the cytoplasm of a zygote can also be employed.
It is preferred to obtain a balanced expression of each fibrinogen
chain to allow for efficient formation of the mature protein.
Ideally, the three expression units should be on the same DNA
molecule for introduction into eggs. This approach, however, may
generate technical problems at, for example, the injection and
manipulation stages. For example, the size of fibrinogen expression
units may necessitate the use of yeast artificial chromosomes
(YACs) or phage P1 to amplify and manipulate the DNA prior to
injection. If this approach is followed, segments of DNA to be
injected, containing all three expression units, would be very
large, thus requiring modification of the injection procedure
using, for example, larger bore needles. In a more simple approach,
a mixture of each individual expression unit is used. It is
preferred to combine equimolar amounts of the three expression
units, although those skilled in the art will recognize that this
ratio may be varied to compensate for the characteristics of a
given expression unit. Some expression, generally a reduced level,
will be obtained when lesser molar amounts of one or two chains are
used, and expression efficiencies can generally be expected to
decline in approximate proportion to the divergence from the
preferred equimolar ratio. In any event, it is preferred to use a
mixture having a ratio of A.alpha.:B.beta.:.gamma. expression units
in the range of 0.5-1:0.5-1:0.5-1. When the ratio is varied from
equimolar, it is preferred to employ relatively more of the B.beta.
expression unit. Alternatively, one or a mixture of two of the
expression units is introduced into individual eggs. However,
animals derived by this approach will express only one or two
fibrinogen chains. To generate an intact fibrinogen molecule by
this approach requires a subsequent breeding program designed to
combine all three expression units in individuals of a group of
animals.
In general, female animals are superovulated by treatment with
follicle stimulating hormone, then mated. Fertilized eggs are
collected, and the heterologous DNA is injected into the eggs using
known methods. See, for example, U.S. Pat No. 4,873,191; Gordon et
al, Proc. Natl. Acad. Sci. USA 77: 7380-7384, 1980; Gordon and
Ruddle, Science 214: 1244-1246, 1981; Palmiter and Brinster, Cell.
41: 343-345, 1985; Brinster et al., Proc. Natl. Acad. Sci. USA 82:
4438-4442, 1985; Hogan et al., Manipulating the Mouse Embryo: A
Laboratory Manual, Cold Spring Harbor Laboratory, 1986; Simons et
al. Bio/Technology 6:179-183, 1988; Wall et al., Biol. Reprod. 32:
645-651, 1985; Buhler et al., Bio/Technology 8: 140-143, 1990;
Ebert et al., Bio/Technology 9: 835-838, 1991; Krimpenfort et al.,
Bio/Technology 9: 844-847, 1991; Wall et al., J. Cell. Biochem. 49:
113-120, 1992; WIPO publications WO 88/00239, WO 90/05118, and WO
92/11757; and GB 87/00458, which are incorporated herein by
reference.
For injection into fertilized eggs, the expression units are
removed from their respective vectors by digestion with appropriate
restriction enzymes. For convenience, it is preferred to design the
vectors to that the expression units are removed by cleavage with
enzymes that do not cut either within the expression units or
elsewhere in the vectors. The expression units are recovered by
conventional methods, such as electro-elution followed by phenol
extraction and ethanol precipitation, sucrose density gradient
centrifugation, or combinations of these approaches.
DNA is injected into eggs essentially as described in Hogan et al.,
ibid. In a typical injection, eggs in a dish of an embryo culture
medium are located using a stereo room microscope (.times.50 or
.times.63 magnification preferred). Suitable media include Hepes
(N-2-hydroxyethylpiperazine-N'-2-ethanesulphonic acid) or
bicarbonate buffered media such as M2 or M16 (available from Sigma
Chemical Co., St Louis, USA) or synthetic oviduct medium (disclosed
below). The eggs are secured and transferred to the center of a
glass slide on an injection rig using, for example, a drummond
pipette complete with capillary tube. Viewing at lower (e.g.
.times.4) magnification is used at this stage. Using the holding
pipette of the injection rig, the eggs are positioned centrally on
the slide. Individual eggs are sequentially secured to the holding
pipette for injection. For each injection process, the holding
pipette/egg is positioned in the center of the viewing field. The
injection needle is then positioned directly below the egg.
Preferably using .times.40 Nomarski objectives, both manipulator
heights are adjusted to focus both the egg and the needle. The
pronuclei are located by rotating the egg and adjusting the holding
pipette assembly as necessary. Once the pronucleus has been
located, the height of the manipulator is altered to focus the
pronuclear membrane. The injection needle is positioned below the
egg such that the needle tip is in a position below the center of
the pronucleus. The position of the needle is then altered using
the injection manipulator assembly to bring the needle and the
pronucleus into the same focal plane. The needle is moved, via the
joy stick on the injection manipulator assembly, to a position to
the right of the egg. With a short, continuous jabbing movement,
the pronuclear membrane is pierced to leave the needle tip inside
the pronucleus. Pressure is applied to the injection needle via the
glass syringe until the pronucleus swells to approximately twice
its volume. At this point, the needle is slowly removed. Reverting
to lower (e.g. .times.4) magnification, the injected egg is moved
to a different area of the slide, and the process is repeated with
another egg.
After the DNA is injected, the eggs may be cultured to allow the
pronuclei to fuse, producing one-cell or later stage embryos. In
general, the eggs are cultured at approximately the body
temperature of the species used in a buffered medium containing
balanced salts and serum. Surviving embryos are then transferred to
pseudopregnant recipient females, typically by inserting them into
the oviduct or uterus, and allowed to develop to term. During
embryogenesis, the injected DNA integrates in a random fashion in
the genomes of a small number of the developing embryos.
Potential transgenic offspring are screened via blood samples
and/or tissue biopsies. DNA is prepared from these samples and
examined for the presence of the injected construct by techniques
such as polymerase chain reaction (PCR; see Mullis, U.S. Pat. No.
4,683,202) and Southern blotting (Southern, J. Mol. Biol. 98:503,
1975; Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold
Spring Harbor Laboratory, 1982). Founder transgenic animals, or
G0s, may be wholly transgenic, having transgenes in all of their
cells, or mosaic, having transgenes in only a subset of cells (see,
for example, Wilkie et al., Develop. Biol. 118: 9-18, 1986). In the
latter case, groups of germ cells may be wholly or partially
transgenic. In the latter case, the number of transgenic progeny
from a founder animal will be less than the expected 50% predicted
from Mendelian principles. Founder G0 animals are grown to sexual
maturity and mated to obtain offspring, or G1s. The G1s are also
examined for the presence of the transgene to demonstrate
transmission from founder G0 animals. In the case of male G0s,
these may be mated with several non-transgenic females to generate
many offspring. This increases the chances of observing transgene
transmission. Female G0 founders may be mated naturally,
artificially inseminated or superovulated to obtain many eggs which
are transferred to surrogate mothers. The latter course gives the
best chance of observing transmission in animals having a limited
number of young. The above-described breeding procedures are used
to obtain animals that can pass the DNA on to subsequent
generations of offspring in the normal, Mendelian fashion, allowing
the development of, for example, colonies (mice), flocks (sheep),
or herds (pigs, goats and cattle) of transgenic animals.
The milk from lactating G0 and G1 females is examined for the
expression of the heterologous protein using immunological
techniques such as ELISA (see Harlow and Lane, Antibodies, A
Laboratory Manual, Cold Spring Harbor Laboratory, 1988) and Western
blotting (Towbin et al., Proc. Natl. Acad. Sci. USA 76: 4350-4354,
1979). For a variety of reasons known in the art, expression levels
of the heterologous protein will be expected to differ between
individuals.
A satisfactory family of animals should satisfy three criteria:
they should be derived from the same founder G0 animal; they should
exhibit stable transmission of the transgene; and they should
exhibit stable expression levels from generation to generation and
from lactation to lactation of individual animals. These principles
have been demonstrated and discussed (Carver et al., Bio/Technology
11: 1263-1270, 1993). Animals from such a suitable family are
referred to as a "line." Initially, male animals, G0 or G1, are
used to derive a flock or herd of producer animals by natural or
artificial insemination. In this way, many female animals
containing the same transgene integration event can be quickly
generated from which a supply of milk can be obtained.
The fibrinogen is recovered from milk using standard practices such
as skimming, precipitation, filtration and protein chromatography
techniques.
Fibrinogen produced according to the present invention is useful
within human and veterinary medicine, such as in the formulation of
surgical adhesives. Adhesives of this type are known in the art.
See, for example, U.S. Pat. Nos. 4,377,572; 4,442,655; 4,462,567;
and 4,627,879, which are incorporated herein by reference. In
general, fibrinogen and factor XIII are combined to form a first
component that is mixed just prior to use with a second component
containing thrombin. The thrombin converts the fibrinogen to
fibrin, causing the mixture to gel, and activates the factor XIII.
The activated factor XIII cross links the fibrin to strengthen and
stabilize the adhesive matrix. Such adhesives typically contain
from about 30 mg/ml to about 100 mg/ml fibrinogen and from about 50
.mu.g/ml to about 500 .mu.g/ml factor XIII. They may also contain
additional ingredients, such as aprotinin, albumin, fibronectin,
bulking agents, and solubilizers. Methods for producing factor XIII
are known in the art. See, for example, U.S. Pat No. 5,204,447. The
fibrinogen is also useful for coating surfaces of polymeric
articles, e.g. synthetic vascular grafts, as disclosed in U.S. Pat.
No. 5,272,074 (incorporated herein by reference).
The invention is further illustrated by the following non-limiting
examples.
EXAMPLES
Example I
The multiple cloning site of the vector pUC18 (Yanisch-Perron et
al., Gene 33:103-119, 1985) was removed and replaced with a
synthetic double stranded oligonucleotide (the strands of which are
shown in SEQ ID NO: 8 and SEQ ID NO: 27) containing the restriction
sites Pvu I/Mlu I/Eco RV/Xba I/Pvu I/Mlu I, and flanked by 5'
overhangs compatible with the restriction sites Eco RI and Hind
III. pUC18 was cleaved with both Eco RI and Hind III, the 5'
terminal phosphate groups were removed with calf intestinal
phosphatase, and the oligonucleotide was ligated into the vector
backbone. The DNA sequence across the junction was confirmed by
sequencing, and the new plasmid was called pUCPM.
The .beta.-lactoglobulin (BLG) gene sequences from pSS1tgXS
(disclosed in WIPO publication WO 88/00239) were excised as a Sal
I-Xba I fragment and recloned into the vector pUCPM that had been
cut with Sal I and Xba I to construct vector pUCXS. pUCXS is thus a
pUC18 derivative containing the entire BLG gene from the Sal I site
to the Xba I site of phage SS1 (Ali and Clark, J. Mol. Biol. 199:
415-426, 1988).
The plasmid pSS1tgSE (disclosed in WIPO publication WO 88/00239)
contains a 1290 bp BLG fragment flanked by Sph I and EcoR I
restriction sites, a region spanning a unique Not I site and a
single Pvu II site which lies in the 5' untranslated leader of the
BLG mRNA. Into this Pvu II site was ligated a double stranded, 8 bp
DNA linker (5'-GGATATCC-3') encoding the recognition site for the
enzyme Eco RV. This plasmid was called pSS1tgSE/RV. DNA sequences
bounded by Sph I and Not I restriction sites in pSS1tgSE/RV were
excised by enzymatic digestion and used to replace the equivalent
fragment in pUCXS. The resulting plasmid was called pUCXSRV. The
sequence of the BLG insert in pUCSXRV is shown in SEQ ID NO: 7,
with the unique Eco RV site at nucleotide 4245 in the 5'
untranslated leader region of the BLG gene. This site allows
insertion of any additional DNA sequences under the control of the
BLG promoter 3' to the transcription initiation site.
Using the primers BLGAMP3 (5'-TGG ATC CCC TGC CGG TGC CTC TGG-3';
SEQ ID NO: 9) and BLGAMP4 (5'-AAC GCG TCA TCC TCT GTG AGC CAG-3';
SEQ ID NO: 10) a PCR fragment of approximately 650 bp was produced
from sequences immediately 3' to the stop codon of the BLG gene in
pUCXSRV. The PCR fragment was engineered to have a BamH I site at
its 5' end and an Mlu I site at its 3' end and was cloned as such
into BamH I and Mlu I cut pGEM7zf(+) (Promega) to give
pDAM200(+).
pUCXSRV was digested with Kpn I, and the largest, vector containing
band was gel purified. This band contained the entire pUC plasmid
sequences and some 3' non-coding sequences from the BLG gene. Into
this backbone was ligated the small Kpn I fragment from pDAM200(+)
which, in the correct orientation, effectively engineered a BamH I
site at the extreme 5' end of the 2.6 Kbp of the BLG 3' flanking
region. This plasmid was called pBLAC200. A 2.6 Kbp Cla I-Xba I
fragment from pBLAC200 was ligated into Cla I-Xba I cut pSP72
vector (Promega), thus placing an EcoR V site immediately upstream
of the BLG sequences. This plasmid was called pBLAC210.
The 2.6 Kbp Eco RV-Xba I fragment from pBLAC210 was ligated into
Eco RV-Xba I cut pUCXSRV to form pMAD6. This, in effect, excised
all coding and intron sequences from pUCXSRV, forming a BLG
minigene consisting of 4.3 Kbp of 5' promoter and 2.6 Kbp of 3'
downstream sequences flanking a unique EcoR V site. An
oligonucleotide linker (ZC6839: ACTACGTAGT; SEQ ID NO: 11) was
inserted into the Eco RV site of pMAD6. This modification destroyed
the Eco RV site and created a Sna BI site to be used for cloning
purposes. The vector was designated pMAD6-Sna. Messenger RNA
initiates upstream of the Sna BI site and terminates downstream of
the Sna BI site. The precursor transcript will encode a single
BLG-derived intron, intron 6, which is entirely within the 3'
untranslated region of the gene.
Example II
Clones encoding the individual fibrinogen chains were obtained from
the laboratory of Dr. Earl W. Davie, University of Washington,
Seattle. A genomic fibrinogen A.alpha.-chain clone (Chung et al.,
1990, ibid.) was obtained from the plasmid BS4. This plasmid
contains the A.alpha. clone inserted into the Sal I and Bam HI
sites of the vector pUC18, but lacks the coding sequence for the
first four amino acids of the A.alpha. chain. A genomic
B.beta.-chain DNA (Chung et al., ibid.) was isolated from a lambda
Charon 4A phage clone (designated .beta..lamda.4) as two EcoRI
fragments of ca. 5.6 Kbp each. The two fragments were cloned
separately into pUC19 that had been digested with Eco RI and
treated with calf intestinal phosphatase. The resulting clones were
screened by digestion with the restriction enzyme Pvu II to
distinguish plasmids with the 5' and 3' Bp inserts (designated
Beta5'RI/puc and Beta3'RI/puc, respectively). Genomic .gamma.-chain
clones were isolated as described by Rixon et al. (Biochemistry 24:
2077-2086, 1985). Clone p.gamma.12A9 comprises 5' non-coding
sequences and approximately 4535 bp of .gamma.-chain coding
sequence. Clone p.gamma.12F3 comprises the remaining coding
sequence and 3' non-coding nucleotides. Both are pBR322-based
plasmids with the fibrinogen sequences inserted at the EcoRI site.
These plasmids were used as templates for the respective PCR
reactions.
The fibrinogen chain coding sequences were tailored for insertion
into expression vectors using the polymerase chain reaction (PCR)
as generally described by Mullis (U.S. Pat. No. 4,683,202). This
procedure removed native 5' and 3' untranslated sequences, added a
9 base sequence (CCT GCA GCC) upstream of the first ATG of each
coding sequence, supplied the first four codons for the
A.alpha.-chain sequence, removed an internal Mlu I site in the
A.alpha. sequence and added restriction sites to facilitate
subsequent cloning steps.
Referring to FIG. 1, the 5' end of the A.alpha. coding sequence was
tailored in a PCR reaction containing 20 pmole for each of primers
ZC6632 (SEQ ID NO: 12) and ZC6627 (SEQ ID NO: 13), approximately 10
ng of plasmid BS4 template DNA, 10 .mu.l of a mix containing 2.5 mM
each dNTP, 7.5 .mu.l 10.times. Pyrococcus furiosus (Pfu) DNA
polymerase buffer #1 (200 mM Tris-HCl, pH 8.2, 100 mM KCl, 60 mM
(NH.sub.4).sub.2SO.sub.4, 20 mM MgCl.sub.2, 1% Triton X-100, 100
.mu.g/ml nuclease free bovine serum albumin)(Stratagene, La Jolla,
Calif.), and water to 75 .mu.l. The mixture was heated to
94.degree. C. in a DNA thermal cycler (Perkin-Elmer Corp., Norwalk,
Conn.). To the heated mixture was added 25 .mu.l of a mixture
containing 2.5 .mu.l 10.times.Pfu buffer #1, 22 .mu.l H.sub.2O and
1 .mu.l 2.5 units/.mu.l Pfu DNA polymerase (Stratagem). The
reactions were run in a DNA thermal cycler (Perkin-Elmer) for five
cycles of 94.degree., 45 seconds; 40.degree., 90 seconds;
72.degree., 120 seconds; 20 cycles of 94.degree., 45 seconds;
45.degree., 90 seconds; 72.degree., 120 seconds; then incubated at
72.degree. for 7 minutes. The 5' PCR-generated fragment was
digested with Bam HI and Hind III, and the Bam HI-Hind III fragment
was then ligated to an internal 2.91 Kbp Hind III-Xba I fragment
and Bam HI, Xba I-digested pUC18. PCR-generated exon sequences were
sequenced.
Referring again to FIG. 1, the 3' end of the A.alpha. coding
sequence was tailored in a series of steps in which the Mlu I site
563 bases upstream from the stop codon of the A.alpha. sequence was
mutated using an overlap extension PCR reaction (Ho et al., Gene
77: 51-59, 1989). In the first reaction 40 pmole of each of primers
ZC6521 (SEQ ID NO: 14) and ZC6520 (SEQ ID NO: 15) were combined
with approximately 10 ng of plasmid BS4 template DNA in a reaction
mixture as described above. The reaction was run for 5 cycles of
94.degree., 45 seconds; 40.degree., 60 seconds; 72.degree., 120
seconds; 15 cycles of 94.degree., 45 seconds; 45.degree., 60
seconds; 72.degree., 120 seconds; then incubated at 72.degree. for
7 minutes. A second reaction was carried out in the same manner
using 40 pmole of each of primers ZC6519 (SEQ ID NO: 16) and ZC6518
(SEQ ID NO: 17) and BS4 as template. The PCR-generated DNA
fragments from the first and second reactions were isolated by gel
electrophoresis and elution from the gel. Approximately 1/10 of
each recovered reaction product was combined with 40 pmole of each
of primers ZC6521 (SEQ ID NO: 14) and ZC6518 (SEQ ID NO: 17) in a
PCR reaction in which the complementary 3' ends of each fragment
(containing the single base change) annealed and served as a primer
for the 3' extension of the complementary strand. PCR was carried
out using the same reaction conditions as in the first and second
3' PCR steps. The reaction product was then digested with Xba I and
Bam HI, and the Xba I-Bam HI fragment was cloned into Xba I, Bam
HI-digested pUC18. PCR-generated exons were sequenced.
As shown in FIG. 1, the 5' Bam HI-Xba I fragment (3.9 Kbp) and the
3' Xba I-Bam HI fragment (1.3 Kbp) were inserted into the Bam HI
site of the vector Zem228. Zem228 is a pUC18 derivative comprising
a Bam HI cloning site between a mouse MT-1 promoter and SV40
terminator, and a neomycin resistance marker flanked by SV40
promoter and terminator sequences. See European Patent Office
Publication EP 319,944 and FIG. 2. The entire A.alpha. coding
sequence was isolated from the Zem228 vector as an Sna BI fragment,
which was inserted into the Sna BI site of the plasmid
pMAD6-Sna.
Referring to FIG. 3, the 5' end of the B.beta.-chain was tailored
by PCR using the oligonucleotides ZC6629 (SEQ ID NO: 18), ZC6630
(SEQ ID NO: 19) and ZC6625 (SEQ ID NO: 20). These primers were used
in pairwise combinations (ZC6629+ZC6625 or ZC6630+ZC6625) to
generate B.beta. coding sequences beginning at the first ATG codon
(position 470 in SEQ ID NO: 3)(designated N1-Beta) or the third ATG
codon (position 512 in SEQ ID NO: 3)(designated N3-Beta).
Approximately 5 ng of Beta5'RI/puc template DNA was combined with
20 pmole of each of the primers (N1-Beta:ZC6629, SEQ ID NO:
18+ZC6625, SEQ ID NO: 20; or N3-Beta:ZC6630, SEQ ID NO: 19+ZC6625,
SEQ ID NO: 20) in a reaction mixture as described above. The
mixtures were incubated for 5 cycles of 94.degree., 45 seconds;
40.degree., 120 seconds; (N1-Beta) or 90 seconds (N3-Beta);
72.degree., 120 seconds; 20 cycles of 94.degree., 45 seconds;
45.degree., 120 seconds; (N1-Beta) or 90 seconds (N3-Beta);
72.degree., 120 seconds; then incubated at 72.degree. for 7
minutes. The two reaction products N1, 555 bp or N3, 510 bp) were
each digested with Eco RI and Bgl II, and the fragments were
ligated to the internal Bgl II-Xba I fragment and Eco RI+Xba
I-digested pUC19. The 3' end of the B.beta. sequence was tailored
in a reaction mixture as described above using the oligonucleotide
primers ZC6626 (SEQ ID NO: 21) and ZC6624 (SEQ ID NO: 22) and
approximately 5 ng of Beta3'RI/puc template. The mixtures were
incubated for 5 cycles of 94.degree., 45 seconds; 40.degree., 90
seconds; 72.degree., 120 seconds; 15 cycles of 94.degree., 45
seconds; 45.degree., 90 seconds; 72.degree., 120 seconds; then
incubated at 72.degree. for 7 minutes. A 990 bp Bgl II-Eco RI
fragment was isolated. This 3' fragment was ligated to the adjacent
coding fragment (340 bp, SphI-Bgl II) and Sph I+Eco RI-digested
pUC19. The 3' and 5' PCR-generated exons were sequenced. A third
intermediate vector was constructed by combining two internal
fragments (4285 bp Xba I-Eco RI and 383 kb Eco RI-Sph I) in Xba
I+Sph I-digested pUC19. The entire B.beta. coding sequence (two
forms) was then assembled by ligating one of the 5' Eco RI-Xba I
fragments, the internal Xba I-Sph I fragment, the 3' Sph I-Eco RI
fragment and Eco RI-digested vector pUC19. The B.beta. sequence was
then isolated as a 7.6 Kbp Sna BI fragment and inserted into the
Sna BI site of pMAD6-Sna.
Referring to FIG. 4, the 5' end of the gamma chain sequence was
tailored by PCR using the oligonucleotide primers ZC6514 (SEQ ID
NO: 23) and ZC6517 (SEQ ID NO: 24) and approximately 50 ng of
p.gamma.12A9 as template. The PCR reaction was run as described
above using 40 pM of each primer. The reaction was run for 5 cycles
of 94.degree., 45 seconds; 40.degree., 60 seconds, 72.degree., 120
seconds, followed by 15 cycles of 94.degree., 45 seconds;
45.degree., 60 seconds; 72.degree., 120 seconds. The resulting 213
by fragment was digested with Bam HI and Spe I, and the resulting
restriction fragment was ligated with the adjacent downstream 4.4
kb Spe I-Eco RI fragment and Bam HI+Eco RI digested pUC19. The 3'
end of the gamma chain sequence was tailored using oligonucleotide
primers ZC6516 (SEQ ID NO: 25) and ZC6515 (SEQ ID NO: 26) using 40
pM of each primer, approximately 50 ng of p.gamma.12F3 template and
the same thermal cycling schedule as used for the 5' fragment. The
resulting 500 bp fragment was digested with Spe I and Bam HI, and
the resulting restriction fragment was ligated with the upstream
2.77 kb Eco RI-Spe I fragment and Eco RI+Bam HI-digested pUC19. All
PCR-generated exons were sequenced. The entire .gamma.'-chain
coding sequence was then assembled by ligating a 4.5 Kbp Bam HI-Eco
RI 5' fragment, a 1.1 Kbp Eco RI-Pst I internal fragment and a 2.14
Kbp Pst I-Xba I 3' fragment in Bam HI+Xba I-digested Zem219b.
Zem219b is a pUC18-derived vector containing a mouse
metallothionein promoter and a DHFR selectable marker operably
linked to an SV40 promoter (FIG. 5). Plasmid Zem219b has been
deposited with American Type Culture Collection as an E. coli
XL1-blue transformant under Accession No. 68979. The entire
.gamma.-chain coding sequence was then isolated as a 7.8 Kbp Sna B1
fragment and inserted into the Sna BI site of pMAD6-Sna.
Example III
Mice for initial breeding stocks (C57BL6J, CBACA) were obtained
from Harlan Olac Ltd. (Bicester, UK). These were mated in pairs to
produce F1 hybrid cross (B6CBAF1) for recipient female,
superovulated females, stud males and vasectomized males. All
animals were kept on a 14 hour light/10 hour dark cycle and fed
water and food (Special Diet Services RM3, Edinburgh, Scotland) ad
libitum.
Transgenic mice were generated essentially as described in Hogan et
al., Manipulating the Mouse Embryo: A Laboratory Manual, Cold
Spring Harbor Laboratory, 1986, which is incorporated herein by
reference in its entirety. Female B6CBAF1 animals were
superovulated at 4-5 weeks of age by an i.p. injection of pregnant
mares' serum gonadotrophin (FOLLIGON, Vet-Drug, Falkirk, Scotland)
(5 iu) followed by an i.p. injection of human chorionic
gonadotrophin (CHORULON, Vet-Drug, Falkirk, Scotland) (5 iu) 45
hours later. They were then mated with a stud male overnight. Such
females were next examined for copulation plugs. Those that had
mated were sacrificed, and their eggs were collected for
microinjection.
DNA was injected into the fertilized eggs as described in Hogan et
al. (ibid.) Briefly, each of the vectors containing the A.alpha.,
B.beta. and .gamma. expression units was digested with Mlu I, and
the expression units were isolated by sucrose gradient
centrifugation. All chemicals used were reagent grade (Sigma
Chemical Co., St. Louis, Mo., U.S.A.), and all solutions were
sterile and nuclease-free. Solutions of 20% and 40% sucrose in 1M
NaCl, 20 mM Tris pH 8.0, 5 mM EDTA were prepared using UHP water
and filter sterilized. A 30% sucrose solution was prepared by
mixing equal volumes of the 20% and 40% solutions. A gradient was
prepared by layering 0.5 ml steps of the 40%, 30% and 20% sucrose
solutions into a 2 ml polyallomer tube and allowed to stand for one
hour. 100 .mu.l of DNA solution (max. 8 .mu.g DNA) was loaded onto
the top of the gradient, and the gradient was centrifuged for 17-20
hours at 26,000 rpm, 15.degree. C. in a Beckman TL100
ultracentrifuge using a TLS-55 rotor (Beckman Instruments,
Fullerton, Calif., USA). Gradients were fractionated by puncturing
the tube bottom with a 20 ga. needle and collecting drops in a 96
well microliter plate. 3 .mu.l aliquots were analyzed on a 1%
agarose mini-gel. Fractions containing the desired DNA fragment
were pooled and ethanol precipitated overnight at -20.degree. C. in
0.3M sodium acetate. DNA pellets were resuspended in 50-100 .mu.l
UHP water and quantitated by fluorimetry. The expression units were
diluted in Dulbecco's phosphate buffered saline without calcium and
magnesium (containing, per liter, 0.2 g KCl, 0.2 g
KH.sub.2PO.sub.4, 8.0 g NaCl, 1.15 g Na.sub.2HPO.sub.4), mixed
(using either the N1-Beta or N3-Beta expression unit) in a 1:1:1
molar ratio, concentration adjusted to about 6 .mu.g/ml, and
injected into the eggs (.about.2 pl total DNA solution per
egg).
Recipient females of 6-8 weeks of age are prepared by mating
B6CBAF1 females in natural estrus with vasectomized males. Females
possessing copulation plugs are then kept for transfer of
microinjected eggs.
Following birth of potential transgenic animals, tail biopsies are
taken, under anesthesia, at four weeks of age. Tissue samples are
placed in 2 ml of tail buffer (0.3M Na acetate, 50 mM HCl, 1.5 mM
MgCl.sub.2, 10 mM Tris-HCl, pH 8.5, 0.5% NP40, 0.5% Tween 20)
containing 200 .mu.g/ml proteinase K (Boehringer Mannheim,
Mannheim, Germany) and vortexed. The samples are shaken (250 rpm)
at 55.degree.-60.degree. for 3 hours to overnight. DNA prepared
from biopsy samples is examined for the presence of the injected
constructs by PCR and Southern blotting. The digested tissue is
vigorously vortexed, and 5 .mu.l aliquots are placed in 0.5 ml
microcentrifuge tubes. Positive and negative tail samples are
included as controls. Forty .mu.l of silicone oil (BDH, Poole, UK)
is added to each tube, and the tubes are briefly centrifuged. The
tubes are incubated in the heating block of a thermal cycler (e.g.
Omni-gene, Hybaid, Teddington, UK) to 95.degree. C. for 10 minutes.
Following this, each tube has a 45 .mu.l aliquot of PCR mix added
such that the final composition of each reaction mix is: 50 mM KCl;
2 mM MgCl.sub.2; 10 mM Tris-HCl (pH 8.3); 0.01% gelatin; 0.1% NP40,
10% DMSO; 500 nM each primer, 200 .mu.M dNTPs; 0.02 U/.mu.l Taq
polymerase (Boehringer Mannheim, Mannheim, Germany). The tubes are
then cycled through 30 repeated temperature changes as required by
the particular primers used. The primers may be varied but in all
cases must target the BLG promoter region. This is specific for the
injected DNA fragments because the mouse does not have a BLG gene.
Twelve .mu.l of 5.times. loading buffer containing Orange G marker
dye (0.25% Orange G [Sigma] 15% Ficoll type 400 [Pharmacia
Biosystems Ltd., Milton Keynes, UK]) is then added to each tube,
and the reaction mixtures are electrophoresed on a 1.6% agarose gel
containing ethidium bromide (Sigma) until the marker dye has
migrated 2/3 of the length of the gel. The gel is visualized with a
UV light source emitting a wavelength of 254 nm. Transgenic mice
having one or more of the injected DNA fragments are identified by
this approach.
Positive tail samples are processed to obtain pure DNA. The DNA
samples are screened by Southern blotting using a BLG promoter
probe (nucleotides 2523-4253 of SEQ ID NO: 7). Specific cleavages
with appropriate restriction enzymes (e.g. Eco RI) allow the
distinction of the three constructs containing the A.alpha.,
B.beta. and .gamma. sequences.
Southern blot analysis of transgenic mice prepared essentially as
described above demonstrated that more than 50% of progeny
contained all three fibrinogen sequences. Examination of milk from
positive animals by reducing SDS polyacrylamide gel electrophoresis
demonstrated the presence of all three protein chains at
concentrations up to 1 mg/ml. The amount of fully assembled
fibrinogen was related to the ratios of individual subunits present
in the milk. No apparent phenotype was associated with high
concentrations of human fibrinogen in mouse milk.
Example IV
Donor ewes are treated with an intravaginal
progesterone-impregnated sponge (CHRONOGEST Goat Sponge, Intervet,
Cambridge, UK) on day 0. Sponges are left in situ for ten or twelve
days.
Superovulation is induced by treatment of donor ewes with a total
of one unit of ovine follicle stimulating hormone (OFSH) (OVAGEN,
Horizon Animal Reproduction Technology Pty. Ltd., New Zealand)
administered in eight intramuscular injections of 0.125 units per
injection starting at 5:00 pm on day -4 and ending at 8:00 am on
day 0. Donors are injected intramuscularly with 0.5 ml of a
luteolytic agent (ESTRUMATE, Vet-Drug) on day -4 to cause
regression of the corpus luteum, to allow return to estrus and
ovulation. To synchronize ovulation, the donor animals are injected
intramuscularly with 2 ml of a synthetic releasing hormone analog
(RECEPTAL, Vet-Drug) at 5:00 pm on day 0.
Donors are starved of food and water for at least 12 hours before
artificial insemination (A.I.). The animals are artificially
inseminated by intrauterine laparoscopy under sedation and local
anesthesia on day 1. Either xylazine (ROMPUN, Vet-Drug) at a dose
rate of 0.05-0.1 ml per 10 kg bodyweight or ACP injection 10 mg/ml
(Vet-Drug) at a dose rate of 0.1 ml per 10 kg bodyweight is
injected intramuscularly approximately fifteen minutes before A.I.
to provide sedation. A.I. is carried out using freshly collected
semen from a Poll Dorset ram. Semen is diluted with equal parts of
filtered phosphate buffered saline, and 0.2 ml of the diluted semen
is injected per uterine horn. Immediately pre- or post-A.I., donors
are given an intramuscular injection of AMOXYPEN (Vet-Drug).
Fertilized eggs are recovered on day 2 following starvation of
donors of food and water from 5:00 pm on day 1. Recovery is carried
out under general anesthesia induced by an intravenous injection of
5% thiopentone sodium (INTRAVAL SODIUM, Vet-Drug) at a dose rate of
3 ml per 10 kg bodyweight. Anesthesia is maintained by inhalation
of 1-2% Halothane/O.sub.2/N.sub.2O after intubation. To recover the
fertilized eggs, a laparotomy incision is made, and the uterus is
exteriorized. The eggs are recovered by retrograde flushing of the
oviducts with Ovum Culture Medium (Advanced Protein Products,
Brierly Hill, West Midlands, UK) supplemented with bovine serum
albumin of New Zealand origin. After flushing, the uterus is
returned to the abdomen, and the incision is closed. Donors are
allowed to recover post-operatively or are euthanized. Donors that
are allowed to recover are given an intramuscular injection of
Amoxypen L.A. at the manufacturer's recommended dose rate
immediately pre- or post-operatively.
Plasmids containing the three fibrinogen chain expression units are
digested with Mlu I, and the expression unit fragments are
recovered and purified on sucrose density gradients. The fragment
concentrations are determined by fluorimetry and diluted in
Dulbecco's phosphate buffered saline without calcium and magnesium
as described above. The concentration is adjusted to 6 .mu.g/ml and
approximately 2 pl of the mixture is microinjected into one
pronucleus of each fertilized eggs with visible pronuclei.
All fertilized eggs surviving pronuclear microinjection are
cultured in vitro at 38.5.degree. C. in an atmosphere of 5%
CO.sub.2:5% O.sub.2:90% N.sub.2 and about .about.100% humidity in a
bicarbonate buffered synthetic oviduct medium (see Table)
supplemented with 20% v/v vasectomized ram serum. The serum may be
heat inactivated at 56.degree. C. for 30 minutes and stored frozen
at -20.degree. C. prior to use. The fertilized eggs are cultured
for a suitable period of time to allow early embryo mortality
(caused by the manipulation techniques) to occur. These dead or
arrested embryos are discarded. Embryos having developed to 5 or 6
cell divisions are transferred to synchronized recipient ewes.
TABLE-US-00001 TABLE Synthetic Oviduct Medium Stock A (Lasts 3
Months) NaCl 6.29 g KCl 0.534 g KH.sub.2SO.sub.4 0.162 g
MgSO.sub.4:7H.sub.2O 0.182 g Penicillin 0.06 g Sodium Lactate 60%
syrup 0.6 mls Super H.sub.2O 99.4 mls Stock B (Lasts 2 weeks)
NaHCO.sub.3 0.21 g Phenol red 0.001 g Super H.sub.2O 10 mls Stock C
(Lasts 2 weeks) Sodium Pyruvate 0.051 g Super H.sub.2O 10 mls Stock
D (Lasts 3 months) CaCl.sub.2.cndot.2H.sub.2O 0.262 g Super
H.sub.2O 10 mls Stock E (Lasts 3 months) Hepes 0.651 g Phenol red
0.001 g Super H.sub.2O 10 mls To make up 10 mls of Bicarbonate
Buffered Medium STOCK A 1 ml STOCK B 1 ml STOCK C 0.07 ml STOCK D
0.1 ml Super H.sub.2O 7.83 ml Osmolarity should be 265-285 mOsm.
Add 2.5 ml of heat inactivated sheep serum and filter sterilize. To
make up 10 mls HEPES Buffered Medium STOCK A 1 ml STOCK B 0.2 ml
STOCK C 0.07 ml STOCK D 0.1 ml STOCK E 0.8 ml Super H.sub.2O 7.83
ml Osmolarity should be 265-285 mOsm. Add 2.5 ml of heat
inactivated sheep serum and filter sterilize.
Recipient ewes are treated with an intravaginal
progesterone-impregnated sponge (Chronogest Ewe Sponge or
Chronogest Ewe-Lamb Sponge, Intervet) left in situ for 10 or 12
days. The ewes are injected intramuscularly with 1.5 ml (300 iu) of
a follicle stimulating hormone substitute (P.M.S.G., Intervet) and
with 0.5 ml of a luteolytic agent (Estrumate, Coopers Pitman-Moore)
at sponge removal on day -1. The ewes are tested for estrus with a
vasectomized ram between 8:00 am and 5:00 pm on days 0 and 1.
Embryos surviving in vitro culture are returned to recipients
(starved from 5:00 pm on day 5 or 6) on day 6 or 7. Embryo transfer
is carried out under general anesthesia as described above. The
uterus is exteriorized via a laparotomy incision with or without
laparoscopy. Embryos are returned to one or both uterine horns only
in ewes with at least one suitable corpora lutea. After replacement
of the uterus, the abdomen is closed, and the recipients are
allowed to recover. The animals are given an intramuscular
injection of Amoxypen L.A. at the manufacturer's recommended dose
rate immediately pre- or post-operatively.
Lambs are identified by ear tags and left with their dams for
rearing. Ewes and lambs are either housed and fed complete diet
concentrates and other supplements and or ad lib. hay, or are let
out to grass.
Within the first week of life (or as soon thereafter as possible
without prejudicing health), each lamb is tested for the presence
of the heterologous DNA by two sampling procedures. A 10 ml blood
sample is taken from the jugular vein into an EDTA vacutainer. If
fit enough, the lambs also have a second 10 ml blood sample taken
within one week of the first. Tissue samples are taken by tail
biopsy as soon as possible after the tail has become desensitized
after the application of a rubber elastrator ring to its proximal
third (usually within 200 minutes after "tailing"). The tissue is
placed immediately in a solution of tail buffer. Tail samples are
kept at room temperature and analyzed on the day of collection. All
lambs are given an intramuscular injection of Amoxypen L.A. at the
manufacturer's recommended dose rate immediately post-biopsy, and
the cut end of the tail is sprayed with an antibiotic spray.
DNA is extracted from sheep blood by first separating white blood
cells. A 10 ml sample of blood is diluted in 20 ml of Hank's
buffered saline (HBS; obtained from Sigma Chemical Co.). Ten ml of
the diluted blood is layered over 5 ml of Histopaque (Sigma) in
each of two 15 ml screw-capped tubes. The tubes are centrifuged at
3000 rpm (2000.times.g max.), low brake for 15 minutes at room
temperature. White cell interfaces are removed to a clean 15 ml
tube and diluted to 15 ml in HBS. The diluted cells are spun at
3000 rpm for 10 minutes at room temperature, and the cell pellet is
recovered and resuspended in 2-5 ml of tail buffer.
To extract DNA from the white cells, 10% SDS is added to the
resuspended cells to a final concentration of 1%, and the tube is
inverted to mix the solution. One mg of fresh proteinase K solution
is added, and the mixture is incubated overnight at 45.degree. C.
DNA is extracted using an equal volume of phenol/chloroform
(.times.3) and chloroform/isoamyl alcohol (.times.1). The DNA is
then precipitated by adding 0.1 volume of 3M NaOAc and 2 volumes of
ethanol, and the tube is inverted to mix. The precipitated DNA is
spooled out using a clean glass rod with a sealed end. The spool is
washed in 70% ethanol, and the DNA is allowed to partially dry,
then is redissolved in TE (10 mM Tris-HCl, 1 mM EDTA, pH 7.4).
DNA samples from blood and tail are analyzed by Southern blotting
using probes for the BLG promoter region and the fibrinogen chain
coding regions.
From the foregoing, it will be appreciated that, although specific
embodiments of the invention have been described herein for
purposes of illustration, various modifications may be made without
deviating from the spirit and scope of the invention. Accordingly,
the invention is not limited except as by the appended claims.
SEQUENCE LISTINGS
1
275943 base pairsnucleic aciddoublelinearDNA (genomic)Human
Fibrinogen A-alpha chainCDS join(31..84, 1154..1279, 1739..1922,
3055..3200, 3786..5210) 1GTCTAGGAGC CAGCCCCACC CTTAGAAAAG ATG TTT
TCC ATG AGG ATC GTC TGC 54 Met Phe Ser Met Arg Ile Val Cys 1 5CTA
GTT CTA AGT GTG GTG GGC ACA GCA TGG GTATGGCCCT TTTCATTTTT 104Leu
Val Leu Ser Val Val Gly Thr Ala Trp 10 15TCTTCTTGCT TTCTCTCTGG
TGTTTATTCC ACAAAGAGCC TGGAGGTCAG AGTCTACCTG 164CTCTATGTCC
TGACACACTC TTAGCTTTAT GACCCCAGGC CTGGGAGGAA ATTTCCTGGG
224TGGGCTTGAC ACCTCAAGAA TACAGGGTAA TATGACACCA AGAGGAAGAT
CTTAGATGGA 284TGAGAGTGTA CAACTACAAG GGAAACTTTA GCATCTGTCA
TTCAGTCTTA CCACATTTTG 344TTTTGTTTTG TTTTAAAAAG GGCAAGAATT
ATTTGCCATC CTTGTACCTA TAAAGCCTTG 404GTGCATTATA ATGCTAGTTA
ATGGAATAAA ACATTTTATG GTAAGATTTG TTTTCTTTAG 464TTATTAATTT
CTTGCTACTT GTCCATAATA AGCAGAACTT TTAGTGTTAG TACAGTTTTG
524CTGAAAGGTT ATTGTTGTGT TTGTCAAGAC AGAAGAAAAA GCAAACGAAT
TATCTTTGGA 584AATATCTTTG CAGTATCAGA AGAGATTAGT TAGTAAGGCA
ATACGCTTTT CCGCAGTAAT 644GGTATTCTTT TAAATTATGA ATCCATCTCT
AAAGGTTACA TAGAAACTTG AAGGAGAGAG 704GAACATTCAG TTAAGATAGT
CTAGGTTTTT CTACTGAAGC AGCAATTACA GGAGAAAGAG 764CTCTACAGTA
GTTTTCAACT TTCTGTCTGC AGTCATTAGT AAAAATGAAA AGGTAAAATT
824TAACTGATTT TATAGATTCA AATAATTTTC CTTTTAGGAT GGATTCTTTA
AAACTCCTAA 884TATTTATCAA ATGCTTATTT AAGTGTCACA CACAGTTAAG
AAATTTGTAC ACCTTGTCTC 944CTTTAATTCT CATAACAACT CCATAAAATG
GGTCCTAGGA TTTCCATTTG AAGATAAGAA 1004ACCTGAAGCT TGCCGAAGCC
CTGTGTCTGC TCTCCTTAAT CTCTGTGAGA GTGCCATCTC 1064TTCCTGGGGA
CTTGTAGGCA TGCCACTGTC TCCTCTTCTG GCTAACATTG CTGTTGCTCT
1124CTTTTGTGTA TGTGAATGAA TCTTTAAAG ACT GCA GAT AGT GGT GAA GGT GAC
1177 Thr Ala Asp Ser Gly Glu Gly Asp 20 25TTT CTA GCT GAA GGA GGA
GGC GTG CGT GGC CCA AGG GTT GTG GAA AGA 1225Phe Leu Ala Glu Gly Gly
Gly Val Arg Gly Pro Arg Val Val Glu Arg 30 35 40CAT CAA TCT GCC TGC
AAA GAT TCA GAC TGG CCC TTC TGC TCT GAT GAA 1273His Gln Ser Ala Cys
Lys Asp Ser Asp Trp Pro Phe Cys Ser Asp Glu 45 50 55GAC TGG
GTAAGCAGTC AGCGGGGGAA GCAGGAGATT CCTTCCCTCT GATGCTAGAG 1329Asp Trp
60GGGCTCACAG GCTGACCTGA TTGGTCCCAG AAACTTTTTT AAATAGAAAA TAATTGAATA
1389GTTACCTACA TAGCAAATAA AGAAAAGGAA CCTACTCCCA AGAGCACTGT
TTATTTACCT 1449CCCCAACTCT GGATCATTAG TGGGTGAACA GACAGGATTT
CAGTTGCATG CTCAGGCAAA 1509ACCAGGCTCC TGAGTATTGT GGCCTCAATT
TCCTGGCACC TATTTATGGC TAAGTGGACC 1569CTCATTCCAG AGTTTCTCTG
CGACCTCTAA CTAGTCCTCT TACCTACTTT TAAGCCAACT 1629TATCTGGAAG
AGAAAGGGTA GGAAGAAATG GGGGCTGCAT GGAAACATGC AAAATTATTC
1689TGAATCTGAG AGATAGATCC TTACTGTAAT TTTCTCCCTT CACTTTCAG AAC TAC
1744 Asn TyrAAA TGC CCT TCT GGC TGC AGG ATG AAA GGG TTG ATT GAT GAA
GTC AAT 1792Lys Cys Pro Ser Gly Cys Arg Met Lys Gly Leu Ile Asp Glu
Val Asn 65 70 75CAA GAT TTT ACA AAC AGA ATA AAT AAG CTC AAA AAT TCA
CTA TTT GAA 1840Gln Asp Phe Thr Asn Arg Ile Asn Lys Leu Lys Asn Ser
Leu Phe Glu 80 85 90TAT CAG AAG AAC AAT AAG GAT TCT CAT TCG TTG ACC
ACT AAT ATA ATG 1888Tyr Gln Lys Asn Asn Lys Asp Ser His Ser Leu Thr
Thr Asn Ile Met 95 100 105 110GAA ATT TTG AGA GGC GAT TTT TCC TCA
GCC AAT A GTAAGTATTA 1932Glu Ile Leu Arg Gly Asp Phe Ser Ser Ala
Asn 115 120CATATTTACT TCTTTGACTT TATAACAGAA ACAACAAAAA TCCTAAATAA
ATATGATATC 1992CGCTTATATC TATGACAATT TCATCCCAAA GTACTTAGTG
TAGAAACACA TACCTTCATA 2052ATATCCCTGA AAATTTTAAG AGGGAGCTTT
TGTTTTCGTT ATTTTTTCAA AGTAAAAGAT 2112GTTAACTGAG ATTGTTTAAG
GTCACAAAAT AAGTCAGAAT TTTGGATTAA AACAAGAATT 2172TAAATGTGTT
CTTTTCAACA GTATATACTG AAAGTAGGAT GGGTCAGACT CTTTGAGTTG
2232ATATTTTTGT TTCTGCTTTG TAAAGGTGAA AACTGAGAGG TCAAGGAACT
TGTTCAAAGA 2292CACAGAGCTG GGAATTCAAC TCCCAGACTC CACTGAGCTG
ATTAGGTAGA TTTTTAAATT 2352TAAAATATAG GGTCAAGCTA CGTCATTCTC
ACAGTCTACT CATTAGGGTT AGGAAACATT 2412GCATTCACTC TGGGCATGGA
CAGCGAGTCT AGGGAGTCCT CAGTTTCTCA AGTTTTGCTT 2472TGCCTTTTTA
CACCTTCACA AACACTTGAC ATTTAAAATC AGTGATGCCA ACACTAGCTG
2532GCAAGTGAGT GATCCTGTTG ACCCAAAACA GCTTAGGAAC CATTTCAAAT
CTATAGAGTT 2592AAAAAGAAAA GCTCATCAGT AAGAAAATCC AATATGTTCA
AGTCCCTTGA TTAAGGATGT 2652TATAAAATAA TTGAAATGCA ATCAAACCAA
CTATTTTAAC TCCAAATTAC ACCTTTAAAA 2712TTCCAAAGAA AGTTCTTCTT
CTATATTTCT TTGGGATTAC TAATTGCTAT TAGGACATCT 2772TAACTGGCAT
TCATGGAAGG CTGCAGGGCA TAACATTATC CAAAAGTCAA ATGCCCCATA
2832GGTTTTGAAC TCACAGATTA AACTGTAACC AAAATAAAAT TAGGCATATT
TACAAGCTAG 2892TTTCTTTCTT TCTTTTTTCT CTTTCTTTCT TTCTTTCTTT
CTTTCTTTCT TTCTTTCTTT 2952CTTTCTTTCT TTCTCCTTCC TTCCTTTCTT
CCTTTCTTTT TTGCTGGCAA TTACAGACAA 3012ATCACTCAGC AGCTACTTCA
ATAACCATAT TTTCGATTTC AG AC CGT GAT AAT 3065 Asn Arg Asp Asn 125ACC
TAC AAC CGA GTG TCA GAG GAT CTG AGA AGC AGA ATT GAA GTC CTG 3113Thr
Tyr Asn Arg Val Ser Glu Asp Leu Arg Ser Arg Ile Glu Val Leu 130 135
140AAG CGC AAA GTC ATA GAA AAA GTA CAG CAT ATC CAG CTT CTG CAG AAA
3161Lys Arg Lys Val Ile Glu Lys Val Gln His Ile Gln Leu Leu Gln Lys
145 150 155AAT GTT AGA GCT CAG TTG GTT GAT ATG AAA CGA CTG GAG
GTAAGTATGT 3210Asn Val Arg Ala Gln Leu Val Asp Met Lys Arg Leu Glu
160 165 170GGCTGTGGTC CCGAGTGTCC TTGTTTTTGA GTAGAGGGAA AAGGAAGGCG
ATAGTTATGC 3270ACTGAGTGTC TACTATATGC AGAGAAAAGT GTTATATCCA
TCATCTACCT AAAAGTAGGT 3330ATTATTTTCC TCACTCCACA GTTGAAGAAA
AAAAAATTCA GAGATATTAA GTAAATTTTC 3390CAACGTACAT AGATAGTAAT
TCAAAGCAAT GTTCAGTCCC TGTCTATTCC AAGCCATTAC 3450ATCACCACAC
CTCTGAGCCC TCAGCCTGAG TTCACCAAGG ATCATTTAAT TAGCGTTTCC
3510TTTGAGAGGG AATAGCACCT TACTCTTGAT CCATTCTGAG GCTAAGATGA
ATTAAACAGC 3570ATCCATTGCT TATCCTGGCT AGCCCTGCAA TACCCAACAT
CTCTTCCACT GAGGGTGCTC 3630GATAGGCAGA AAACAGAGAA TATTAAGTGG
TAGGTCTCCG AGTCAAAAAA AATGAAACCA 3690GTTTCCAGAA GGAAAATTAA
CTACCAGGAA CTCAATAGAC GTAGTTTATG TATTTGTATC 3750TACATTTTCT
CTTTATTTTT CTCCCCTCTC TCTAG GTG GAC ATT GAT ATT AAG 3803 Val Asp
Ile Asp Ile Lys 175ATC CGA TCT TGT CGA GGG TCA TGC AGT AGG GCT TTA
GCT CGT GAA GTA 3851Ile Arg Ser Cys Arg Gly Ser Cys Ser Arg Ala Leu
Ala Arg Glu Val 180 185 190GAT CTG AAG GAC TAT GAA GAT CAG CAG AAG
CAA CTT GAA CAG GTC ATT 3899Asp Leu Lys Asp Tyr Glu Asp Gln Gln Lys
Gln Leu Glu Gln Val Ile 195 200 205GCC AAA GAC TTA CTT CCC TCT AGA
GAT AGG CAA CAC TTA CCA CTG ATA 3947Ala Lys Asp Leu Leu Pro Ser Arg
Asp Arg Gln His Leu Pro Leu Ile 210 215 220AAA ATG AAA CCA GTT CCA
GAC TTG GTT CCC GGA AAT TTT AAG AGC CAG 3995Lys Met Lys Pro Val Pro
Asp Leu Val Pro Gly Asn Phe Lys Ser Gln225 230 235 240CTT CAG AAG
GTA CCC CCA GAG TGG AAG GCA TTA ACA GAC ATG CCG CAG 4043Leu Gln Lys
Val Pro Pro Glu Trp Lys Ala Leu Thr Asp Met Pro Gln 245 250 255ATG
AGA ATG GAG TTA GAG AGA CCT GGT GGA AAT GAG ATT ACT CGA GGA 4091Met
Arg Met Glu Leu Glu Arg Pro Gly Gly Asn Glu Ile Thr Arg Gly 260 265
270GGC TCC ACC TCT TAT GGA ACC GGA TCA GAG ACG GAA AGC CCC AGG AAC
4139Gly Ser Thr Ser Tyr Gly Thr Gly Ser Glu Thr Glu Ser Pro Arg Asn
275 280 285CCT AGC AGT GCT GGA AGC TGG AAC TCT GGG AGC TCT GGA CCT
GGA AGT 4187Pro Ser Ser Ala Gly Ser Trp Asn Ser Gly Ser Ser Gly Pro
Gly Ser 290 295 300ACT GGA AAC CGA AAC CCT GGG AGC TCT GGG ACT GGA
GGG ACT GCA ACC 4235Thr Gly Asn Arg Asn Pro Gly Ser Ser Gly Thr Gly
Gly Thr Ala Thr305 310 315 320TGG AAA CCT GGG AGC TCT GGA CCT GGA
AGT GCT GGA AGC TGG AAC TCT 4283Trp Lys Pro Gly Ser Ser Gly Pro Gly
Ser Ala Gly Ser Trp Asn Ser 325 330 335GGG AGC TCT GGA ACT GGA AGT
ACT GGA AAC CAA AAC CCT GGG AGC CCT 4331Gly Ser Ser Gly Thr Gly Ser
Thr Gly Asn Gln Asn Pro Gly Ser Pro 340 345 350AGA CCT GGT AGT ACC
GGA ACC TGG AAT CCT GGC AGC TCT GAA CGC GGA 4379Arg Pro Gly Ser Thr
Gly Thr Trp Asn Pro Gly Ser Ser Glu Arg Gly 355 360 365AGT GCT GGG
CAC TGG ACC TCT GAG AGC TCT GTA TCT GGT AGT ACT GGA 4427Ser Ala Gly
His Trp Thr Ser Glu Ser Ser Val Ser Gly Ser Thr Gly 370 375 380CAA
TGG CAC TCT GAA TCT GGA AGT TTT AGG CCA GAT AGC CCA GGC TCT 4475Gln
Trp His Ser Glu Ser Gly Ser Phe Arg Pro Asp Ser Pro Gly Ser385 390
395 400GGG AAC GCG AGG CCT AAC AAC CCA GAC TGG GGC ACA TTT GAA GAG
GTG 4523Gly Asn Ala Arg Pro Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu
Val 405 410 415TCA GGA AAT GTA AGT CCA GGG ACA AGG AGA GAG TAC CAC
ACA GAA AAA 4571Ser Gly Asn Val Ser Pro Gly Thr Arg Arg Glu Tyr His
Thr Glu Lys 420 425 430CTG GTC ACT TCT AAA GGA GAT AAA GAG CTC AGG
ACT GGT AAA GAG AAG 4619Leu Val Thr Ser Lys Gly Asp Lys Glu Leu Arg
Thr Gly Lys Glu Lys 435 440 445GTC ACC TCT GGT AGC ACA ACC ACC ACG
CGT CGT TCA TGC TCT AAA ACC 4667Val Thr Ser Gly Ser Thr Thr Thr Thr
Arg Arg Ser Cys Ser Lys Thr 450 455 460GTT ACT AAG ACT GTT ATT GGT
CCT GAT GGT CAC AAA GAA GTT ACC AAA 4715Val Thr Lys Thr Val Ile Gly
Pro Asp Gly His Lys Glu Val Thr Lys465 470 475 480GAA GTG GTG ACC
TCC GAA GAT GGT TCT GAC TGT CCC GAG GCA ATG GAT 4763Glu Val Val Thr
Ser Glu Asp Gly Ser Asp Cys Pro Glu Ala Met Asp 485 490 495TTA GGC
ACA TTG TCT GGC ATA GGT ACT CTG GAT GGG TTC CGC CAT AGG 4811Leu Gly
Thr Leu Ser Gly Ile Gly Thr Leu Asp Gly Phe Arg His Arg 500 505
510CAC CCT GAT GAA GCT GCC TTC TTC GAC ACT GCC TCA ACT GGA AAA ACA
4859His Pro Asp Glu Ala Ala Phe Phe Asp Thr Ala Ser Thr Gly Lys Thr
515 520 525TTC CCA GGT TTC TTC TCA CCT ATG TTA GGA GAG TTT GTC AGT
GAG ACT 4907Phe Pro Gly Phe Phe Ser Pro Met Leu Gly Glu Phe Val Ser
Glu Thr 530 535 540GAG TCT AGG GGC TCA GAA TCT GGC ATC TTC ACA AAT
ACA AAG GAA TCC 4955Glu Ser Arg Gly Ser Glu Ser Gly Ile Phe Thr Asn
Thr Lys Glu Ser545 550 555 560AGT TCT CAT CAC CCT GGG ATA GCT GAA
TTC CCT TCC CGT GGT AAA TCT 5003Ser Ser His His Pro Gly Ile Ala Glu
Phe Pro Ser Arg Gly Lys Ser 565 570 575TCA AGT TAC AGC AAA CAA TTT
ACT AGT AGC ACG AGT TAC AAC AGA GGA 5051Ser Ser Tyr Ser Lys Gln Phe
Thr Ser Ser Thr Ser Tyr Asn Arg Gly 580 585 590GAC TCC ACA TTT GAA
AGC AAG AGC TAT AAA ATG GCA GAT GAG GCC GGA 5099Asp Ser Thr Phe Glu
Ser Lys Ser Tyr Lys Met Ala Asp Glu Ala Gly 595 600 605AGT GAA GCC
GAT CAT GAA GGA ACA CAT AGC ACC AAG AGA GGC CAT GCT 5147Ser Glu Ala
Asp His Glu Gly Thr His Ser Thr Lys Arg Gly His Ala 610 615 620AAA
TCT CGC CCT GTC AGA GGT ATC CAC ACT TCT CCT TTG GGG AAG CCT 5195Lys
Ser Arg Pro Val Arg Gly Ile His Thr Ser Pro Leu Gly Lys Pro625 630
635 640TCC CTG TCC CCC TAGACTAAGT TAAATATTTC TGCACAGTGT TCCCATGGCC
5247Ser Leu Ser ProCCTTGCATTT CCTTCTTAAC TCTCTGTTAC ACGTCATTGA
AACTACACTT TTTTGGTCTG 5307TTTTTGTGCT AGACTGTAAG TTCCTTGGGG
GCAGGGCCTT TGTCTGTCTC ATCTCTGTAT 5367TCCCAAATGC CTAACAGTAC
AGAGCCATGA CTCAATAAAT ACATGTTAAA TGGATGAATG 5427AATTCCTCTG
AAACTCTATT TGAGCTTATT TAGTCAAATT CTTTCACTAT TCAAAGTGTG
5487TGCTATTAGA ATTGTCACCC AACTGATTAA TCACATTTTT AGTATGTGTC
TCAGTTGACA 5547TTTAGGTCAG GCTAAATACA AGTTGTGTTA GTATTAAGTG
AGCTTAGCTA CCTGTACTGG 5607TTACTTGCTA TTAGTTTGTG CAAGTAAAAT
TCCAAATACA TTTGAGGAAA ATCCCCTTTG 5667CAATTTGTAG GTATAAATAA
CCGCTTATTT GCATAAGTTC TATCCCACTG TAAGTGCATC 5727CTTTCCCTAT
GGAGGGAAGG AAAGGAGGAA GAAAGAAAGG AAGGGAAAGA AACAGTATTT
5787GCCTTATTTA ATCTGAGCCG TGCCTATCTT TGTAAAGTTA AATGAGAATA
ACTTCTTCCA 5847ACCAGCTTAA TTTTTTTTTT AGACTGTGAT GATGTCCTCC
AAACACATCC TTCAGGTACC 5907CAAAGTGGCA TTTTCAATAT CAAGCTATCC GGATCC
5943644 amino acidsamino acidlinearprotein 2Met Phe Ser Met Arg Ile
Val Cys Leu Val Leu Ser Val Val Gly Thr 1 5 10 15Ala Trp Thr Ala
Asp Ser Gly Glu Gly Asp Phe Leu Ala Glu Gly Gly 20 25 30Gly Val Arg
Gly Pro Arg Val Val Glu Arg His Gln Ser Ala Cys Lys 35 40 45Asp Ser
Asp Trp Pro Phe Cys Ser Asp Glu Asp Trp Asn Tyr Lys Cys 50 55 60Pro
Ser Gly Cys Arg Met Lys Gly Leu Ile Asp Glu Val Asn Gln Asp 65 70
75 80Phe Thr Asn Arg Ile Asn Lys Leu Lys Asn Ser Leu Phe Glu Tyr
Gln 85 90 95Lys Asn Asn Lys Asp Ser His Ser Leu Thr Thr Asn Ile Met
Glu Ile 100 105 110Leu Arg Gly Asp Phe Ser Ser Ala Asn Asn Arg Asp
Asn Thr Tyr Asn 115 120 125Arg Val Ser Glu Asp Leu Arg Ser Arg Ile
Glu Val Leu Lys Arg Lys 130 135 140Val Ile Glu Lys Val Gln His Ile
Gln Leu Leu Gln Lys Asn Val Arg145 150 155 160Ala Gln Leu Val Asp
Met Lys Arg Leu Glu Val Asp Ile Asp Ile Lys 165 170 175Ile Arg Ser
Cys Arg Gly Ser Cys Ser Arg Ala Leu Ala Arg Glu Val 180 185 190Asp
Leu Lys Asp Tyr Glu Asp Gln Gln Lys Gln Leu Glu Gln Val Ile 195 200
205Ala Lys Asp Leu Leu Pro Ser Arg Asp Arg Gln His Leu Pro Leu Ile
210 215 220Lys Met Lys Pro Val Pro Asp Leu Val Pro Gly Asn Phe Lys
Ser Gln225 230 235 240Leu Gln Lys Val Pro Pro Glu Trp Lys Ala Leu
Thr Asp Met Pro Gln 245 250 255Met Arg Met Glu Leu Glu Arg Pro Gly
Gly Asn Glu Ile Thr Arg Gly 260 265 270Gly Ser Thr Ser Tyr Gly Thr
Gly Ser Glu Thr Glu Ser Pro Arg Asn 275 280 285Pro Ser Ser Ala Gly
Ser Trp Asn Ser Gly Ser Ser Gly Pro Gly Ser 290 295 300Thr Gly Asn
Arg Asn Pro Gly Ser Ser Gly Thr Gly Gly Thr Ala Thr305 310 315
320Trp Lys Pro Gly Ser Ser Gly Pro Gly Ser Ala Gly Ser Trp Asn Ser
325 330 335Gly Ser Ser Gly Thr Gly Ser Thr Gly Asn Gln Asn Pro Gly
Ser Pro 340 345 350Arg Pro Gly Ser Thr Gly Thr Trp Asn Pro Gly Ser
Ser Glu Arg Gly 355 360 365Ser Ala Gly His Trp Thr Ser Glu Ser Ser
Val Ser Gly Ser Thr Gly 370 375 380Gln Trp His Ser Glu Ser Gly Ser
Phe Arg Pro Asp Ser Pro Gly Ser385 390 395 400Gly Asn Ala Arg Pro
Asn Asn Pro Asp Trp Gly Thr Phe Glu Glu Val 405 410 415Ser Gly Asn
Val Ser Pro Gly Thr Arg Arg Glu Tyr His Thr Glu Lys 420 425 430Leu
Val Thr Ser Lys Gly Asp Lys Glu Leu Arg Thr Gly Lys Glu Lys 435 440
445Val Thr Ser Gly Ser Thr Thr Thr Thr Arg Arg Ser Cys Ser Lys Thr
450 455 460Val Thr Lys Thr Val Ile Gly Pro Asp Gly His Lys Glu Val
Thr Lys465 470 475 480Glu Val Val Thr Ser Glu Asp Gly Ser Asp Cys
Pro Glu Ala Met Asp 485 490 495Leu Gly Thr Leu Ser Gly Ile Gly Thr
Leu Asp Gly Phe Arg His Arg 500 505 510His Pro Asp Glu Ala Ala Phe
Phe Asp Thr Ala Ser Thr Gly Lys Thr 515 520 525Phe Pro Gly Phe Phe
Ser Pro Met Leu Gly Glu Phe Val Ser Glu Thr 530 535 540Glu Ser Arg
Gly Ser Glu Ser Gly Ile Phe Thr Asn Thr Lys Glu Ser545 550 555
560Ser Ser His His Pro Gly Ile Ala Glu Phe Pro Ser Arg Gly Lys Ser
565 570 575Ser Ser Tyr Ser Lys Gln Phe Thr Ser Ser Thr Ser Tyr Asn
Arg Gly 580 585 590Asp Ser Thr Phe Glu Ser Lys Ser Tyr Lys Met Ala
Asp Glu Ala Gly 595 600 605Ser Glu Ala Asp His Glu Gly Thr His Ser
Thr Lys Arg Gly His Ala 610 615 620Lys Ser Arg Pro Val Arg Gly Ile
His Thr Ser Pro Leu Gly Lys Pro625 630 635 640Ser Leu Ser Pro8878
base pairsnucleic aciddoublelinearDNA (genomic)human fibrinogen
B-beta chainmisc_RNA 1..469exon 470..583intron 584..3257exon
3258..3449intron 3450..3938exon 3939..4122intron 4123..5042exon
5043..5270intron 5271..5830exon 5831..5944intron 5945..6632exon
6633..6758intron 6759..6966exon 6967..7252intron 7253..7870exon
7871..81023'UTR 8103..8537misc_RNA 8538..8878CDS
join(470..583, 3258..3449, 3939..4122, 5831..5944, 6633..6758,
6967..7252, 7871..8102) 3GAATTCATGC CCCTTTTGAA ATAGACTTAT
GTCATTGTCA GAAAACATAA GCATTTATGG 60TATATCATTA ATGAGTCACG ATTTTAGTGG
TTGCCTTGTG AGTAGGTCAA ATTTACTAAG 120CTTAGATTTG TTTTCTCACA
TATTCTTTCG GAGCTTGTGT AGTTTCCACA TTAATTTACC 180AGAAACAAGA
TACACACTCT CTTTGAGGAG TGCCCTAACT TCCCATCATT TTGTCCAATT
240AAATGAATTG AAGAAATTTA ATGTTTCTAA ACTAGACCAA CAAAGAATAA
TAGTTGTATG 300ACAAGTAAAT AAGCTTTGCT GGGAAGATGT TGCTTAAATG
ATAAAATGGT TCAGCCAACA 360AGTGAACCAA AAATTAAATA TTAACTAAGG
AAAGGTAACC ATTTCTGAAG TCATTCCTAG 420CAGAGGACTC AGATATATAT
AGGATTGAAG ATCTCTCAGT TAAGTCTAC ATG AAA 475 Met Lys 1AGG ATG GTT
TCT TGG AGC TTC CAC AAA CTT AAA ACC ATG AAA CAT CTA 523Arg Met Val
Ser Trp Ser Phe His Lys Leu Lys Thr Met Lys His Leu 5 10 15TTA TTG
CTA CTA TTG TGT GTT TTT CTA GTT AAG TCC CAA GGT GTC AAC 571Leu Leu
Leu Leu Leu Cys Val Phe Leu Val Lys Ser Gln Gly Val Asn 20 25 30GAC
AAT GAG GAG GTGAATTTTT TAAAGCATTA TTATATTATT AGTAGTATTA 623Asp Asn
Glu Glu 35TTAATATAAG ATGTAACATA ATCATATTAT GTGCTTATTT TAATGAAATT
AGCATTGCTT 683ATAGTTATGA AATGGAATTG TTAACCTCTG ACTTATTGTA
TTTAAAGAAT GTTTCATAGT 743ATTTCTTATA TAAAAACAAA GTAATTTCTT
GTTTTCTAGT TTATCACCTT TGTTTTCTTA 803AGATGAGGAT GGCTTAGCTA
ATGTAAGATG TGTTTTTCTC ACTTGCTATT CTGAGTACTG 863TGATTTTCAT
TTACTTCTAG CAATACAGGA TTACAATTAA GAGGACAAGA TCTGAAAATC
923TCACAAACTA TAAAATAATA AAAGAGCAGA ATTTTAAGAT AAAAGAAACT
GGTGGTAGGT 983AGATTGTTCT TTGGTGAAGG AAGGTAATAT ATATTGTTAC
TGAGATTACT ATTTATAAAA 1043ATTATAACTA AGCCTAAAAG CAAAATACAT
CAAGTGTAAT GATAGAAAAT GAAATATTGC 1103TTTTTTCAGA TGAAAAGTTC
AAATTAGAGT TAGTGTGTAT TGTTATTATT AATAGTTATG 1163AAACACGGTT
CAGTCTAATT TATTTATTTG TAGAACAGTT TGTCCTCAAC TATTATTTTT
1223GCTGACTTAT TGCTGTTAAT TTGCAGTTAC TAAAAATACA GAAATGCATT
TAGGACAATG 1283GATATTTAAG AAATTTAAAT TTTATCATCA AACGTATCAT
GGCCAAATTT CTTACATATA 1343GCATAGTATC ATTAAACTAG AAATAAGAAT
ACACAATAAT ATTTAAATGA AGTGATTCAT 1403TTCGGATCAT TATTGAGTTT
CAAGGGAACT TGAGTGTTGT ACTTATCAGA CTCTACATGT 1463AAGAACATAT
AGTTAATCTG GTTGTGTGTG TAAAAACATA TGGTTAATCT GGTTAAGTCT
1523GGTTAATCAT ATTAGGTAAG AAAAATGTAA AGAATGTGTA AGACGAAATT
TTTGTAAAGT 1583ACTCTGCAAA GCACTTTCAC ATTTCTGCTT ATCAACTAAA
CCTCACAGAG ATAGTTTAAT 1643AGTTTAGGCT TTAAAATGGA TTTTGATTAT
TCAACAAGTG GCCTTCATAA TTTCTTTAAG 1703TGTTTTTCTT TAAGTATATA
CTTTCTTTAA ATATTTTTTA AAATTTCCTT TTCTCTAGTA 1763AAGCCAGACC
ATCCATGCTA CCTCTCTAGT GGCACTCTGA AATAAAAAGA AAATAGTTTT
1823CTCTGTTATA ATTGTATTTG TAATAAGCAG ATGAATCACA TTTCTTAAAA
TTTGTTTTAG 1883AGAGGGTAAG CTCTGACTAG GACCATGACT TCAATGTGAA
ATATGTATAT ATCCTCCGAA 1943TCTTTACATA TTAAGAATGT ATATAGTCAA
CTGGTTAAAC AGGAAAATCT GGAACAGCCT 2003GGCTGGGTTT TAATCTTAGC
ACCATCCTAC TAAATGTTAA ATAATATTAT AATCTAATGA 2063ATAAATGACA
ATGCAATTCC AAATAGAGTT CATCTGATGA CTTCTAGACT CACAAAATTG
2123CAAGAGAGCT CAGTTGTTGC TCAGTTGTTC CAAATCATGT CGTTTGTTAA
TTTGTAATTA 2183AGCTCCAAAG GATGTATAGC TACTGACAAA AAAAAAAATG
AGAATGTAGT TAATCCAAAT 2243CAAAACTTTC CTATTGCAAT GCGTATTTTC
TGCTTCATTA TCCTTTAATA TAATATTTTA 2303AGTTAGCAAG TAATTTTAAT
TACAATGCAC AAGCCTTGAG AATTATTTTA AATATAAGAA 2363AATCATAATG
TTTGATAAAG AAATCATGTA AGAAATTTCA AGATAATGGT TTAACAAATA
2423ATTTTGTTGA TAGAAGATAA GACTAAAAGT GAAATTCGAA GTGGAGAGGA
CACTTAAACT 2483GTAGTACTTG TTATGTGTGA TTCCAGTAAA AATAGTAATG
AGCACTTATT ATTGCCAAGT 2543ACTGTTCTGA GGGTACCATA TGCAATAAGT
TATTTAATCC TTACAATAAT CTTGTAAGGC 2603AGATTCAAAC TATCATTACA
CTTATTTTAC AGATGAGAAA ACTGGGGCAC AGATAAAGCA 2663ACTTGCCCAA
GGTCTCATAG CTGTAAGTCA ACCCTACGGT CAAGACCTAC AAGTAGCCGA
2723GCTCCAGAGT ACATTATGAG GGTCAAAGAT TGTCTTATTA CAAATAAATT
CCAAGTAGAA 2783TCAACCTTTA ATAAGTCTTT AATGTCTCTT AAATATGTTT
ATATAGGAGT CTAATCACCA 2843ATTCACAAAA ATGAAAGTAG GGAAATGATT
AACAATAATC ATAGGAATCT AACAATCCAA 2903GTGGCTTGAG AATATTCATT
CTTCTTGACA GTATAGATTC TTTACAATTT CGTAAGTTCC 2963AATGTATGTT
TTAGGAATAT GAGGTCATTA CTATTCATAA TCTGATACAG CTTTATCCTA
3023AGGCCTCTCT TTAAAAACTA CACTGCATCA TAGCTTTTTT GTGCAGTTGG
TCTTTCTACT 3083GTTACTGAAC AGTAAGCAAC CTACAGATTC ACTATCACCA
ACCAGCCAGT TGATGGATCT 3143TAAGCAAATT ATCAAGCTTG TGATAACCTA
AATTATAAAA TGAGGGTGTT GGAATAGTTA 3203CATTCCAAAT CTTCTATAAC
ACTCTGTATT ATATTTCTGC CTCATTCCTT GTAG GGT 3260 GlyTTC TTC AGT GCC
CGT GGT CAT CGA CCC CTT GAC AAG AAG AGA GAA GAG 3308Phe Phe Ser Ala
Arg Gly His Arg Pro Leu Asp Lys Lys Arg Glu Glu 40 45 50 55GCT CCC
AGC CTG AGG CCT GCC CCA CCG CCC ATC AGT GGA GGT GGC TAT 3356Ala Pro
Ser Leu Arg Pro Ala Pro Pro Pro Ile Ser Gly Gly Gly Tyr 60 65 70CGG
GCT CGT CCA GCC AAA GCA GCT GCC ACT CAA AAG AAA GTA GAA AGA 3404Arg
Ala Arg Pro Ala Lys Ala Ala Ala Thr Gln Lys Lys Val Glu Arg 75 80
85AAA GCC CCT GAT GCT GGA GGC TGT CTT CAC GCT GAC CCA GAC CTG
3449Lys Ala Pro Asp Ala Gly Gly Cys Leu His Ala Asp Pro Asp Leu 90
95 100GTGGGTGCAC TGATGTTTCT TGCAGTGGTG GCTCTCTCAT GCAGAGAAAG
CCTGTAGTCA 3509TGGCAGTCTG CTAATGTTTC ACTGACCCAC ATTACCATCA
CTGTTATTTT GTTTGTTTAT 3569TTTGGAAATA AAATTCAAAA CATAAACATA
TTGGGCCTTT GGTTTAGGCT TTCTTTCTTG 3629TTTTCTTTGG TCTGGGCCCA
AAATTTCAAA TTAGGATATG TGGGTGCCAC CTTTCCATTT 3689GTATTTTGCC
ACTGCCTTTG TTTAGTTGGT AAAATTTTCA TAGCCCAATT ATATTTTTTC
3749TGGGGTAAGT AATATTTTAA ATCTCTATGA GAGTATGATG ATGACTTTCG
AATTTCTGGT 3809CTTACAGAAA ACCAAATAAT AAATTTTTAT GTTGGCTAAT
CGTATCGCTG AATTTTCCTA 3869TGTGCTATTT TAACAAATGT CCATGACCCA
AATCCTTCAT CTAATGCCTG CTATTTTCTT 3929TGTTTTTAG GGG GTG TTG TGT CCT
ACA GGA TGT CAG TTG CAA GAG GCT 3977 Gly Val Leu Cys Pro Thr Gly
Cys Gln Leu Gln Glu Ala 105 110 115TTG CTA CAA CAG GAA AGG CCA ATC
AGA AAT AGT GTT GAT GAG TTA AAT 4025Leu Leu Gln Gln Glu Arg Pro Ile
Arg Asn Ser Val Asp Glu Leu Asn 120 125 130AAC AAT GTG GAA GCT GTT
TCC CAG ACC TCC TCT TCT TCC TTT CAG TAC 4073Asn Asn Val Glu Ala Val
Ser Gln Thr Ser Ser Ser Ser Phe Gln Tyr 135 140 145ATG TAT TTG CTG
AAA GAC CTG TGG CAA AAG AGG CAG AAG CAA GTA AAA G 4122Met Tyr Leu
Leu Lys Asp Leu Trp Gln Lys Arg Gln Lys Gln Val Lys 150 155
160GTAGATATCC TTGTGCTTTC CATTCGATTT TCAGCTATAA AATTGGAACC
GTTAGACTGC 4182CACGAGAATG CATGGTTGTG AGAAGATTAA CATTTCTGGG
TTAGTGAATA GCATTCATAC 4242GCTTTTGGGC ACCTTCCCCT GCAACTTGCC
AGATAAGCAC TATTCAGCTC TTATTCCCAG 4302TCTGACATCA GCAAGTGTGA
TTTTCTATGA AAAATTCTAC TATGACTCCT TATTTTAAGT 4362ATACAAGAAA
CTTGTGACTC AGAAGATAAT ATTTACAGAG TGGAAAAAAA CCCCTAGCAT
4422TTATAGTTTT AACATTTGAG GTTTTGAATG AGAGAGTTAT CCATAATATA
TTCAATTGTG 4482TTGTGGATAA TGACACCTAA CCTGTGAATC TTGAGGTCAG
AATGTTGAGT GCTGTTGACT 4542TGGTGGTCAG GAAACAGCTA GTGCGTGAGC
CTGGCACAGG CATCTCAGTG AGTAGCATAC 4602CCACAGTTGG AAATTTTTCA
AAGAAATCAA AGGAATCATG ACATCTTATA AATTTCAAGG 4662TTCTGCTATA
CTTATGTGAA ATGGATAAAT AAATCAAGCA TATCCACTCT GTAAGATTGA
4722ACTTCTCAGA TGGAAGACCC CAATACTGCT TTCTCCTCTT TTCCCTCACC
AAAGAAATAA 4782ACAACCTATT TCATTTATTA CTGGACACAA TCTTTAGCGT
ATACCTATGG TAAATTACTA 4842GTATGGTGGT TAGGATTTAT GTTAATTTGT
ATATGTCATG CGCCAAATCA TTTCCACTAA 4902ATATGACTAT ATATCATAAC
TGCTTGGTGA TAGCTCAGTG TTTAATAGTT TATTCTCAGA 4962AAATCAAAAT
TGTATAGTTA AATACATTAG TTTTATGAGG CAAAAATGCT AACTATTTCT
5022ACATAATTTC ATTTTTCCAG AT AAT GAA AAT GTA GTC AAT GAG TAC TCC
5071 Asp Asn Glu Asn Val Val Asn Glu Tyr Ser 165 170TCA GAA CTG GAA
AAG CAC CAA TTA TAT ATA GAT GAG ACT GTG AAT AGC 5119Ser Glu Leu Glu
Lys His Gln Leu Tyr Ile Asp Glu Thr Val Asn Ser 175 180 185AAT ATC
CCA ACT AAC CTT CGT GTG CTT CGT TCA ATC CTG GAA AAC CTG 5167Asn Ile
Pro Thr Asn Leu Arg Val Leu Arg Ser Ile Leu Glu Asn Leu190 195 200
205AGA AGC AAA ATA CAA AAG TTA GAA TCT GAT GTC TCA GCT CAA ATG GAA
5215Arg Ser Lys Ile Gln Lys Leu Glu Ser Asp Val Ser Ala Gln Met Glu
210 215 220TAT TGT CGC ACC CCA TGC ACT GTC AGT TGC AAT ATT CCT GTG
GTG TCT 5263Tyr Cys Arg Thr Pro Cys Thr Val Ser Cys Asn Ile Pro Val
Val Ser 225 230 235GGC AAA G GTAACTGATT CATAAACATA TTTTTAGAGA
GTTCCAGAAG AACTCACACA 5320Gly LysCCAAAAATAA GAGAACAACA ACAACAACAA
AAATGCTAAG TGGATTTTCC CAACAGATCA 5380TAATGACATT ACAGTACATC
ATAAAAATAT CCTTAGCCAG TTGTGTTTTG GACTGGCCTG 5440GTGCATTTGC
TGGTTTTGAT GAGCAGGATG GGGCACAGGT AGTCCCAGGG GTGGCTGATG
5500TGTGCATCTG CGTACTGGCT TGAACAGATG GCAGAACCAC AGATAGATGT
AGAAGTTTCT 5560CCATTTTGTG TGTTCTGGGA GCTCATGGAT ATTCCAGGAC
ACAAAAGGTG GAGAAGAGCT 5620TTGTTCATCC TCTTAGCAGA TAAACGTCCT
CAAAACTGGG TTGGACTTAC TAAAGTAAAA 5680TGAAAATCTA ATATTTGTTA
TATTATTTTC AAAGGTCTAT AATAACACAC TCCTTAGTAA 5740CTTATGTAAT
GTTATTTTAA AGAATTGGTG ACTAAATACA AAGTAATTAT GTCATAAACC
5800CCTGAACATA ATGTTGTCTT ACATTTGCAG AA TGT GAG GAA ATT ATC AGG AAA
5853 Glu Cys Glu Glu Ile Ile Arg Lys 240 245GGA GGT GAA ACA TCT GAA
ATG TAT CTC ATT CAA CCT GAC AGT TCT GTC 5901Gly Gly Glu Thr Ser Glu
Met Tyr Leu Ile Gln Pro Asp Ser Ser Val 250 255 260AAA CCG TAT AGA
GTA TAC TGT GAC ATG AAT ACA GAA AAT GGA G 5944Lys Pro Tyr Arg Val
Tyr Cys Asp Met Asn Thr Glu Asn Gly 265 270 275GTAAGCTTTC
GACAGTTGTT GACCTGTTGA TCTGTAATTA TTTGGATACC GTAAAATGCC
6004AGGAAACAAG GCCAGGTGTG GTGGCTCATA CCTGTAATTC CAGCACCTTG
GGAGGCCAAA 6064GTGGGCTGAT AGCTTGAGCC TAGGAGTTTG AAACTAGCCT
GGGCAACATA ATGAGACCCT 6124AACTCTACAA AAAAAAAAAA AATACCAAAA
AAAAAAAAAA AATCAGCTGT GTTGGTAGTA 6184TGTGCCTGTA GTCCCAGCTA
TCCAGGAGGC TGAGATGGGA GATCACCTGA GCCCACAACC 6244TGGAGTCTTG
ATCATGCTAC TGAACTGTAG CCTGGGCAAC AGAGGATAGT GAGATCCTGT
6304CTCAAAAAAA AAAATTAATT AAAAAGCCAG GAAACAAGAC TTAGCTCTAA
CATCTAACAT 6364AGCTGACAAA GGAGTAATTT GATGTGGAAT TCAACCTGAT
ATTTAAAAGT TATAAAATAT 6424CTATAATTCA CAATTTGGGG TAAGATAAAG
CACTTGCAGT TTCCAAAGAT TTTACAAGTT 6484TACCTCTCAT ATTTATTTCC
TTATTGTGTC TATTTTAGAG CACCAAATAT ATACTAAATG 6544GAATGGACAG
GGGATTCAGA TATTATTTTC AAAGTGACAT TATTTGCTGT TGGTTAATAT
6604ATGCTCTTTT TGTTTCTGTC AACCAAAG GA TGG ACA GTG ATT CAG AAC CGT
6655 Gly Trp Thr Val Ile Gln Asn Arg 280 285CAA GAC GGT AGT GTT GAC
TTT GGC AGG AAA TGG GAT CCA TAT AAA CAG 6703Gln Asp Gly Ser Val Asp
Phe Gly Arg Lys Trp Asp Pro Tyr Lys Gln 290 295 300GGA TTT GGA AAT
GTT GCA ACC AAC ACA GAT GGG AAG AAT TAC TGT GGC 6751Gly Phe Gly Asn
Val Ala Thr Asn Thr Asp Gly Lys Asn Tyr Cys Gly 305 310 315CTA CCA
G GTAACGAACA GGCATGCAAA ATAAAATCAT TCTATTTGAA ATGGGATTTT 6808Leu
ProTTTTAATTAA AAAACATTCA TTGTTGGAAG CCTGTTTTAG GCAGTTAAGA
GGAGTTTCCT 6868GACAAAAATG TGGAAGCTAA AGATAAGGGA AGAAAGGCAG
TTTTTAGTTT CCCAAAATTT 6928TATTTTTGGT GAGAGATTTT ATTTTGTTTT TCTTTTAG
GT GAA TAT TGG CTT 6980 Gly Glu Tyr Trp Leu 320GGA AAT GAT AAA ATT
AGC CAG CTT ACC AGG ATG GGA CCC ACA GAA CTT 7028Gly Asn Asp Lys Ile
Ser Gln Leu Thr Arg Met Gly Pro Thr Glu Leu325 330 335 340TTG ATA
GAA ATG GAG GAC TGG AAA GGA GAC AAA GTA AAG GCT CAC TAT 7076Leu Ile
Glu Met Glu Asp Trp Lys Gly Asp Lys Val Lys Ala His Tyr 345 350
355GGA GGA TTC ACT GTA CAG AAT GAA GCC AAC AAA TAC CAG ATC TCA GTG
7124Gly Gly Phe Thr Val Gln Asn Glu Ala Asn Lys Tyr Gln Ile Ser Val
360 365 370AAC AAA TAC AGA GGA ACA GCC GGT AAT GCC CTC ATG GAT GGA
GCA TCT 7172Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met Asp Gly
Ala Ser 375 380 385CAG CTG ATG GGA GAA AAC AGG ACC ATG ACC ATT CAC
AAC GGC ATG TTC 7220Gln Leu Met Gly Glu Asn Arg Thr Met Thr Ile His
Asn Gly Met Phe 390 395 400TTC AGC ACG TAT GAC AGA GAC AAT GAC GGC
TG GTATGTGTGG 7262Phe Ser Thr Tyr Asp Arg Asp Asn Asp Gly Trp405
410 415CACTCTTTGC TCCTGCTTTA AAAATCACAC TAATATCATT ACTCAGAATC
ATTAACAATA 7322TTTTTAATAG CTACCACTTC CTGGGCACTT ACTGTCAGCC
ACTGTCCTAA GCTCTTTATG 7382CATCACTCGA AAGCATTTCA ACTATAAGGT
AGACATTCTT ATTCTCATTT TACAGATGAG 7442ATTTAGAGAG ATTACGTGAT
TTGTCCAATG TCACACAACT ACCCAGAGAT AAAACTAGAA 7502TTTGAGCACA
GTTACTTTCT GAATAATGAG CATTTAGATA AATACCTATA TCTCTATATT
7562CTAAAGTGTG TGTGAAAACT TTCATTTTCA TTTCCAGGGT TCTCTGATAC
TAAGGGTTGT 7622AAAAGCTATT ATTCCAGTAT AAAGTAACAA ACACAGTCCC
TAGATGGATT GCCACAAAGG 7682CCCAGTTATC TCTCTTTCTT GCTATAGGGC
ACAGGAGGTC TTTGGTGTAT TAGTGTGACT 7742CTATGTATAG CACCCAAAGG
AAAGACTACT GTGCACACGA GTGTAGCAGT CTTTTATGGG 7802TAATCTGCAA
AACGTAACTT GACCACCGTA GTTCTGTTTC TAATAACGCC AAACACATTT 7862TCTTTCAG
G TTA ACA TCA GAT CCC AGA AAA CAG TGT TCT AAA GAA GAC 7910 Leu Thr
Ser Asp Pro Arg Lys Gln Cys Ser Lys Glu Asp 420 425GGT GGT GGA TGG
TGG TAT AAT AGA TGT CAT GCA GCC AAT CCA AAC GGC 7958Gly Gly Gly Trp
Trp Tyr Asn Arg Cys His Ala Ala Asn Pro Asn Gly 430 435 440AGA TAC
TAC TGG GGT GGA CAG TAC ACC TGG GAC ATG GCA AAG CAT GGC 8006Arg Tyr
Tyr Trp Gly Gly Gln Tyr Thr Trp Asp Met Ala Lys His Gly445 450 455
460ACA GAT GAT GGT GTA GTA TGG ATG AAT TGG AAG GGG TCA TGG TAC TCA
8054Thr Asp Asp Gly Val Val Trp Met Asn Trp Lys Gly Ser Trp Tyr Ser
465 470 475ATG AGG AAG ATG AGT ATG AAG ATC AGG CCC TTC TTC CCA CAG
CAA 8099Met Arg Lys Met Ser Met Lys Ile Arg Pro Phe Phe Pro Gln Gln
480 485 490TAGTCCCCAA TACGTAGATT TTTGCTCTTC TGTATGTGAC AACATTTTTG
TACATTATGT 8159TATTGGAATT TTCTTTCATA CATTATATTC CTCTAAAACT
CTCAAGCAGA CGTGAGTGTG 8219ACTTTTTGAA AAAAGTATAG GATAAATTAC
ATTAAAATAG CACATGATTT TCTTTTGTTT 8279TCTTCATTTC TCTTGCTCAC
CCAAGAAGTA ACAAAAGTAT AGTTTTGACA GAGTTGGTGT 8339TCATAATTTC
AGTTCTAGTT GATTGCGAGA ATTTTCAAAT AAGGAAGAGG GGTCTTTTAT
8399CCTTGTCGTA GGAAAACCAT GACGGAAAGG AAAAACTGAT GTTTAAAAGT
CCACTTTTAA 8459AACTATATTT ATTTATGTAG GATCTGTCAA AGAAAACTTC
CAAAAAGATT TATTAATTAA 8519ACCAGACTCT GTTGCAATAA GTTAATGTTT
TCTTGTTTTG TAATCCACAC ATTCAATGAG 8579TTAGGCTTTG CACTTGTAAG
GAAGGAGAAG CGTTCACAAC CTCAAATAGC TAATAAACCG 8639GTCTTGAATA
TTTGAAGATT TAAAATCTGA CTCTAGGACG GGCACGGTGG CTCACGACTA
8699TAATCCCAAC ACTTTGGGAG GCTGAGGCGG GCGGTCACAA GGTCAGGAGT
TCAAGACCAG 8759CCTGACCAAT ATGGTGAAAC CCCATCTCTA CTAAAAATAC
AAAAATTAGC CAGGCGTGGT 8819GGCAGGTGCC TGTAGGTCCC AGCTAGCCTG
TGAGGTGGAG ATTGCATTGA GCCAAGATC 8878491 amino acidsamino
acidlinearprotein 4Met Lys Arg Met Val Ser Trp Ser Phe His Lys Leu
Lys Thr Met Lys 1 5 10 15His Leu Leu Leu Leu Leu Leu Cys Val Phe
Leu Val Lys Ser Gln Gly 20 25 30Val Asn Asp Asn Glu Glu Gly Phe Phe
Ser Ala Arg Gly His Arg Pro 35 40 45Leu Asp Lys Lys Arg Glu Glu Ala
Pro Ser Leu Arg Pro Ala Pro Pro 50 55 60Pro Ile Ser Gly Gly Gly Tyr
Arg Ala Arg Pro Ala Lys Ala Ala Ala 65 70 75 80Thr Gln Lys Lys Val
Glu Arg Lys Ala Pro Asp Ala Gly Gly Cys Leu 85 90 95His Ala Asp Pro
Asp Leu Gly Val Leu Cys Pro Thr Gly Cys Gln Leu 100 105 110Gln Glu
Ala Leu Leu Gln Gln Glu Arg Pro Ile Arg Asn Ser Val Asp 115 120
125Glu Leu Asn Asn Asn Val Glu Ala Val Ser Gln Thr Ser Ser Ser Ser
130 135 140Phe Gln Tyr Met Tyr Leu Leu Lys Asp Leu Trp Gln Lys Arg
Gln Lys145 150 155 160Gln Val Lys Asp Asn Glu Asn Val Val Asn Glu
Tyr Ser Ser Glu Leu 165 170 175Glu Lys His Gln Leu Tyr Ile Asp Glu
Thr Val Asn Ser Asn Ile Pro 180 185 190Thr Asn Leu Arg Val Leu Arg
Ser Ile Leu Glu Asn Leu Arg Ser Lys 195 200 205Ile Gln Lys Leu Glu
Ser Asp Val Ser Ala Gln Met Glu Tyr Cys Arg 210 215 220Thr Pro Cys
Thr Val Ser Cys Asn Ile Pro Val Val Ser Gly Lys Glu225 230 235
240Cys Glu Glu Ile Ile Arg Lys Gly Gly Glu Thr Ser Glu Met Tyr Leu
245 250 255Ile Gln Pro Asp Ser Ser Val Lys Pro Tyr Arg Val Tyr Cys
Asp Met 260 265 270Asn Thr Glu Asn Gly Gly Trp Thr Val Ile Gln Asn
Arg Gln Asp Gly 275 280 285Ser Val Asp Phe Gly Arg Lys Trp Asp Pro
Tyr Lys Gln Gly Phe Gly 290 295 300Asn Val Ala Thr Asn Thr Asp Gly
Lys Asn Tyr Cys Gly Leu Pro Gly305 310 315 320Glu Tyr Trp Leu Gly
Asn Asp Lys Ile Ser Gln Leu Thr Arg Met Gly 325 330 335Pro Thr Glu
Leu Leu Ile Glu Met Glu Asp Trp Lys Gly Asp Lys Val 340 345 350Lys
Ala His Tyr Gly Gly Phe Thr Val Gln Asn Glu Ala Asn Lys Tyr 355 360
365Gln Ile Ser Val Asn Lys Tyr Arg Gly Thr Ala Gly Asn Ala Leu Met
370
375 380Asp Gly Ala Ser Gln Leu Met Gly Glu Asn Arg Thr Met Thr Ile
His385 390 395 400Asn Gly Met Phe Phe Ser Thr Tyr Asp Arg Asp Asn
Asp Gly Trp Leu 405 410 415Thr Ser Asp Pro Arg Lys Gln Cys Ser Lys
Glu Asp Gly Gly Gly Trp 420 425 430Trp Tyr Asn Arg Cys His Ala Ala
Asn Pro Asn Gly Arg Tyr Tyr Trp 435 440 445Gly Gly Gln Tyr Thr Trp
Asp Met Ala Lys His Gly Thr Asp Asp Gly 450 455 460Val Val Trp Met
Asn Trp Lys Gly Ser Trp Tyr Ser Met Arg Lys Met465 470 475 480Ser
Met Lys Ile Arg Pro Phe Phe Pro Gln Gln 485 49010564 base
pairsnucleic aciddoublelinearDNA (genomic)human fibrinogen gamma
chainCDS join(1799..1876, 1973..2017, 2207..2390, 2510 ..2603,
4211..4341, 4645..4778, 5758..5942, 7426 ..7703, 9342..9571)
5CTACACACTT CTTGAAGGCA AAGGCAATGC TGAAGTCACC TTTCATGTTC AAATCATATT
60AAAAAGTTAG CAAGATGTAA TTATCAGTGT ACTATGTAAA TCTTTGTGAA TGATCAATAA
120TTACATATTT TCATTATATA TATTTTAGTA GATAATATTT ATATACATTC
AACATTCTAA 180ATATAGAAAG TTTACAGAGA AAAATAAAGC CTTTTTTTCC
AATCCTGTCC TCCACCTCTG 240CATCCCATTC TTCTTCACAG AGGCAACTGA
TTCAAGTCAT TACATAGTTA TTGAGTGTTA 300ACTACAACTA TGTTAAGTAC
AGCTATATAT GTTAGATGCC GTAGCCACAG AAATCAGTTT 360ACAATCTAAT
GCAGTGGATA CAGCATGTAT ACATATAATA TAAGGTTGCT ACAAATGCTA
420TCTGAGGTAG AGCTGTTTGA AAGAATACTA ATACTTAAAT GTTTAATTCA
ACTGACTTGA 480TTGACAACTG ATTAGCTGAG TGGAAAAGAT GGATGAGAAA
GATTGTGAGA CTTAATTGGC 540TGGTGGTATG GTGATATGAT TGACAATAAC
TGCTAAGTCA GAGAGGGATA TATTAAGGAG 600GAGAAGAAAA GCAACAAATC
TGGTTTTGAT GTGTTCACTT TGTTATAATT ATTGATTATT 660TACTGAATAT
GAATATTTAT CTTTGTTTTT GAGTCAATAA ATATACCTTT GTAAAGACAG
720AATTAAAGTA TTAGTATTTC TTTCAAACTG GAGGCATTTC TCCCACTAAC
ATATTTCATC 780AAAACTTATA ATAAGCTTGG TTCCAGAGGA AGAAATGAGG
GATAACCAAA AATAGAGACA 840TTAATAATAG TGTAACGCCC AGTGATAAAT
CTCAATAGGC AGTGATGACA GACATGTTTT 900CCCAAACACA AGGATGCTGT
AAGGGCCAAA CAGAAATGAT GGCCCCTCCC CAGCACCTCA 960TTTTGCCCCT
TCCTTCAGCT ATGCCTCTAC TCTCCTTTAG ATACAAGGGA GGTGGATTTT
1020TCTCTTCTCT GAGATAGCTT GATGGAACCA CAGGAACAAT GAAGTGGGCT
CCTGGCTCTT 1080TTCTCTGTGG CAGATGGGGT GCCATGCCCA CCTTCAGACA
AAGGGAAGAT TGAGCTCAAA 1140AGCTCCCTGA GAAGTGAGAG CCTATGAACA
TGGTTGACAC AGAGGGACAG GAATGTATTT 1200CCAGGGTCAT TCATTCCTGG
GAATAGTGAA CTGGGACATG GGGGAAGTCA GTCTCCTCCT 1260GCCACAGCCA
CAGATTAAAA ATAATAATGT TAACTGATCC CTAGGCTAAA ATAATAGTGT
1320TAACTGATCC CTAAGCTAAG AAAGTTCTTT TGGTAATTCA GGTGATGGCA
GCAGGACCCA 1380TCTTAAGGAT AGACTAGGTT TGCTTAGTTC GAGGTCATAT
CTGTTTGCTC TCAGCCATGT 1440ACTGGAAGAA GTTGCATCAC ACAGCCTCCA
GGACTGCCCT CCTCCTCACA GCAATGGATA 1500ATGCTTCACT AGCCTTTGCA
GATAATTTTG GATCAGAGAA AAAACCTTGA GCTGGGCCAA 1560AAAGGAGGAG
CTTCAACCTG TGTGCAAAAT CTGGGAACCT GACAGTATAG GTTGGGGGCC
1620AGGATGAGGA AAAAGGAACG GGAAAGACCT GCCCACCCTT CTGGTAAGGA
GGCCCCGTGA 1680TCAGCTCCAG CCATTTGCAG TCCTGGCTAT CCCAGGAGCT
TACATAAAGG GACAATTGGA 1740GCCTGAGAGG TGACAGTGCT GACACTACAA
GGCTCGGAGC TCCGGGCACT CAGACATC 1798ATG AGT TGG TCC TTG CAC CCC CGG
AAT TTA ATT CTC TAC TTC TAT GCT 1846Met Ser Trp Ser Leu His Pro Arg
Asn Leu Ile Leu Tyr Phe Tyr Ala 1 5 10 15CTT TTA TTT CTC TCT TCA
ACA TGT GTA GCA GTAAGTGTGC TCTTCACAAA 1896Leu Leu Phe Leu Ser Ser
Thr Cys Val Ala 20 25ACGTTGTTTA AAATGGAAAG CTGGAAAATA AAACAGATAA
TAAACTAGTG AAATTTTCGT 1956ATTTTTTCTC TTTTAG TAT GTT GCT ACC AGA GAC
AAC TGC TGC ATC TTA 2005 Tyr Val Ala Thr Arg Asp Asn Cys Cys Ile
Leu 30 35GAT GAA AGA TTC GTAAGTAGTT TTTATGTTTC TCCCTTTGTG
TGTGAACTGG 2057Asp Glu Arg Phe 40AGAGGGGCAG AGGAATAGAA ATAATTCCCT
CATAAATATC ATCTGGCACT TGTAACTTTT 2117TAAAAACATA GTCTAGGTTT
TACCTATTTT TCTTAATAGA TTTTAAGAGT AGCATCTGTC 2177TACATTTTTA
ATCACTGTTA TATTTTCAG GGT AGT TAT TGT CCA ACT ACC TGT 2230 Gly Ser
Tyr Cys Pro Thr Thr Cys 45GGC ATT GCA GAT TTC CTG TCT ACT TAT CAA
ACC AAA GTA GAC AAG GAT 2278Gly Ile Ala Asp Phe Leu Ser Thr Tyr Gln
Thr Lys Val Asp Lys Asp 50 55 60 65CTA CAG TCT TTG GAA GAC ATC TTA
CAT CAA GTT GAA AAC AAA ACA TCA 2326Leu Gln Ser Leu Glu Asp Ile Leu
His Gln Val Glu Asn Lys Thr Ser 70 75 80GAA GTC AAA CAG CTG ATA AAA
GCA ATC CAA CTC ACT TAT AAT CCT GAT 2374Glu Val Lys Gln Leu Ile Lys
Ala Ile Gln Leu Thr Tyr Asn Pro Asp 85 90 95GAA TCA TCA AAA CCA A
GTGAGAAAAT AAAGACTACT GACCAAAAAA 2420Glu Ser Ser Lys Pro
100TAATAATAAT AATCTGTGAA GTTCTTTTGC TGTTGTTTTA GTTGTTCTAT
TTGCTTAAGG 2480ATTTTTATGT CTCTGATCCT ATATTACAG AT ATG ATA GAC GCT
GCT ACT TTG 2532 Asn Met Ile Asp Ala Ala Thr Leu 105 110AAG TCC AGG
ATA ATG TTA GAA GAA ATT ATG AAA TAT GAA GCA TCG ATT 2580Lys Ser Arg
Ile Met Leu Glu Glu Ile Met Lys Tyr Glu Ala Ser Ile 115 120 125TTA
ACA CAT GAC TCA AGT ATT CG GTAAGGATTT TTGTTTTAAT TTGCTCTGCA 2633Leu
Thr His Asp Ser Ser Ile Arg 130AGACTGATTT AGTTTTTATT TAATATTCTA
TACTTGAGTG AAAGTAATTT TTAATGTGTT 2693TTCCCCATTT ATAATATCCC
AGTGACATTA TGCCTGATTA TGTTGAGCAT AGTAGAGATA 2753GAAGTTTTTA
GTGCAATATA AATTATACTG GGTTATAATT GCTTATTAAT AATCACATTG
2813AAGAAAGATG TTCTAGATGT CTTCAAATGC TAGTTTGACC ATATTTATCA
AAAATTTTTT 2873CCCCATCCCC CATTTATCTT ACAACATAAA ATCAATCTCA
TAGGAATTTG GGTGTTGAAA 2933ATAAAATCCT CTTTATAAAA ATGCTGACAA
ATTGGTGGTT AAAAAAATTA GCAAGCAGAG 2993GCATAGTAAG GATTTTGGCT
CCTAAAGTAA ATTATATTGA ATGTGGAGCA GGAAGAAACA 3053TGTCTTGAGA
GACTAAGTGT GGCAAATATT GCAAAGCTCA TATTGATCAT TGCAGAATGA
3113ACCTGCATAG TCTCTTCCCT TCATTTGGAA GTGAATGTCT CTGTTAAAGC
TTCTCAGGGA 3173CTCATAAACT TTCTGAACAT AAGGTCTCAG ATACAGTTTT
AATATTTTTC CCCAATTTTT 3233TTTTCTGAAT TTTTCTCAAA GCAGCTTGAG
AAATTGAGAT AAATAGTAGC TAGGGAGAAG 3293TGGCCCAGGA AAGATTTCTC
CTCTTTTTGC TATCAGAGGG CCCTTGTTAT TATTGTTATT 3353ATTATTACTT
GCATTATTAT TGTCCATCAT TGAAGTTGAA GGAGGTTATT GTACAGAAAT
3413TGCCTAAGAC AAGGTAGAGG GAAAACGTGG ACAAATAGTT TGTCTACCCT
TTTTTACTTC 3473AAAGAAAGAA CGGTTTATGC ATTGTAGACA GTTTTCTATC
ATTTTTGGAT ATTTGCAAGC 3533CACCCTGTAA GTAACTACAA AAGGAGGGTT
TTTACTTCCC CCAGTCCATT CCCAAAGCTA 3593TGTAACCAGA AGCATTAAAG
AAGAAAGGGG AAGTATCTGT TGTTTTATTT TACATACAAT 3653AACGTTCCAG
ATCATGTCCC TGTGTAAGTT ATATTTTAGA TTGAAGCTTA TATGTATAGC
3713CTCAGTAGAT CCACAAGTGA AAGGTATACT CCTTCAGCAC ATGTGAATTA
CTGAACTGAG 3773CTTTTCCTGC TTCTAAAGCA TCAGGGGGTG TTCCTATTAA
CCAGTCTCGC CACTCTTGCA 3833GGTTGCTATC TGCTGTCCCT TATGCATAAA
GTAAAAAGCA AAATGTCAAT GACATTTGCT 3893TATTGACAAG GACTTTGTTA
TTTGTGTTGG GAGTTGAGAC AATATGCCCC ATTCTAAGTA 3953AAAAGATTCA
GGTCCACATT GTATTCCTGT TTTAATTGAT TTTTTGATTT GTTTTTCTTT
4013TTCAAAAAGT TTATAATTTT AATTCATGTT AATTTAGTAA TATAATTTTA
CATTTTCCTC 4073AAGAATGGAA TAATTTATCA GAAAGCACTT CTTAAGAAAA
TACTTAGCAG TTTCCAAAGA 4133AAATATAAAA TTACTCTTCT GAAAGGAATA
CTTATTTTTG TCTTCTTATT TTTGTTATCT 4193TATGTTTCTG TTTGTAG A TAT TTG
CAG GAA ATA TAT AAT TCA AAT AAT CAA 4244 Tyr Leu Gln Glu Ile Tyr
Asn Ser Asn Asn Gln 135 140 145AAG ATT GTT AAC CTG AAA GAG AAG GTA
GCC CAG CTT GAA GCA CAG TGC 4292Lys Ile Val Asn Leu Lys Glu Lys Val
Ala Gln Leu Glu Ala Gln Cys 150 155 160CAG GAA CCT TGC AAA GAC ACG
GTG CAA ATC CAT GAT ATC ACT GGG AAA G 4341Gln Glu Pro Cys Lys Asp
Thr Val Gln Ile His Asp Ile Thr Gly Lys 165 170 175GTAACTGATG
AAGGTTATAT TGGGATTAGG TTCATCAAAG TAAGTAATGT AAAGGAGAAA
4401GTATGTACTG GAAAGTATAG GAATAGTTTA GAAAGTGGCT ACCCATTAAG
TCTAAGAATT 4461TCAGTTGTCT AGACCTTTCT TGAATAGCTA AAAAAAACAG
TTTAAAAGGA ATGCTGATGT 4521GAAAAGTAAG AAAATTATTC TTGGAAAATG
AATAGTTTAC TACATGTTAA AAGCTATTTT 4581TCAAGGCTGG CACAGTCTTA
CCTGCATTTC AAACCACAGT AAAAGTCGAT TCTCCTTCTC 4641TAG AT TGT CAA GAC
ATT GCC AAT AAG GGA GCT AAA CAG AGC GGG CTT 4688 Asp Cys Gln Asp
Ile Ala Asn Lys Gly Ala Lys Gln Ser Gly Leu 180 185 190TAC TTT ATT
AAA CCT CTG AAA GCT AAC CAG CAA TTC TTA GTC TAC TGT 4736Tyr Phe Ile
Lys Pro Leu Lys Ala Asn Gln Gln Phe Leu Val Tyr Cys 195 200 205GAA
ATC GAT GGG TCT GGA AAT GGA TGG ACT GTG TTT CAG AAG 4778Glu Ile Asp
Gly Ser Gly Asn Gly Trp Thr Val Phe Gln Lys 210 215 220GTAATTTTTT
CCCCACCATG TGTATTTAAT AAATTCCTAC ATTGTTTCTG CCATATGGCA
4838GATACTTTTC TAAGCACCTT GTGAACCGTA GCTCATTTAA TCCTTGCAAT
AGCCCTAAGA 4898GGAAGGTACT TCTGTTACTC CTATTTACAG AAAAGGAAAC
TGAGGCACAC AAGGTTAAAT 4958AACTTGCCCA AGACCACATA ACTAATAAGC
AACAGAGTCA GCATTTGAAC CTAGGCAGTA 5018TAGTTTCAGA GTTTGTGACT
TGACTCTATA TTGTACTGGC ACTGACTTTG TAGATTCATG 5078GTGGCACATA
ATCATAGTAC CACAGTGACA AATAAAAAGA AGGAAACTCT TTTGTCAGGT
5138AGGTCAAGAC CTGAGGTTTC CCATCACAAG ATGAGGAAGC CCAACACCAC
CCCCCACCAC 5198CCCACCACCA TCACCACCCT TTCACACACC AGAGGATACA
CTTGGGCTGC TCCAAGACAA 5258GGAACCTGTG TTGCATCTGC CACTTGCTGA
TACCCACTAG GAATCTTGGC TCCTTTACTT 5318TCTGTTTACC TCCCACCACT
GTTATAACTG TTTCTACAGG GGGCGCTCAG AGGGAATGAA 5378TGGTGGAAGC
ATTAGTTGCC AGACACCGAT TGAGCAATGG GTTCCATCAT AAGTGTAAGA
5438ATCAGTAATA TCCAGCTAGA GTTCTGAAGT CGTCTAGGTG TCTTTTTAAT
ATTACCACTC 5498ATTTAGAATT TATGATGTGC CAGAAACCCT CTTAAGTATT
TCTCTTATAT TCTCTCTCAT 5558GATCCTTGCA GCAACCCTAA GAAGTAACCA
TCATTTTTCC TATTTGATAC ATGAGGAAAC 5618TGAGGTAGCT TGGCCAAGAT
CACTTAGTTG GGAGTTGATA GAACCAGTGC TCTGTATTTT 5678TGACAAAATG
TTGACAGCAT TCTCTTTACA TGCATTGATA GTCTATTTTC TCCTTTTGCT
5738CTTGCAAATG TGTAATTAG AGA CTT GAT GGC AGT GTA GAT TTC AAG AAA
AAC 5790 Arg Leu Asp Gly Ser Val Asp Phe Lys Lys Asn 225 230TGG ATT
CAA TAT AAA GAA GGA TTT GGA CAT CTG TCT CCT ACT GGC ACA 5838Trp Ile
Gln Tyr Lys Glu Gly Phe Gly His Leu Ser Pro Thr Gly Thr 235 240
245ACA GAA TTT TGG CTG GGA AAT GAG AAG ATT CAT TTG ATA AGC ACA CAG
5886Thr Glu Phe Trp Leu Gly Asn Glu Lys Ile His Leu Ile Ser Thr
Gln250 255 260 265TCT GCC ATC CCA TAT GCA TTA AGA GTG GAA CTG GAA
GAC TGG AAT GGC 5934Ser Ala Ile Pro Tyr Ala Leu Arg Val Glu Leu Glu
Asp Trp Asn Gly 270 275 280AGA ACC AG GTACTGTTTT GAAATGACTT
CCAACTTTTT ATTGTAAAGA 5982Arg Thr SerTTGCCTGGAA TGTGCACTTT
CCAACTATCA ATAGACAATG GCAAATGCAG CCTGACAAAT 6042GCAAACAGCA
CATCCAGCCA CCATTTTCTC CAGGAGTCTG TTTGGTTCTT GGGCAATCCA
6102AAAAGGTAAA TTCTATTCAG GATGAATCTA AGTGTATTGG TACAATCTAA
TTACCCTGGA 6162ACCATTCAGA GTAATAGCTA ATTACTGAAC TTTTAATCAG
TCCCAGGAAT TGAGCATAAA 6222ATTATAATTT TATCTAGTCT AAATTACTAT
TTCATGAAGC AGGTATTATT ATTAATCCCA 6282TTTTATAGAT TAACTTGCTC
AAAGTCACAT TGCTGATAAG TGGTAGAGGT AGAATTCAGA 6342CTCAAGTAGT
TTAACTTTAG AGCCTGTCCT CTTAACAACT ATCCTGGTTG AAAAGCAAAT
6402ACAGCCTCTT CAGACTTCTC AGTGCCTTGA TGGCCATTTA TTCTGTCAAA
TCATGAGCTA 6462CCCTAAAAGT AAACCAGCTA GCTCTTTTGA TGATCTAGAG
GCTTCTTTTT GCTTGAGATA 6522TTTGAAGGTT TTAAGCATTG TTACCTAATT
AAAATGCAGA AAAATATCCA ACCCTCTTGT 6582TATGTTTAAG GAATAGTGAA
ATATATTGTC TTCAAACACA TGGACTTTTT TTTATTGCTT 6642GGTTGGTTTT
TAATCCAGAA AGTGCTATAG TCAGTAGACC TTCTTCTAGG AAAGGACCTT
6702CCATTTCCCA GCCACTGGAG ATTAGAAAAT AAGCTAAATA TTTTCTGGAA
ATTTCTGTTC 6762ATTCATTAAG GCCCATCCTT TCCCCCACTC TATAGAAGTG
TTGTCCACTT GCACAATTTT 6822TTCCAGGAAA GAATCTCTCT AACTCCTTCA
GCTCACATGC TTTGGACCAC ACAGGGAAGA 6882CTTTGATTGT GTAATGCCCT
CAGAAGCTCT CCTTCTTGCC ACTACCACAC TGATTTGAGG 6942AAGAAAATCC
CTTTAGCACC TAACCCTTCA GGTGCTATGA GTGGCTAATG GAACTGTACC
7002TCCTTCAAGT TTTGTGCAAT AATTAAGGGT CACTCACTGT CAGATACTTT
CTGTGATCTA 7062TGATAATGTG TGTGCAACAC ATAACATTTC AATAAAAGTA
GAAAATATGA AATTAGAGTC 7122ATCTACACAT CTGGATTTGA TCTTAGAATG
AAACAAGCAA AAAAGCATCC AAGTGAGTGC 7182AATTATTAGT TTTCAGAGAT
GCTTCAAAGG CTTCTAGGCC CATCCCGGGA AGTGTTAATG 7242AGCTGTGGAC
TGGTTCACAT ATCTATTGCC TCTTGCCAGA TTTGCAAAAA ACTTCACTCA
7302ATGAGCAAAT TTCAGCCTTA AGAAACAAAG TCAAAAATTC CAAGGAAGCA
TCCTACGAAA 7362GAGGGAACTT CTGAGATCCC TGAGGAGGGT CAGCATGTGA
TGGTTGTATT TCCTTCTTCT 7422CAG T ACT GCA GAC TAT GCC ATG TTC AAG GTG
GGA CCT GAA GCT GAC 7468 Thr Ala Asp Tyr Ala Met Phe Lys Val Gly
Pro Glu Ala Asp 285 290 295AAG TAC CGC CTA ACA TAT GCC TAC TTC GCT
GGT GGG GAT GCT GGA GAT 7516Lys Tyr Arg Leu Thr Tyr Ala Tyr Phe Ala
Gly Gly Asp Ala Gly Asp 300 305 310GCC TTT GAT GGC TTT GAT TTT GGC
GAT GAT CCT AGT GAC AAG TTT TTC 7564Ala Phe Asp Gly Phe Asp Phe Gly
Asp Asp Pro Ser Asp Lys Phe Phe315 320 325 330ACA TCC CAT AAT GGC
ATG CAG TTC AGT ACC TGG GAC AAT GAC AAT GAT 7612Thr Ser His Asn Gly
Met Gln Phe Ser Thr Trp Asp Asn Asp Asn Asp 335 340 345AAG TTT GAA
GGC AAC TGT GCT GAA CAG GAT GGA TCT GGT TGG TGG ATG 7660Lys Phe Glu
Gly Asn Cys Ala Glu Gln Asp Gly Ser Gly Trp Trp Met 350 355 360AAC
AAG TGT CAC GCT GGC CAT CTC AAT GGA GTT TAT TAC CAA G 7703Asn Lys
Cys His Ala Gly His Leu Asn Gly Val Tyr Tyr Gln 365 370
375GTATGTTTTC CTTTCTTAGA TTCCAAGTTA ATGTATAGTG TATACTATTT
TCATAAAAAA 7763TAATAAATAG ATATGAAGAA ATGAAGAATA ATTTATAAAG
ATAGTAGGGA TTTTATCATG 7823TTCTTTATTT CAACTAAGTT CTTTGAAACT
GGAAGTGGAT AATACCAAGT TCATGCCTAA 7883AATTAGCCCT TCTAAAGAAA
TCCACCTGCT GCAAAATATC CAGTAGTTTG GCATTATATG 7943TGAAACTATC
ACCATCATAG CTGGCACTGT GGGTTGTGGG ATCTCCTTTA GACATACAAC
8003ATAAATGATC TGGATGGATT AACATTACTA CATGGATGCT TGTTGACACA
TTAACCTGGC 8063TTCCCATGAG CTTTGTGTCA GATACACGCA GTGAACAGGT
GTTTGGAGGA ACAGAATAAA 8123GAGAAGGCAA GCACTGGTAA GGGCAGGGGT
TTGTGAAAGC TTGAGAGAAG AGACCAGTCT 8183GAGGACAGTA GACACTTATT
TTAGGATGGG GGTTGGATGA GGAGGCTATA GTTTGCTATA 8243AGCTTGGAAT
GGTTTGGAAC ACTGGTTTCA CTCACCTACC CAGCAGTTAT GTGTGGGGAA
8303GCCTTACCGA TGCTAAAGGA TCCATGTTAC AATAATGGCA TTATTTGGAA
ATCCCAGTGG 8363TATTCCATGA ATAAAACCAC TATGAAGATA ATCCCACTCA
ACAGACTCTC CGTTGGAGAA 8423GGACAGCAAC ACCACCCTGG GAAAGCCAAA
CAGTCAGACC AGACCTGTTT AGCATCAGTA 8483GGACTTCCCT ACCATATCTG
CTGGGTAGAT GAGTGAAACC AGTGTTCCAA ACCACTCCGG 8543GCTTGTAGCA
AACCATAGTC TCCTCATCTA CCAAGATGAG CAACCTTACC TCCTGATGTC
8603CTAGCCAATC ACCAACTAGG AAACTTTGCA CAGTTTATTT AAAGTAACAG
TTTGATTTTC 8663ACAATATTTT TAAATTGGAG AAACATAACT TATCTTTGCA
CTCACAAACC ACATAATGAG 8723AAGAAACTCT AAGGGAAAAT GCTTGATCTG
TGTGACCCGG GGCGCCATGC CAGAGCTGTA 8783GTTCATGCCA GTGTTGTGCT
CTGACAAGCC TTTTACAGAA TTACATGAGA TCTGCTTCCC 8843TAGGACAAGG
AGAAGGCAAA TCAACAGAGG CTGCACTTTA AAATGGAGAC ATAAAATAAC
8903ATGCCAGAAC CATTTCCTAA AGCTCCTCAA TCAACCAACA AAATTGTGCT
TTCAAATAAC 8963CTGAGTTGAC CTCATCAGGA ATTTTGTGGC TCCTTCTCTT
CTAACCTGCC TGAAGAAAGA 9023TGGTCCACAG CAGCTGAGTC CGGGATGGAT
AAGCTTAGGG ACAGAGGCCA ATTAGGGAAC 9083TTTGGGTTTC TAGCCCTACT
AGTAGTGAAT AAATTTAAAG TGTGGATGTG ACTATGAGTC 9143ACAGCACAGA
TGTTGTTTAA TAATATGTTT ATTTTATAAA TTGATATTTT AGGAATCTTT
9203GGAGATATTT TCAGTTAGCA GATAATACTA TAAATTTTAT GTAACTGGCA
ATGCACTTCG 9263TAATAGACAG CTCTTCATAG ACTTGCAGAG GTAAAAAGAT
TCCAGAATAA TGATATGTAC 9323ATCTACGACT TGTTTTAG GT GGC ACT TAC TCA
AAA GCA TCT ACT CCT AAT 9373 Gly Gly Thr Tyr Ser Lys Ala Ser Thr
Pro Asn 380 385GGT TAT GAT AAT GGC ATT ATT TGG GCC ACT TGG AAA ACC
CGG TGG TAT 9421Gly Tyr Asp Asn Gly Ile Ile Trp Ala Thr Trp Lys Thr
Arg Trp Tyr 390 395 400TCC ATG AAG AAA ACC ACT ATG AAG ATA ATC CCA
TTC AAC AGA CTC ACA 9469Ser Met Lys Lys Thr Thr Met Lys Ile Ile Pro
Phe Asn Arg Leu Thr 405 410 415ATT GGA GAA GGA CAG CAA CAC CAC CTG
GGG GGA GCC AAA CAG GTC AGA 9517Ile Gly Glu Gly Gln Gln His His Leu
Gly Gly Ala Lys Gln Val Arg420 425 430 435CCA GAG CAC CCT GCG GAA
ACA GAA TAT GAC TCA CTT TAC CCT GAG GAT 9565Pro Glu His Pro Ala Glu
Thr Glu Tyr Asp Ser Leu Tyr Pro Glu Asp 440 445 450GAT TTG
TAGAAAATTA ACTGCTAACT TCTATTGACC CACAAAGTTT CAGAAATTCT 9621Asp
LeuCTGAAAGTTT CTTCCTTTTT TCTCTTACTA TATTTATTGA TTTCAAGTCT
TCTATTAAGG 9681ACATTTAGCC TTCAATGGAA ATTAAAACTC ATTTAGGACT
GTATTTCCAA ATTACTGATA 9741TCAGAGTTAT TTAAAAATTG TTTATTTGAG
GAGATAACAT TTCAACTTTG TTCCTAAATA 9801TATAATAATA AAATGATTGA
CTTTATTTGC ATTTTTATGA CCACTTGTCA TTTATTTTGT 9861CTTCGTAAAT
TATTTTCATT ATATCAAATA TTTTAGTATG TACTTAATAA AATAGGAGAA
9921CATTTTAGAG TTTCAAATTC CCAGGTATTT TCCTTGTTTA TTACCCCTAA
ATCATTCCTA 9981TTTAATTCTT CTTTTTAAAT GGAGAAAATT ATGTCTTTTT
AATATGGTTT TTGTTTTGTT 10041ATATATTCAC AGGCTGGAGA CGTTTAAAAG
ACCGTTTCAA AAGAGATTTA CTTTTTTAAA 10101GGACTTTATC TGAACAGAGA
GATATAATAT TTTTCCTATT GGACAATGGA CTTGCAAAGC 10161TTCACTTCAT
TTTAAGAGCA AAAGACCCCA TGTTGAAAAC TCCATAACAG TTTTATGCTG
10221ATGATAATTT ATCTACATGC ATTTCAATAA ACCTTTTGTT TCCTAAGACT
AGATACATGG 10281TACCTTTATT GACCATTAAA AAACCACCAC TTTTTGCCAA
TTTACCAATT ACAATTGGGC 10341AACCATCAGT AGTAATTGAG TCCTCATTTT
ATGCTAAATG TTATGCCTAA CTCTTTGGGA 10401GTTACAAAGG AAATAGCAAT
TATGGCTTTT GCCCTCTAGG AGATACAGGA CAAATACAGG 10461AAAATACAGC
AACCCAAACT GACAATACTC TATACAAGAA CATAATCACT AAGCAGGAGT
10521CACAGCCACA CAACCAAGAT GCATAGTATC CAAAGTGCAG CTG 10564453 amino
acidsamino acidlinearprotein 6Met Ser Trp Ser Leu His Pro Arg Asn
Leu Ile Leu Tyr Phe Tyr Ala 1 5 10 15Leu Leu Phe Leu Ser Ser Thr
Cys Val Ala Tyr Val Ala Thr Arg Asp 20 25
30Asn Cys Cys Ile Leu Asp Glu Arg Phe Gly Ser Tyr Cys Pro Thr Thr
35 40 45Cys Gly Ile Ala Asp Phe Leu Ser Thr Tyr Gln Thr Lys Val Asp
Lys 50 55 60Asp Leu Gln Ser Leu Glu Asp Ile Leu His Gln Val Glu Asn
Lys Thr 65 70 75 80Ser Glu Val Lys Gln Leu Ile Lys Ala Ile Gln Leu
Thr Tyr Asn Pro 85 90 95Asp Glu Ser Ser Lys Pro Asn Met Ile Asp Ala
Ala Thr Leu Lys Ser 100 105 110Arg Ile Met Leu Glu Glu Ile Met Lys
Tyr Glu Ala Ser Ile Leu Thr 115 120 125His Asp Ser Ser Ile Arg Tyr
Leu Gln Glu Ile Tyr Asn Ser Asn Asn 130 135 140Gln Lys Ile Val Asn
Leu Lys Glu Lys Val Ala Gln Leu Glu Ala Gln145 150 155 160Cys Gln
Glu Pro Cys Lys Asp Thr Val Gln Ile His Asp Ile Thr Gly 165 170
175Lys Asp Cys Gln Asp Ile Ala Asn Lys Gly Ala Lys Gln Ser Gly Leu
180 185 190Tyr Phe Ile Lys Pro Leu Lys Ala Asn Gln Gln Phe Leu Val
Tyr Cys 195 200 205Glu Ile Asp Gly Ser Gly Asn Gly Trp Thr Val Phe
Gln Lys Arg Leu 210 215 220Asp Gly Ser Val Asp Phe Lys Lys Asn Trp
Ile Gln Tyr Lys Glu Gly225 230 235 240Phe Gly His Leu Ser Pro Thr
Gly Thr Thr Glu Phe Trp Leu Gly Asn 245 250 255Glu Lys Ile His Leu
Ile Ser Thr Gln Ser Ala Ile Pro Tyr Ala Leu 260 265 270Arg Val Glu
Leu Glu Asp Trp Asn Gly Arg Thr Ser Thr Ala Asp Tyr 275 280 285Ala
Met Phe Lys Val Gly Pro Glu Ala Asp Lys Tyr Arg Leu Thr Tyr 290 295
300Ala Tyr Phe Ala Gly Gly Asp Ala Gly Asp Ala Phe Asp Gly Phe
Asp305 310 315 320Phe Gly Asp Asp Pro Ser Asp Lys Phe Phe Thr Ser
His Asn Gly Met 325 330 335Gln Phe Ser Thr Trp Asp Asn Asp Asn Asp
Lys Phe Glu Gly Asn Cys 340 345 350Ala Glu Gln Asp Gly Ser Gly Trp
Trp Met Asn Lys Cys His Ala Gly 355 360 365His Leu Asn Gly Val Tyr
Tyr Gln Gly Gly Thr Tyr Ser Lys Ala Ser 370 375 380Thr Pro Asn Gly
Tyr Asp Asn Gly Ile Ile Trp Ala Thr Trp Lys Thr385 390 395 400Arg
Trp Tyr Ser Met Lys Lys Thr Thr Met Lys Ile Ile Pro Phe Asn 405 410
415Arg Leu Thr Ile Gly Glu Gly Gln Gln His His Leu Gly Gly Ala Lys
420 425 430Gln Val Arg Pro Glu His Pro Ala Glu Thr Glu Tyr Asp Ser
Leu Tyr 435 440 445Pro Glu Asp Asp Leu 45010807 base pairsnucleic
aciddoublelinearovine beta-lactoglobulin 7ACGCGTGTCG ACCTGCAGGT
CAACGGATCT CTGTGTCTGT TTTCATGTTA GTACCACACT 60GTTTTGGTGG CTGTAGCTTT
CAGCTACAGT CTGAAGTCAT AAAGCCTGGT ACCTCCAGCT 120CTGTTCTCTC
TCAAGATTGT GTTCTGCTGT TTGGGTCTTT AGTGTCTCCA CACAATTTTT
180AGAATTGTTT GTTCTAGTTC TGTGAAAAAT GATGCTGGTA TTTTGATAAG
GATTGCATTG 240AATCTGTAAA GCTACAGATA TAGTCATTGG GTAGTACAGT
CACTTTAACA ATATTAACTC 300TTCACATCTG TGAGCATGAT ATATTTTCCC
CCTCTATATC ATCTTCAATT CCTCCTATCA 360GTTTCTTTCA TTGCAGTTTT
CTGAGTACAG GTCTTACACC TCCTTGGTTA GAGTCATTCC 420TCAGTATTTT
ATTCCTTTGA TACAATTGTG AATGAGGTAA TTTTCTTAGT TTCTCTTTCT
480GATAGCTCAT TGTTAGTGTA TATATAGAAA AGCAACAGAT TTCTATGTAT
TAATTTTGTA 540TCCTGCAACA GATTTCTATG TATTAATTTT GTATCCTGCT
ACTTTACGGA ATTCACTTAT 600TAGCTTTTTG GTGACATCTT GAGGATTTTC
TGAAGAAAAT GGCATGGTAT GGTAGGACAA 660GGTGTCATGT CATCTGCAAA
CAGTGGCAGT TTTCCTTCTT CCCTTCCAAC CTGGATTTCT 720TTGATTTCTT
TCTGTCTGAG TACGACTAGG ATTCCCAATA CTATACCGAA TAAAAGTGGC
780AAGAGTGGAC ATCCTTGTCT TATTTTTCTG ACCTTAGAGG AAATGCTTTC
AGTTTTTCAC 840CATTAATTAT AATGTTTACT GTGGGCTTGT CATATGTGGC
CTTCATTATA TGGAGGTCTA 900TTCCCTCTAT ACCCACCTTG TTGAGAGTTT
TTATCATAAA AGTATGTTGA ATTTTGTCAA 960AAGTTTTTCC TGCATCTATT
GAGATGATTT TTACTCTTCA ATTCATTAAT GATTTTTATT 1020CTTCATTTTG
TTAATGATTT CCATTCTTCA ATTTGTTAAC GTGGTATATC ACATTGATTG
1080ATTTGTGGAT ACCTTTGTAT CCCTGGGATA AACCTCACTT GATCATGAGC
TTTCAATGTA 1140TTTTTGAATT CACTTTGCTA ATATTCTGTT GGGTATTTTT
GCATCTCTAT TCATCAATGA 1200TATTGGCCTA AGAAAGGTTT TGTCTGGTTT
TAGTATCAGG GTGATGCTGG CCTCATAGAG 1260AGAGTTTAGA AGCATTTCCT
CCTCTTTGAT TTTTCGGAAT AGTTTGAGTA GGATAGGTAT 1320TAACTCTTCT
TTAAATGTTT GGGGACTTCC CTGGTGAGCC GGTGGTTGAG AATCCGCCTC
1380AGGGATGTGG GTTTGATCCC TGGTCAGGGA ACCATTAATA AGATCCCACA
TGCTGCAGGC 1440AACAAGCCCC CAAGCTGCAA CCACTGAGCT GCAACCGCTG
CAGTGCCCAC AGGCCACGAC 1500CAGAGAAAGC CCACATACAG CAGGGAAGAC
CCAGCACAAC CGGAAAAAGG AGTTTGGTGG 1560AATACAGCTG TGAAGCCGTC
TGGTCCTGGA CTCCTGCTTG AGGGAATTTT TTAAAAATTA 1620TTGATTCAAT
TTCATTACTG GTAACTGGTC TGTTCATATT TTCTATTTCT TCCGGGTTCA
1680GTCTTGGGAG ATTGTACATG CCTAGGAATG TGTCCGTTTC TTCTAGGTTG
TCCATTTTAT 1740TGGACATGCA TGGGAGCACA CAGCACCGAC CAGCGAGACT
CATGCTGGCT TCCTGGGGCC 1800AGGCTGGGGC CCCAAGCAGC ATGGCATCCT
AGAGTGTGTG AAAGCCCACT GACCCTGCCC 1860AGCCCCACAA TTTCATTCTG
AGAAGTGATT CCTTGCTTCT GCACTTACAG GCCCAGGATC 1920TGACCTGCTT
CTGAGGAGCA GGGGTTTTGG CAGGACGGGG AGATGCTGAG AGCCGACGGG
1980GGTCCAGGTC CCCTCCCAGG CCCCCCTGTC TGGGGCAGCC CTTGGGAAAG
ATTGCCCCAG 2040TCTCCCTCCT ACAGTGGTCA GTCCCAGCTG CCCCAGGCCA
GAGCTGCTTT ATTTCCGTCT 2100CTCTCTCTGG ATGGTATTCT CTGGAAGCTG
AAGGTTCCTG AAGTTATGAA TAGCTTTGCC 2160CTGAAGGGCA TGGTTTGTGG
TCACGGTTCA CAGGAACTTG GGAGACCCTG CAGCTCAGAC 2220GTCCCGAGAT
TGGTGGCACC CAGATTTCCT AAGCTCGCTG GGGAACAGGG CGCTTGTTTC
2280TCCCTGGCTG ACCTCCCTCC TCCCTGCATC ACCCAGTTCT GAAAGCAGAG
CGGTGCTGGG 2340GTCACAGCCT CTCGCATCTA ACGCCGGTGT CCAAACCACC
CGTGCTGGTG TTCGGGGGGC 2400TACCTATGGG GAAGGGCTTC TCACTGCAGT
GGTGCCCCCC GTCCCCTCTG AGATCAGAAG 2460TCCCAGTCCG GACGTCAAAC
AGGCCGAGCT CCCTCCAGAG GCTCCAGGGA GGGATCCTTG 2520CCCCCCCGCT
GCTGCCTCCA GCTCCTGGTG CCGCACCCTT GAGCCTGATC TTGTAGACGC
2580CTCAGTCTAG TCTCTGCCTC CGTGTTCACA CGCCTTCTCC CCATGTCCCC
TCCGTGTCCC 2640CGTTTTCTCT CACAAGGACA CCGGACATTA GATTAGCCCC
TGTTCCAGCC TCACCTGAAC 2700AGCTCACATC TGTAAAGACC TAGATTCCAA
ACAAGATTCC AACCTGAAGT TCCCGGTGGA 2760TGTGAGTTCT GGGGCGACAT
CCTTCAACCC CATCACAGCT TGCAGTTCAT CGCAAAACAT 2820GGAACCTGGG
GTTTATCGTA AAACCCAGGT TCTTCATGAA ACACTGAGCT TCGAGGCTTG
2880TTGCAAGAAT TAAAGGTGCT AATACAGATC AGGGCAAGGA CTGAAGCTGG
CTAAGCCTCC 2940TCTTTCCATC ACAGGAAAGG GGGGCCTGGG GGCGGCTGGA
GGTCTGCTCC CGTGAGTGAG 3000CTCTTTCCTG CTACAGTCAC CAACAGTCTC
TCTGGGAAGG AAACCAGAGG CCAGAGAGCA 3060AGCCGGAGCT AGTTTAGGAG
ACCCCTGAAC CTCCACCCAA GATGCTGACC AGCCAGCGGG 3120CCCCCTGGAA
AGACCCTACA GTTCAGGGGG GAAGAGGGGC TGACCCGCCA GGTCCCTGCT
3180ATCAGGAGAC ATCCCCGCTA TCAGGAGATT CCCCCACCTT GCTCCCGTTC
CCCTATCCCA 3240ATACGCCCAC CCCACCCCTG TGATGAGCAG TTTAGTCACT
TAGAATGTCA ACTGAAGGCT 3300TTTGCATCCC CTTTGCCAGA GGCACAAGGC
ACCCACAGCC TGCTGGGTAC CGACGCCCAT 3360GTGGATTCAG CCAGGAGGCC
TGTCCTGCAC CCTCCCTGCT CGGGCCCCCT CTGTGCTCAG 3420CAACACACCC
AGCACCAGCA TTCCCGCTGC TCCTGAGGTC TGCAGGCAGC TCGCTGTAGC
3480CTGAGCGGTG TGGAGGGAAG TGTCCTGGGA GATTTAAAAT GTGAGAGGCG
GGAGGTGGGA 3540GGTTGGGCCC TGTGGGCCTG CCCATCCCAC GTGCCTGCAT
TAGCCCCAGT GCTGCTCAGC 3600CGTGCCCCCG CCGCAGGGGT CAGGTCACTT
TCCCGTCCTG GGGTTATTAT GACTCTTGTC 3660ATTGCCATTG CCATTTTTGC
TACCCTAACT GGGCAGCAGG TGCTTGCAGA GCCCTCGATA 3720CCGACCAGGT
CCTCCCTCGG AGCTCGACCT GAACCCCATG TCACCCTTGC CCCAGCCTGC
3780AGAGGGTGGG TGACTGCAGA GATCCCTTCA CCCAAGGCCA CGGTCACATG
GTTTGGAGGA 3840GCTGGTGCCC AAGGCAGAGG CCACCCTCCA GGACACACCT
GTCCCCAGTG CTGGCTCTGA 3900CCTGTCCTTG TCTAAGAGGC TGACCCCGGA
AGTGTTCCTG GCACTGGCAG CCAGCCTGGA 3960CCCAGAGTCC AGACACCCAC
CTGTGCCCCC GCTTCTGGGG TCTACCAGGA ACCGTCTAGG 4020CCCAGAGGGG
ACTTCCTGCT TGGCCTTGGA TGGAAGAAGG CCTCCTATTG TCCTCGTAGA
4080GGAAGCCACC CCGGGGCCTG AGGATGAGCC AAGTGGGATT CCGGGAACCG
CGTGGCTGGG 4140GGCCCAGCCC GGGCTGGCTG GCCTGCATGC CTCCTGTATA
AGGCCCCAAG CCTGCTGTCT 4200CAGCCCTCCA CTCCCTGCAG AGCTCAGAAG
CACGACCCCA GGGATATCCC TGCAGCCATG 4260AAGTGCCTCC TGCTTGCCCT
GGGCCTGGCC CTCGCCTGTG GCGTCCAGGC CATCATCGTC 4320ACCCAGACCA
TGAAAGGCCT GGACATCCAG AAGGTTCGAG GGTTGGCCGG GTGGGTGAGT
4380TGCAGGGCGG GCAGGGGAGC TGGGCCTCAG AGAGCCAAGA GAGGCTGTGA
CGTTGGGTTC 4440CCATCAGTCA GCTAGGGCCA CCTGACAAAT CCCCGCTGGG
GCAGCTTCAA CCAGGCGTTC 4500ACTGTCTTGC ATTCTGGAGG CTGGAAGCCC
AAGATCCAGG TGTTGGCAGG GCTGGCTTCT 4560CCTGCGGCCG CTCTCTGGGG
AGCAGACGGC CGTCTTCTCC AGTCCTCTGC GCGCCCTGAT 4620TTCCTCTTCC
TGTGAGGCCA CCAGGCCTGC TGGAAACACG CCTGCCTGCG CAGCTTCACA
4680CGACCTTTGT CATCTCTTTA AAGGCCATGT CTCCAGAGTC ATGTGTTGAA
GTTCTGGGGG 4740TTAGTGGGAC ACAGTTCAGC CCCTAAAAGA GTCTCTCTGC
CCCTCAAATT TTCCCCACCT 4800CCAGCCATGT CTCCCCAAGA TCCAAATGTT
GCTACATGTG GGGGGGCTCA TCTGGGTCCC 4860TCTTTGGGTT CAGTGTGAGT
CTGGGGAGAG CATTCCCCAG GGTGCAGAGT TGGGGGGAGT 4920ATCTCAGGGC
TGCCCAGGCC GGGGTGGGAC AGAGAGCCCA CTGTGGGGCT GGGGGCCCCT
4980TCCCACCCCC AGAGTGCAAC TCAAGGTCCC TCTCCAGGTG GCGGGGACTT
GGCACTCCTT 5040GGCTATGGCG GCCAGCGACA TCTCCCTGCT GGATGCCCAG
AGTGCCCCCC TGAGAGTGTA 5100CGTGGAGGAG CTGAAGCCCA CCCCCGAGGG
CAACCTGGAG ATCCTGCTGC AGAAATGGTG 5160GGCGTCTCTC CCCAACATGG
AACCCCCACT CCCCAGGGCT GTGGACCCCC CGGGGGGTGG 5220GGTGCAGGAG
GGACCAGGGC CCCAGGGCTG GGGAAGAGGG CTCAGAGTTT ACTGGTACCC
5280GGCGCTCCAC CCAAGGCTGC CCACCCAGGG CTTTTTTTTT TTTTAAACTT
TTATTAATTT 5340GATGCTTCAG AACATCATCA AACAAATGAA CATAAAACAT
TCATTTTTGT TTACTTGGAA 5400GGGGAGATAA AATCCTCTGA AGTGGAAATG
CATAGCAAAG ATACATACAA TGAGGCAGGT 5460ATTCTGAATT CCCTGTTAGT
CTGAGGATTA CAAGTGTATT TGAGCAACAG AGAGACATTT 5520TCATCATTTC
TAGTCTGAAC ACCTCAGTAT CTAAAATGAA CAAGAAGTCC TGGAAACGAA
5580GCAGTGTGGG GATAGGCCCG TGTGAAGGCT GCTGGGAGGC AGCAGACCTG
GGTCTTCGGG 5640CTCAAGCAGT TCCCGCTACC AGCCCTGTCC ACCTCAGACG
GGGGTCAGGG TGCAGGAGAG 5700AGCTGGATGG GTGTGGGGGC AGAGATGGGG
ACCTGAACCC CAGGGCTGCC TTTTGGGGGT 5760GCCTGTGGTC AAGGCTCTCC
CTGACCTTTT CTCTCTGGCT TCATCTGACT TCTCCTGGCC 5820CATCCACCCG
GTCCCCTGTG GCCTGAGGTG ACAGTGAGTG CGCCGAGGCT AGTTGGCCAG
5880CTGGCTCCTA TGCCCATGCC ACCCCCCTCC AGCCCTCCTG GGCCAGCTTC
TGCCCCTGGC 5940CCTCAGTTCA TCCTGATGAA AATGGTCCAT GCCAATGGCT
CAGAAAGCAG CTGTCTTTCA 6000GGGAGAACGG CGAGTGTGCT CAGAAGAAGA
TTATTGCAGA AAAAACCAAG ATCCCTGCGG 6060TGTTCAAGAT CGATGGTGAG
TCCGGGTCCC TGGGGGACAC CCACCACCCC CGCCCCCGGG 6120GACTGTGGAC
AGGTTCAGGG GGCTGGCGTC GGGCCCTGGG ATGCTAAGGG ACTGGTGGTG
6180ATGAAGACAC TGCCTTGACA CCTGCTTCAC TTGCCTCCCC TGCCACCTGC
CCGGGGCCTT 6240GGGGCGGTGG CCATGGGCAG GTCCCGGCTG GCGGGCTAAC
CCACCAGGGT GACACCCGAG 6300CTCTCTTTGC TGGGGGGCGG GCGGTGCTCT
GGGCCCTCAG GCTGAGCTCA GGAGGTACCT 6360GTGCCCTCCC AGGGGTAACC
GAGAGCCGTT GCCCACTCCA GGGGCCCAGG TGCCCCACGA 6420CCCCAGCCCG
CTCCACAGCT CCTTCATCTC CTGGAGACAA ACTCTGTCCG CCCTCGCTCA
6480TTCACTTGTT CGTCCTAAAT CCGAGATGAT AAAGCTTCGA GGGGGGGTTG
GGGTTCCATC 6540AGGGCTGCCC TTCCGCCGGG CAGCCTGGGC CACATCTGCC
CTTGGCCCCC TCAGGACTCA 6600CTCTGACTGG AGGCCCTGCA CTGACTGACG
CCAGGGTGCC CAGCCCAGGG TCTCTGGCGC 6660CATCCAGCTG CACTGGGTTT
GGGTGCTGGT CCTGCCCCCA AGCTGCCCGG ACACCACAGG 6720CAGCCGGGGC
TGCCCACTGG CCTCGGTCAG GGTGAGCCCC AGCTGCCCCC GCTCAGGGCT
6780TGCCCCGACA ATGACCCCAT CCTCAGGACG CACCCCCCTT CCCTTGCTGG
GCAGTGTCCA 6840GCCCCACCCG AGATCGGGGG AAGCCCTATT TCTTGACAAC
TCCAGTCCCT GGGGGAGGGG 6900GCCTCAGACT GAGTGGTGAG TGTTCCCAAG
TCCAGGAGGT GGTGGAGGGT CCTGGCGGAT 6960CCAGAGTTGA CAGTGAGGGC
TTCCTGGGCC CCATGCGCCT GGCAGTGGCA GCAGGGAAGA 7020GGAAGCACCA
TTTCAGGGGT GGGGGATGCC AGAGGCGCTC CCCACCCCGT CTTCGCCGGG
7080TGGTGACCCC GGGGGAGCCC CGCTGGTCGT GGAGGGTGCT GGGGGCTGAC
TAGCAACCCC 7140TCCCCCCCCG TTGGAACTCA CTTTTCTCCC GTCTTGACCG
CGTCCAGCCT TGAATGAGAA 7200CAAAGTCCTT GTGCTGGACA CCGACTACAA
AAAGTACCTG CTCTTCTGCA TGGAAAACAG 7260TGCTGAGCCC GAGCAAAGCC
TGGCCTGCCA GTGCCTGGGT GGGTGCCAAC CCTGGCTGCC 7320CAGGGAGACC
AGCTGCGTGG TCCTTGCTGC AACAGGGGGT GGGGGGTGGG AGCTTGATCC
7380CCAGGAGGAG GAGGGGTGGG GGGTCCCTGA GTCCCGCCAG GAGAGAGTGG
TCGCATACCG 7440GGAGCCAGTC TGCTGTGGGC CTGTGGGTGG CTGGGGACGG
GGGCCAGACA CACAGGCCGG 7500GAGACGGGTG GGCTGCAGAA CTGTGACTGG
TGTGACCGTC GCGATGGGGC CGGTGGTCAC 7560TGAATCTAAC AGCCTTTGTT
ACCGGGGAGT TTCAATTATT TCCCAAAATA AGAACTCAGG 7620TACAAAGCCA
TCTTTCAACT ATCACATCCT GAAAACAAAT GGCAGGTGAC ATTTTCTGTG
7680CCGTAGCAGT CCCACTGGGC ATTTTCAGGG CCCCTGTGCC AGGGGGGCGC
GGGCATCGGC 7740GAGTGGAGGC TCCTGGCTGT GTCAGCCGGC CCAGGGGGAG
GAAGGGACCC GGACAGCCAG 7800AGGTGGGGGG CAGGCTTTCC CCCTGTGACC
TGCAGACCCA CTGCACTGCC CTGGGAGGAA 7860GGGAGGGGAA CTAGGCCAAG
GGGGAAGGGC AGGTGCTCTG GAGGGCAAGG GCAGACCTGC 7920AGACCACCCT
GGGGAGCAGG GACTGACCCC CGTCCCTGCC CCATAGTCAG GACCCCGGAG
7980GTGGACAACG AGGCCCTGGA GAAATTCGAC AAAGCCCTCA AGGCCCTGCC
CATGCACATC 8040CGGCTTGCCT TCAACCCGAC CCAGCTGGAG GGTGAGCACC
CAGGCCCCGC CCTTCCCCAG 8100GGCAGGAGCC ACCCGGCCCC GGGACGACCT
CCTCCCATGG TGACCCCCAG CTCCCCAGGC 8160CTCCCAGGAG GAAGGGGTGG
GGTGCAGCAC CCCGTGGGGG CCCCCTCCCC ACCCCCTGCC 8220AGGCCTCTCT
TCCCGAGGTG TCCAGTCCCA TCCTGACCCC CCCATGACTC TCCCTCCCCC
8280ACAGGGCAGT GCCACGTCTA GGTGAGCCCC TGCCGGTGCC TCTGGGGTAA
GCTGCCTGCC 8340CTGCCCCACG TCCTGGGCAC ACACATGGGG TAGGGGGTCT
TGGTGGGGCC TGGGACCCCA 8400CATCAGGCCC TGGGGTCCCC CCTGTGAGAA
TGGCTGGAAG CTGGGGTCCC TCCTGGCGAC 8460TGCAGAGCTG GCTGGCCGCG
TGCCACTCTT GTGGGTGACC TGTGTCCTGG CCTCACACAC 8520TGACCTCCTC
CAGCTCCTTC CAGCAGAGCT AAGGCTAAGT GAGCCAGAAT GGTACCTAAG
8580GGGAGGCTAG CGGTCCTTCT CCCGAGGAGG GGCTGTCCTG GAACCACCAG
CCATGGAGAG 8640GCTGGCAAGG GTCTGGCAGG TGCCCCAGGA ATCACAGGGG
GGCCCCATGT CCATTTCAGG 8700GCCCGGGAGC CTTGGACTCC TCTGGGGACA
GACGACGTCA CCACCGCCCC CCCCCCATCA 8760GGGGGACTAG AAGGGACCAG
GACTGCAGTC ACCCTTCCTG GGACCCAGGC CCCTCCAGGC 8820CCCTCCTGGG
GCTCCTGCTC TGGGCAGCTT CTCCTTCACC AATAAAGGCA TAAACCTGTG
8880CTCTCCCTTC TGAGTCTTTG CTGGACGACG GGCAGGGGGT GGAGAAGTGG
TGGGGAGGGA 8940GTCTGGCTCA GAGGATGACA GCGGGGCTGG GATCCAGGGC
GTCTGCATCA CAGTCTTGTG 9000ACAACTGGGG GCCCACACAC ATCACTGCGG
CTCTTTGAAA CTTTCAGGAA CCAGGGAGGG 9060ACTCGGCAGA GACATCTGCC
AGTTCACTTG GAGTGTTCAG TCAACACCCA AACTCGACAA 9120AGGACAGAAA
GTGGAAAATG GCTGTCTCTT AGTCTAATAA ATATTGATAT GAAACTCAAG
9180TTGCTCATGG ATCAATATGC CTTTATGATC CAGCCAGCCA CTACTGTCGT
ATCAACTCAT 9240GTACCCAAAC GCACTGATCT GTCTGGCTAA TGATGAGAGA
TTCCCAGTAG AGAGCTGGCA 9300AGAGGTCACA GTGAGAACTG TCTGCACACA
CAGCAGAGTC CACCAGTCAT CCTAAGGAGA 9360TCAGTCCTGG TGTTCATTGG
AGGACTGATG TTGAAGCTGA AACTCCAATG CTTTGGCCAC 9420CTGATGTGAA
GAGCTGACTC ATTTGAAAAG ACCCTGATGC TGGGAAAGAT TGAGGGCAGG
9480AGGAGAAGGG GACGACAGAG GATGAGATGG TTGGATGGCA TCACCAACAC
AATGGACATG 9540GGTTTGGGTG GACTCCAGGA GTTGGTGATG GACAGGGAGG
CCTGGCGTGC TACGGAAGCG 9600GTTTATGGGG TCACAAAGAC TGAGTGACTG
AACTGAGCTG AACTGAATGG AAATGAGGTA 9660TACAGCAAAG TGGGGATTTT
TTAGATAATA AGAATATACA CATAACATAG TGTATACTCA 9720TATTTTTATG
CATACCTGAA TGCTCAGTCA CTCAGTCGTA TCTGACTCTG TGACCTATGG
9780ACCGTAGCCT TCCAGGTTTC TTCTGTCCAC AGAATTCTCC AAGGCAAGAA
TACTGGAGTG 9840GGTAGCCATT TCCTCCTCCA GGGGATCCTC CCGACCCAGG
GATTGAACCG GCATCTCCTG 9900TATTGGCAGG TGGATTCTTT ACCACTGTGC
CACCAGGGAA GCCCGTGTTA CTCTCTATGT 9960CCCACTTAAT TACCAAAGCT
GCTCCAAGAA AAAGCCCCTG TGCCCTCTGA GCTTCCCGGC 10020CTGCAGAGGG
TGGTGGGGGT AGACTGTGAC CTGGGAACAC CCTCCCGCTT CAGGACTCCC
10080GGGCCACGTG ACCCACAGTC CTGCAGACAG CCGGGTAGCT CTGCTCTTCA
AGGCTCATTA 10140TCTTTAAAAA AAACTGAGGT CTATTTTGTG ACTTCGCTGC
CGTAACTTCT GAACATCCAG 10200TGCGATGGAC AGGACCTCCT CCCCAGGCCT
CAGGGGCTTC AGGGAGCCAG CCTTCACCTA 10260TGAGTCACCA GACACTCGGG
GGTGGCCCCG CCTTCAGGGT GCTCACAGTC TTCCCATCGT 10320CCTGATCAAA
GAGCAAGACC AATGACTTCT TAGGAGCAAG CAGACACCCA CAGGACACTG
10380AGGTTCACCA GAGCTGAGCT GTCCTTTTGA ACCTAAAGAC ACACAGCTCT
CGAAGGTTTT 10440CTCTTTAATC TGGATTTAAG GCCTACTTGC CCCTCAAGAG
GGAAGACAGT CCTGCATGTC 10500CCCAGGACAG CCACTCGGTG GCATCCGAGG
CCACTTAGTA TTATCTGACC GCACCCTGGA 10560ATTAATCGGT CCAAACTGGA
CAAAAACCTT GGTGGGAAGT TTCATCCCAG AGGCCTCAAC 10620CATCCTGCTT
TGACCACCCT GCATCTTTTT TTCTTTTATG TGTATGCATG TATATATATA
10680TATATATTTT TTTTTTTTTC ATTTTTTGGC TGTGCTGGCT GTTCGTTGCA
GTTCGGTGCG 10740CAGGCTTCTC TCTAGTTTCT CTCTAGTCTT CTCTTATCAC
AGAGCAGTCT CTAGACGATC 10800GACGCGT 1080747 base pairsnucleic
acidsinglelinear 8AATTCCGATC GACGCGTCGA CGATATACTC TAGACGATCG
ACGCGTA 4724 base pairsnucleic acidsinglelinearBLGAMP3 9TGGATCCCCT
GCCGGTGCCT CTGG 2424 base pairsnucleic acidsinglelinearBLGAMP4
10AACGCGTCAT CCTCTGTGAG CCAG 2410 base pairsnucleic
acidsinglelinearZC6839 11ACTACGTAGT 1042 base pairsnucleic
acidsinglelinearZC6632 12CGACGCGGAT CCTACGTACC TGCAGCCATG
TTTTCCATGA GG 4221 base pairsnucleic acidsinglelinearZC6627
13AGGGCTTCGG CAAGCTTCAG G 2124 base pairsnucleic
acidsinglelinearZC6521 14GCCAAAGACT TACTTCCCTC TAGA 2430 base
pairsnucleic acidsinglelinearZC6520 15GCATGAACGT CGCGTGGTGG
TTGTGCTACC 3030 base pairsnucleic acidsinglelinearZC6519
16ACCACGCGAC GTTCATGCTC TAAAACCGTT 3036 base pairsnucleic
acidsinglelinearZC6518 17GCTGCGGGAT CCTACGTACT AGGGGGACAG GGAAGG
3645 base pairsnucleic acidsinglelinearZC6629 18CGACGCGAAT
TCTACGTACC TGCAGCCATG AAAAGGATGG TTTCT 4545 base pairsnucleic
acidsinglelinearZC6630 19CGACGCGAAT TCTACGTACC TGCAGCCATG
AAACATCTAT TATTG 4521 base pairsnucleic acidsinglelinearZC6625
20GTGAGATTTT CAGATCTTGT C 2121 base pairsnucleic
acidsinglelinearZC6626 21AAGAATTACT GTGGCCTACC A 2133 base
pairsnucleic acidsinglelinearZC6624 22GCTGCGGAAT TCTACGTACT
ATTGCTGTGG GAA 3345 base pairsnucleic acidsinglelinearZC6514
23CGACGCGGAT CCTACGTACC TGCAGCCATG AGTTGGTCCT
TGCAC 4521 base pairsnucleic acidsinglelinearzc6517 24GTCTCTGGTA
GCAACATACT A 2122 base pairsnucleic acidsinglelinearzc6516
25GGGTTTCTAG CCCTACTAGT AG 2222 base pairsnucleic
acidsinglelinearzc6515 26GGGTTTCTAG CCCTACTAGT AG 2247 base
pairsnucleic acidsinglelinear 27AAGCTACGCG TCGATCGTCT AGAGTATATC
GTCGACGCGT CGATCGG 47
* * * * *