Single-chain Multivalent Binding Protein Compositions And Methods BENATUIL; Lorenzo ; et al. [AbbVie Inc.]

Single-chain Multivalent Binding Protein Compositions And Methods

BENATUIL; Lorenzo ; et al.

Patent Application Summary

U.S. patent application number 15/252115 was filed with the patent office on 2017-03-30 for single-chain multivalent binding protein compositions and methods. The applicant listed for this patent is AbbVie Inc.. Invention is credited to Lorenzo BENATUIL, Chung-Ming HSIEH.

Application Number	20170088611 15/252115
Document ID	/
Family ID	51259765
Filed Date	2017-03-30

United States Patent Application	20170088611
Kind Code	A1
BENATUIL; Lorenzo ; et al.	March 30, 2017

SINGLE-CHAIN MULTIVALENT BINDING PROTEIN COMPOSITIONS AND METHODS

Abstract

Provided are protein, nucleic acid, and cellular libraries of single chain multivalent binding proteins (e.g., scDVD and scDVDFab molecules) and methods of using these of these libraries for the screening of single chain multivalent binding proteins using cell surface display technology (e.g., yeast display).

Inventors:

BENATUIL; Lorenzo; (Northborough, MA) ; HSIEH; Chung-Ming; (Newton, MA)

Applicant:

Name	City	State	Country	Type
AbbVie Inc.	North Chicago	IL	US

Family ID:

51259765

Appl. No.:

15/252115

Filed:

August 30, 2016

Related U.S. Patent Documents


Application Number	Filing Date	Patent Number
14141500	Dec 27, 2013	9458244
15252115
61746659	Dec 28, 2012

Current U.S. Class:	1/1
Current CPC Class:	C07K 16/245 20130101; C07K 16/468 20130101; C07K 2317/56 20130101; C07K 2317/35 20130101; C40B 40/10 20130101; C07K 16/22 20130101; C07K 16/244 20130101; C07K 2317/626 20130101; C12N 15/1082 20130101; C07K 2317/622 20130101; C07K 16/241 20130101
International Class:	C07K 16/24 20060101 C07K016/24; C12N 15/10 20060101 C12N015/10; C07K 16/22 20060101 C07K016/22

Claims

1. A single chain multivalent binding protein having the general formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, wherein VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites.

2. A single chain multivalent binding protein having the general formula (VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, wherein VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding site.

3. The binding protein of claim 1 or 2 which is a single-chain dual variable domain immunoglobulin molecules (scDVD).

4. The binding protein of any one of the preceding claims, further comprising a cell surface anchoring moiety linked to the N and/or C terminus.

5. The binding protein of claim 4, wherein the anchoring moiety comprises the Aga2p polypeptide.

6. A polynucleotide encoding a binding protein of any one of the preceding claims.

7. A host cell expressing a binding protein of any one of the preceding claims.

8. A diverse library of binding proteins comprising a polypeptide chain having the general formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, wherein VH1 is a first heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second heavy chain variable domain, X2 is a linker, VL1 is a first light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second light chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library.

9. A diverse library of binding proteins comprising a polypeptide chain having the general formula (VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, wherein VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VL1, X1, VL2, X2, VH1, X3, and/or VH2 independently vary within the library.

10. The diverse library of claim 8 or 9, wherein each binding proteins further comprises a cell surface anchoring moiety linked to the N or C terminus.

11. The diverse library of claim 10, wherein the anchoring moiety is a cell surface protein.

12. The diverse library of claim 10, wherein the anchoring moiety is Aga2p.

13. The diverse library of any one of the preceding claims, wherein the polypeptide chain is a scDVD.

14. The library of any one of the preceding claims, wherein the amino acid sequence of at least one CDR of VH1, VH2, VL1 or VL2 independently varies within the library.

15. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR3 of VH1, VH2 independently vary within the library.

16. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR1 and HCDR2 of VH1 or VH2 independently vary within the library.

17. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR1, HCDR2 and HCDR3 of VH1 or VH2 independently vary within the library.

18. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR3 of VL1 or VL2 independently vary within the library.

19. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR1 and HCDR2 of VL1 or VL2 independently vary within the library.

20. The library of any one of the preceding claims, wherein the amino acid sequence of HCDR1, HCDR2 and HCDR3 of VL1 or VL2 independently vary within the library.

21. The library of any one of the preceding claims, wherein X1 independently varies within the library and wherein X1 is selected from the amino acid sequences set forth in FIG. 2.

22. The library of any one of the preceding claims, wherein X2 independently varies within the library and wherein X2 is (G.sub.4S)n, where n=1-10.

23. The library of any one of the preceding claims, wherein X3 independently varies within the library and wherein X3 is selected from the amino acid sequences set forth in FIG. 2.

24. The library of any one of the preceding claims, wherein the library of binding proteins share at least 70, 75, 80, 85, 90, 95, 96, 97, 98, or 99 amino acid sequence identity with a reference binding protein.

25. The library of any one of the preceding claims, wherein VH1 and VH2 of the reference binding protein specifically bind to different antigens.

26. A diverse library of polynucleotides encoding the diverse library of binding proteins of any one of the preceding claims.

27. A diverse library of expression vectors comprising the diverse library of polynucleotides of claim 26.

28. A library of transformed host cells, expressing the diverse library of binding proteins of any one of the preceding claims.

29. The library of transformed host cells of claim 28, wherein the binding proteins are anchored on the cell surface.

30. The library of transformed host cells of claim 28, wherein the binding proteins are anchored on the cell surface through Aga1p.

31. The library of transformed host cells of claim 28, wherein the host cells are eukaryotic.

32. The library of transformed host cells of claim 31, wherein the host cells are yeast.

33. The library of transformed host cells of claim 31, wherein the yeast is selected from the group consisting of Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Candida albicans, Candida kefyr, Candida tropicalis, Cryptococcus laurentii, Cryptococcus neoformans, Hansenula anomala, Hansenula polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Kluyveromyces marxianus, Pichia pastoris, Rhodotorula rubra, Schizosaccharomyces pombe and Yarrowia lipolytica.

34. The library of transformed host cells of claim 31, wherein the yeast is Saccharomyces cerevisiae.

35. A method of selecting a binding protein that specifically binds to a target antigen, the method comprising: a) providing a diverse library of transformed host cells expressing the diverse library of binding proteins of any one of claims 8-25; b) contacting the host cells with the target antigen; and c) selecting a host cell that bind to the target antigen, thereby identifying a binding protein that specifically binds to a target antigen.

36. A method of selecting a binding protein that specifically binds to a first and a second target antigen simultaneously, the method comprising: a) providing a diverse library of transformed host cells expressing the diverse library of binding proteins of any one of claims 8-25; b) contacting the host cells with the first and second target antigen; and c) selecting a host cell that bind to the first and second target antigen, thereby identifying a binding protein that specifically binds to a first and a second target antigen simultaneously.

37. The method of claim 35 or 36, wherein host cells that bind to the first and/or second antigen are selected by Magnetic Activated Cell Sorting using magnetically labeled antigen.

38. The method of claim 35, 36, or 37, wherein host cells that bind to the first and/or second antigen are selected by Fluorescence Activated Cell Sorting using fluorescently labeled antigen.

39. The method of any one of claims 35-38, further comprising isolating the binding protein-encoding polynucleotide sequences from the host cells selected in step (c).

40. A method of producing a binding protein, comprising expressing in a host cell a binding protein that was selected using the methods of any of claims 8-25.

41. A method of producing a diverse library of binding proteins that specifically binds to a target antigen, the method comprising: a) providing a first diverse library of scDVD molecules, wherein the amino acid sequence of a first region of the scDVD molecules is varied in the library, and wherein each member of the library binds to the target antigen; b) providing a second diverse library of scDVD molecules, wherein the amino acid sequence of a second region of the scDVD molecules is varied in the library, and wherein each member of the library binds to the target antigen; c) recombining the first and second libraries to produce a third diverse library of scDVD molecules, wherein the third library comprises the first regions from the first library and the second region from the second library, thereby producing a diverse library of binding proteins that specifically binds to a target antigen.

42. The method of claim 41, wherein the first and second libraries are recombined by yeast gap repair of polynucleotides encoding the libraries.

Description

RELATED APPLICATIONS

[0001] This application is a continuation of U.S. patent application Ser. No. 14/141,500, filed on Dec. 27, 2013, which claims priority from U.S. Provisional Patent Application Ser. No. 61/746,659, filed on Dec. 28, 2012, which are hereby incorporated by reference in their entirety.

BACKGROUND

[0002] I. Field

[0003] The present disclosure pertains to methods and compositions for producing single chain multivalent binding proteins that specifically bind to one or more desired target antigens. More specifically, the disclosure relates to protein, nucleic acid, and cellular libraries of single chain multivalent binding proteins (e.g., scDVD molecules) and methods of using these libraries for the screening of single chain multivalent binding proteins using cell surface display technology (e.g., yeast display).

[0004] II. Description of Related Art

[0005] A wide variety of multispecific antibody formats have been developed (see Kriangkum, J., et al., Biomol Eng, 2001. 18(2): p. 31-40). Amongst them tandem single-chain Fv molecules and diabodies, and various derivatives there of, are the most widely used formats for the construction of recombinant bispecific antibodies. More recently diabodies have been fused to Fc to generate more Ig-like molecules, named di-diabodies (see Lu, D., et al., J Biol Chem, 2004. 279(4): p. 2856-65). In addition, multivalent antibody construct comprising two Fab repeats in the heavy chain of an IgG and capable of binding four antigen molecules has been described (see WO 0177342A1, and Miller, K., et al., J Immunol, 2003. 170(9): p. 4854-61).

[0006] Despite the many bispecific antibody formats available to the skilled artisan, there is often a need for the skilled artisan to improve the affinity of the bispecific antibody through affinity maturation. However, conventional affinity maturation approaches rely upon screening for affinity matured variants of the component binding domains of the multispecific antibody followed by their reassembly into the original multispecific format. Such reassembly often results in a loss of the desired improvement in binding affinity or other desirable binding characteristics. Accordingly, there is a need in the art for improved constructs, formats, and screening methodologies for identifying affinity variants of multivalent binding proteins in their desired multivalent format.

SUMMARY

[0007] The present disclosure provides a novel compositions and methods useful for the generation of improved single-chain multivalent binding proteins (e.g., scDVD) capable of binding two or more antigens simultaneously with high affinity.

[0008] Accordingly, in one aspect, the disclosure provides a single chain multivalent binding protein.

[0009] In certain embodiments, the single chain multivalent binding protein has the general formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, wherein VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites.

[0010] In certain embodiments, the single chain binding protein has the formula CH1-X0-VH1-(X1)n-VH2-X2-CL1-X4-VL1-(X3)n-VL2, wherein CH1 is a heavy chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, CL1 is a light chain heavy domain, X4 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (h.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0011] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0012] In certain embodiments, the single chain multivalent binding protein has the general formula (VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, wherein VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding site.

[0013] In certain embodiments, the single chain binding protein has the formula CL1-X0-VL1-(X1)n-VL2-X2-CH1-X4-VH1-(X3)n-VH2, wherein CL1 is a light chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, CH1 is a heavy chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding site. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (h.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0014] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0015] In certain embodiments, the single chain multivalent binding protein is a single-chain dual variable domain immunoglobulin molecules (scDVD).

[0016] In certain embodiments, the single chain multivalent binding protein further comprising a cell surface anchoring moiety linked to the N and/or C terminus. In one embodiment, the anchoring moiety comprises the Aga2p polypeptide.

[0017] In another aspect, the disclosure provides a polynucleotide encoding a binding protein disclosed herein.

[0018] In another aspect, the disclosure provides a host cell expressing a binding protein disclosed herein.

[0019] In another aspect, the disclosure provides a diverse library of binding proteins.

[0020] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, wherein VH1 is a first heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second heavy chain variable domain, X2 is a linker, VL1 is a first light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second light chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library.

[0021] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula CH1-X0-VH1-(X1)n-VH2-X2-CL1-X4-VL1-(X3)n-VL2, wherein CH1 is a heavy chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, CL1 is a light chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (hc.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0022] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0023] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula (VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, wherein VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VL1, X1, VL2, X2, VH1, X3, and/or VH2 independently vary within the library.

[0024] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula CL1-X0-VL1-(X1)n-VL2-X2-CH1-X4-VH1-(X3)n-VH2, wherein CL1 is a light chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, CH1 is a heavy chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, the VH2 and VL2 respectively combine to form two functional antigen binding site, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library. In certain embodiments, the CL1 light chain. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (hc.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0025] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0026] In certain embodiments, each binding proteins further comprises a cell surface anchoring moiety linked to the N or C terminus. In certain embodiments, the anchoring moiety is a cell surface protein. In one embodiment, the anchoring moiety is Aga2p.

[0027] In certain embodiments, the polypeptide chain is a scDVD or scDVDFab.

[0028] In certain embodiments, the amino acid sequence of at least one CDR of VH1, VH2, VL1 or VL2 independently varies within the library. In one embodiment, the amino acid sequence of HCDR3 of VH1, VH2 independently vary within the library. In one embodiment, the amino acid sequence of HCDR1 and HCDR2 of VH1 or VH2 independently vary within the library. In one embodiment, the amino acid sequence of HCDR1, HCDR2 and HCDR3 of VH1 or VH2 independently vary within the library. In one embodiment, the amino acid sequence of HCDR3 of VL1 or VL2 independently vary within the library. In one embodiment, the amino acid sequence of HCDR1 and HCDR2 of VL1 or VL2 independently vary within the library. In one embodiment, the amino acid sequence of HCDR1, HCDR2 and HCDR3 of VL1 or VL2 independently vary within the library.

[0029] In certain embodiments, X1 independently varies within the library and wherein X1 is selected from the amino acid sequences set forth in FIG. 2. In certain embodiments, X2 independently varies within the library and wherein X2 is (G.sub.4S)n, where n=1-10 (SEQ ID NO: 53). In other embodiments, X2 is selected from the amino acid sequences set forth in FIG. 11B. In specific embodiments, X2 is selected from the amino acid sequences set forth in FIG. 11B when the polypeptide chain includes CH and CL domain. In certain embodiments, X3 independently varies within the library and X3 is selected from the amino acid sequences set forth in FIG. 2.

[0030] In certain embodiments, the library of binding proteins share at least 70, 75, 80, 85, 90, 95, 96, 97, 98, or 99 amino acid sequence identity with a reference binding protein. In certain embodiments, VH1 and VH2 of the reference binding protein specifically bind to different antigens.

[0031] In another aspect, the disclosure provides a diverse library of polynucleotides encoding a diverse library of binding proteins disclosed herein.

[0032] In another aspect, the disclosure provides a diverse library of expression vectors comprising a diverse library of polynucleotides disclosed herein.

[0033] In another aspect, the disclosure provides a library of transformed host cells, expressing the diverse library of binding proteins disclosed herein.

[0034] In certain embodiments, the binding proteins are anchored on the cell surface of a transformed host cell. In certain embodiments, the binding proteins are anchored on the cell surface through Aga1p.

[0035] In certain embodiments, the host cells are eukaryotic. In certain embodiments, the host cells are yeast, e.g., Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Candida albicans, Candida kefyr, Candida tropicalis, Cryptococcus laurentii, Cryptococcus neoformans, Hansenula anomala, Hansenula polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Kluyveromyces marxianus, Pichia pastoris, Rhodotorula rubra, Schizosaccharomyces pombe and Yarrowia lipolytica. In one embodiment, the yeast is Saccharomyces cerevisiae.

[0036] In another aspect, the disclosure provides a method of selecting a binding protein that specifically binds to a target antigen, the method comprising: providing a diverse library of transformed host cells expressing a diverse library of binding proteins disclosed herein; contacting the host cells with the target antigen; and selecting a host cell that bind to the target antigen, thereby identifying a binding protein that specifically binds to a target antigen.

[0037] In another aspect, the disclosure provides a method of selecting a binding protein that specifically binds to a first and a second target antigen simultaneously, the method comprising: providing a diverse library of transformed host cells expressing a diverse library of binding proteins disclosed herein; contacting the host cells with the first and second target antigen; and selecting a host cell that bind to the first and second target antigen, thereby identifying a binding protein that specifically binds to a first and a second target antigen simultaneously.

[0038] In certain embodiments of the methods disclosed herein, host cells that bind to the first and/or second antigen are selected by Magnetic Activated Cell Sorting using magnetically labeled antigen. In certain embodiments of the methods disclosed herein, host cells that bind to the first and/or second antigen are selected by Fluorescence Activated Cell Sorting using fluorescently labeled antigen.

[0039] In certain embodiments, the methods disclosed herein further comprise isolating the binding protein-encoding polynucleotide sequences from the selected host cells.

[0040] In another aspect, the disclosure provides a method of producing a binding protein comprising expressing in a host cell a binding protein that was selected using the methods disclosed herein.

[0041] In another aspect, the disclosure provides method of producing a diverse library of binding proteins that specifically binds to a target antigen, the method comprising: providing a first diverse library of scDVD or scDVDFab molecules, wherein the amino acid sequence of a first region of the scDVD or scDVDFab molecules is varied in the library, and wherein each member of the library binds to the target antigen; providing a second diverse library of scDVD or scDVDFab molecules, wherein the amino acid sequence of a second region of the scDVD or scDVDFab molecules is varied in the library, and wherein each member of the library binds to the target antigen; recombining the first and second libraries to produce a third diverse library of scDVD or scDVDFab molecules, wherein the third library comprises the first regions from the first library and the second region from the second library, thereby producing a diverse library of binding proteins that specifically binds to a target antigen.

[0042] In certain embodiments, the first and second libraries are recombined by yeast gap repair of polynucleotides encoding the libraries.

BRIEF DESCRIPTION OF THE DRAWINGS

[0043] FIG. 1A depicts an exemplary single chain dual variable domain (scDVD) molecules (FIG. 1A discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54), FIG. 1B depicts an exemplary full-length DVD-Ig molecule, and FIG. 1C depicts an exemplary a single chain Fv molecule.

[0044] FIG. 2 is a schematic representation of an scDVD molecule and exemplary inter-variable domain linker amino acid sequences. The linkers between the VH1 and VH2 domains have amino acid sequences of SEQ ID NOs:9-30 listed from top to bottom. The linkers between the VL1 and VL2 domains have amino acid sequences of SEQ ID NOs:31-52 listed from top to bottom. FIG. 2 discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54.

[0045] FIG. 3 depicts the results of flow cytometry assays measuring the cell surface expression of scDVD or scFv on yeast cells.

[0046] FIG. 4A depicts the results of flow cytometry assays measuring the binding of DLL4 and/or VEGF to yeast cells expressing cell surface DLL4/VEGF-binding scDVD, and FIG. 4B depicts SOST and/or TNFa to yeast cells expressing cell surface SOST/TNFa-binding scDVD.

[0047] FIG. 5 depicts the results of flow cytometry assays measuring the binding of SOST and/or TNFa to yeast cells expressing cell surface SOST/TNFa-binding scDVD tagged with various epitope tags.

[0048] FIG. 6A depicts the amino acid sequence of an exemplary SOST/TNFa-binding scDVD molecule (SEQ ID NO:57) (FIG. 6A discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54). FIG. 6B depicts an exemplary SOST/TNFa-binding scDVD library design, with the VH3-9, SOST VH, V1-16 and MSL10VL sequences represented by SEQ ID NOs: 58-61, respectively (FIG. 6B discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54). FIG. 6C depicts the results of flow cytometry assays measuring the binding of SOST to yeast cells expressing parental or affinity matured cell surface SOST/TNFa-binding scDVD. FIG. 6D depicts the results of flow cytometry assays measuring the binding of SOST to yeast cells expressing parental or affinity matured cell surface SOST/TNFa-binding scDVD.

[0049] FIG. 7A depicts a schematic representation of an scDVD molecule and exemplary inter-VL domain linker amino acid sequences of SEQ ID NOs:62-73 listed from top to bottom (FIG. 7A discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54), and FIG. 7B depicts and results (as fold enrichment) of yeast display screens of SOST/TNFa-binding scDVD library comprising various inter-VL domain linker amino acid sequences.

[0050] FIG. 8 is a schematic representation of exemplary scDVD libraries disclosed herein and multiplexing methods of using these libraries.

[0051] FIG. 9 is a schematic representation of exemplary scDVD libraries disclosed herein.

[0052] FIG. 10A depicts an exemplary single chain dual variable domain Fab (scDVDFab) molecules, FIG. 10B depicts an exemplary full-length DVD-Ig molecule, and FIG. 10C depicts an exemplary a single chain DVD molecule (FIG. 10C discloses "(G.sub.4S).sub.n" as SEQ ID NO: 54).

[0053] FIG. 11A depicts a schematic representation of an scDVDFab molecule, FIG. 11B depicts GS-rigid linker amino acid sequences (SEQ ID NOs:1-4), and FIG. 11C depicts a schematic of a scDVDFab with a GS-rigid linker (FIG. 11C discloses "G.sub.3SG.sub.3" as SEQ ID NO: 96 and "G.sub.2SG.sub.2" as SEQ ID NO: 97).

[0054] FIG. 12 depicts the results of flow cytometry assays measuring the expression of scDVDFab on the surface of yeast.

[0055] FIG. 13 depicts the results of flow cytometry assays showing that 1B/IL17 scDVDFab expressed on yeast retains its ability to bind both IL1B and/or IL17.

[0056] FIG. 14 depicts the results of flow cytometry assays showing that scDVDFab and DVD-Fab had similar binding profiles binding to both IL1B and IL17 on the surface of yeast.

DETAILED DESCRIPTION

[0057] The present disclosure provides a novel compositions and methods useful for the generation of improved single-chain multivalent binding proteins (e.g., scDVD) capable of binding two or more antigens simultaneously with high affinity.

I. DEFINITIONS

[0058] Unless otherwise defined herein, scientific and technical terms used in connection with the present disclosure shall have the meanings that are commonly understood by those of ordinary skill in the art. The meaning and scope of the terms should be clear, however, in the event of any latent ambiguity, definitions provided herein take precedent over any dictionary or extrinsic definition. Further, unless otherwise required by context, singular terms shall include pluralities and plural terms shall include the singular. Generally, nomenclature used in connection with, and techniques of, cell and tissue culture, molecular biology, immunology, microbiology, genetics and protein and nucleic acid chemistry and hybridization described herein are those well known and commonly used in the art.

[0059] In order that the disclosure may be more readily understood, certain terms are first defined.

[0060] The term "multivalent binding protein" is used throughout this specification to denote a binding protein comprising two or more antigen binding sites, each of which can bind independently bind to an antigen.

[0061] The terms "dual variable domain immunoglobulin" or "DVD-Ig" refer to the multivalent binding proteins disclosed in, e.g., U.S. Pat. No. 8,258,268, which is herein incorporated by reference in its entirety.

[0062] The terms "single chain dual variable domain immunoglobulin" or "scDVD" refer to the antigen binding fragment of a DVD molecule that is analogous to an antibody single chain Fv fragment. scDVD are generally of the formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, where VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, where the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites. An exemplary scDVD is depicted in FIGS. 1A-1C herein.

[0063] The terms "single chain dual variable domain immunoglobulin Fab" or "scDVDFab" refer to the antigen binding fragment of a DVD molecule that includes the variable heavy (VH) and light (VL) chains of a DVD-Ig. scDVD are generally of the formula CH1-X0-VH1-(X1)n-VH2-X2-CL1-X4-VL1-(X3)n-VL2, where CH1 is a heavy chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, CL1 is a light chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, where the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (hc.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa.. An exemplary scDVDFab is depicted in FIG. 10A, herein.

[0064] The term "antibody", as used herein, broadly refers to any immunoglobulin (Ig) molecule comprised of four polypeptide chains, two heavy (H) chains and two light (L) chains, or any functional fragment, mutant, variant, or derivation thereof, which retains the essential epitope binding features of an Ig molecule. Such mutant, variant, or derivative antibody formats are known in the art. Non-limiting embodiments of which are discussed below.

[0065] In a full-length antibody, each heavy chain is comprised of a heavy chain variable region (abbreviated herein as HCVR or VH) and a heavy chain constant region. The heavy chain constant region is comprised of three domains, CH1, CH2 and CH3. Each light chain is comprised of a light chain variable region (abbreviated herein as LCVR or VL) and a light chain constant region. The light chain constant region is comprised of one domain, CL. The VH and VL regions can be further subdivided into regions of hypervariability, termed complementarity determining regions (CDR), interspersed with regions that are more conserved, termed framework regions (FR). Each VH and VL is composed of three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FR1, CDR1, FR2, CDR2, FR3, CDR3, FR4. Immunoglobulin molecules can be of any type (e.g., IgG, IgE, IgM, IgD, IgA and IgY), class (e.g., IgG 1, IgG2, IgG 3, IgG4, IgA1 and IgA2) or subclass.

[0066] The term "Fc region" is used to define the C-terminal region of an immunoglobulin heavy chain, which may be generated by papain digestion of an intact antibody. The Fc region may be a native sequence Fc region or a variant Fc region. The Fc region of an immunoglobulin generally comprises two constant domains, a CH2 domain and a CH3 domain, and optionally comprises a CH4 domain. Replacements of amino acid residues in the Fc portion to alter antibody effector function are known in the art (Winter, et al. U.S. Pat. Nos. 5,648,260; 5,624,821). The Fc portion of an antibody mediates several important effector functions e.g. cytokine induction, ADCC, phagocytosis, complement dependent cytotoxicity (CDC) and half-life/clearance rate of antibody and antigen-antibody complexes. In some cases these effector functions are desirable for therapeutic antibody but in other cases might be unnecessary or even deleterious, depending on the therapeutic objectives. Certain human IgG isotypes, particularly IgG1 and IgG3, mediate ADCC and CDC via binding to Fc.gamma.Rs and complement C1q, respectively. Neonatal Fc receptors (FcRn) are the critical components determining the circulating half-life of antibodies. In still another embodiment at least one amino acid residue is replaced in the constant region of the antibody, for example the Fc region of the antibody, such that effector functions of the antibody are altered. The dimerization of two identical heavy chains of an immunoglobulin is mediated by the dimerization of CH3 domains and is stabilized by the disulfide bonds within the hinge region (Huber et al. Nature; 264: 415-20; Thies et al 1999 J Mol Biol; 293: 67-79.). Mutation of cysteine residues within the hinge regions to prevent heavy chain-heavy chain disulfide bonds will destabilize dimeration of CH3 domains. Residues responsible for CH3 dimerization have been identified (Dall'Acqua 1998 Biochemistry 37: 9266-73.). Therefore, it is possible to generate a monovalent half-Ig. Interestingly, these monovalent half Ig molecules have been found in nature for both IgG and IgA subclasses (Seligman 1978 Ann Immunol 129: 855-70; Biewenga et al 1983 Clin Exp Immunol 51: 395-400). The stoichiometry of FcRn: Ig Fc region has been determined to be 2:1 (West et al 2000 Biochemistry 39: 9698-708), and half Fc is sufficient for mediating FcRn binding (Kim et al 1994 Eur J Immunol; 24: 542-548.). Mutations to disrupt the dimerization of CH3 domain may not have greater adverse effect on its FcRn binding as the residues important for CH3 dimerization are located on the inner interface of CH3 b sheet structure, whereas the region responsible for FcRn binding is located on the outside interface of CH2-CH3 domains. However the half Ig molecule may have certain advantage in tissue penetration due to its smaller size than that of a regular antibody. In one embodiment at least one amino acid residue is replaced in the constant region of the binding protein disclosed herein, for example the Fc region, such that the dimerization of the heavy chains is disrupted, resulting in half DVD Ig molecules.

[0067] The term "antigen-binding portion" of an antibody (or simply "antibody portion"), as used herein, refers to one or more fragments of an antibody that retain the ability to specifically bind to an antigen. It has been shown that the antigen-binding function of an antibody can be performed by fragments of a full-length antibody. Such antibody embodiments may also be bispecific, dual specific, or multi-specific formats; specifically binding to two or more different antigens. Examples of binding fragments encompassed within the term "antigen-binding portion" of an antibody include (i) a Fab fragment, a monovalent fragment consisting of the VL, VH, CL and CH1 domains; (ii) a F(ab').sub.2 fragment, a bivalent fragment comprising two Fab fragments linked by a disulfide bridge at the hinge region; (iii) a Fd fragment consisting of the VH and CH1 domains; (iv) a Fv fragment consisting of the VL and VH domains of a single arm of an antibody, (v) a dAb fragment (Ward et al., (1989) Nature 341:544-546, Winter et al., PCT publication WO 90/05144 A1 herein incorporated by reference), which comprises a single variable domain; and (vi) an isolated complementarity determining region (CDR). Furthermore, although the two domains of the Fv fragment, VL and VH, are coded for by separate genes, they can be joined, using recombinant methods, by a synthetic linker that enables them to be made as a single protein chain in which the VL and VH regions pair to form monovalent molecules (known as single chain Fv (scFv); see e.g., Bird et al. (1988) Science 242:423-426; and Huston et al. (1988) Proc. Natl. Acad. Sci. USA 85:5879-5883). Such single chain antibodies are also intended to be encompassed within the term "antigen-binding portion" of an antibody. Other forms of single chain antibodies, such as diabodies are also encompassed. Diabodies are bivalent, bispecific antibodies in which VH and VL domains are expressed on a single polypeptide chain, but using a linker that is too short to allow for pairing between the two domains on the same chain, thereby forcing the domains to pair with complementary domains of another chain and creating two antigen binding sites (see e.g., Holliger, P., et al. (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Poljak, R. J., et al. (1994) Structure 2:1121-1123). Such antibody binding portions are known in the art (Kontermann and Dubel eds., Antibody Engineering (2001) Springer-Verlag. New York. 790 pp. (ISBN 3-540-41354-5). In addition single chain antibodies also include "linear antibodies" comprising a pair of tandem Fv segments (VH-CH1-VH-CH1) which, together with complementary light chain polypeptides, form a pair of antigen binding regions (Zapata et al. Protein Eng. 8(10):1057-1062 (1995); and U.S. Pat. No. 5,641,870).

[0068] As used herein, the terms "VH domain" and "VL domain" refer to single antibody variable heavy and light domains, respectively, comprising FR (Framework Regions) 1, 2, 3 and 4 and CDR (Complementary Determinant Regions) 1, 2 and 3 (see Kabat et al. (1991) Sequences of Proteins of Immunological Interest. (NIH Publication No. 91-3242, Bethesda).

[0069] As used herein, the terms "CH1 domain" and "CL1 domain" refer to single antibody heavy and light constant regions. A CL1 domain can be a C.kappa. or a C.lamda., domain.

[0070] As used herein, the term "CDR" or "complementarity determining region" means the noncontiguous antigen combining sites found within the variable region of both heavy and light chain polypeptides. These particular regions have been described by Kabat et al., J. Biol. Chem. 252, 6609-6616 (1977) and Kabat et al., Sequences of protein of immunological interest. (1991), and by Chothia et al., J. Mol. Biol. 196:901-917 (1987) and by MacCallum et al., J. Mol. Biol. 262:732-745 (1996) where the definitions include overlapping or subsets of amino acid residues when compared against each other. The amino acid residues which encompass the CDRs as defined by each of the above cited references are set forth for comparison. Preferably, the term "CDR" is a CDR as defined by Kabat, based on sequence comparisons.

[0071] As used herein the term "framework (FR) amino acid residues" refers to those amino acids in the framework region of an immunogobulin chain. The term "framework region" or "FR region" as used herein, includes the amino acid residues that are part of the variable region, but are not part of the CDRs (e.g., using the Kabat definition of CDRs).

[0072] As used herein, the term "specifically binds to" refers to the ability of a binding polypeptide to bind to an antigen with an Kd of at least about 1.times.10.sup.-6 M, 1.times.10.sup.-7 M, 1.times.10.sup.-8 M, 1.times.10.sup.-9 M, 1.times.10.sup.-10 M, 1.times.10.sup.-11 M, 1.times.10.sup.-12 M, or more, and/or bind to an antigen with an affinity that is at least two-fold greater than its affinity for a nonspecific antigen. It shall be understood, however, that the binding polypeptide are capable of specifically binding to two or more antigens which are related in sequence. For example, the binding polypeptides disclosed herein can specifically bind to both human and a non-human (e.g., mouse or non-human primate) orthologs of an antigen.

[0073] The term "Polypeptide" as used herein, refers to any polymeric chain of amino acids. The terms "peptide" and "protein" are used interchangeably with the term polypeptide and also refer to a polymeric chain of amino acids. The term "polypeptide" encompasses native or artificial proteins, protein fragments and polypeptide analogs of a protein sequence. A polypeptide may be monomeric or polymeric.

[0074] The term "linker" is used to denote polypeptides comprising two or more amino acid residues joined by peptide bonds and are used to link one or more antigen binding portions. Such linker polypeptides are well known in the art (see e.g., Holliger, P., et al. (1993) Proc. Natl. Acad. Sci. USA 90:6444-6448; Poljak, R. J., et al. (1994) Structure 2:1121-1123). Preferred linkers include, but are not limited to, the amino acid linkers set forth in Table 7 herein.

[0075] The term "K.sub.on", as used herein, is intended to refer to the on rate constant for association of an antibody to the antigen to form the antibody/antigen complex as is known in the art.

[0076] The term "K.sub.off", as used herein, is intended to refer to the off rate constant for dissociation of an antibody from the antibody/antigen complex as is known in the art.

[0077] The term "Kd", as used herein, is intended to refer to the dissociation constant of a particular antibody-antigen interaction as is known in the art.

[0078] The term "vector", as used herein, is intended to refer to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA loop into which additional DNA segments may be ligated. Another type of vector is a viral vector, wherein additional DNA segments may be ligated into the viral genome. Certain vectors are capable of autonomous replication in a host cell into which they are introduced (e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) can be integrated into the genome of a host cell upon introduction into the host cell, and thereby are replicated along with the host genome. Moreover, certain vectors are capable of directing the expression of genes to which they are operatively linked. Such vectors are referred to herein as "recombinant expression vectors" (or simply, "expression vectors"). In general, expression vectors of utility in recombinant DNA techniques are often in the form of plasmids. In the present specification, "plasmid" and "vector" may be used interchangeably as the plasmid is the most commonly used form of vector. However, the disclosure is intended to include such other forms of expression vectors, such as viral vectors (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which serve equivalent functions.

[0079] "Transformation", as defined herein, refers to any process by which exogenous DNA enters a host cell. Transformation may occur under natural or artificial conditions using various methods well known in the art. Transformation may rely on any known method for the insertion of foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method is selected based on the host cell being transformed and may include, but is not limited to, viral infection, electroporation, lipofection, and particle bombardment. Such "transformed" cells include stably transformed cells in which the inserted DNA is capable of replication either as an autonomously replicating plasmid or as part of the host chromosome. They also include cells which transiently express the inserted DNA or RNA for limited periods of time.

[0080] The term "recombinant host cell" (or simply "host cell"), as used herein, is intended to refer to a cell into which exogenous DNA has been introduced. It should be understood that such terms are intended to refer not only to the particular subject cell, but, to the progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term "host cell" as used herein. Preferably host cells include prokaryotic and eukaryotic cells selected from any of the Kingdoms of life. Preferred eukaryotic cells include protist, fungal, plant and animal cells. Most preferably host cells include but are not limited to the prokaryotic cell line E. Coli; mammalian cell lines CHO, HEK 293 and COS; the insect cell line Sf9; and the fungal cell Saccharomyces cerevisiae.

II. SINGLE-CHAIN MULTIVALENT BINDING PROTEINS

[0081] In one aspect, the disclosure provides single-chain multivalent binding proteins that can bind to two antigen simultaneously. In certain embodiments, the single-chain multivalent binding proteins generally comprise a polypeptide of the formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, where VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, where the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites.

[0082] In certain embodiments, the single chain binding protein has the formula CH1-X0-VH1-(X1)n-VH2-X2-CL1-X4-VL1-(X3)n-VL2, wherein CH1 is a heavy chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, CL1 is a light chain heavy domain, X4 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (h.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0083] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0084] In certain embodiments, the single-chain multivalent binding proteins generally comprise a polypeptide of the formula VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, where VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VHL2 is a second antibody heavy chain variable domain, and n is 0 or 1, where the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites

[0085] In certain embodiments, the single chain binding protein has the formula CL1-X0-VL1-(X1)n-VL2-X2-CH1-X4-VH1-(X3)n-VH2, wherein CL1 is a light chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, CH1 is a heavy chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding site. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (h.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa..

[0086] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0087] In certain embodiments, the single-chain multivalent binding proteins are single-chain dual variable domain immunoglobulin molecules (scDVD). An exemplary scDVD is depicted in FIGS. 1A-1C herein. In other embodiments, the single-chain multivalent binding proteins are single-chain dual variable domain immunoglobulin Fab molecules (scDVDFab). An exemplary scDVDFab is depicted in FIG. 10A, herein.

[0088] In certain embodiments, the multivalent binding proteins comprise a cell surface anchoring moiety linked to the N and/or C terminus. Any molecule that can display the binding protein on the surface of a cell can be employed including, without limitation, cell surface protein and lipids. In certain embodiments, the anchoring moiety comprises the Aga2p polypeptide.

[0089] The antibody variable domains for the use in the single-chain multivalent binding proteins disclosed herein can be obtained using recombinant DNA techniques from a parent antibody (or DVD-Ig) generated by any method known in the art. In a certain embodiments, the variable domain is a murine heavy or light chain variable domain. In a certain embodiments, the variable domain is a CDR grafted or a humanized variable heavy or light chain domain. In a certain embodiments, the variable domain is a human heavy or light chain variable domain.

[0090] In certain embodiments, the first and second variable domains are linked directly to each other using recombinant DNA techniques. In certain embodiments, the variable domains are linked via a linker sequence. Preferably two variable domains are linked. Three or more variable domains may also be linked directly or via a linker sequence. The variable domains may bind the same antigen or may bind different antigens. Single-chain multivalent binding proteins molecules disclosed herein may include one immunoglobulin variable domain and one non-immunoglobulin variable domain such as ligand binding domain of a receptor, active domain of an enzyme. Single-chain multivalent binding proteins molecules may also comprise two or more non-Ig domains.

[0091] The linker sequence may be a single amino acid or a polypeptide sequence. Preferably the linker sequences are selected from the group consisting of consisting of the amino acid sequences set forth in FIG. 2 herein.

[0092] In certain embodiments, a heavy chain or light chain constant domain is linked to the single-chain multivalent binding proteins domains using recombinant DNA techniques. Additionally or alternatively, in certain embodiments, the DVD heavy chain is linked to an Fc region. The Fc region may be a native sequence Fc region, or a variant Fc region. In certain embodiments, the Fc region is a human Fc region. In one embodiment the Fc region includes an Fc region from IgG1, IgG2, IgG3, IgG4, IgA, IgM, IgE, or IgD.

III. LIBRARIES OF MULTIVALENT BINDING PROTEIN

[0093] In one aspect, the disclosure provides libraries of single-chain multivalent binding proteins (e.g., scDVD molecules). Such libraries are particularly useful for selecting multivalent binding proteins with improved properties relative to a reference binding molecule (e.g., improved binding kinetics or thermostability). Exemplary libraries and methods are set forth in FIGS. 8 and 9.

[0094] In certain embodiments, the library of binding proteins comprises a polypeptide chain having the general formula VH1-(X1)n-VH2-X2-VL1-(X3)n-VL2, wherein VH1 is a first heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second heavy chain variable domain, X2 is a linker, VL1 is a first light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second light chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library. In one embodiment, the polypeptide chain is a scDVD.

[0095] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula CH1-X0-VH1-(X1)n-VH2-X2-CL1-X4-VL1-(X3)n-VL2, wherein CH1 is a heavy chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, X2 is a linker, CL1 is a light chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (hc.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa.. In one embodiment, the polypeptide chain is a scDVDFab.

[0096] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0097] In certain embodiments, the binding proteins further comprise a polypeptide chain having the general formula (VL1-(X1)n-VL2-X2-VH1-(X3)n-VH2, wherein VL1 is a first heavy chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second heavy chain variable domain, X2 is a linker, VH1 is a first light chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second light chain variable domain, and n is 0 or 1, wherein the VH1 and VL1, and the VH2 and VL2 respectively combine to form two functional antigen binding sites, and wherein the amino acid sequences of VL1, X1, VL2, X2, VH1, X3, and/or VH2 independently vary within the library. In one embodiment, the polypeptide chain is a scDVD.

[0098] In certain embodiments, the diverse library of binding proteins comprises a polypeptide chain having the general formula CL1-X0-VL1-(X1)n-VL2-X2-CH1-X4-VH1-(X3)n-VH2, wherein CL1 is a light chain constant domain, X0 is a linker with the proviso that it is not a constant domain, VL1 is a first antibody light chain variable domain, X1 is a linker with the proviso that it is not a constant domain, VL2 is a second antibody light chain variable domain, X2 is a linker, CH1 is a heavy chain constant domain, X4 is a linker with the proviso that it is not a constant domain, VH1 is a first antibody heavy chain variable domain, X3 is a linker with the proviso that it is not a constant domain, VH2 is a second antibody heavy chain variable domain, and n is 0 or 1, and wherein the VH1 and VL1, the VH2 and VL2 respectively combine to form two functional antigen binding site, and wherein the amino acid sequences of VH1, X1, VH2, X2, VL1, X3, and/or VL2 independently vary within the library. In certain embodiments, the CL1 light chain. Optionally, the CL1 domain can be a kappa (hc.kappa. or c.kappa.) or a lambda (hc.lamda. or c.lamda.) constant domain. In certain embodiments, CL1 is c.kappa.. In one embodiment, the polypeptide chain is a scDVDFab.

[0099] In certain embodiments, X2 is a GS-rigid linker sequence. The GS rigid linker sequence can comprise an amino acid sequence selected from the group consisting of SEQ ID NOs:1-4.

[0100] Any region of the polypeptide chains can be varied independently in the libraries disclosed herein. In certain embodiments, the amino acid sequences of at least one CDR of VH1, VH2, VL1 or VL2 independently varies within the library. In one embodiment, the amino acid sequences of HCDR3 of VH1, VH2 independently vary within the library. In one embodiment, the amino acid sequences of HCDR1 and HCDR2 of VH1 or VH2 independently vary within the library. In one embodiment, the amino acid sequences of HCDR1, HCDR2 and HCDR3 of VH1 or VH2 independently vary within the library. In one embodiment, the amino acid sequences of HCDR3 of VL1 or VL2 independently vary within the library. In one embodiment, the amino acid sequences of HCDR1 and HCDR2 of VL1 or VL2 independently vary within the library. In one embodiment, the amino acid sequences of HCDR1, HCDR2 and HCDR3 of VL1 or VL2 independently vary within the library.

[0101] The linker regions X1, X2 and/or X3 can be also be varied independently in the libraries disclosed herein. Any length and sequence of linkers can be employed. Suitable amino acid sequences for use in linker X1, X2 and/or X3 are set forth in FIG. 2 herein. In other embodiments, X2 is selected from the amino acid sequences set forth in FIG. 11B. In specific embodiments, X2 is selected from the amino acid sequences set forth in FIG. 11B when the polypeptide chain includes CH and CL domain.

[0102] In certain embodiments, the libraries disclosed herein are used in cell surface display techniques (e.g., yeast display as described in Wittrup, et al. U.S. Pat. No. 6,699,658, incorporated herein by reference). Accordingly, in certain embodiments, each binding protein in the library further comprises a cell surface anchoring moiety linked to the N and/or C terminus. Any molecule that can display the binding proteins on the surface of a cell can be employed including, without limitation, cell surface protein and lipids. In certain embodiments, the anchoring moiety comprise the Aga2p polypeptide.

[0103] In certain embodiments, each binding protein in the library further comprises an epitope tag that that can be recognized by binding protein (e.g., an antibody). Suitable tags include without limitation, include His, HA, c-myc, Flag, HSV, S, AcV5, E2, E, and StrepII tags.

[0104] In certain embodiments, the library of binding proteins are employed to affinity mature a reference binding protein (e.g., scDVD or scDVDFab). Accordingly, in certain embodiments, the library of binding proteins share at least 70, 75, 80, 85, 90, 95, 96, 97, 98, or 99 amino acid sequence identity with a reference binding protein (e.g., scDVD or scDVDFab). In certain embodiments, the VH1 and VH2 of the reference binding protein specifically bind to different antigens.

[0105] In another aspect, the disclosure provides libraries of polynucleotides encoding the diverse library of binding proteins. The libraries can be produced by any art recognized means. In certain embodiments, the libraries are produced by combining portions of other libraries by overlap PCR In certain embodiments, libraries are produced by combining portions of other libraries by gap repair transformation in yeast cells. In certain embodiments, the nucleic acids encoding the binding proteins are operably linked to one or more expression control elements (e.g., promoters or enhancer elements).

[0106] In another aspect, the disclosure provides libraries of expression vectors comprising the diverse library of polynucleotides disclosed herein. Any vectors suitable of expressing the binding proteins can be employed.

[0107] In another aspect, the disclosure provides a library of transformed host cells, expressing the diverse library of binding proteins disclosed herein. In certain embodiments, the individual transformed cells in the library of transformed host cells express only one species from the diverse library binding proteins.

[0108] Any cells, prokaryotic or eukaryotic, are suitable for use as host cells. In certain embodiments, the host cells are yeast including, without limitation, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Candida albicans, Candida kefyr, Candida tropicalis, Cryptococcus laurentii, Cryptococcus neoformans, Hansenula anomala, Hansenula polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Kluyveromyces marxianus, Pichia pastoris, Rhodotorula rubra, Schizosaccharomyces pombe and Yarrowia lipolytica.

[0109] In certain embodiments, the expressed binding proteins are anchored on the surface of the host cell. Any means for anchoring can be employed. In certain embodiments, the binding proteins are anchored on the cell surface through Aga1p. This is usually achieved by the fusion of the Aga2p protein the N and/or C terminus of the binding protein.

IV. SINGLE-CHAIN MULTIVALENT BINDING PROTEIN SCREENING METHODS

[0110] In another aspect, the disclosure provides a method of selecting a binding protein (e.g., scDVD or scDVDFab) that specifically binds to a target antigen. The method generally comprises: a) providing a diverse library of transformed host cells expressing a diverse library of binding proteins disclosed herein; b) contacting the host cells with the target antigen; and c) selecting a host cell that bind to the target antigen, thereby identifying a binding protein that specifically binds to a target antigen.

[0111] In another aspect, the disclosure provides a method of selecting a binding protein that specifically binds to a first and a second target antigen simultaneously. The method generally comprises: a) providing a diverse library of transformed host cells expressing a diverse library of binding proteins disclosed herein; b) contacting the host cells with the first and second target antigen; and c) selecting a host cell that bind to the first and second target antigen, thereby identifying a binding protein that specifically binds to a first and a second target antigen simultaneously.

[0112] In certain embodiments of the foregoing methods, host cells that bind to the first and/or second antigen are selected by Magnetic Activated Cell Sorting using magnetically labeled antigen. In certain embodiments of the foregoing methods, host cells that bind to the first and/or second antigen are selected by Fluorescence Activated Cell Sorting using fluorescently labeled antigen.

[0113] Any host cells, prokaryotic or eukaryotic, are suitable for use in the foregoing methods. In certain embodiments, the host cells are yeast including, without limitation, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, Candida albicans, Candida kefyr, Candida tropicalis, Cryptococcus laurentii, Cryptococcus neoformans, Hansenula anomala, Hansenula polymorpha, Kluyveromyces fragilis, Kluyveromyces lactis, Kluyveromyces marxianus, Pichia pastoris, Rhodotorula rubra, Schizosaccharomyces pombe and Yarrowia lipolytica.

[0114] In certain embodiments, the expressed binding proteins are anchored on the surface of the host cell. Any means for anchoring can be employed. In certain embodiments, the binding proteins are anchored on the cell surface through Aga1p. This is usually achieved by the fusion of the Aga2p protein to one or more chain of the binding protein.

[0115] After selection of antigen-binding host cells, the polynucleotides encoding the binding proteins expressed by those cells can be isolated using any standard molecular biological means. These polynucleotides can be isolated and re-expressed in another cellular or acellular system as desired. Alternatively, these polynucleotides can be further modified and screened using the methods disclosed herein. In certain embodiments, the isolated polynucleotides are recombined with other polynucleotides (including libraries disclosed herein) to produce new, hybrid polynucleotides encoding novel binding proteins.

[0116] In certain embodiments, multiplex methods of screening libraries are employed. In such methods, each individual library is barcoded by one or more epitope tags that allows for differentiating one library or a subgroup or libraries from another library or a subgroup of libraries. Unique tag or tags are peptide sequences attached at the N-, C-, or both termini, or in the linker between VH and VL domains. The libraries are differentiated by binders (e.g., antibodies) to the epitope tags using flow cytometry or fluorescence activated cell sorting. The method of differentiation of libraries can be additive (a library having one or more tags distinct from the others) or subtractive (a library missing one ore more tags from the others). The libraries can be kept separately or combined (i.e. multiplexed) for analysis or cell sorting.

[0117] In the multiplex methods, the libraries are generally introduced to organisms that are amenable to magnetic and fluorescent activated cell sorting including, but not limited to, bacteria, yeast, and mammalian cells.

[0118] The libraries separated and distinguished by one or more tags can differ according to one or more of the following attributes: 1) antibody germline subgroups or sequences, light chain isotypes (kappa vs. lambda), or combinations thereof (e.g. specific VH/VL pairs); 2) natural or synthetic (or a combination thereof) antibody or TCR sequences; 3) cell type (B, T, plasma cells, etc); 4) tissues (peripheral blood, spleen, lymph node, bone marrow, tonsil, cord blood, etc); 5) species (human, mouse, rat, llama, rabbit, chicken, hamster, shark, etc); 6) protein scaffolds (antibodies, T cell receptors, etc); ormats (antibody and its fragments scFv, Fab, dAb, DVD-Ig, DVD-Fab, scDVD, scDVDFab, etc); 7) diversity and locations (framework vs. CDR diversity, HCDR3 size and diversity, HC vs. LC diversity, DVD-Ig linkers, domain orientation, etc; and/or 8) operation logistics (operators, lab locations, cell sorters, etc)

[0119] In certain embodiments, multiple diverse libraries are created, where each library contains clones that vary at a different discreet region of a reference binding protein. Each library is then screened separately for binding to the desired antigen(s) and the selected clones from each library are recombined to from a new library for screening. For example, to facilitate the affinity maturation of a reference binding protein, two distinct, diverse libraries can be created: a first diverse library in which only the HCDR1 and HCDR2 regions of a reference antibody are varied; and a second diverse library in which only the HCDR3 region of a reference antibody are varied. The first and the second library can be screened using the methods disclosed herein (e.g., using yeast display) to identify binding molecules with improved antigen binding characteristics. The polynucleotides encoding the selected binding proteins can then be recombined (e.g., by overlap PCR or yeast GAP repair) to form a third library comprising the HCDR1 and HCDR2 regions from the first library and the HCDR3 regions form second library. This third library can then be screened using the methods disclosed herein to identify binding proteins with further improved antigen binding characteristics. Exemplary libraries and methods are set forth in FIGS. 8 and 9.

[0120] Binding proteins selected using the methods disclosed herein can be isolated and re-expressed in another cellular or acellular system as desired.

V. ENGINEERED MULTIVALENT BINDING PROTEINS

[0121] In certain preferred embodiments, the single-chain multivalent binding proteins produced using the methods and compositions disclosed herein exhibit improved properties (e.g., affinity or stability) with respect to a corresponding parental reference binding protein. For example, the engineered binding protein may dissociate from its target antigen with a k.sub.off rate constant of about 0.1 s.sup.-1 or less, as determined by surface plasmon resonance, or inhibit the activity of the target antigen with an IC.sub.50 of about 1.times.10.sup.-6M or less. Alternatively, the binding protein may dissociate from the target antigen with a k.sub.off rate constant of about 1.times.10.sup.-2s.sup.-1 or less, as determined by surface plasmon resonance, or may inhibit activity of the target antigen with an IC.sub.50 of about 1.times.10.sup.-7M or less. Alternatively, the binding protein may dissociate from the target with a k.sub.off rate constant of about 1.times.10.sup.-3s.sup.-1 or less, as determined by surface plasmon resonance, or may inhibit the target with an IC.sub.50 of about 1.times.10.sup.-8M or less. Alternatively, binding protein may dissociate from the target with a k.sub.off rate constant of about 1.times.10.sup.-4s.sup.-1 or less, as determined by surface plasmon resonance, or may inhibit its activity with an IC.sub.50 of about 1.times.10.sup.-9M or less. Alternatively, binding protein may dissociate from the target with a k.sub.off rate constant of about 1.times.10.sup.-5s.sup.-1 or less, as determined by surface plasmon resonance, or inhibit its activity with an IC.sub.50 of about 1.times.10.sup.-1.degree. M or less. Alternatively, binding protein may dissociate from the target with a k.sub.off rate constant of about 1.times.10.sup.-5s.sup.-1 or less, as determined by surface plasmon resonance, or may inhibit its activity with an IC.sub.50 of about 1.times.10.sup.-11 M or less.

[0122] In certain embodiments, the engineered binding protein comprises a heavy chain constant region, such as an IgG1, IgG2, IgG3, IgG4, IgA, IgE, IgM or IgD constant region. Preferably, the heavy chain constant region is an IgG1 heavy chain constant region or an IgG4 heavy chain constant region. Furthermore, the binding protein can comprise a light chain constant region, either a kappa light chain constant region or a lambda light chain constant region. The binding protein comprises a kappa light chain constant region. In certain embodiments, the scDVD is reformatted into a DVD-Ig or a DVD-Fab molecule (scDVDFab).

[0123] In certain embodiments, the engineered binding protein comprises an engineered effector function known in the art (see, e.g., Winter, et al. U.S. Pat. Nos. 5,648,260; 5,624,821). The Fc portion of a binding protein mediates several important effector functions e.g. cytokine induction, ADCC, phagocytosis, complement dependent cytotoxicity (CDC) and half-life/clearance rate of binding protein and antigen-binding protein complexes. In some cases these effector functions are desirable for therapeutic binding protein but in other cases might be unnecessary or even deleterious, depending on the therapeutic objectives. Certain human IgG isotypes, particularly IgG1 and IgG3, mediate ADCC and CDC via binding to Fc.gamma.Rs and complement C1q, respectively. Neonatal Fc receptors (FcRn) are the critical components determining the circulating half-life of binding proteins. In still another embodiment at least one amino acid residue is replaced in the constant region of the binding protein, for example the Fc region of the binding protein, such that effector functions of the binding protein are altered.

[0124] In certain embodiments, the engineered binding protein is derivatized or linked to another functional molecule (e.g., another peptide or protein). For example, a labeled binding protein disclosed herein can be derived by functionally linking a binding protein or binding protein portion disclosed herein (by chemical coupling, genetic fusion, noncovalent association or otherwise) to one or more other molecular entities, such as another binding protein (e.g., a bispecific binding protein or a diabody), a detectable agent, a cytotoxic agent, a pharmaceutical agent, and/or a protein or peptide that can mediate associate of the binding protein with another molecule (such as a streptavidin core region or a polyhistidine tag).

[0125] Useful detectable agents with which a binding protein or binding protein portion disclosed herein may be derivatized include fluorescent compounds. Exemplary fluorescent detectable agents include fluorescein, fluorescein isothiocyanate, rhodamine, 5-dimethylamine-1-napthalenesulfonyl chloride, phycoerythrin and the like. A binding protein may also be derivatized with detectable enzymes, such as alkaline phosphatase, horseradish peroxidase, glucose oxidase and the like. When a binding protein is derivatized with a detectable enzyme, it is detected by adding additional reagents that the enzyme uses to produce a detectable reaction product. For example, when the detectable agent horseradish peroxidase is present, the addition of hydrogen peroxide and diaminobenzidine leads to a colored reaction product, which is detectable. A binding protein may also be derivatized with biotin, and detected through indirect measurement of avidin or streptavidin binding.

[0126] In other embodiment, the engineered binding protein is further modified to generate glycosylation site mutants in which the 0- or N-linked glycosylation site of the binding protein has been mutated. One skilled in the art can generate such mutants using standard well-known technologies. Glycosylation site mutants that retain the biological activity, but have increased or decreased binding activity, are another object of the present invention.

[0127] In still another embodiment, the glycosylation of the engineered binding protein or antigen-binding portion disclosed herein is modified. For example, an aglycoslated binding protein can be made (i.e., the binding protein lacks glycosylation). Glycosylation can be altered to, for example, increase the affinity of the binding protein for antigen. Such carbohydrate modifications can be accomplished by, for example, altering one or more sites of glycosylation within the binding protein sequence. For example, one or more amino acid substitutions can be made that result in elimination of one or more variable region glycosylation sites to thereby eliminate glycosylation at that site. Such aglycosylation may increase the affinity of the binding protein for antigen. Such an approach is described in further detail in PCT Publication WO2003016466A2, and U.S. Pat. Nos. 5,714,350 and 6,350,861, each of which is incorporated herein by reference in its entirety.

[0128] Additionally or alternatively, an engineered binding protein disclosed herein can be further modified with an altered type of glycosylation, such as a hypofucosylated binding protein having reduced amounts of fucosyl residues or a binding protein having increased bisecting GlcNAc structures. Such altered glycosylation patterns have been demonstrated to increase the ADCC ability of binding proteins. Such carbohydrate modifications can be accomplished by, for example, expressing the binding protein in a host cell with altered glycosylation machinery. Cells with altered glycosylation machinery have been described in the art and can be used as host cells in which to express recombinant binding proteins disclosed herein to thereby produce a binding protein with altered glycosylation. See, for example, Shields, R. L. et al. (2002) J. Biol. Chem. 277:26733-26740; Umana et al. (1999) Nat. Biotech. 17:176-1, as well as, European Patent No: EP 1,176,195; PCT Publications WO 03/035835; WO 99/54342 80, each of which is incorporated herein by reference in its entirety. Using techniques known in the art a practitioner may generate binding proteins exhibiting human protein glycosylation. For example, yeast strains have been genetically modified to express non-naturally occurring glycosylation enzymes such that glycosylated proteins (glycoproteins) produced in these yeast strains exhibit protein glycosylation identical to that of animal cells, especially human cells (U.S. patent Publication Nos. 20040018590 and 20020137134 and PCT publication WO2005100584 A2).

VI. PRODUCTION OF MULTIVALENT BINDING PROTEINS

[0129] Engineered binding proteins of the present disclosure may be produced by any of a number of techniques known in the art. For example, expression from host cells, wherein expression vector(s) encoding the heavy and light chains is (are) transfected into a host cell by standard techniques. The various forms of the term "transfection" are intended to encompass a wide variety of techniques commonly used for the introduction of exogenous DNA into a prokaryotic or eukaryotic host cell, e.g., electroporation, calcium-phosphate precipitation, DEAE-dextran transfection and the like. Although it is possible to express the binding proteins disclosed herein in either prokaryotic or eukaryotic host cells, expression of binding proteins in eukaryotic cells is preferable, and most preferable in mammalian host cells, because such eukaryotic cells (and in particular mammalian cells) are more likely than prokaryotic cells to assemble and secrete a properly folded and immunologically active binding protein.

[0130] Preferred mammalian host cells for expressing the recombinant binding proteins disclosed herein include Chinese Hamster Ovary (CHO cells) (including dhfr-CHO cells, described in Urlaub and Chasin, (1980) Proc. Natl. Acad. Sci. USA 77:4216-4220, used with a DHFR selectable marker, e.g., as described in R. J. Kaufman and P. A. Sharp (1982) Mol. Biol. 159:601-621), NS0 myeloma cells, COS cells and SP2 cells. When recombinant expression vectors encoding binding protein genes are introduced into mammalian host cells, the binding proteins are produced by culturing the host cells for a period of time sufficient to allow for expression of the binding protein in the host cells or, more preferably, secretion of the binding protein into the culture medium in which the host cells are grown. Binding proteins can be recovered from the culture medium using standard protein purification methods.

[0131] Host cells can also be used to produce functional binding protein fragments, such as Fab fragments or scFv molecules. It will be understood that variations on the above procedure are within the scope of the present disclosure. For example, it may be desirable to transfect a host cell with DNA encoding functional fragments of either the light chain and/or the heavy chain of a binding protein of this disclosure. Recombinant DNA technology may also be used to remove some, or all, of the DNA encoding either or both of the light and heavy chains that is not necessary for binding to the antigens of interest. The molecules expressed from such truncated DNA molecules are also encompassed by the binding proteins disclosed herein. In addition, bifunctional binding proteins may be produced in which one heavy and one light chain are a binding protein disclosed herein and the other heavy and light chain are specific for an antigen other than the antigens of interest by crosslinking a binding protein disclosed herein to a second binding protein by standard chemical crosslinking methods.

[0132] In a preferred system for recombinant expression of a binding protein, or antigen-binding portion thereof, disclosed herein, a recombinant expression vector encoding both the binding protein heavy chain and the binding protein light chain is introduced into dhfr-CHO cells by calcium phosphate-mediated transfection. Within the recombinant expression vector, the binding protein heavy and light chain genes are each operatively linked to CMV enhancer/AdMLP promoter regulatory elements to drive high levels of transcription of the genes. The recombinant expression vector also carries a DHFR gene, which allows for selection of CHO cells that have been transfected with the vector using methotrexate selection/amplification. The selected transformant host cells are cultured to allow for expression of the binding protein heavy and light chains and intact binding protein is recovered from the culture medium. Standard molecular biology techniques are used to prepare the recombinant expression vector, transfect the host cells, select for transformants, culture the host cells and recover the binding protein from the culture medium. Still further the disclosure provides a method of synthesizing a recombinant binding protein disclosed herein by culturing a host cell disclosed herein in a suitable culture medium until a recombinant binding protein disclosed herein is synthesized. The method can further comprise isolating the recombinant binding protein from the culture medium.

II. EXEMPLIFICATION

[0133] The present disclosure is further illustrated by the following examples which should not be construed as further limiting. The contents of Sequence Listing, figures and all references, patents and published patent applications cited throughout this application are expressly incorporated herein by reference.

Example 1. Generation of Single Chain Dual Variable Domain Molecules

[0134] The design of a scDVD molecule derived from a DVD-Ig is shown schematically in FIGS. 1A-1C. For comparison, the schematic diagrams of a DVD-Ig (FIG. 1B) and a scFv (FIG. 1C) have also been presented. The scDVD protein includes both the variable heavy and light chains of a DVD-Ig in their entirety with the carboxyl terminus of the VH domains tethered to the amino terminus of the VL domains through a Gly.sub.4Ser peptide linker (SEQ ID NO: 54) of 30, 35, 40 or 45 amino acids. VH1 and VH2 are paired connected with a specific linker sequence of 6 to 14 amino acids. VL1 and VL2 are paired connected with a specific linker sequence (SL) of 6 amino acids. Sequences encoding the variable regions were PCR amplified from DVD-Ig expression vectors. Primers were designed in such a way that amplified DNAs have the necessary overlap sequence to perform additional overlapping PCRs. The final fragment contains the VH domains, the long Gly.sub.4Ser linker (SEQ ID NO: 54), the VL domains and a peptide tag used to monitor expression of the scDVD on the surface of yeast. The construct is cloned by homologous recombination into a pYD yeast expression vector using DH5.alpha. chemically competent bacteria. Clones from the transformation were screened by bacteria colony PCR for the presence of the correct construct.

[0135] Several different linker sequences were evaluated for linking the VH domains or VL domains (see FIG. 2). The SL linkers correspond to the first 6 to 14 amino acids amino acids of the IgG1 constant region (ASTKGPSVFPLAPS (SEQ ID NO: 55)), or corresponding to the first 6 to 14 amino acids of the IgK constant region (RTVAAPSVFIFPPS (SEQ ID NO: 56)). The GS linkers correspond to 6 to 14 amino acids with repeats of Gly.sub.4Ser (SEQ ID NO: 54). The RL linkers correspond to sequences of 6 to 14 amino acids rich in Proline.

Example 2. scDVD Expression on the Surface of Yeast

[0136] The expression of scDVD on the surface of yeast and the suitability of the selected epitope tags for monitoring expression were evaluated. scDVD expression on the surface of yeast was monitored by flow cytometry analysis using antibodies against scDVD epitope tags. The expression of scDVD on the surface of yeast was found to be comparable to that observed for scFv molecules, with about 50% of the yeast cells expressing the scDVD construct (FIG. 3A). However, scDVD expression shows a lower mean fluorescence intensity compared to scFv, suggesting a lower number of scDVD molecules were expressed by single cell. FIG. 3A (right dot-plot) shows this difference when two different yeast cultures (one expressing scDVD and another expressing scFv) are labeled together in the same tube. Both constructs are expressed in about 50% of the cells (data not shown) but scFv clones have a higher mean fluorescence.

[0137] The length of the long Gly.sub.4Ser linker (SEQ ID NO: 54) did not greatly impact the ability of the cells to express the scDVD. A Gly.sub.4Ser linker (SEQ ID NO: 54) of 30 amino acids seemed to have a negative impact on the expression while there was no difference in expression when using Gly.sub.4Ser (SEQ ID NO: 54) of 35, 40 or 45 amino acids (FIG. 3B).

Example 3. scDVD Retains the Ability of DVD-Ig to Bind Both Targets

[0138] Two different DVD-Igs were expressed as scDVD on the surface of yeast using pYD vectors with three different tags (AcV5, E or StrepII peptide tags). Each construct was incubated with biotinylated antigens under the same conditions and concentrations. scDVD expression was monitored using epitope tags specific antibodies made in mouse, goat and rabbit, respectively. Fluorochrome labeled donkey anti-mouse, goat or rabbit antibodies were used as detection reagents. Mean fluorescence is shown in each individual dot-plot. DLL4/VEGF scDVD retains its ability to bind both DLL4 and/or VEGF (FIG. 4A). There is no difference in binding (mean fluorescence intensity) when the scDVD is incubated with DLL4, VEGF, or a mixture of the two antigens. The same findings were observed for TNF/SOST scDVD. This scDVD retains its ability to bind both TNF and/or Sclerostin (FIG. 4B). There is no difference in binding (mean fluorescence intensity) when the scDVD is incubated with TNF, SOST, or a mixture of the two antigens. Yeast cells express many copies of scDVD on the cell surface, accordingly, the simultaneous binding to both antigens could theoretically be due to some scDVD molecules on a cell binding to one antigen and other scDVD molecules on the same cell binding independently to the second antigen. However, the mean fluorescence do not change when the scDVD is incubated with one antigen, the other antigen or a mix of both antigens, suggesting that the scDVD molecules are binding both antigens simultaneously.

Example 4. scDVD Binds Both Antigens Regardless the Tag Used to Monitor its Expression on the Surface of Yeast

[0139] In yeast display, expression tags are used to monitor the antibody expression and to normalize the antigen-binding signal for expression, thus eliminating artifacts due to host expression bias. This allows for fine discrimination between mutants with different affinities towards their target. Experiments were performed to determine if any given functional DVD-Ig, when expressed as a scDVD, maintains its binding capabilities towards its two cognate targets regardless of the tag used to monitor its expression on the surface of yeast. Specifically, TNF/SOST DVD-Ig was expressed as scDVD on the surface of yeast using three different tags (AcV5, E or StrepII peptide tags). The three constructs were exposed to the same biotinylated antigens (TNF and Sclerostin) under the same conditions and concentrations. scDVD expression was monitored using tag-specific antibodies made in mouse (anti-AcV5; Abcam), goat (anti-E; Abcam) and rabbit (anti-StrepII; GeneScript). Fluorochrome labeled donkey anti-mouse (PerCP), goat (PE) or rabbit (DyLight488) antibodies were used as detection reagents (see Tables 1-3 herein). Antigen binding was monitored by APC conjugated streptavidin or Dylight633 conjugated neutravidin. All samples were analyzed by flow cytometry. FIG. 5 shows that it is feasible to use different peptide tags to monitor scDVD expression and binding on the surface of yeast.

Example 5. Binding Selection of a TNF/SOST scDVD Derived Library Demonstrate Expression and Binding Improvement Compare with the Parental scDVD

[0140] In order to test the ability of scDVD format expressed on the surface of yeast to enhance and affinity mature DVD-Ig, an affinity maturation of a TNF/SOST DVD-Ig was performed using different libraries. These libraries were constructed to contain limited mutations in different CDRs of SOST variable domains. The TNF/SOST scDVD protein sequence is set forth in FIG. 6A. To design these libraries hypermutated CDR residues were identified from other human antibody sequences. The corresponding SOST CDR residues were then subjected to limited mutagenesis by PCR with primers having low degeneracy (79% parental nucleotide and 21% all other three nucleotides) at these positions to create three antibody libraries in the scDVD format suitable for yeast surface display. The first library (H1+H2) contained mutations in HCDR1 and HCDR2 of SOST VH domain. The second library (H3) contained mutations in HCDR3 of SOST VH domain and the third library (LC) contained mutations in all CDRs of SOST VL domain. To further increase the identity of SOST variable domains to the human germline framework sequence, a binary degeneracy (50% parental 50% germline) at certain positions were introduced into the libraries and certain residues were germline (see FIG. 6B). The introduced changes were as follows:

H1+H2 Library:

[0141] Limited mutagenesis of residues: D30, D31, S52, H53, G54, D55, F56 and D58

[0142] Germlining 7 residues: G16R, T23A, S74A, T77S, G82bS, M87T, I89L

H3 Library:

[0143] Limited mutagenesis of residues: N95, N96, R97, G98, Y99, G100, G100a, L100b

[0144] Germlining 7 residues: G16R, T23A, S74A, T77S, G82bS, M87T, I89L

[0145] Binary degeneracy between SOST VH and germline at G94K

LC Library:

[0146] Limited mutagenesis of residues: S27, S30, T32, S40, S94

[0147] NNK randomization at residues N95a, G95b and S95c

[0148] Binary degeneracy between SOST VL and germline at G3V

[0149] These libraries (see FIG. 6B) were separately transformed and displayed on yeast cells and selected against low concentration of biotinylated Sclerostin and TNF by magnetic then fluorescence activated cell sorting. Each library was differently tagged by one of StrepII, FLAG or E peptide tags. scDVD expression and antigen binding were monitored by flow cytometry as described above using the antibodies described on Tables 2 and 3 herein.

[0150] After 2 and 4 rounds of selection, the binding towards Sclerostin was notably improved compared to the binding of the parental molecule. Parental TNF/SOST scDVD binds to 300 nM of Sclerostin after an incubation for 1 hour at 37.degree. C. No binding was observed when the parental molecule was incubated with 30 nM of Sclerostin. In contrast, after 2 rounds of selection the H3 library shows binding to 30 nM of Sclerostin, and after 4 round of selection the binding to 30 nM of Sclerostin is observed when the library output was incubated only for 20 minutes at room temperature (see FIG. 6C). Similar improvements were observed for the H1+H2 and LC libraries.

[0151] Once the diversity of each library is reduced to about 10.sup.3 the plasmid DNA from each output was isolated and the libraries are recombined by PCR into a new library (rHC+LC). This library was transformed into yeast cells and displayed on cell surfaces to be selected against biotinylated Sclerostin. After selection the improvement in affinity is very notorious. As pointed out the parental construct is able to bind Sclerostin at 300 nM when incubated for 1 hour at 37.degree. C. rHC+LC library output after 6 round of selection is able to bind 0.1 nM of Sclerostin when incubated only for 20 seconds at 4.degree. C. (FIG. 6D). Although, no formal quantification of the affinity is done, an improvement of more than 100 folds is expected based on this results. It is clear that scDVD based libraries could be selected and enriched for better binders.

Example 6. Binding Selection of TNF/SOST scDVD Libraries Shows Enrichment of SL Linkers Between VL Domains

[0152] As discussed above, there is a clear need for linker engineering during the construction and optimization of DVD-Ig antibodies. Steric hindrance due to the proximity of the outer variable domain to the ligand binding site of the inner VD could, at least partially, be responsible for a reduced affinity of a domain when engineered as the inner variable domain. Accordingly, experiments were performed to determine if the scDVD approach could be used to engineer linkers to pair VHs or VLs in a DVD-Ig. To this end, a TNF/SOST scDVD library was made by introducing 12 different linkers: four SL linkers corresponding to the first 6, 8, 10 and 12 amino acids amino acids of the IgK constant region; four GS linkers with repeats of Gly.sub.4Ser (SEQ ID NO: 54) of 6, 8, 10 and 12 amino acids; and four proline-rich RL linkers corresponding to 6, 8, 10 and 12 amino acids (see FIG. 7A). Additionally, residues S94, N95a, G95b and S95c of the LCDR3 of SOST VL were mutated by NNK randomization. After four rounds of selection using different concentrations of Sclerostin under different conditions, the library output showed enrichment in RL linkers especially of the longest size (12 and 10 amino acids; between 3 to 7 folds). Also, the GS linkers were significantly reduced (between 6 to 8 fold) (see FIG. 7B). This data clearly demonstrates that scDVD-based yeast surface display allows for the optimization and engineering of linkers to pair VHs or VLs.

TABLE-US-00001 TABLE 1 Peptide tags used on a panel of yeast expression vectors SEQ SEQ Peptide DNA ID Protein ID pYDsTEV Tag sequence NO: sequence NO: vectors HIS* CATCATCA 74 HHHHHH 85 CCATCACC AT V5 GGTAAGCC 75 GKPIPNPL 86 13767_pYDs_ TATCCCTA LGLDST TEV_total ACCCTCTC CTCGGTCT CGATTCTA CG c-MYC GAACAAAA 76 EQKLISEE 87 pYDsTEV_c-MYC ACTTATTT DL CTGAAGAA GATCTG HA TACCCATA 77 YPYDVPDY 88 pYDsTEV_HA CGATGTTC A CGGATTAC GCT HSV AGCCAGCC 78 SQPELAPE 89 pYDsTEV_HSV AGAACTCG DPED CTCCTGAA GACCCAGA GGAC FLAG GACTACAA 79 DYKDDDDK 90 pYDsTEV_FLAG GGACGACG ACGACAAG StrepII TGGAGCCA 80 WSHPQFEK 91 pYDsTEV_ TCCGCAGT StrepII TTGAGAAG E2 TCCAGCAC 81 SSTSSDFR 92 pYDsTEV_E2 CTCGAGTG DR ATTTTCGA GATCGC S AAGGAAAC 82 KETAAAKF 93 pYDsTEV_S CGCGGCTG ERQHMDS CCAAGTTT GAACGCCA GCATATGG ATAGC E GGAGCGCC 83 GAPVPYPD 94 pYDsTEV_E TGTACCAT PLEPR ATCCGGAT CCGCTGGA ACCGCGC AcV5 AGCTGGAA 84 SWKDASGW 95 pYDsTEV_AcV5 GGATGCGA S GCGGCTGG AGC *HIS tag is present in all pYDsTEV vectors downstream of all others tags.

TABLE-US-00002 TABLE 2 Commercially available anti-peptide tags antibodies used to monitor ScDVD antibody expression on yeast. Tag Ab Source Clone Source Catalog # S Mouse SBSTAGa Abcam ab24838 S Rabbit Polyclonal ab18588 AcV5 Mouse AcV5 Abcam. Rabbit S tag ab49581 antibody E2 Mouse 5E11 Abcam. AcV5 tag ab977 antibody E Rabbit Polyclonal Abcam T7 tag .RTM. ab3397 E Goat Polyclonal Abcam ab95868 E Chicken Polyclonal ab18695 StrepII Mouse Strep-tag Abcam. E tag antibody MCA2489 StrepII Rabbit Polyclonal Abcam. E tag antibody A00626 HA Mouse HA-7 Sigma H9658 HA Goat Polyclonal Abcam ab9134 HA Rat (IgG1) 3F10 Roche 11-867-423 c-myc Mouse 9E10 Sigma M4439 c-myc Rabbit Polyclonal Sigma C3956 Flag Mouse M2 Sigma F3165 Flag Rabbit Polyclonal Sigma F7425 HSV Rabbit Polyclonal Sigma H6030

TABLE-US-00003 TABLE 3 Commercially available secondary reagents used to monitor scFv antibody expression and binding on the surface of yeast Secondary reagent Fluorocrome Source Catalog # F(ab')2 Frag. Donkey Anti-Rat IgG PerCp Jackson 712-126-150 ImmunoResearch F(ab')2 Frag. Donkey Anti-Goat IgG R-PE Jackson ImmunoResearch F(ab')2 Frag. Donkey Anti-Rabbit IgG DyLight-488 Jackson 705-116-147 ImmunoResearch F(ab')2 Frag. Goat Anti-Rabbit IgG R-PE Jackson ImmunoResearch F(ab')2 Frag. Goat Anti-Rabbit IgG Alexafluor 488 Invitrogen 711-486-152 Chicken anti mouse IgG (H + L) PerCP Jackson 111-116-144 ImmunoResearch F(ab')2 Frag Donkey Anti-Mouse IgG Alexafluor 633 ThermoScientific 715-126-151

Example 7. Generation of a Single Chain Dual Variable Domain Fab (scDVDFab) Including Constant Regions

[0153] Another design of a scDVDFab antibody derived from a DVD-Ig is shown schematically in FIGS. 10A-10C. For comparison, the schematic diagrams of a DVD-Ig (FIG. 10B) and a scDVD (FIG. 10C) have also been presented. In this example, the scDVDFab protein includes the variable heavy (VH) and light (VL) chains of a DVD-Ig in their entirety with the CH1 region of the heavy chain and the kappa constant region (C.kappa.) of the light chain. As shown in FIG. 10A, The VL domains fused to the C.kappa. are tethered to the VH domains fused to the CH1 through a GS-rigid peptide linker of 41, 49, 57 or 65 amino acids from the carboxyl terminus of the Ck region to the amino terminus of the VH domains. These linkers are shown in greater detail below. VL1 and VL2 are paired connected with specific linkers already described and used in DVD-Igs and scDVD. The same is for VH1 and VH2 pair. FIG. 11A contains a schematic representation of a scDVDFab linear sequence.

[0154] Sequences encoding the variable regions were PCR amplified from the DVD-Ig expression vectors. Primers were designed in such a way that amplified DNAs had the necessary overlap sequence to perform additional overlapping PCRs. The final fragment contained the linear sequence represented in FIG. 11A plus a peptide tag used to monitor expression of the scDVDFab on the surface of yeast. The construct was cloned by homologous recombination into a pYD yeast expression vector using DH5a chemically competent bacteria. Clones from the transformation were screened by bacteria colony PCR for the presence of the right construct.

[0155] GS-Rigid Linkers

[0156] The GS-rigid linkers were made by combinations of different Gly/Ser segments and proline rich rigid segments. The sequences of the linkers are below and a GS-rigid linker scheme could be found in FIG. 11B. More specifically the GS-rigid linkers are composed as follows:

N-terminus-G.sub.3SG.sub.3-left rigid segment-G.sub.2SG.sub.2-right rigid segment-G.sub.3SG.sub.3-C-terminus ("G.sub.3SG.sub.3" disclosed as SEQ ID NO: 96 and "G.sub.2SG.sub.2" disclosed as SEQ ID NO: 97)

[0157] where the rigid segments vary in length and amino acid composition. The following rigid segments have been tested:

TABLE-US-00004 Right rigid segment in the linkers: (SEQ ID NO: 98) TPAPLPAPLPT 11 AA (SEQ ID NO: 99) TPAPTPAPLPAPLPT 15 AA (SEQ ID NO: 100) TPAPLPAPTPAPLPAPLPT 19 AA (SEQ ID NO: 101) TPAPLPAPLPAPTPAPLPAPLPT 23 AA Left rigid segments in the linkers: (SEQ ID NO: 5) TPLPAPLPAPT 11 AA (SEQ ID NO: 6) TPLPTPLPAPLPAPT 15 AA (SEQ ID NO: 7) TPLPAPLPTPLPAPLPAPT 19 AA (SEQ ID NO: 8) TPLPAPLPAPLPTPLPAPLPAPT 23 AA 41 aminoacids GS-rigid linker: (SEQ ID NO: 1) GGGSGGGTPLPAPLPAPTGGSGGTPAPLPAPLPTGGGSGGG 49 aminoacids GS-rigid linker: (SEQ ID NO: 2) GGGSGGGTPLPTPLPAPLPAPTGGSGGTPAPTPAPLPAPLP TGGGSGGG 57 aminoacids GS-rigid linker: (SEQ ID NO: 3) GGGSGGGTPLPAPLPTPLPAPLPAPTGGSGGTPAPTPAPTP APLPAPLPTGGGSGGG 65 aminoacids GS-rigid linker: (SEQ ID NO: 4) GGGSGGGTPLPAPLPAPLPTPLPAPLPAPTGGSGGTPAPTP APTPAPTPAPLPAPLPTGGGSGGG

Example 8. scDVDFab Expression on the Surface of Yeast

[0158] scDVDFab were expressed on the surface of yeast and the selected peptide tags were suitable for monitoring its expression. ScDVDFab expression on the surface of yeast was monitored by flow cytometry analysis and antibodies were used to detect peptide tags. A DVD-Ig was expressed as scDVDFab on the surface of yeast using pYD vectors and 4 different GS-rigid linkers. The expression of scDVDFab on the surface of yeast was comparable to that observed for scFv molecules reaching more than 50% of the yeast cells expressing the construct (FIG. 12). The length of the GS-rigid linker did not impact the ability of the cells to express the scDVDFab.

Example 9. ScDVDFab Retained the Ability of DVD-Ig to Bind Both Targets

[0159] Functional DVD-Ig expressed as scDVDFab maintained its binding capabilities towards its two targets on the surface of yeast. A DVD-Igs was expressed as scDVDFab on the surface of yeast using pYD vectors. Aliquots of the yeast culture were incubated with biotinylated antigens. scDVDFab expression was monitored by purified tag-specific antibodies. Fluorochrome labeled secondary antibodies were used as detection reagents. IL-1B/IL17 scDVDFab retains its ability to bind both IL1B and/or IL17 (FIG. 13).

Example 10. Binding to Both Targets is Comparable Between scDVDFab and DVD-Fab Formats Expressed on the Surface of Yeast

[0160] scDVDFab constructs bound both antigens in a similar way as the DVD-Fab bind them. A DVD-Ig was expressed as scDVDFab and DVD-Fab on the surface of yeast using pYD vectors. Aliquots of the yeast culture were incubated with biotinylated antigens. scDVDFab and DVD-Fab expression was monitored by purified tag-specific antibodies. Fluorochrome labeled secondary antibodies were used as detection reagents. The scDVDFab and DVD-Fab had similar binding profiles binding to both IL1B and IL17 on the surface of yeast. There is a small increase in the mean fluorescence of scDVDFab compared to DVD-Fab (FIG. 14).

Sequence CWU 1

1

101141PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 1Gly Gly Gly Ser Gly Gly Gly Thr Pro Leu Pro Ala Pro Leu Pro Ala 1 5 10 15 Pro Thr Gly Gly Ser Gly Gly Thr Pro Ala Pro Leu Pro Ala Pro Leu 20 25 30 Pro Thr Gly Gly Gly Ser Gly Gly Gly 35 40 249PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 2Gly Gly Gly Ser Gly Gly Gly Thr Pro Leu Pro Thr Pro Leu Pro Ala 1 5 10 15 Pro Leu Pro Ala Pro Thr Gly Gly Ser Gly Gly Thr Pro Ala Pro Thr 20 25 30 Pro Ala Pro Leu Pro Ala Pro Leu Pro Thr Gly Gly Gly Ser Gly Gly 35 40 45 Gly 357PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 3Gly Gly Gly Ser Gly Gly Gly Thr Pro Leu Pro Ala Pro Leu Pro Thr 1 5 10 15 Pro Leu Pro Ala Pro Leu Pro Ala Pro Thr Gly Gly Ser Gly Gly Thr 20 25 30 Pro Ala Pro Thr Pro Ala Pro Thr Pro Ala Pro Leu Pro Ala Pro Leu 35 40 45 Pro Thr Gly Gly Gly Ser Gly Gly Gly 50 55 465PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 4Gly Gly Gly Ser Gly Gly Gly Thr Pro Leu Pro Ala Pro Leu Pro Ala 1 5 10 15 Pro Leu Pro Thr Pro Leu Pro Ala Pro Leu Pro Ala Pro Thr Gly Gly 20 25 30 Ser Gly Gly Thr Pro Ala Pro Thr Pro Ala Pro Thr Pro Ala Pro Thr 35 40 45 Pro Ala Pro Leu Pro Ala Pro Leu Pro Thr Gly Gly Gly Ser Gly Gly 50 55 60 Gly 65 511PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 5Thr Pro Leu Pro Ala Pro Leu Pro Ala Pro Thr 1 5 10 615PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 6Thr Pro Leu Pro Thr Pro Leu Pro Ala Pro Leu Pro Ala Pro Thr 1 5 10 15 719PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 7Thr Pro Leu Pro Ala Pro Leu Pro Thr Pro Leu Pro Ala Pro Leu Pro 1 5 10 15 Ala Pro Thr 823PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 8Thr Pro Leu Pro Ala Pro Leu Pro Ala Pro Leu Pro Thr Pro Leu Pro 1 5 10 15 Ala Pro Leu Pro Ala Pro Thr 20 929PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 9Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Gly Gly Gly Glu Val Gln Leu Val Glu Ser Gly 20 25 1028PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 10Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Gly Gly Glu Val Gln Leu Val Glu Ser Gly 20 25 1127PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 11Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Gly Glu Val Gln Leu Val Glu Ser Gly 20 25 1226PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 12Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Glu Val Gln Leu Val Glu Ser Gly 20 25 1325PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 13Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Glu Val Gln Leu Val Glu Ser Gly 20 25 1424PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 14Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Glu Val Gln Leu Val Glu Ser Gly 20 1523PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 15Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Gly Glu 1 5 10 15 Val Gln Leu Val Glu Ser Gly 20 1622PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 16Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly Glu Val 1 5 10 15 Gln Leu Val Glu Ser Gly 20 1721PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 17Thr Leu Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Glu Val Gln 1 5 10 15 Leu Val Glu Ser Gly 20 1828PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 18Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe 1 5 10 15 Pro Leu Ala Pro Glu Val Gln Leu Val Glu Ser Gly 20 25 1927PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 19Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe 1 5 10 15 Pro Leu Ala Glu Val Gln Leu Val Glu Ser Gly 20 25 2026PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 20Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe 1 5 10 15 Pro Leu Glu Val Gln Leu Val Glu Ser Gly 20 25 2125PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 21Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe 1 5 10 15 Pro Glu Val Gln Leu Val Glu Ser Gly 20 25 2224PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 22Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Phe 1 5 10 15 Glu Val Gln Leu Val Glu Ser Gly 20 2323PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 23Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Val Glu 1 5 10 15 Val Gln Leu Val Glu Ser Gly 20 2422PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 24Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser Glu Val 1 5 10 15 Gln Leu Val Glu Ser Gly 20 2521PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 25Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Glu Val Gln 1 5 10 15 Leu Val Glu Ser Gly 20 2629PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 26Thr Leu Val Thr Val Ser Ser Thr Pro Ala Pro Leu Pro Ala Pro Leu 1 5 10 15 Pro Ala Pro Thr Thr Glu Val Gln Leu Val Glu Ser Gly 20 25 2727PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 27Thr Leu Val Thr Val Ser Ser Thr Pro Ala Pro Leu Pro Ala Pro Ala 1 5 10 15 Pro Thr Thr Glu Val Gln Leu Val Glu Ser Gly 20 25 2825PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 28Thr Leu Val Thr Val Ser Ser Thr Pro Ala Pro Leu Pro Ala Pro Thr 1 5 10 15 Thr Glu Val Gln Leu Val Glu Ser Gly 20 25 2923PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 29Thr Leu Val Thr Val Ser Ser Thr Pro Ala Pro Leu Pro Thr Thr Glu 1 5 10 15 Val Gln Leu Val Glu Ser Gly 20 3021PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 30Thr Leu Val Thr Val Ser Ser Thr Pro Ala Pro Thr Thr Glu Val Gln 1 5 10 15 Leu Val Glu Ser Gly 20 3129PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 31Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Gly Gly Gly Gly Ser Asp Ile Gln Met Thr Gln Ser Pro 20 25 3228PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 32Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Gly Gly Gly Gly Asp Ile Gln Met Thr Gln Ser Pro 20 25 3327PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 33Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Gly Gly Gly Asp Ile Gln Met Thr Gln Ser Pro 20 25 3426PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 34Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Gly Gly Asp Ile Gln Met Thr Gln Ser Pro 20 25 3525PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 35Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Gly Asp Ile Gln Met Thr Gln Ser Pro 20 25 3624PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 36Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Ser 1 5 10 15 Asp Ile Gln Met Thr Gln Ser Pro 20 3723PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 37Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly Asp 1 5 10 15 Ile Gln Met Thr Gln Ser Pro 20 3822PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 38Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Asp Ile 1 5 10 15 Gln Met Thr Gln Ser Pro 20 3921PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 39Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Asp Ile Gln 1 5 10 15 Met Thr Gln Ser Pro 20 4028PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 40Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Phe 1 5 10 15 Ile Phe Pro Pro Asp Ile Gln Met Thr Gln Ser Pro 20 25 4127PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 41Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Phe 1 5 10 15 Ile Phe Pro Asp Ile Gln Met Thr Gln Ser Pro 20 25 4226PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 42Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Phe 1 5 10 15 Ile Phe Asp Ile Gln Met Thr Gln Ser Pro 20 25 4325PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 43Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Phe 1 5 10 15 Ile Asp Ile Gln Met Thr Gln Ser Pro 20 25 4424PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 44Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Phe 1 5 10 15 Asp Ile Gln Met Thr Gln Ser Pro 20 4523PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 45Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val Asp 1 5 10 15 Ile Gln Met Thr Gln Ser Pro 20 4622PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 46Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Asp Ile 1 5 10 15 Gln Met Thr Gln Ser Pro 20 4721PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 47Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Asp Ile Gln 1 5 10 15 Met Thr Gln Ser Pro 20 4829PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 48Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Ala Pro 1 5 10 15 Leu Pro Ala Pro Thr Asp Ile Gln Met Thr Gln Ser Pro 20 25 4927PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 49Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Ala Pro 1 5 10 15 Ala Pro Thr Asp Ile Gln Met Thr Gln Ser Pro 20 25 5025PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 50Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Ala Pro 1 5 10 15 Thr Asp Ile Gln Met Thr Gln Ser Pro 20 25 5123PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 51Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Thr Asp 1 5 10 15 Ile Gln Met Thr Gln Ser Pro 20 5221PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 52Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Thr Asp Ile Gln 1 5 10 15 Met Thr Gln Ser Pro 20 5350PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 53Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly 1 5 10 15 Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly 20 25 30 Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 35 40 45 Gly Ser 50 545PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 54Gly Gly Gly Gly Ser 1 5 5514PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 55Ala Ser Thr Lys Gly Pro Ser Val Phe Pro Leu Ala Pro Ser 1 5 10 5614PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 56Arg Thr Val Ala Ala Pro Ser Val Phe Ile Phe Pro Pro Ser 1 5 10 57508PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 57Glu Val Gln Leu Val Gln Ser Gly Ala Glu Val Lys Lys Pro Gly Ala 1 5 10 15 Ser Val Lys Val Ser Cys Lys Ala Ser Gly Tyr Thr Phe Ala Asn Tyr 20 25 30 Gly Ile Ile Trp Val Arg Gln Ala Pro Gly Gln Gly Leu Glu Trp Met 35 40 45 Gly Trp Ile Asn Thr Tyr Thr Gly Lys Pro Thr Tyr Ala Gln Lys Phe 50 55 60 Gln Gly Arg Val Thr Met Thr Thr Asp Thr Ser Thr Ser Thr Ala Tyr 65 70 75 80 Met Glu Leu Ser Ser Leu Arg Ser Glu Asp Thr Ala Val Tyr Tyr Cys 85 90 95 Ala Arg Lys Leu Phe Thr Thr Met Asp Val Thr Asp Asn Ala Met Asp 100 105 110 Tyr Trp Gly Gln Gly Thr Thr Val Thr Val Ser Ser Ala Ser Thr Lys 115 120 125 Gly Pro Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro 130 135 140 Gly Arg Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asp 145 150 155 160 Asp Tyr Ala Leu His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu 165 170 175 Trp Val Ser Gly Ile Ser Trp His Gly Asp Phe Ile Asp Tyr Ala Asp 180

185 190 Ser Val Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Ser 195 200 205 Leu Tyr Leu Gln Met Asn Ser Leu Arg Val Glu Asp Thr Ala Leu Tyr 210 215 220 Tyr Cys Ala Gly Asn Asn Arg Gly Tyr Gly Gly Leu Asp Val Trp Gly 225 230 235 240 Gln Gly Thr Thr Val Thr Val Ser Ser Gly Gly Gly Gly Ser Gly Gly 245 250 255 Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly 260 265 270 Gly Ser Gly Gly Gly Gly Ser Gly Gly Gly Gly Ser Asp Ile Gln Met 275 280 285 Thr Gln Ser Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr 290 295 300 Ile Thr Cys Arg Ala Ser Gln Asp Ile Ser Gln Tyr Leu Asn Trp Tyr 305 310 315 320 Gln Gln Lys Pro Gly Lys Ala Pro Lys Leu Leu Ile Tyr Tyr Thr Ser 325 330 335 Arg Leu Gln Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Gly Ser Gly 340 345 350 Thr Asp Phe Thr Leu Thr Ile Ser Ser Leu Gln Pro Glu Asp Phe Ala 355 360 365 Thr Tyr Phe Cys Gln Gln Gly Asn Thr Trp Pro Pro Thr Phe Gly Gln 370 375 380 Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Gln Ser Val 385 390 395 400 Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln Arg Val Thr 405 410 415 Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn Thr Val Asn 420 425 430 Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu Ile Tyr Ser 435 440 445 Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser Gly Ser Lys 450 455 460 Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln Ser Glu Asp 465 470 475 480 Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu Asn Gly Ser 485 490 495 Tyr Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu 500 505 58119PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 58Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Arg 1 5 10 15 Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Thr Phe Asp Asp Tyr 20 25 30 Ala Met His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val 35 40 45 Ser Gly Ile Ser Trp Asn Ser Gly Ser Ile Gly Tyr Ala Asp Ser Val 50 55 60 Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ala Lys Asn Ser Leu Tyr 65 70 75 80 Leu Gln Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Leu Tyr Tyr Cys 85 90 95 Ala Lys Asp Tyr Tyr Tyr Tyr Tyr Gly Met Asp Val Trp Gly Gln Gly 100 105 110 Thr Thr Val Thr Val Ser Ser 115 59119PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 59Glu Val Gln Leu Val Glu Ser Gly Gly Gly Leu Val Gln Pro Gly Gly 1 5 10 15 Ser Leu Arg Leu Ser Cys Thr Ala Ser Gly Phe Thr Phe Asp Asp Tyr 20 25 30 Ala Leu His Trp Val Arg Gln Ala Pro Gly Lys Gly Leu Glu Trp Val 35 40 45 Ser Gly Ile Ser Trp His Gly Asp Phe Ile Asp Tyr Ala Asp Ser Val 50 55 60 Lys Gly Arg Phe Thr Ile Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 65 70 75 80 Leu Gln Met Asn Gly Leu Arg Val Glu Asp Met Ala Ile Tyr Tyr Cys 85 90 95 Ala Gly Asn Asn Arg Gly Tyr Gly Gly Leu Asp Val Trp Gly Gln Gly 100 105 110 Thr Thr Val Thr Val Ser Ser 115 60111PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 60Gln Ser Val Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln 1 5 10 15 Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn 20 25 30 Thr Val Asn Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40 45 Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50 55 60 Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln 65 70 75 80 Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu 85 90 95 Asn Gly Pro Val Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu 100 105 110 61111PRTArtificial SequenceDescription of Artificial Sequence Synthetic polypeptide 61Gln Ser Gly Leu Thr Gln Pro Pro Ser Ala Ser Gly Thr Pro Gly Gln 1 5 10 15 Arg Val Thr Ile Ser Cys Ser Gly Ser Ser Ser Asn Ile Gly Ser Asn 20 25 30 Thr Val Asn Trp Tyr Gln Gln Leu Pro Gly Thr Ala Pro Lys Leu Leu 35 40 45 Ile Tyr Ser Asn Asn Gln Arg Pro Ser Gly Val Pro Asp Arg Phe Ser 50 55 60 Gly Ser Lys Ser Gly Thr Ser Ala Ser Leu Ala Ile Ser Gly Leu Gln 65 70 75 80 Ser Glu Asp Glu Ala Asp Tyr Tyr Cys Ala Ala Trp Asp Asp Ser Leu 85 90 95 Asn Gly Ser Tyr Val Phe Gly Gly Gly Thr Lys Leu Thr Val Leu 100 105 110 6228PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 62Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val 1 5 10 15 Phe Ile Phe Pro Gln Ser Val Leu Thr Gln Pro Pro 20 25 6326PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 63Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val 1 5 10 15 Phe Ile Gln Ser Val Leu Thr Gln Pro Pro 20 25 6424PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 64Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Ser Val 1 5 10 15 Gln Ser Val Leu Thr Gln Pro Pro 20 6522PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 65Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Val Ala Ala Pro Gln Ser 1 5 10 15 Val Leu Thr Gln Pro Pro 20 6628PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 66Gln Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Gly Gly Gln Ser Val Leu Thr Gln Pro Pro 20 25 6726PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 67Gln Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Ser Gly Gln Ser Val Leu Thr Gln Pro Pro 20 25 6824PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 68Gln Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gly Gly 1 5 10 15 Gln Ser Val Leu Thr Gln Pro Pro 20 6922PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 69Gln Gly Thr Lys Leu Glu Ile Lys Arg Gly Gly Ser Gly Gly Gln Ser 1 5 10 15 Val Leu Thr Gln Pro Pro 20 7028PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 70Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Ala 1 5 10 15 Pro Ala Pro Thr Gln Ser Val Leu Thr Gln Pro Pro 20 25 7126PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 71Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Ala 1 5 10 15 Pro Thr Gln Ser Val Leu Thr Gln Pro Pro 20 25 7224PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 72Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Leu Pro Thr 1 5 10 15 Gln Ser Val Leu Thr Gln Pro Pro 20 7322PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 73Gln Gly Thr Lys Leu Glu Ile Lys Arg Thr Pro Ala Pro Thr Gln Ser 1 5 10 15 Val Leu Thr Gln Pro Pro 20 7418DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 74catcatcacc atcaccat 187542DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 75ggtaagccta tccctaaccc tctcctcggt ctcgattcta cg 427630DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 76gaacaaaaac ttatttctga agaagatctg 307727DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 77tacccatacg atgttccgga ttacgct 277836DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 78agccagccag aactcgctcc tgaagaccca gaggac 367924DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 79gactacaagg acgacgacga caag 248024DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 80tggagccatc cgcagtttga gaag 248130DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 81tccagcacct cgagtgattt tcgagatcgc 308245DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 82aaggaaaccg cggctgccaa gtttgaacgc cagcatatgg atagc 458339DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 83ggagcgcctg taccatatcc ggatccgctg gaaccgcgc 398427DNAArtificial SequenceDescription of Artificial Sequence Synthetic oligonucleotide 84agctggaagg atgcgagcgg ctggagc 27856PRTArtificial SequenceDescription of Artificial Sequence Synthetic 6xHis tag 85His His His His His His 1 5 8614PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 86Gly Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr 1 5 10 8710PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 87Glu Gln Lys Leu Ile Ser Glu Glu Asp Leu 1 5 10 889PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 88Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 1 5 8912PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 89Ser Gln Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp 1 5 10 908PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 90Asp Tyr Lys Asp Asp Asp Asp Lys 1 5 918PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 91Trp Ser His Pro Gln Phe Glu Lys 1 5 9210PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 92Ser Ser Thr Ser Ser Asp Phe Arg Asp Arg 1 5 10 9315PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 93Lys Glu Thr Ala Ala Ala Lys Phe Glu Arg Gln His Met Asp Ser 1 5 10 15 9413PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 94Gly Ala Pro Val Pro Tyr Pro Asp Pro Leu Glu Pro Arg 1 5 10 959PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 95Ser Trp Lys Asp Ala Ser Gly Trp Ser 1 5 967PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 96Gly Gly Gly Ser Gly Gly Gly 1 5 975PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 97Gly Gly Ser Gly Gly 1 5 9811PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 98Thr Pro Ala Pro Leu Pro Ala Pro Leu Pro Thr 1 5 10 9915PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 99Thr Pro Ala Pro Thr Pro Ala Pro Leu Pro Ala Pro Leu Pro Thr 1 5 10 15 10019PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 100Thr Pro Ala Pro Leu Pro Ala Pro Thr Pro Ala Pro Leu Pro Ala Pro 1 5 10 15 Leu Pro Thr 10123PRTArtificial SequenceDescription of Artificial Sequence Synthetic peptide 101Thr Pro Ala Pro Leu Pro Ala Pro Leu Pro Ala Pro Thr Pro Ala Pro 1 5 10 15 Leu Pro Ala Pro Leu Pro Thr 20

* * * * *