Modified Gp140 Envelope Polypeptides Of Hiv-1 Isolates, Compositions, Stabilized Trimeric Complexes, And Uses Thereof Dey; Antu K. ; et al. [PROGENICS PHARMACEUTICAL: INC.: and CORNELL RESEARCH FOUNDATION INC.]

Modified Gp140 Envelope Polypeptides Of Hiv-1 Isolates, Compositions, Stabilized Trimeric Complexes, And Uses Thereof

Dey; Antu K. ; et al.

Patent Application Summary

U.S. patent application number 13/100941 was filed with the patent office on 2011-10-13 for modified gp140 envelope polypeptides of hiv-1 isolates, compositions, stabilized trimeric complexes, and uses thereof. This patent application is currently assigned to PROGENICS PHARMACEUTICAL: INC.: and CORNELL RESEARCH FOUNDATION INC.. Invention is credited to Antu K. Dey, Michael Franti, Sai Prasad N. Iyer, Yun (Kenneth) Kang, John P. Moore, William C. Olson.

Application Number	20110250220 13/100941
Document ID	/
Family ID	39430255
Filed Date	2011-10-13

United States Patent Application	20110250220
Kind Code	A1
Dey; Antu K. ; et al.	October 13, 2011

MODIFIED GP140 ENVELOPE POLYPEPTIDES OF HIV-1 ISOLATES, COMPOSITIONS, STABILIZED TRIMERIC COMPLEXES, AND USES THEREOF

Abstract

This invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, said gp41 ectodomain polypeptide portion being modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535; glutamine (Q) at an amino acid position equivalent to amino acid position 543; serine (S) at an amino acid position equivalent to amino acid position 553; lysine (K) at an amino acid position equivalent to amino acid position 567; and arginine (R) at an amino acid position equivalent to amino acid position 588, the amino acid positions being numbered by reference to the HIV-1 isolate KNH1144. This invention also provides nucleic acids encoding such a polypeptide, vectors, host cells, trimeric complexes and compositions thereof. Also provided are antibodies generated against the modified polypeptides and trimeric complexes, and methods of using the modified polypeptides, compositions and trimeric complexes.

Inventors:	Dey; Antu K.; (Auburndale, MA) ; Moore; John P.; (New York, NY) ; Olson; William C.; (Yorktown Heights, NY) ; Iyer; Sai Prasad N.; (East Elmhurst, NY) ; Kang; Yun (Kenneth); (Livingston, NJ) ; Franti; Michael; (Cambridge, MA)
Assignee:	PROGENICS PHARMACEUTICAL: INC.: and CORNELL RESEARCH FOUNDATION INC.
Family ID:	39430255
Appl. No.:	13/100941
Filed:	May 4, 2011

Related U.S. Patent Documents


Application Number	Filing Date	Patent Number
12312016	Oct 7, 2009	7939083
PCT/US2007/022227	Oct 17, 2007
13100941
60855236	Oct 30, 2006
60854034	Oct 23, 2006

Current U.S. Class:	424/188.1 ; 435/69.3; 530/350
Current CPC Class:	A61K 39/21 20130101; C12N 2740/16134 20130101; A61K 39/12 20130101; C12N 2740/16122 20130101; A61K 39/00 20130101; A61P 31/18 20180101; A61P 37/04 20180101; C07K 14/005 20130101
Class at Publication:	424/188.1 ; 530/350; 435/69.3
International Class:	A61K 39/21 20060101 A61K039/21; A61P 31/18 20060101 A61P031/18; A61P 37/04 20060101 A61P037/04; C07K 14/16 20060101 C07K014/16; C12P 21/00 20060101 C12P021/00

Goverment Interests

[0002] This invention was made with support under Grant Nos. AI 36082 and AI 45463 and NIH contract N01 AI 30030 from the National Institutes of Health, U.S. Department of Health and Human Services. Accordingly, the United States Government has certain rights in the subject invention.

Claims

1-96. (canceled)

97. A purified, modified HIV-1 gp140 envelope polypeptide comprising (1) a gp120 polypeptide portion comprising consecutive amino acids and (2) a gp41 ectodomain polypeptide portion comprising consecutive amino acids, said gp41 ectodomain polypeptide portion being heterologous to the gp41 ectodomain polypeptide portion of a KNH1144 HIV-1 isolate and being modified to comprise methionine (M) at an amino acid position equivalent to amino acid position 535 (M535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino acid positions are numbered by reference to the KNH1144 HIV-1 isolate.

98. The purified, modified gp140 envelope polypeptide of claim 97, wherein (1) the gp120 polypeptide portion is further modified to comprise a cysteine (C) residue at an amino acid position equivalent to amino acid position 492, numbered by reference to the JR-FL HIV isolate, and (2) the gp41 ectodomain polypeptide portion is modified to comprise a cysteine (C) residue at an amino acid position equivalent to amino acid position 596, numbered by reference to the JR-FL HIV-1 isolate.

99. The purified, modified HIV-1 gp140 envelope polypeptide of claim 97 or claim 98, wherein the gp41 ectodomain polypeptide portion further comprises a proline (P) at an amino acid position equivalent to amino acid position 559, numbered by reference to the KNH1144 HIV-1 isolate.

100. The purified, modified HIV-1 gp140 envelope polypeptide of claim 97 wherein the gp120 or gp41 ectodomain is derived from a HIV-1 isolate of clade A, B, C, D, E, F, G, H, J or O.

101. The purified, modified HIV-1 gp140 envelope polypeptide of claim 97, wherein the HIV-1 isolate is an HIV-1.sub.JR-FL, HIV-1.sub.Ba-L, HIV-1.sub.5768, HIV-1.sub.DH123, HIV-1.sub.GUN-1, HIV-1.sub.89.6, or HIV-1.sub.HXB2 isolate.

102. A purified, modified gp160 polypeptide which comprises a consecutive amino acid sequence as set forth in SEQ ID NO:7 or SEQ ID NO:8.

103. A method of producing a stabilized trimeric complex of HIV gp120 and gp41 wherein said gp41 comprises the following mutations numbered by reference to the KNH1144 HIV-1 isolate: X535M, X543Q, X553S, X567K, and X588R, comprising the steps of: a) obtaining an expression vector comprising a nucleic acid which encodes a gp120 envelope polypeptide and a gp41 envelope polypeptide; b) introducing the following mutations into the portion of the nucleic acid which encodes the gp41 envelope polypeptide: X535M, X543Q, X553S, X567K, and X588R; c) transfecting a cell with the expression vector resulting from step b) under conditions permitting expression of the gp120 envelope polypeptide and the gp41 envelope polypeptide; and d) purifying the gp120 envelope polypeptides and gp41 envelope polypeptides expressed in step c) under conditions permitting association of gp120 envelope polypeptides and gp41 envelope polypeptides so as to form the stabilized trimeric complex.

104. A composition comprising the purified, modified polypeptide of any one of claims 97 to 99, and a pharmaceutically acceptable carrier, excipient, or diluent, and optionally, an adjuvant.

105. A composition comprising the purified, modified polypeptide of claim 102 and a pharmaceutically acceptable carrier, excipient, or diluent, and optionally, an adjuvant.

106. A trimeric complex which comprises a noncovalent oligomer of three identical purified, modified HIV-1 gp140 envelope polypeptides of any one of claims 97 to 99.

107. A trimeric complex which comprises a noncovalent oligomer of three identical purified, modified HIV-1 gp140 envelope polypeptides, each gp140 polypeptide comprising a gp120 polypeptide portion and a gp41 ectodomain polypeptide portion derived from the purified, modified gp160 polypeptide of claim 102.

108. A composition comprising the trimeric complex of claim 106, a pharmaceutically acceptable carrier, excipient, or diluent, and optionally, an adjuvant.

109. A composition comprising the trimeric complex of claim 107, a pharmaceutically acceptable carrier, excipient, or diluent, and optionally, an adjuvant.

110. The composition of claim 108, further comprising a non-ionic detergent.

111. The composition of claim 109, further comprising a non-ionic detergent.

Description

[0001] This application is a divisional of U.S. Ser. No. 12/312,016, filed Oct. 7, 2009, .sctn.371 national stage of PCT International Application No. PCT/US2007/022227, filed Oct. 17, 2007, which claims the benefit of U.S. Provisional Applications Nos. 60/655,236, filed Oct. 30, 2006 and 60/854,034, filed Oct. 23, 2006, the contents of each of which are hereby incorporated by reference into this application.

[0003] Throughout this application, certain publications are referenced. Full citations for these publications may be found immediately preceding the claims. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art to which this invention relates.

BACKGROUND OF THE INVENTION

[0004] The ability of human immunodeficiency virus type 1 (HIV-1) to enter its target cell and establish an infection is dependent on interactions between functional HIV envelope glycoprotein (Env) complexes on the virus and receptors on the host cell. The HIV-1 Env complex is initially synthesized as the polyprotein precursor gp160, which undergoes oligomerization, disulfide bond formation and extensive glycosylation in the endoplasmic reticulum (Earl, Moss, and Doms, 1991) and is then proteolytically cleaved into the surface (gp120) and transmembrane (gp41) subunits by furin-like endo-proteases in the Golgi network (Fields, 1996; Hunter and Swanstrom, 1990). The resulting Env complex is a trimer, with three gp120 proteins associated non-covalently with three gp41 subunits.

[0005] During the entry process, gp120 interacts with the CD4 receptor, which triggers conformational changes that facilitate gp120 binding to a coreceptor, CCR5 or CXCR4 (Berger, Murphy, and Farber, 1999; Rizzuto at al., 1998). These interactions promote extensive conformational changes in the gp41 subunit that drive the insertion of the hydrophobic gp41 N-terminal region (fusion peptide) into the host cell membrane. Subsequently, formation of the six-helix bundle configuration of the three gp41 ectodomains forces the juxtaposition of the viral and cell membranes, promoting their fusion (Doms and Moore, 2000; Jones, Korte, and Blumenthal, 1998; Melikyan et al., 2000; Moore and Doms, 2003; Sattentau and Moore, 1991; Sullivan et al., 1998; Wu et al., 1996; Zhang et al., 1999).

[0006] The trimeric nature of the Env complex has been confirmed by various lines of evidence (Blacklow, Lu, and Kim, 1995; Center et al., 2002; Center et al., 2001; Chan et al., 1997; Chan and Kim, 1998; Lu, Blacklow, and Kim, 1995; Zhu et al., 2003), most recently by cryo-electron microscopy (Zanetti et al., 2006; Zhu et al., 2006). The trimer is held together by labile, non-covalent inter-subunit interactions. The weak interactions between gp120 and gp41, and between individual gp41 subunits, are probably necessary to permit the conformational changes that are necessary for the process of virus-cell fusion to proceed efficiently, but such instability of the Env complexes complicates the generation of soluble forms of Env trimers that are suitable for vaccine research and structural studies. To obtain soluble Env trimers, the transmembrane (TM) region and the cytoplasmic tail (CT) are routinely deleted from gp41 to create gp140 proteins that contain gp120 and the gp41 ectodomain (gp41ECTO). Attempts to stabilize the non-covalent inter-subunit interactions have included mutating the cleavage site within gp140 to make uncleaved oligomers (Chakrabarti at al., 2002; Srivastava et al., 2002; Yang et al., 2000; Yang et al., 2002; Zhang et al., 2001) and engineering of an inter-subunit disulfide bond (Binley at al., 2000) between gp120 and gp41 and an isoleucine to proline substitution at position 559 (I559P) in the N-terminal heptad region of gp41 ectodomain (SOSIP) (Sanders et al., 2002) to promote gp41-gp41 association.

[0007] Despite the efforts to stabilize the naturally unstable Env complexes, problems with the stability of the complexes still exist. For example, gp120 rapidly dissociates from gp41 when soluble forms of gp140 proteins are expressed, and trimeric gp140 proteins can degrade into dimers and monomers, or associate into tetramers (dimers of dimers) and aggregates (Earl at al., 1994; Schulke et al., 2002; Staropoli et al., 2000). Similarly, monomeric and oligomeric gp120-gp41 structures are found to be present on cells that express Env proteins, as are both gp41 stumps from which gp120 has been shed and uncleaved proteins that have evaded the host cell proteases that typically process gp160 (Herrera et al., 2005; Kuznetsov at al., 2003; Moore et al., 2006; Thomas at al., 1991; Wyatt and Sodroski, 1998; Zhu et al., 2003).

[0008] Thus, stable, Env-based vaccines that mimic the native trimer conformation of the native Env structure and that remain stable when used as immunogens and vaccines are needed in the art to combat infection by HIV and its devastating consequences.

SUMMARY OF THE INVENTION

[0009] Described herein are the molecular determinants of enhanced trimer stability. These lie within the N-terminal region of gp41.sub.ECTO, an area with a well-documented role in gp41-gp41 interactions (Center, Kemp, and Poumbourios, 1997; Poumbourios et al., 1997; Shugars et al., 1996). Specifically, five amino acid changes based on the KNH1144 sequence have a trimer-stabilizing effect on heterologous gp140 proteins. The introduction of these changes does not impair the exposure of various neutralizing antibody epitopes on the resulting gp140 proteins, leaving the overall antigenic structure of the trimer not adversely affected.

[0010] The present invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, the gp41 ectodomain polypeptide portion being modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); and lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and optionally being modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535) and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0011] This invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, the gp41 ectodomain polypeptide portion being modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0012] The invention also provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, the gp41 ectodomain polypeptide portion being modified to comprise isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588, wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0013] This invention also provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises an A.fwdarw.C mutation at amino acid position 492, numbered by reference to the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a T.fwdarw.C mutation at amino acid position 596, numbered by reference to the HIV-1 isolate JR-FL; and (ii) isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588, wherein amino acid positions 535, 543, 553, 567 and 588 are numbered by reference to the HIV-1 isolate KNH1144.

[0014] In an embodiment, the gp120 polypeptide portion of the above described modified gp140 envelope polypeptides is modified to comprise a cysteine (C) residue at an amino acid position equivalent to amino acid position 492, numbered by reference to the HIV isolate JR-FL. In an embodiment, the gp41 ectodomain polypeptide portion of the above described modified gp140 envelope polypeptide is modified to comprise a cysteine (C) residue at an amino acid position equivalent to amino acid position 596, numbered by reference to the HIV-1 isolate JR-FL. In an embodiment, the gp41 ectodomain polypeptide portion of the above described modified gp140 envelope polypeptide is modified to comprise a proline (P) residue at an amino acid position equivalent to amino acid position 559, numbered by reference to the HIV-1 isolate KNH1144. In an embodiment, in the above described modified gp140 polypeptides, the isoleucine (I) at the amino acid position equivalent to amino acid position 535 is the result of an M535I mutation; the glutamine (Q) at the amino acid position equivalent to amino acid position 543 is the result of an L543Q mutation; the serine (S) at the amino acid position equivalent to amino acid position 553 is the result of an N553S mutation; the lysine (K) at the amino acid position equivalent to amino acid position 567 is the result of a Q567K mutation; and the arginine (R) at the amino acid position equivalent to amino acid position 588 is the result of a G588R mutation, wherein the amino , acid positions 535, 543, 553, 567 and 588 are numbered by reference to the HIV-1 isolate KNH1144.

[0015] The invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises an cysteine (C) at an amino acid position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a cysteine (C) at an amino acid position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL; (ii) a proline (P) at an amino acid position equivalent to amino acid 559 of the HIV-1 isolate KNH1144; and (iii) isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino positions of (iii) are numbered by reference to the HIV-1 isolate KNH1144.

[0016] The invention further provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises an A.fwdarw.C mutation at amino acid position 492, numbered by reference to the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a T.fwdarw.C mutation at amino acid position 596, numbered by reference to the HIV-1 isolate JR-FL; and (ii) isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588, wherein the 535, 543, 553, 567 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144. In an embodiment, the modified gp140 envelope polypeptide also contains an I.fwdarw.P mutation at amino acid position 559, numbered by reference to the HIV-1 isolate KNH1144.

[0017] The present invention also provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises a cysteine (C) residue at an amino acid position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a cysteine (C) residue at an amino acid position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL; (ii) a proline (P) residue at an amino acid position equivalent to amino acid position 559 of the HIV-1 isolate KNH1144; and (iii) glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); and lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and optionally comprises isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535) and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino acid positions of (iii) are numbered by reference to the HIV-1 isolate KNH1144.

[0018] The invention provides an isolated nucleic acid encoding a modified form of an HIV-1 gp120 and gp41 polypeptide complex, wherein the modification in gp120 comprises a mutation of a non-cysteine amino acid to cysteine (C) at an amino acid position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL; and the modifications in gp41 comprise a mutation of a non-cysteine amino acid to cysteine (C) at an amino acid position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL, a mutation of a non-isoleucine amino acid to isoleucine (I) at an amino acid position equivalent to amino acid position 535 of the HIV-1 isolate KNH1144, a mutation of a non-glutamine amino acid to glutamine (Q) at an amino acid position equivalent to amino acid position 543 of the HIV-1 isolate KNH1144, a mutation of a non-serine amino acid to serine (S) at an amino acid position equivalent to amino acid position 553 of the HIV-1 isolate KNH1144, a mutation of a non-lysine amino acid to lysine (K) at an amino acid position equivalent to amino acid position 567 of the HIV-1 isolate KNH1144, and a mutation of a non-arginine amino acid to arginine (R) at an amino acid position equivalent to amino acid position 588 of the HIV-1 isolate KNH1144. In an embodiment, the modifications in gp41 encoded by the isolated nucleic acid further comprise a mutation of a non-proline amino acid to proline (P) at an amino acid position equivalent to amino acid position 559 of the HIV-1 isolate KNH1144. In an embodiment, the isolated nucleic acid is DNA, cDNA, or RNA. In an embodiment, an expression vector, which may contain an expression cassette, contains the above-described nucleic acid. In an embodiment, a eukaryotic or prokaryotic host cell contains the expression vector.

[0019] This invention further provides an isolated nucleic acid encoding a modified form of an HIV-1 gp120 and gp41 polypeptide complex, wherein the modification in gp120 comprises an A492C mutation and the modifications in gp41 comprise a T596C mutation, an M535/mutation; an L543Q mutation; an N553S mutation; a Q567K mutation and a G588R mutation, wherein the A492C and T596C mutations are numbered by reference to the HIV-1 isolate JR-FL, and the M535I, L543Q, N553S, Q567K and G588R mutations are numbered by reference to the HIV-1 isolate KNH1144.

[0020] This invention also provides a method for eliciting an immune response against HIV-1 or an HIV-1 infected cell in a subject comprising administering to the subject an amount of the composition of the invention effective to elicit the immune response in the subject.

[0021] This invention provides a method for eliciting an immune response against HIV-1 or an HIV-1 infected cell in a subject comprising administering to the subject an amount of the trimeric complex of the invention effective to elicit the immune response in the subject.

[0022] This invention also provides a method for preventing a subject from becoming infected with HIV-1, comprising administering to the subject an amount of the composition of the invention effective to prevent the subject from becoming infected with HIV-1.

[0023] This invention further provides a method for reducing the likelihood of a subject becoming infected with HIV-1, comprising administering to the subject an amount of the composition of the invention effective to reduce the likelihood of the subject becoming infected with HIV-1.

[0024] This invention also provides a method for delaying the onset of, or slowing the rate of progression of, an HIV-1-related disease in an HIV-1-infected subject, which comprises administering to the subject an amount of an isolated nucleic acid encoding a modified form of an HIV-1 gp120 and gp41 polypeptide complex, wherein the modification in gp120 comprises an A492C mutation and the modifications in gp41 comprise a T596C mutation, an M535I mutation; an L543Q mutation; an N553S mutation; a Q567K mutation and a G588R mutation, wherein the A492C and T596C mutations are numbered by reference to the HIV-1 isolate JR-FL, and the M535I, L543Q, N553S, Q567K and G588R mutations are numbered by reference to the HIV-1 isolate KNH1144 effective to delay the onset of, or slow the rate of progression of, the HIV-1-related disease in the subject.

[0025] This invention also provides a method of stabilizing HIV-1 trimer complexes which comprise non-covalently associated gp120 and gp41 envelope polypeptides, which polypeptides comprise consecutive amino acids, said method comprising: introducing into the gp41 ectodomain polypeptide an isoleucine (I) at an amino acid position equivalent to amino acid position 535; a glutamine (Q) at an amino acid position equivalent to amino acid position 543; a serine (S) at an amino acid position equivalent to amino acid position 553; a lysine (K) at an amino acid position equivalent to amino acid position 567; and an arginine (R) at an amino acid position equivalent to amino acid position 588, wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0026] The invention provides a chimeric gp140 polypeptide comprising (i) a gp120 envelope polypeptide of a clade B subtype of an HIV-1 isolate and (ii) a gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144, said polypeptides comprising consecutive amino acids, wherein the KNH1144 gp41 ectodomain polypeptide comprises isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588; and wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0027] The invention further provides a chimeric gp140 polypeptide comprising (i) a gp120 envelope polypeptide of a clade H subtype of an HIV-1 isolate and (ii) a gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144, said polypeptides comprising consecutive amino acids, wherein the KNH1144 gp41 ectodomain polypeptide comprises the sequence as set forth in SEQ ID NO:1 or SEQ ID NO:18.

[0028] The invention further provides a chimeric gp140 polypeptide comprising (i) a gp120 envelope polypeptide of a clade 13 subtype of an HIV-1 isolate and (ii) a gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144, said polypeptides comprising consecutive amino acids, wherein the KNH1144 gp41 ectodomain polypeptide comprises an amino acid sequence as set forth in SEQ ID NO:20 or SEQ ID NO:21, or the gp41 ectodomain polypeptide portion of the gp160 polypeptide as set forth in any one of SEQ ID NOS:5-8.

[0029] This invention provides a gp41 ectodomain polypeptide which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:1, SEQ ID NO:18, SEQ ID NO:2, SEQ ID NO:22, SEQ ID NO:3, SEQ ID NO:25, or SEQ ID NO:28, which sequences contain or are modified to contain one or more of the trimer stabilizing amino acid residues described herein. In an embodiment, the gp41 ectodomain polypeptide contains at least three of the trimer stabilizing amino acid residues.

[0030] The invention further provides a modified gp41 ectodomain polypeptide which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:24, SEQ ID NO:27, or SEQ ID NO:30.

[0031] The invention also provides a modified gp160 polypeptide, which comprises a consecutive amino acid sequence as set forth in any one of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:11, SEQ ID NO:14, or SEQ ID NO:17.

[0032] The invention further provides a gp160 polypeptide, which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:9, SEQ ID NO:12, or SEQ ID NO:15, which sequences are modified to contain one or more of the trimer stabilizing amino acid residues described herein. In an embodiment, the modified gp160 polypeptides contain at least three of the trimer stabilizing amino acid residues.

[0033] Also provided are the gp120 and gp41 portions of the modified gp160 polypeptides, which can complex to form stabilized trimers of the invention. In an embodiment, the trimers further comprise a non-ionic detergent as described herein.

[0034] The present invention further provides an antibody, or a portion of the antibody, generated by immunizing an animal with a modified gp140 polypeptide as described herein; an antibody, or a portion of the antibody, generated by immunizing an animal with a trimeric complex as described herein; an antibody, or a portion of the antibody, generated by immunizing an animal with a composition as described herein; an antibody, or a portion of the antibody, generated by immunizing an animal with a modified gp41 ectodomain polypeptide as described herein; or an antibody, or a portion of the antibody, generated by immunizing an animal with the modified gp160 polypeptide or a portion thereof, e.g., gp120 polypeptide and/or gp41 ectodomain polypeptide, as described herein. In an embodiment, the antibody is a monoclonal antibody, or a portion of the monoclonal antibody. In an embodiment, the antibody is a humanized antibody, or a portion of the humanized antibody.

[0035] This invention also provides trimeric complexes and compositions as described herein, further comprising a non-ionic detergent.

[0036] The invention further provides a use of a modified gp140 polypeptide, a trimeric complex, a composition, a modified gp41 ectodomain polypeptide,- or a modified gp160 polypeptide, or portion thereof, e.g., gp120 polypeptide and/or gp41 ectodomain polypeptide, for the preparation of a medicament for the treatment or prevention of infection by human immunodeficiency virus (HIV).

BRIEF DESCRIPTIONS OF THE FIGURES

[0037] FIGS. 1A and 1B: (A) Schematic view of gp41 region showing the location of the fusion peptide (FP), heptad repeat regions 1 and 2 (HR1 and HR2), the transmembrane region (TM) and the cytoplasmic tail (CT). The intramolecular disulfide bond is also shown. (B) Alignment of the N-terminus regions of KNH1144, JR-FL and Ba-L gp41, highlighting the 5 amino acids (bold and shaded) in and near the HR1 region (underlined) that differ in JR-FL and B-aL when compared to KNH1144.

[0038] FIGS. 2A-2C: Trimer formation by cleaved, wild-type and mutant KNH1144 gp140 proteins. (A) SOS and SOSIP versions of KNH1144 gp140 proteins. (B) KNH1144 SOSIP gp140 mutants containing the indicated single residue substitutions in the gp41 N-terminal region, compared with the wild-type KNH1144 SOSIP gp140. (C) KNH1144 SOSIP and SOS mutant gp140s, as indicated. Each panel shows a BN-PAGE analysis, followed by western blotting using MAb CA13.

[0039] FIGS. 3A-3C: Trimer formation by cleaved, wild-type and mutant JR-FL SOS gp140 proteins. (A) Design of various chimeric and mutant JR-FL gp140s. The intermolecular disulfide bond (SOS) and the Ile to Pro substitution at position 559 (I559P; SOSIP) are shown. (B) The indicated wild-type and mutant/chimeric gp140 proteins were analyzed using BN-PAGE and western blotting with MAb CA13. The designation NT 1-5 refers to substitution of the 5 amino acids M535I, L543Q, N553S, Q567K and G588R, in the gp41 N-terminus region. (C) The JR-FLgp120-1144gp41 (ECTO) SOS gp140 chimera and the JR-FL gp41 NT 1-5 SOS gp140 mutant were analyzed by SDS-PAGE and western blotting, followed by detection with MAb B13. The - and + symbols indicate the absence and presence of DTT.

[0040] FIGS. 4A and 4B: (A) The wild-type JR-FL SOS gp140 and (B) the JR-FL gp41 NT 1-5 SOS gp140 mutant were analyzed by size-exclusion chromatography followed by BN-PAGE and western blotting with MAb CA13. The mutant protein is predominantly trimeric, the wild-type protein mostly monomeric.

[0041] FIGS. 5A and 5B: (A) Representative SPR analysis of the binding of MAbs to the JR-FL SOS gp140 and the gp41 NT 1-5 SOS gp140 mutant to the following test agents were: (I) CD4IgG2, (II) b12, (III) 2G12, (IV) 2F5, (V) 4E10, (VI) PA-1, (VII) b6 and (VIII) 17b -/+D1D2-CD4. The y-axis shows the SPR response unit (RU), the x-axis the time in seconds (s). (B) Injected samples from the BIAcore machine were manually collected after the ligand binding analysis, then analyzed by BN-PAGE. The wild-type JR-FL SOS gp140 and the gp41 NT 1-5 SOS gp140 mutant proteins are shown, from a representative experiment, one using the PA-1 mAb.

[0042] FIGS. 6A and 6B: Stabilizing cleaved Ba-L SOS gp140 trimers. (A) The wild-type Ba-L SOS gp140 and the mutant Ba-L gp41 NT 1-4 SOS gp140 proteins were analyzed by BN-PAGE and western blotting with MAb CA13. (B) The same proteins were analyzed by SDS-PAGE and western blotting, followed by detection with MAb B13. The - and + symbols indicate the absence and presence of DTT.

[0043] FIG. 7: Effect of gp41 N-terminus substitutions on Env incorporation into pseudovirions. The JR-FL WT and gp41 NT 1-5 mutant viruses were produced by transfection of HEK 293T cells and pelleted from clarified supernatants. The gp120, gp41 and p24 proteins were resolved by SDS-PAGE and analyzed by Western blotting with the appropriate antibodies.

[0044] FIGS. 8A and 8B: Effect of gp41 N-terminal changes on the Env forms present on pseudovirions. FIG. 5A: Virions, normalized for p24 content and expressing either the JR-FL WT Env glycoprotein, or the gp41 NT 1-5 mutant Env glycoprotein, were solubilized and analyzed under native conditions on a 4-12% Bis-Tris NuPAGE gel and Western blotted with the anti-gp12-MAb ARP43119. Env tetramers and dimers are highlighted with black arrows; trimers are indicated with a gray arrow. FIG. 8B: The histogram shows the relative proportions of the different Env forms present on the WT (black bars) and mutant (gray bars) pseudovirions. The densitometric data represents the Mean.+-.Standard Deviation of values from four independent experiments.

[0045] FIGS. 9A and 9B: Effect of gp41 N-terminal substitutions on soluble CD4- and temperature-induced gp120 shedding from pseudovirions. FIG. 9A: Pseudovirions expressing the JR-FL WT Env or gp41 NT 1-5 mutant Env were incubated for 2 hours with sCD4 at the concentrations indicated, at either 4.degree. C. or 37.degree. C. FIG. 9B: The pseudovirions were incubated for 2 hours at the temperatures indicated in the absence of sCD4. In both experiments, the HxB2 Env-pseudotyped virus served as a reference standard. The amount of virion-bound Env is expressed relative to that present on each virus in the absence of 5CD4 at 4.degree. C. (=100%).

[0046] FIG. 10: Effect of gp41 N-terminal substitutions on Env-pseudotyped virus infectivity. Pseudovirions containing normalized amounts of p24 antigen and bearing the WT or mutant forms of JR-FL Env were serially diluted and used to infect U87.CD4.CCR5 cells. Infectivity was quantified by measuring luciferase activity four days post infection.

[0047] FIGS. 11A and 11B: Effect of gp41 N-terminal substitutions on Env-mediated cell-cell fusion. The kinetics of fusion mediated by the WT (black squares) and mutant (gray triangles) forms of JR-FL Env were determined in a .beta.-lactamase reporter assay using HeLa-CD4/CCR5 (RC49) cells. The extent of fusion is expressed as the percentage of the maximal fusion mediated by each Env (FIG. 11A), or the maximal fusion mediated by the WT Env (FIG. 11B). The data represent the Mean.+-.Standard Errors of three independent experiments. The various kinetic parameters are described in Table 3.

[0048] FIGS. 12A and 12B: Effect of gp41 N-terminal substitutions on the binding of MAbs to pseudovirions. Equal amounts (judged by p24 antigen content) of virions expressing either the WT (black bars) or mutant (white bars) forms of JR-FL Env were tested in a virus capture assay. The amount of p24 antigen captured by each of the indicated MAbs is recorded.

[0049] FIGS. 13A and 13B: Cell-surface expression of wild-type and gp41 mutant Env glycoproteins and their reactivity with CD4-IgG2 and MAbs. FIG. 13A: Cell surface-expressed Envs were biotinylated, avidin-precipitated and detected using MAb ARP3119. Cell surface expressed CD47 served as a loading control (lower panel). FIG. 13B: The WT and gp41 NT mutant Env glycoproteins were stained with 10 .mu.g/ml of biotinylated MAbs, followed by streptavidin-PE. Background fluorescence due to the secondary antibody was determined using isotype-matched controls; background values were subtracted from experimental values. The MFI (mean fluorescence intensity) values are shown as Mean.+-.Standard Deviation from a representative experiment performed in triplicate.

[0050] FIG. 14: Analysis of purified KNH1144 SOSIP R6 gp140 trimer and gp120 monomer. Purified KNH1144 gp120 monomer (left panel, gp120) and SOSIP R6 gp140 trimer were analyzed by reducing (left panel, SOSIP R6, Red) and non-reducing SDS-PAGE (left panel, SOSIP R6, NR). Proteins were visualized by Coomassie G-250 stain. Purified trimer was also analyzed via ARP3119 western blot on non-reducing SDS-PAGE to examine presence of SDS-insoluble aggregates (middle panel, Anti-Env blot). The numbers on the left represent the migratory positions of the molecular weight standard proteins. The right panel shows BN-PAGE analysis of purified trimer, either untreated or treated with Tween.RTM. 20 (SOSIPR6, -/+ lanes) and purified gp120 monomer in absence or presence of Tween.RTM. 20 treatment (gp120, -/+ lanes). Arrows indicate high molecular weight (HMW) aggregate, trimer and gp120 monomer species. M stands for the 669k thyroglobulin and 440k ferritin molecular weight protein standards.

[0051] FIGS. 15A-15D: Tween.RTM. 20 conversion experiments. (A) Dose response: Purified KNH1144 SOSIP R6 gp140 trimer was incubated with 0 (no detergent control), or 0.1, 0.05, 0.01, 0.001, or 0.0001% Tween.RTM. 20 and analyzed by BN-PAGE and Coomassie G-250 stain. Arrows point to HMW aggregate and trimer species. M stands for the 669k thyroglobulin and 440k ferritin molecular weight protein standards. (B) Time course: Purified KNH1144 SOSIP R6 gp140 trimer was incubated with Tween.RTM. 20 for 5 min (left panel) or 10 min (right panel). Trimer was either untreated (- lane) or Tween.RTM. 20 treated (+ lane). Arrows indicate trimer and HMW aggregate bands. (C) Temperature effect: Purified KNH1144 SOSIP R6 gp140 trimer was either untreated (- lane) or treated with Tween.RTM. 20 at on ice (0), room temperature (RT) or 37.degree. C. Reactions were analyzed by BN-PAGE and Coomassie G-250 stain. Arrows indicate HMW aggregate and trimer proteins. (D) Tween.RTM. 20 effect on HMW aggregate and dimer fractions: A preparation composed predominantly of HMW aggregate (>80%) was untreated (left panel, - lane), or incubated with Tween.RTM. 20 (left panel, + lane), and analyzed by BN-PAGE and Coomassie G-250 stain. Solid arrows indicate HMW aggregate and trimer proteins. Preparations composed of HMW aggregate, dimers and monomers were untreated (right panel, - lane) or incubated with Tween.RTM. 20 (right panel, + lane) and analyzed by BN-PAGE and Coomassie G-250 stain. Arrows on the right hand side point to aggregate, trimer, dimer and monomer species.

[0052] FIG. 16: Size Exchange Chromatography (SEC) analysis of KNH1144 SOSIP R6 gp140 trimer. KNH1144 SOSIP R6 gp140 trimer was resolved on a Superdex 200 10/300 GL column in TN-500 buffer containing 0.05% Tween.RTM. 20 (TNT-500). The A.sub.200 protein profile of the run is shown in the middle panel. Fractions B7-C3 from the run were analyzed by BN-PAGE, followed by silver stain (bottom panel). Arrows to the side of the BN-PAGE image point to the trimer. The vertical arrow in the BN-PAGE indicates the peak signal of the trimer in fraction B12. The arrow in the middle chromatograph corresponds to fraction B12.

[0053] FIGS. 17A and 17B: Effect of Tween.RTM. 20 treatment on KNH1144 SOSIP R6 HMW aggregate antigenicity. (A) Lectin ELISA of untreated and Tween.RTM. 20 treated KNH1144 SOSIP R6 HMW aggregate: Untreated or Tween.RTM. 20-treated HMW aggregate were bound to GNA lectin coated ELISA plates and probed with 2G12, b6, b12, CD4-IgG2, and HIVIg. The panels represent their respective binding curves. Antibody affinity to the untreated HMW aggregate is represented by the curve having diamond lines. Affinity to the Tween.RTM. 20 treated HMW aggregate is represented by curve having square lines. The Y-axis represents the colorimetric signal at OD492 and the X-axis represents antibody concentration in [ug/ml]. (B) Lectin ELISA of untreated and Tween.RTM. 20-treated KNH1144 SOSIP R6 gp140 trimer: Untreated or Tween.RTM. 20 treated trimer (containing 10-15% HMW aggregate) were bound to GNA lectin coated ELISA plates and probed with 2G12, b6, b12, and CD4-IgG2. The panels represent their respective binding curves. Antibody affinity to the untreated trimer is represented by the curve having diamond lines. Affinity to the Tween.RTM. 20 treated trimer is represented by the curve having square lines. The Y-axis represents the colorimetric signal at OD492 and the X-axis represents antibody concentration in [ug/ml].

[0054] FIG. 18: Effect of Tween.RTM. 20 treatment on KNH1144 SOSIP R6 gp140 trimer binding to DEAE anion exchange column. Purified KNH1144 SOSIP R6 gp140 trimer, spiked with alpha-2 macroglobulin (a.sub.2M) contaminant, was either untreated or treated with Tween.RTM. 20. Following treatment, sample was applied over an anion exchange column (DEAE HiTrap FF 1 ml column) (Load). Flow through (FT) fractions were collected and the column was washed (Wash). The column was eluted (Elution) and fractions were analyzed over BN-PAGE, followed by Coomassie G-250 stain. The top panel shows fractions analyzed from the untreated control trimer DEAE application. The bottom panel shows fractions analyzed from the Tween.RTM. 20 treated trimer DEAE application. Arrows point to trimer and a.sub.2M contaminant proteins. M stands for the 669k thyroglobulin and 440k ferritin molecular weight protein standards. Asterisks highlight the fraction where the trimer is found.

[0055] FIG. 19: Negative stain electron micrographs of KNH1144 SOSIP R6 gp140 trimers. KNH1144 SOSIP R6 gp140 trimers were analyzed by negative stain electron microscopy. A gallery of 19 selected trimeric proteins in deeper stain is shown. Bar=50 nm.

[0056] FIG. 20: SEC analysis of KNH1144 gp120 monomer: KNH1144 gp120 monomer was resolved on a Superdex 200 10/300 GL column in TN-500 buffer. The top chromatograph shows its A.sub.280 protein profile of the run. As a control, JR-FL gp120 monomer was resolved in a similar manner and its A.sub.280 protein profile is displayed in the bottom chromatograph. The observed retention times for both monomers and their apparent calculated molecular weights are indicated.

[0057] FIG. 21: Tween.RTM. 20 effect on a.sub.2M: Purified a.sub.2M was incubated with Tween.RTM. 20 (+ lane) or was untreated (- lane). Reactions were analyzed by BN-PAGE and Coomassie stain. Arrow indicates a.sub.2M band.

DETAILED DESCRIPTION OF THE INVENTION

Definitions

[0058] As used in this application, except as otherwise expressly provided herein, each of the following terms shall have the meaning set forth below.

[0059] The following standard abbreviations are used throughout the specification to indicate specific amino acids: A=ala=alanine; R=arg=arginine; N=asn=asparagine; D=asp=aspartic acid; C=cys=cysteine; Q=gln=glutamine; E=glu=glutamic acid; G=gly=glycine; H=his=histidine; I=ile=isoleucine; L=leu=leucine; K=lys=lysine; M=met=methionine; F=phe=phenylalanine; P=pro=proline; S=ser=serine; T=thr=threonine; W=trp=tryptophan; Y=tyr=tyrosine; V=val=valine; B=asx=asparagine or aspartic acid; Z=glx=glutamine or glutamic acid.

[0060] An "A492C mutation" refers to a point mutation of amino acid 492, for example, in the HIV-1.sub.JRFL isolate gp120 protein, from alanine to cysteine. Because of sequence and sequence numbering variability among different HIV strains and isolates, it will be appreciated that the same amino acid may not reside at position 492 in all other HIV isolates. For example, in the HIV-1 KNH1144 isolate, the corresponding amino acid, or the amino acid position that is equivalent to amino acid position A492 in the JR-FL isolate, is A511; in HIV-1.sub.HXB2 the corresponding or equivalent amino acid is A501 (Genbank Accession No. AAB50262); and in HIV-1., such amino acid is A499 (Genbank Accession No. AAA44992). The amino acid may also be an amino acid other than alanine or cysteine which has similar polarity or charge characteristics, for example. This invention encompasses the replacement of such amino acids by cysteine, as may be readily identified in other HIV isolates by those skilled in the art. Thus, the invention encompasses an HIV-1 isolate in which a cysteine residue replaces, or is substituted for, (e.g., by mutation), a non-cysteine amino acid at an amino acid position equivalent to position 492 in the HIV-1 isolate JR-FL. Illustratively, e.g., equivalent amino acid position(s) in other HIV-1 strains or clades may be determined by reference to SEQ ID NO:9, SEQ ID NO:2 and/or SEQ ID NO:22.

[0061] "I559P" refers to a point mutation wherein the isoleucine residue at position 559 of a polypeptide chain is replaced by a proline residue. Thus, the invention encompasses an HIV-1 isolate in which a proline residue replaces, or is substituted for, a non-proline (e.g., isoleucine) amino acid at an amino acid position equivalent to position 559 in the HIV-1 isolate KNH1144, for example. Illustratively, e.g., equivalent amino acid position(s) in other HIV-1 strains or clades may be determined by reference to SEQ ID NO:1, SEQ ID NO:5 and/or SEQ ID NO:18.

[0062] A "T596C mutation" refers to a point mutation of an amino acid at amino acid position 596 in the HIV-1.sub.JRFL isolate gp41 ectodomain from threonine to cysteine. Because of sequence and sequence numbering variability among different HIV strains and isolates, it will be appreciated that this amino acid will not be at position 596 in all other HIV isolates. For example, in HIV-1 KNH1144 isolate, the corresponding amino acid is T605; in HIV-1.sub.HXB2 the corresponding amino acid is T605 (Genbank Accession No. AAB50262); and in HIV-1.sub.NL4-3 the corresponding amino acid is T603 (Genbank Accession No. AAA44992). The amino acid may also be an amino acid other than threonine or cysteine which has similar polarity or charge characteristics, for example. This invention encompasses cysteine mutations in such amino acids, which can be readily identified in other HIV isolates by those skilled in the art. This invention encompasses the replacement, or substitution, of such amino acids by cysteine, as may be readily identified in other HIV isolates by those skilled in the art. Thus, the invention further encompasses an HIV-1 isolate in which a cysteine residue replaces, or is substituted for, a non-cysteine amino acid at an amino acid position equivalent to position 596 in the HIV-1 isolate JR-FL. Similarly, the invention encompasses an HIV-1 isolate in which a cysteine residue replaces, or is substituted for, a non-cysteine amino acid at an amino acid position equivalent to position 492 in the HIV-1 isolate JR-FL.

[0063] "HIV" refers to the human immunodeficiency virus. HIV includes, without limitation, HIV-1. HIV may be either of the two known types of HIV, i.e., HIV-1 or HIV-2. The HIV-1 virus may represent any of the known major subtypes or clades (e.g., Classes A, B, C, D, E, F, G, J, and H) or outlying subtype (Group 0). Also encompassed are other HIV-1 subtypes or clades that may be isolated.

[0064] "gp140 envelope" refers to a protein having two disulfide-linked polypeptide chains, the first chain comprising the amino acid sequence of the HIV gp120 glycoprotein and the second chain comprising the amino acid sequence of the water-soluble portion of HIV gp41 glycoprotein ("gp41 portion"). HIV gp140 protein includes, without limitation, proteins wherein the gp41 portion comprises a point mutation such as I559P. gp140 envelope comprising such mutation is encompassed by the terms "HIV SOS gp140", as well as "HIV gp140 monomer" or "SOSIP gp140".

[0065] "gp41" includes, without limitation, (a) the entire gp41 polypeptide including the transmembrane and cytoplasmic domains; (b) gp41 ectodomain (gp41.sub.ECTO); (c) gp41 modified by deletion or insertion of one or more glycosylation sites; (d) gp41 modified so as to eliminate or mask the well-known immunodominant epitope; (e) a gp41 fusion protein; and (f) gp41 labeled with an affinity ligand or other detectable marker. As used herein, "ectodomain" means the extracellular region of a transmembrane protein exclusive of the transmembrane spanning and cytoplasmic regions.

[0066] "Host cells" include, but are not limited to, prokaryotic cells, e.g., bacterial cells (including gram-positive cells), yeast cells, fungal cells, insect cells and animal cells. Suitable animal cells include, but are not limited to HeLa cells, COS cells, CV1 cells and various primary mammalian cells. Numerous mammalian cells can be used as hosts, including, but not limited to, mouse embryonic fibroblast NIH-3T3 cells, CHO cells, HeLa cells, L(tk-) cells and COS cells. Mammalian cells can be transfected by methods well known in the art, such as calcium phosphate precipitation, electroporation and microinjection. Electroporation can also be performed in viva as described previously (see, e.g., U.S. Pat. Nos. 6,110,161; 6,262,281; and 6,610,044).

[0067] "Immunizing" means generating an immune response to an antigen in a subject. This can be accomplished, for example, by administering a primary dose of an antigen, e.g., a vaccine, to a subject, followed after a suitable period of time by one or more subsequent administrations of the antigen or vaccine, so as to generate in the subject an immune response against the antigen or vaccine. A suitable period of time between administrations of the antigen or vaccine may readily be determined by one skilled in the art, and is usually on the order of several weeks to months. Adjuvant may or may not be co-administered.

[0068] "Nucleic acid" refers to any nucleic acid or polynucleotide, including, without limitation, DNA, RNA and hybrids thereof. The nucleic acid bases that form nucleic acid molecules can be the bases A, C, T, G and U, as well as derivatives thereof. Derivatives of these bases are well known in the art and are exemplified in PCR Systems, Reagents and Consumables (Perkin-Elmer Catalogue 1996-1997, Roche Molecular Systems, Inc., Branchburg, N.J., USA).

[0069] A "vector" refers to any nucleic acid vector known in the art. Such vectors include, but are not limited to, plasmid vectors, cosmid vectors and bacteriophage vectors. For example, one class of vectors utilizes DNA elements which are derived from animal viruses such as animal papilloma virus, polyoma virus, adenovirus, vaccinia virus, baculovirus, retroviruses (RSV, MMTC or MoMLV), Semliki Forest virus or SV40 virus. The eukaryotic expression plasmid PPI4 and its derivatives are widely used in constructs described herein. However, the invention is not limited to derivatives of the PPI4 plasmid and may include other plasmids known to those skilled in the art.

[0070] In accordance with the invention, numerous vector systems for expression of recombinant proteins may be employed. For example, one class of vectors utilizes DNA elements which are derived from animal viruses such as bovine papilloma virus, polyoma virus, adenovirus, vaccinia virus, baculovirus, retroviruses (RSV, MMTV or MoMLV), Semliki Forest virus or SV40 virus. Additionally, cells which have stably integrated the DNA into their chromosomes may be selected by introducing one or more markers which allow for the selection of transfected host cells. The marker may provide, for example, prototropy to an auxotrophic host, biocide (e.g., antibiotic) resistance, or resistance to heavy metals such as copper or the like. The selectable marker gene can be either directly linked to the DNA sequences to be expressed, or introduced into the same cell by cotransformation. Additional elements may also be needed for optimal synthesis of mRNA. These elements may include splice signals, as well as transcriptional promoters, enhancers, and termination signals. The cDNA expression vectors incorporating such elements include those described by (Okayama and Berg, 1983).

[0071] "Pharmaceutically acceptable carriers" are well known to those skilled in the art and include, but are not limited to, 0.01-0.1M and preferably 0.05M phosphate buffer, phosphate-buffered saline (PBS), or 0.9% saline. Additionally, such pharmaceutically acceptable carriers may include, but are not limited to, aqueous or non-aqueous solutions, suspensions, and emulsions. Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. Aqueous carriers, diluents and excipients include water, alcoholic/aqueous solutions, emulsions or suspensions, saline and buffered media. Parenteral vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, lactated Ringer's and fixed oils. Intravenous vehicles include fluid and nutrient replenishers, electrolyte replenishers such as those based on Ringer's dextrose, and the like. Solid compositions may comprise nontoxic solid carriers such as, for example, glucose, sucrose, mannitol, sorbitol, lactose, starch, magnesium stearate, cellulose or cellulose derivatives, sodium carbonate and magnesium carbonate. For administration in an aerosol, such as for pulmonary and/or intranasal delivery, an agent or composition is preferably formulated with a nontoxic surfactant, for example, esters or partial esters of C6 to C22 fatty acids or natural glycerides, and a propellant. Additional carriers such as lecithin may be included to facilitate intranasal delivery. Preservatives and other additives, such as, for example, antimicrobials, antioxidants, chelating agents, inert gases, and the like may also be included with all the above carriers.

[0072] Adjuvants are formulations and/or additives that are routinely combined with antigens to boost immune responses. Suitable adjuvants for nucleic acid based vaccines include, but are not limited to, saponins, Quil A, imiquimod, resiquimod, interleukin-12 delivered in purified protein or nucleic acid form, short bacterial immunostimulatory nucleotide sequences such as CpG-containing motifs, interleukin-2/Ig fusion proteins delivered in purified protein or nucleic acid form, oil in water micro-emulsions such as MF59, polymeric microparticles, cationic liposomes, monophosphoryl lipid A, immunomodulators such as Ubenimex, and genetically detoxified toxins such as E. coli heat labile toxin and cholera toxin from Vibrio. Such adjuvants and methods of combining adjuvants with antigens are well known to those skilled in the art.

[0073] Adjuvants suitable for use with protein immunization include, but are not limited to, alum; Freund's incomplete adjuvant (FIA); saponin; Quil A; QS-21; Ribi Detox; monophosphoryl lipid A (MPL) adjuvants such as Enhanzyn.TM.; nonionic block copolymers such as L-121 (Pluronic; Syntex SAF); TiterMax Classic adjuvant (block copolymer, CRL89-41, squalene and microparticulate stabilizer; Sigma-Aldrich); TiterMax Gold Adjuvant (new block copolymer, CRL-8300, squalene and a sorbitan monooleate; Sigma-Aldrich); Ribi adjuvant system using one or more of the following: monophosphoryl lipid A, synthetic trehalose, dicorynomycolate, mycobacterial cell wall skeleton incorporated into squalene and polysorbate-80; Corixa); RC-552 (a small molecule synthetic adjuvant; Corixa); Montanide adjuvants (including Montanide IMS111X, Montanide IMS131x, Montanide IMS221x, Montanide IMS301x, Montanide ISA 26A, Montanide ISA206, Montanide ISA 207, Montanide ISA25, Montanide ISA27, Montanide ISA28, Montanide ISA35, Montanide ISA50V, Montanide ISA563, Montanide ISA70, Montanide ISA 708, Montanide ISA740, Montanide ISA763A, and Montanide ISA773; Seppic Inc., Fairfield, N.J.); and N-Acetylmuramyl-L-alanyl-D-isoglutamine hydrate (Sigma-Aldrich). Methods of combining adjuvants with antigens are well known to those skilled in the art.

[0074] Because current vaccines depend on generating antibody responses to injected antigens, commercially available adjuvants have been developed largely to enhance these antibody responses. To date, the only FDA-approved adjuvant for use with human vaccines is alum. However, although alum helps boost antibody responses to vaccine antigens, it does not enhance T cell immune responses. Thus, adjuvants that are able to boost T cell immune responses after a vaccine is administered are also contemplated for use.

[0075] It is also known to those skilled in the art that cytotoxic T lymphocyte and other cellular immune responses are elicited when protein-based immunogens are formulated and administered with appropriate adjuvants, such as ISCOMs and micron-sized polymeric or metal oxide particles. Certain microbial products also act as adjuvants by activating macrophages, lymphocytes and other cells within the immune system, and thereby stimulating a cascade of cytokines that regulate immune responses. One such adjuvant is monophosphoryl lipid A (MPL) which is a derivative of the gram-negative bacterial lipid A molecule, one of the most potent immunostimulants known. The Enhanzyn.TM. adjuvant (Corixa Corporation, Hamilton, Mont.) consists of MPL, mycobacterial cell wall skeleton and squalene.

[0076] Adjuvants may be in particulate form. The antigen may be incorporated into biodegradable particles composed of poly-lactide-co-glycolide (PLG) or similar polymeric material. Such biodegradable particles are known to provide sustained release of the immunogen and thereby stimulate long-lasting immune responses to the immunogen. Other particulate adjuvants include, but are not limited to, micellular particles comprising Quillaia saponins, cholesterol and phospholipids known as immunostimulating complexes (ISCOMs; CSL Limited, Victoria AU), and superparamagnetic particles. Superparamagnetic microbeads include, but are not limited to, .mu.MACS.TM. Protein G and .mu.MACS.TM. Protein A microbeads (Miltenyi Biotec), Dynabeads.RTM. Protein G and Dynabeads.RTM. Protein A (Dynal Biotech). In addition to their adjuvant effect, superparamagnetic particles such as .mu.MACS.TM. Protein G and Dynabeads.RTM. Protein G have the important advantage of enabling immunopurification of proteins.

[0077] A "prophylactically effective amount" is any amount of an agent which, when administered to a subject prone to suffer from a disease or disorder, inhibits or prevents the onset of the disorder. The prophylactically effective amount will vary with the subject being treated, the condition to be treated, the agent delivered and the route of delivery. A person of ordinary skill in the art can perform routine titration experiments to determine such an amount. Depending upon the agent delivered, the prophylactically effective amount of agent can be delivered continuously, such as by continuous pump, or at periodic intervals (for example, on one or more separate occasions). Desired time intervals of multiple amounts of a particular agent can be determined without undue experimentation by one skilled in the art.

[0078] "Inhibiting" the onset of a disorder means either lessening the likelihood of the disorder's onset, preventing the onset of the disorder entirely, or in some cases, reducing the severity of the disease or disorder after onset. In the preferred embodiment, inhibiting the onset of a disorder means preventing its onset entirely.

[0079] "Reducing the likelihood of a subject's becoming infected with HIV-1" means reducing the likelihood of the subject's becoming infected with HIV-1 by at least two-fold. For example, if a subject has a 1% chance of becoming infected with HIV-1, a two-fold reduction in the likelihood of the subject becoming infected with HIV-1 would result in the subject having a 0.5% chance of becoming infected with HIV-1. In the preferred embodiment of this invention, reducing the likelihood of the subject's becoming infected with HIV-1 means reducing the likelihood of the subject's becoming infected with the virus by at least ten-fold.

[0080] "Subject" means any animal or artificially modified animal. Animals include, but are not limited to, humans, non-human primates, cows, horses, sheep, goats, pigs, dogs, cats, rabbits, ferrets, rodents such as mice, rats and guinea pigs, and birds and fowl, such as chickens and turkeys. Artificially modified animals include, but are not limited to, transgenic animals or SCID mice with human immune systems. In the preferred embodiment, the subject is a human.

[0081] "Exposed" to HIV-1 means contact or association with HIV-1 such that infection could result.

[0082] A "therapeutically effective amount" is any amount of an agent which, when administered to a subject afflicted with a disorder against which the agent is effective, causes the subject to be treated. "Treating" a subject afflicted with a disorder shall mean causing the subject to experience a reduction, diminution, remission, suppression, or regression of the disorder and/or its symptoms. In one embodiment, recurrence of the disorder and/or its symptoms is prevented. Most preferably, the subject is cured of the disorder and/or its symptoms.

[0083] "HIV-1 infected" means the introduction of viral components, virus particles, or viral genetic information into a cell, such as by fusion of cell membrane with HIV-1. The cell may be a cell of a subject. In the preferred embodiment, the cell is a cell in a human subject.

Embodiments of the Invention

[0084] The present invention encompasses HIV envelope (Env) glycoprotein complexes, which comprise non-covalently-associated surface gp120 and transmembrane gp41 glycoprotein subunits, and soluble forms thereof. The HIV envelope (Env) glycoprotein complexes of the invention are more structurally stable than native Env complexes, which are characteristically more labile or unstable in order to be capable of efficiently undergoing conformational changes during the process of virus-cell fusion.

[0085] In accordance with the present invention, the structural instability of the native HIV Env complex, or soluble forms thereof, is overcome by the introduction of amino acid sequence changes designed to stabilize inter-subunit interactions between gp120 and gp41, or between the gp41 components of a trimer. Such changes according to this invention include not only the introduction of a disulfide bond between gp120 and gp41; an additional change in gp41 that promotes trimer stability after gp120 and gp41 are cleaved into separate subunits during Env processing, and additional changes at the cleavage site between gp120 and gp41 to promote proteolytic processing, but also include amino acid changes, namely, five amino acid changes, in the highly conserved Leucine-zipper (LZ)-like motif near the N-terminus (NT) of gp41. The five amino acid changes, as described herein, were found to contribute to trimer stability by reducing the prevalence of monomeric, dimeric, or aggregated forms of gp140. Consequently, the present invention provides trimer stability enhancing amino acids which, when present in the NT of gp41 in an HIV isolate, allow the generation of more stable trimer complexes comprised of gp120 and gp41 envelope polypeptides. The invention thus provides a reduction in the qualitative heterogeneity of the Env glycoprotein, which is beneficial for the production of anti-HIV vaccines and immunogens designed to mimic the native trimeric form of viral Env.

[0086] In an embodiment, the invention encompasses envelope trimers for the production of virus like particles (VPLs) and pseudoparticles for use as VLP-based immunogens, to generate neutralizing antibodies, for example, and VLP-based vaccines against which a subject can mount a potent immune response against HIV. In accordance with the invention, gp120/gp41 trimers comprising the stabilizing N-terminal gp41 mutations of the invention, as well as gp120/gp41 trimers comprising other stabilizing mutations in gp120 and gp41 and the N-terminal gp41 mutations as described herein, are used to generate VPLs and pseudovirions having reduced monomer, diner and tetramer forms and enhanced trimer forms of gp120/gp41 Env. The N-terminal stabilizing mutations in the context of HIV-1 virus as described herein can yield trimer forms of Env (gp120/gp41) on VLP and pseudovirions, to the virtual exclusion of monomer, dimer and tetramer forms, thus allowing for an immunogen that more closely resembles native HIV envelope trimers.

[0087] This invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids, and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, said gp41 ectodomain polypeptide portion being modified to comprise isoleucine (I) at amino acid position 535 (I535); glutamine (Q) at amino acid position 543 (Q543); serine (S) at amino acid position 553 (S553); lysine (K) at amino acid position 567 (K567); and arginine (R) at amino acid position 588 (R588), wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144. (e.g., SEQ ID NO:1, SEQ ID NO:5 and/or SEQ ID NO:18). In one embodiment, the isoleucine (I) at amino acid position 535 is the result of an M535I mutation. In another embodiment, the glutamine (Q) at amino acid position 543 is the result of an L543Q mutation. In another embodiment, the serine (S) at amino acid position 553 is the result of an N553S mutation. In yet another embodiment, the lysine (K) at amino acid position 567 is the result of a Q567K mutation. In another embodiment, the arginine (R) at amino acid position 588 is the result of a G588R mutation.

[0088] Because the amino acid positions of different HIV-1 isolates may not be identical with those of the HIV-1 isolate KNH1144, the invention further provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids, and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, said gp41 ectodomain polypeptide portion being modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0089] The invention also provides a modified gp140 envelope polypeptide of an HIV-1 isolate comprising a gp120 polypeptide portion comprising consecutive amino acids and a gp41 ectodomain polypeptide portion comprising consecutive amino acids, said gp41 ectodomain polypeptide portion being modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); and lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and optionally being modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535) and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588); wherein the amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0090] This invention further provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises a cysteine (C) at an amino acid position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL (e.g., SEQ ID NO:9), and the modified gp41 ectodomain polypeptide comprises (i) a cysteine (C) at an amino acid position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL (e.g., SEQ ID NOS:2 and 22); and (ii) at least one of isoleucine (I) at an amino amino acid position equivalent to amino acid position 535 (I535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino positions of (ii) are numbered by reference to the HIV-1 isolate KNH1144. In one embodiment, the modified gp140 envelope polypeptide further comprises proline (P) at an amino acid position equivalent to amino acid position 559, numbered by reference to the HIV-1 isolate KNH1144. In another embodiment, the gp41 ectodomain polypeptide portion of the modified gp140 envelope polypeptide is modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); and lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and is optionally modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535) and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the 543, 553, 567, 535 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0091] In another embodiment, the gp41 ectodomain polypeptide portion of the modified gp140 envelope polypeptide is modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the 543, 553, 567, 535 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0092] This invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises a cysteine (C) at an amino acid position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a cysteine (C) at an amino acid position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL; (ii) a proline (P) at an amino acid position equivalent to amino acid position 559 of the HIV-1 isolate KNH1144 (e.g., SEQ ID NO:6; SEQ ID NO:19); and (iii) one or more of isoleucine (I) at an amino amino acid position equivalent to amino acid position 535 (I535); glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the amino positions of (iii) are numbered by reference to the HIV-1 isolate KNH1144.

[0093] In an embodiment, the gp41 ectodomain polypeptide portion of the modified gp140 envelope polypeptide is modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); and lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); and is optionally modified to comprise isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535) and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the 543, 553, 567, 535 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0094] In another embodiment, the gp41 ectodomain polypeptide portion of the modified gp140 envelope polypeptide is modified to comprise glutamine (Q) at an amino acid position equivalent to amino acid position 543 (Q543); serine (S) at an amino acid position equivalent to amino acid position 553 (S553); lysine (K) at an amino acid position equivalent to amino acid position 567 (K567); isoleucine (I) at an amino acid position equivalent to amino acid position 535 (I535); and arginine (R) at an amino acid position equivalent to amino acid position 588 (R588), wherein the 543, 553, 567, 535 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144. In another embodiment, the isoleucine (I) at an amino acid position equivalent to amino acid position 535 is the result of an M535I mutation; the glutamine (Q) at an amino acid position equivalent to amino acid position 543 is the result of an L543Q mutation; the serine (S) at an amino acid position equivalent to amino acid position 553 is the result of an N5535 mutation; the lysine (K) at an amino acid position equivalent to amino acid position 567 is the result of a Q567K mutation; and the arginine (R) at an amino acid position equivalent to amino acid position 588 is the result of a G588R mutation, wherein the 543, 553, 567, 535 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144.

[0095] In another embodiment, the invention provides a modified gp140 envelope polypeptide of an HIV-1 isolate, wherein a first portion of the gp140 polypeptide corresponds to a modified gp120 polypeptide and a second portion of the gp140 polypeptide corresponds to a modified gp41 ectodomain polypeptide, wherein the modified gp120 polypeptide comprises an A.fwdarw.C mutation at amino acid position 492, numbered by reference to the HIV-1 isolate JR-FL, and the modified gp41 ectodomain polypeptide comprises (i) a T.fwdarw.C mutation at amino acid position 596, numbered by reference to the HIV-1 isolate JR-FL; and (ii) isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588, wherein the 535, 543, 553, 567 and 588 amino acid positions are numbered by reference to the HIV-1 isolate KNH1144. In an embodiment this modified gp140 envelope polypeptide further comprises an I.fwdarw.P mutation at amino acid position 559, numbered by reference to the HIV-1 isolate KNH1144.

[0096] This invention provides a modified gp41 ectodomain polypeptide which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:1, SEQ ID NO:18, SEQ ID NO:2, SEQ ID NO:22, SEQ ID NO:3, SEQ ID NO:25, or SEQ ID NO:28. The invention further provides a modified gp41 ectodomain polypeptide which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:24, SEQ ID NO:27, or SEQ ID NO:30.

[0097] This invention provides a modified gp160 polypeptide which comprises the consecutive amino acid sequence as set forth in SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:12, or SEQ ID NO:15. The invention further provides a modified gp160 polypeptide which comprises the consecutive amino acid sequence as set forth in any one of SEQ ID NO:5, SEQ ID NO:11, SEQ ID NO:14, or SEQ ID NO:17. Also embraced by the invention are the sequences of the gp120 and gp41 portions of the gp160 polypeptides described herein.

[0098] In one embodiment, the HIV-1 isolate represents a subtype selected from the group consisting of clades A, B, C, D, E, F, G, H, J and O. In another embodiment, the HIV-1 isolate is a clade A subtype. In another embodiment, the HIV-1 isolate is a clade B subtype. Additionally, the HIV isolate that is modified to contain the trimer stabilizing amino acid residues of the invention may be a strain or a clade other than those particularly specified.

[0099] This invention provides a trimeric complex which comprises a noncovalent oligomer of three identical modified HIV-1 gp140 envelope polypeptides of the invention. The invention further provides a trimeric complex which comprises a noncovalent oligomer of three identical modified gp41 ectodomain polypeptides of the invention.

[0100] This invention provides a composition comprising the modified polypeptide of the invention and a pharmaceutically acceptable carrier, excipient, or diluent.

[0101] This invention also provides a composition comprising the trimeric complex of the invention and a pharmaceutically acceptable carrier, excipient, or diluent. In one embodiment, the composition further comprises an adjuvant. In one embodiment, the composition further comprises an antiretroviral agent.

[0102] This invention provides an isolated nucleic acid encoding a modified form of an HIV-1 gp120 and gp41 polypeptide complex, wherein the modification in gp120 comprises a mutation of the amino acid at a position equivalent to amino acid position 492 of the HIV-1 isolate JR-FL to cysteine (C); and the modifications in gp41 comprise (i) a mutation of the amino acid at a position equivalent to amino acid position 596 of the HIV-1 isolate JR-FL to cysteine (C); (ii) a mutation of the amino acid at a position equivalent to amino acid position 543 of the HIV-1 isolate KNH1144 to glutamine (Q); (iii) a mutation of the amino acid at a position equivalent to amino acid position 553 of the HIV-1 isolate KNH1144 to serine (S); (iv) a mutation of the amino acid at a position equivalent to amino acid position 567 of the HIV-1 isolate KNH1144 to lysine (K); and optionally, (v) a mutation of the amino acid at a position equivalent to amino acid position 535 of the HIV-1 isolate KNH1144 to isoleucine and (vi) a mutation of the amino acid at a position equivalent to amino acid position 588 of the HIV-1 isolate KNH1144 to arginine (R). In an embodiment, the modifications in gp41 further comprise a mutation to proline of a non-proline amino acid at a position equivalent to amino acid position 559, as numbered by reference to the HIV-1 isolate KNH1144 (e.g., SEQ ID NOS:1, 18 and/or 19). In an embodiment, the modifications in gp41 further comprise a mutation to isoleucine of a non-isoleucine amino acid at a position equivalent to amino acid position 535, as numbered by reference to the HIV-1 isolate KNH1144. In an embodiment, the modifications in gp41 further comprise a mutation to methionine of a non-methionine amino acid at a position equivalent to amino acid position 535, as numbered by reference to the HIV-1 isolate KNH1144 (e.g., SEQ ID NO:20; SEQ ID NO:21). In one embodiment, the isolated nucleic acid is DNA. In another embodiment, the isolated nucleic acid is cDNA. In another embodiment, the isolated nucleic acid is RNA.

[0103] This invention provides a vector comprising the isolated nucleic of the invention. This invention also provides a host cell comprising the vector or expression cassette of the invention. The host cell may be a eukaryotic cell or a prokaryotic cell.

[0104] This invention further provides a method for eliciting an immune response against HIV-1 or an HIV-1 infected cell in a subject comprising administering to the subject an amount of the compositions of the invention effective to elicit the immune response in the subject. In some embodiments, the composition is administered in a single dose or in multiple doses. In another embodiment, the composition is administered as part of a heterologous prime-boost regimen.

[0105] This invention provides a method for preventing a subject from becoming infected with HIV-1, comprising administering to the subject an amount of the compositions of the invention effective to prevent the subject from becoming infected with HIV-1.

[0106] This invention provides a method for reducing the likelihood of a subject becoming infected with HIV-1, comprising administering to the subject an amount of the compositions of the invention effective to reduce the likelihood of the subject becoming infected with HIV-1. In one embodiment, the subject has been exposed to HIV-1.

[0107] This invention also provides a method for delaying the onset of, or slowing the rate of progression of, an HIV-1-related disease in an HIV-1-infected subject, which comprises administering to the subject an amount of the compositions of the invention effective to delay the onset of, or slow the rate of progression of, the HIV-1-related disease in the subject.

[0108] This invention provides the trimeric complexes of the invention, or the composition of the invention, further comprising a non-ionic detergent. In one embodiment, the non-ionic detergent is a polyethylene type detergent. In another embodiment, the non-ionic detergent is a polyethylene type detergent. In another embodiment, the polyethylene type detergent is poly(oxyethylene) sorbitan monolaureate. In another embodiment, the poly(oxyethylene) sorbitan monolaureate is poly(oxyethylene) (20) sorbitan monolaureate. In another embodiment, the polyethylene type detergent is poly(oxyethylene) sorbitan monooleate.

[0109] In one embodiment, the non-ionic detergent is present in an amount of from 0.01% to 1%. In another embodiment, the non-ionic detergent is present in an amount of from 0.01% to 0.05%.

[0110] This invention further provides a method of stabilizing HIV-1 trimer complexes which comprise non-covalently associated gp120 and gp41 envelope polypeptides, which polypeptides comprise consecutive amino acids, said method comprising: introducing into the gp41 ectodomain polypeptide a glutamine (Q) at an amino acid position equivalent to amino acid position 543 of the HIV-1 isolate KNH1144; a serine (S) at an amino acid position equivalent to amino acid position 553 of the HIV-1 isolate KNH1144; a lysine (K) at an amino acid position equivalent to amino acid position 567 of the HIV-1 isolate KNH1144; and optionally, an isoleucine (I) at an amino acid position equivalent to amino acid position 535 of the HIV-1 isolate KNH1144 and an arginine (R) at an amino acid position equivalent to amino acid position 588 of the HIV-1 isolate KNH1144. In one embodiment, the method further comprises introducing a cysteine (C) at an amino acid position equivalent to amino acid position 492 of the gp120 polypeptide of the HIV-1 isolate JR-FL, and a cysteine (C) at an amino acid position equivalent to amino acid position 596 of the gp41 ectodomain polypeptide of the HIV-1 isolate JR-FL. In another embodiment, the method further comprises introducing a proline (P) at an amino acid position equivalent to amino acid position 559 of the gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144.

[0111] This invention further provides a method of stabilizing HIV-1 trimer complexes which comprise non-covalently associated gp120 and gp41 envelope polypeptides, which polypeptides comprise consecutive amino acids, said method comprising: introducing into the gp41 ectodomain polypeptide a glutamine (Q) at an amino acid position equivalent to amino acid position 543 of the HIV-1 isolate KNH1144; a serine (S) at an amino acid position equivalent to amino acid position 553 of the HIV-1 isolate KNH1144; a lysine (K) at an amino acid position equivalent to amino acid position 567 of the HIV-1 isolate KNH1144; an isoleucine (I) at an amino acid position equivalent to amino acid position 535 of the HIV-1 isolate KNH1144; and an arginine (R) at an amino acid position equivalent to amino acid position 588 of the HIV-1 isolate KNH1144. In one embodiment, the method further comprises introducing a cysteine (C) residue at an amino acid position equivalent to amino acid position 492 in the gp120 polypeptide of the HIV-1 isolate JR-FL, and a cysteine (C) at an amino acid position equivalent to amino acid position 596 in the gp41 ectodomain polypeptide of the HIV-1 isolate JR-FL. In another embodiment, the method further comprises introducing a proline (P) at an amino acid position equivalent to amino acid position 559 of the gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144.

[0112] This invention provides a chimeric gp140 polypeptide comprising (i) a gp120 envelope polypeptide of a clade B subtype of an HIV-1 isolate and (ii) a gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144, said polypeptides comprising consecutive amino acids, wherein the KNH1144 gp41 ectodomain polypeptide comprises isoleucine (I) at amino acid position 535; glutamine (Q) at amino acid position 543; serine (S) at amino acid position 553; lysine (K) at amino acid position 567; and arginine (R) at amino acid position 588.

[0113] This invention further provides a chimeric gp140 polypeptide comprising (i) a gp120 envelope polypeptide of a clade B subtype of an HIV-1 isolate and (ii) a gp41 ectodomain polypeptide of the HIV-1 isolate KNH1144, said polypeptides comprising consecutive amino acids, wherein the KNH1144 gp41 ectodomain polypeptide comprises an amino acid sequence as set forth in SEQ ID NO:1, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:21, or the gp41 polypeptide portion of the gp160 polypeptide as set forth in any one of SEQ ID NOS:5-8.

[0114] In embodiments of the invention, the HIV-1 isolate is an HIV-1.sub.JR-FL, HIV-1.sub.Ba-L, HIV-1.sub.5768, HIV-1.sub.DW123, HIV-1.sub.GUN-1, HIV-1.sub.89.6, or HIV-1.sub.HXB2 isolate.

[0115] In an embodiment, the present invention encompasses a method for treating or preventing human immunodeficiency viral (HIV) infection in a subject by administering to the subject a therapeutically or prophylactically effective amount of a pharmaceutical composition that includes one or more gp160, gp120, gp41 polypeptides or a combination of gp160, gp120, gp41 polypeptides. In some embodiments, the composition contains a trimeric complex of three gp120 proteins and three gp41 subunits, which have been modified for enhanced stability in accordance with the invention.

[0116] In another embodiment, the present invention provides a method for treating or preventing human immunodeficiency viral infection (HIV) in a subject by administering an amount of a pharmaceutical composition that includes one or more gp160, gp120, gp41 polypeptides, or a combination of gp160, gp120, gp41 polypeptides, using a dosing and resting regimen to effectively treat or prevent at least 70% of subjects in a population of at least ten subjects. Cure or prevention rates of the present invention include, but are not limited to, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% and 100% of subjects having human immunodeficiency viral infection effectively treated, e.g., by reducing viral load, reducing or eliminating viral nucleic acid, or increasing CD4+ cells, in a population of at least 100 subjects.

[0117] Compositions and immunogenic preparations, including vaccine compositions, comprising the polypeptides of the present invention capable of inducing an immunological reaction (including protective immunity) in a suitably treated animal or human, and a suitable carrier therefore, are provided. Immunogenic compositions are those which result in specific antibody production or in cellular immunity when injected into a human or an animal. Such immunogenic compositions or vaccines are useful, for example, in immunizing an animal, including a human, against infection and/or damage caused by HIV.

[0118] The vaccine preparations comprise an immunogenic amount of one or more of the polypeptides of the invention. By "immunogenic amount" is meant an amount capable of eliciting the production of antibodies directed against the retrovirus in a mammal into which the vaccine has been administered. The route of administration and the immunogenic composition may be designed to optimize the immune response on mucosal surfaces, for example, using nasal administration (via an aerosol) of the immunogenic composition.

[0119] In some embodiments, the methods and compositions of the invention also include use of another antiviral agent in addition to the one or more of the present gp160, gp120, gp41 polypeptides, or a combination of gp160, gp120, gp41 polypeptides as described herein. Thus, other antiretroviral agents or compounds, which can be administered in addition to the polypeptides and compositions of the invention include, without limitation, protease inhibitors, retroviral polymerase inhibitors, azidothymidine (AZT), didanoside (DDI), soluble CD4, a polysaccharide sulfates, T22, bicyclam, suramin, antisense oliogonulceotides, ribozymes, rev inhibitors, protease inhibitors, glycolation inhibitors, interferon and the like. Examples include acyclovir, 3-aminopyridine-2-carboxyaldehyde thiosemicarbazone (3-AP, Triapine.TM.) and 3-amino-4-methylpyridine-2-carboxaldehyde thiosemicarbazone (3-AMP), thiamine disulfide, thiamine disulfide nitrate, thiamine disulfide phosphate, bisbentiamine, bisbutyliamine, bisibutiamine, alitiamine, fursultiamine and octotiamine.

Recombinant Production of Polypeptides

[0120] Polypeptides of the invention can be made recombinantly using convenient vectors, expression systems and host cells. The invention therefore provides expression cassettes, vectors and host cells useful for expressing a peptide of the invention, for example, any of the gp160, gp120 and/or gp41 polypeptides as described herein.

[0121] The expression cassettes of the invention include a promoter. Any promoter able to direct transcription of an encoded peptide or polypeptide may be used. Accordingly, many promoters may be included within the expression cassette of the invention. Some useful promoters include constitutive promoters, inducible promoters, regulated promoters, cell specific promoters, viral promoters, and synthetic promoters. A promoter is a nucleotide sequence that controls expression of an operably linked nucleic acid sequence by providing a recognition site for RNA polymerase, and possibly other factors, required for proper transcription. A promoter includes a minimal promoter, consisting only of all basal elements needed for transcription initiation, such as a TATA-box and/or other sequences that serve to specify the site of transcription initiation. A promoter may be obtained from a variety of different sources. For example, a promoter may be derived entirely from a native gene, be composed of different elements derived from different promoters found in nature, or be composed of nucleic acid sequences that are entirely synthetic. A promoter may be derived from many different types of organisms and tailored for use within a given cell.

[0122] For expression of a polypeptide in a bacterium, an expression cassette having a bacterial promoter is used. A bacterial promoter is any DNA sequence capable of binding bacterial RNA polymerase and initiating the downstream (3') transcription of a coding sequence into mRNA. A promoter will have a transcription initiation region that is usually placed proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site and a transcription initiation site. A second domain called an operator may be present and overlap an adjacent RNA polymerase binding site at which RNA synthesis begins. The operator permits negatively regulated (inducible) transcription, as a gene repressor protein may bind the operator and thereby inhibit transcription of a specific gene. Constitutive expression may occur in the absence of negative regulatory elements, such as the operator. In addition, positive regulation may be achieved by a gene activator protein binding sequence, which, if present is usually proximal (5') to the RNA polymerase binding sequence. An example of a gene activator protein is the catabolite activator protein (CAP), which helps initiate transcription of the lac operon in E. coli (Raibaud et al., Ann. Rev. Genet., 18:173 (1984)). Regulated expression may therefore be positive or negative, thereby either enhancing or reducing transcription.

[0123] Sequences encoding metabolic pathway enzymes provide particularly useful promoter sequences. Illustrative examples include promoter sequences derived from sugar metabolizing enzymes, such as galactose, lactose (lac) (Chang et al., Nature, 198:1056 (1977) and maltose. Additional examples include promoter sequences derived from biosynthetic enzymes such as tryptophan (Trp) (Goeddel et al., Nuc. Acids Res., 8:4057 (1980); Yelverton et al., Nuc. Acids Res., 9:731 (1981); U.S. Pat. No. 4,738,921; and EPO Publ. Nos. 036 776 and 121 775). The .beta.-lactamase (bla) promoter system (Weissmann, "The cloning of interferon and other mistakes", in: Interferon 3 (ed. I. Gresser), 1981). Bacteriophage lambda P.sub.L (Shimatake et al., Nature, 292:128 (1981)) and T5 (U.S. Pat. No. 4,689,406) promoter systems also provide useful promoter sequences. Another promoter is the Chlorella virus promoter (U.S. Pat. No. 6,316,224).

[0124] Synthetic promoters that do not occur in nature also function as bacterial promoters. For example, transcription activation sequences of one bacterial or bacteriophage promoter may be joined with the operon sequences of another bacterial or bacteriophage promoter, creating a synthetic hybrid promoter (U.S. Pat. No. 4,551,433). For example, the tac promoter is a hybrid trp-lac promoter comprised of both trp promoter and lac operon sequences that is regulated by the lac repressor (Amann et al., Gene, 25:167 (1983); de Boer et al., Proc. Natl. Acad. Sci. USA, 80:21 (1983)). Furthermore, a bacterial promoter can include naturally occurring promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and initiate transcription. A naturally occurring promoter of non-bacterial origin can also be coupled with a compatible RNA polymerase to produce high levels of expression of some genes in prokaryotes. The bacteriophage T7 RNA polymerase/promoter system is an example of a coupled promoter system (Studier et al., J. Mol. Biol., 189:113 (1986); Tabor et al., Proc. Natl. Acad. Sci. USA, 82:1074 (1985)). In addition, a hybrid promoter can also be comprised of a bacteriophage promoter and an E. coli operator region (EPO Publ. No. 267 851).

[0125] An expression cassette having a baculovirus promoter can be used for expression of a polypeptide in an insect cell. A baculovirus promoter is any DNA sequence capable of binding a baculovirus RNA polymerase and initiating transcription of a coding sequence into mRNA. A promoter will have a transcription initiation region that is usually placed proximal to the 5' end of the coding sequence. This transcription initiation region usually includes an RNA polymerase binding site and a transcription initiation site. A second domain called an enhancer may be present and is usually distal to the structural gene. A baculovirus promoter may be a regulated promoter or a constitutive promoter. Useful promoter sequences may be obtained from structural genes that are transcribed at times late in a viral infection cycle. Examples include sequences derived from the gene encoding the baculoviral polyhedron protein (Friesen et al., "The Regulation of Baculovirus Gene Expression", in: The Molecular Biology of Baculoviruses (ed. Walter Doerfler), 1986; and EPO Publ. Nos. 127 839 and 155 476) and the gene encoding the baculoviral p10 protein (Vlak et al., J. Gen. Virol., 69:765 (1988)).

[0126] Promoters that are functional in yeast are known to those of ordinary skill in the art. In addition to an RNA polymerase binding site and a transcription initiation site, a yeast promoter may also have a second region called an upstream activator sequence. The upstream activator sequence permits regulated expression that may be induced. Constitutive expression occurs in the absence of an upstream activator sequence. Regulated expression may be either positive or negative, thereby either enhancing or reducing transcription.

[0127] Promoters for use in yeast may be obtained from yeast genes that encode enzymes active in metabolic pathways. Examples of such genes include alcohol dehydrogenase (ADH) (EPO Publ. No. 284 044), enolase, glucokinase, glucose-6-phosphate isomerase, glyceraldehyde-3-phosphatedehydrogenase (GAP or GAPDH), hexokinase, phosphofructokinase, 3-phosphoglyceratemutase, and pyruvate kinase (PyK). (EPO Publ. No. 329 203). The yeast PHO5 gene, encoding acid phosphatase, also provides useful promoter sequences. (Myanohara et al., Proc. Natl. Acad. Sci. USA, 80:1 (1983)).

[0128] Synthetic promoters that do not occur in nature may also be used for expression in yeast. For example, upstream activator sequences from one yeast promoter may be joined with the transcription activation region of another yeast promoter, creating a synthetic hybrid promoter. Examples of such hybrid promoters include the ADH regulatory sequence linked to the GAP transcription activation region (U.S. Pat. Nos. 4,876,197 and 4,880,734). Other examples of hybrid promoters include promoters which consist of the regulatory sequences of either the ADH2, GAL4, GAL10, or PHO5 genes, combined with the transcriptional activation region of a glycolytic enzyme gene such as GAP or PyK (EPO Publ. No. 164 556). Furthermore, a yeast promoter can include naturally occurring promoters of non-yeast origin that have the ability to bind yeast RNA polymerase and initiate transcription. Examples of such promoters are known in the art. (Cohen at al., Proc. Natl. Acad. Sci. USA, 77:1078 (1980); Henikoff et al., Nature, 283:835 (1981); Hollenberg et al., Curr. Topics Microbiol. Immunol., 96:119 (1981)); Hollenberg et al., "The Expression of Bacterial Antibiotic Resistance Genes in the Yeast Saccharomyces cerevisiae", in: Plasmids of Medical, Environmental and Commercial Importance (eds. K. N. Timmis and A. Puhler), 1979; (Mercerau-Puigalon at al., Gene, 11:163 (1980); Panthier et al., Curr. Genet., 2:109 (1980)).

[0129] Many mammalian promoters as known in the art that may be used in conjunction with the expression cassette of the invention. Mammalian promoters often have a transcription initiating region, which is usually placed proximal to the 5' end of the coding sequence, and a TATA box, usually located 25-30 base pairs (bp) upstream of the transcription initiation site. The TATA box is thought to direct RNA polymerase II to begin RNA synthesis at the correct site. A mammalian promoter may also contain an upstream promoter element, usually located within 100 to 200 bp upstream of the TATA box. An upstream promoter element determines the rate, at which transcription is initiated and can act in either orientation (Sambrook et al., "Expression of Cloned Genes in Mammalian Cells", in: Molecular Cloning: A Laboratory Manual, 2nd ed., 1989).

[0130] Mammalian viral genes are often highly expressed and have a broad host range; therefore sequences encoding mammalian viral genes often provide useful promoter sequences. Nonlimiting examples include the SV40 early promoter, mouse mammary tumour virus LTR promoter, adenovirus major late promoter (Ad MLP), and Herpes Simplex Virus promoter. In addition, sequences derived from non-viral genes, such as the murine metallothionein gene, also provide useful promoter sequences. Expression may be either constitutive or regulated.

[0131] A mammalian promoter may also be associated with an enhancer. The presence of an enhancer will usually increase transcription from an associated promoter. An enhancer is a regulatory DNA sequence that can stimulate transcription up to 1000-fold when linked to homologous or heterologous promoters, with synthesis beginning at the normal RNA start site. Enhancers are active when they are placed upstream or downstream from the transcription initiation site, in either normal or flipped orientation, or at a distance of more than 1000 nucleotides from the promoter. (Maniatis et al., Science, 236:1237 (1987); Alberts et al., Molecular Biology of the Cell, 2nd ed., 1989). Enhancer elements derived from viruses are often times useful, because they usually have a broad host range. Nonlimiting examples include the SV40 early gene enhancer (Dijkema et al., EMBO J., 4:761 (1985)) and the enhancer/promoters derived from the long terminal repeat (LTR) of the Rous Sarcoma Virus (Gorman at al., Proc. Natl. Acad. Sci. USA, 79:6777 (1982b)) and from human cytomegalovirus (Boshart at al., Cell, 41:521 (1985)). Additionally, some enhancers are regulatable and become active only in the presence of an inducer, such as a hormone or metal ion (Sassone-Corsi and Borelli, Trends Genet., 2:215 (1986); Maniatis et al., Science, 236:1237 (1987)).

[0132] It is understood that many promoters and associated regulatory elements may be used within the expression cassette of the invention to transcribe an encoded polypeptide. The promoters described above are provided merely provided as examples and are not to be considered as a complete list of promoters that are included within the scope of the invention.

[0133] The expression cassette of the invention may contain a nucleic acid sequence for increasing the translation efficiency of an mRNA encoding a polypeptide of the invention. Such increased translation serves to increase production of the polypeptide. The presence of an efficient ribosome binding site is useful for gene expression in prokaryotes. In bacterial mRNA, a conserved stretch of six nucleotides, the Shine-Dalgarno sequence, is usually found upstream of the initiating AUG codon. (Shine et al., Nature, 254:34 (1975)). This sequence is thought to promote ribosome binding to the mRNA by base pairing between the ribosome binding site and the 3' end of Escherichia coli 16S rRNA. (Steitz et al., "Genetic signals and nucleotide sequences in messenger RNA", in: Biological Regulation and Development: Gene Expression (ed. R. F. Goldberger), 1979)). Such a ribosome binding site, or operable derivatives thereof, are included within the expression cassette of the invention.

[0134] A translation initiation sequence can be derived from any expressed Escherichia coli gene and can be used within an expression cassette of the invention. Preferably the gene is a highly expressed gene. A translation initiation sequence can be obtained via standard recombinant methods, synthetic techniques, purification techniques, or combinations thereof, which are all well known. (Ausubel et al., Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, NY. (1989); Beaucage and Caruthers, Tetra. Letts., 22:1859 (1981); VanDevanter et al., Nucleic Acids Res., 12:6159 (1984). Alternatively, translational start sequences can be obtained from numerous commercial vendors. (Operon Technologies; Life Technologies Inc, Gaithersburg, Md.). In some embodiments, the T7 translation initiation sequence is used. The T7 translation initiation sequence is derived from the highly expressed T7 Gene 10 cistron and can have a sequence that includes tctagaaataattttgtttaactttaagaaggagatata (SEQ ID NO:4). Other examples of translation initiation sequences include, but are not limited to, the maltose-binding protein (Mal E gene) start sequence (Guan et al., Gene, 67:21 (1997)) present in the pMalc2 expression vector (New England Biolabs, Beverly, Mass.) and the translation initiation sequence for the following genes: thioredoxin gene (Novagen, Madison, Wis.), Glutathione-S-transferase gene (Pharmacia, Piscataway, N.J.), .beta.-galactosidase gene, chloramphenicol acetyltransferase gene and E. coli Trp E gene (Ausubel et al., 1989, Current Protocols in Molecular Biology, Chapter 16, Green Publishing Associates and Wiley Interscience, NY).

[0135] Eucaryotic mRNA does not contain a Shine-Dalgarno sequence. Instead, the selection of the translational start codon is usually determined by its proximity to the cap at the 5' end of an mRNA. The nucleotides immediately surrounding the start codon in eucaryotic mRNA influence the efficiency of translation. Accordingly, one skilled in the art can determine what nucleic acid sequences will increase translation of a polypeptide encoded by the expression cassette of the invention. Such nucleic acid sequences are within the scope of the invention.

[0136] The invention therefore provides an expression cassette that includes a promoter operable in a selected host and a nucleic acid encoding a polypeptide having a sequence of the invention. In embodiments of the invention, the encoded polypeptide comprises SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:9, SEQ ID NO:12, SEQ ID NO:15, SEQ ID NO:18, SEQ ID NO:22, SEQ ID NO:25, or SEQ ID NO:28, modified to contain HIV trimer stabilizing amino acids as described herein. In other embodiments, the encoded polypeptide comprises SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:11, SEQ ID NO:14, SEQ ID NO:17, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:24, SEQ ID NO:27, or SEQ ID NO:30, comprising HIV trimer stabilizing amino acid modifications as described herein. The expression cassette can also have other elements, for example, termination signals, origins of replication, enhancers, and the like as described herein. The expression cassette can also be placed in a vector for easy replication and maintenance.

[0137] Recombinant expression of the peptides and polypeptides of the invention avoids degradation frequently observed for short peptides within a cell in which they are expressed when the peptides and polypeptides are expressed and stored within inclusion bodies present within the host cells. Hence, the peptides can readily be purified from inclusion bodies. In an embodiment, recombinant peptides are expressed in E. coli strain BL21(DE3)/pLysS (Novagen). Cells were grown at 37.degree. C. in LB medium to an optical density of 0.8 at 600 nm and were induced with isopropylthio-.beta.-D-galactoside for 3-4 hr at 37.degree. C. The cells are centrifuged, frozen at -80.degree. C., resuspended in 50 mM Tris-HCl (pH 8.0) and 1 mM EDTA plus 25% sucrose, and disrupted by sonication. Inclusion bodies of the cell lysate are isolated and washed three times with Triton buffer (20 mM Tris-HCl [pH 8.0], 1 mM EDTA, and 1% Triton X-100). The inclusion bodies are then solubilized in 50 mM Tris-HCl (pH 8.5) plus 8 M urea. Insoluble debris is removed by centrifugation (18,000g, 1 hr, 4.degree. C.); the supernatant is loaded on a DEAE Sepharose column (Amersham Pharmacia Biotech) equilibrated with buffer A (50 mM Tris-HCl [pH 8.5] plus 3 M urea). The soluble peptide is eluted with a linear salt gradient (0-500 mM NaCl in buffer A). The peptide solution is dialyzed into 5% acetic acid overnight at 4.degree. C. Peptides from the soluble fraction are purified to homogeneity by reverse-phase high-performance liquid chromatography (Waters, Inc.) on a Vydac C-18 preparative column (Hesperia, Calif.), using a water-acetonitrile gradient in the presence of 0.1% trifluoroacetic acid, and lyophilized.

[0138] The isolation of the peptides and polypeptides is enhanced because they are present in inclusion bodies that can readily be separated from other cellular components. Such inclusion bodies are more or less soluble under defined conditions that include, but are not limited to, pH, temperature, salt concentration, and protein concentration. Thus, an inclusion body can be insoluble in water but soluble in the presence of urea, acid, guanidinium chloride, and other agents. Hence, after recombinant expression of the present peptides and polypeptides in a host cell, the host cells can be isolated and lysed, and inclusion bodies can be collected, for example, by centrifugation. The inclusion bodies can be rinsed with dilute buffer and then solubilized in urea or other agent. Insoluble debris can be removed by centrifugation and the solubilized peptides can be further purified, for example, by ion exchange chromatography or reverse-phase HPLC.

Antibodies and Binding Entities

[0139] The invention is also directed to binding entities and antibodies that can bind to a trimeric gp120/gp41 polypeptide complex stabilized as described herein. The binding domains of such antibodies, for example, the CDR regions of these antibodies, can also be transferred into or utilized with any convenient binding entity backbone.

[0140] The HIV-1 envelope glycoprotein is the major target for neutralizing antibodies during the course of natural infection and has been extensively employed as an immunogen in vaccine studies (Burton et al., Nature Med. 4, 495-498 (1998); Letvin, Science 280, 1875-1880 (1998); Burton, Proc. Natl. Acad. Sci. USA 94, 10018-10023 (1997); Burton et al., J. Acquir. Immune Defic. Syndr. 11 (Suppl A), 587-598 (1997); Montefiori at al., AIDS Res. Hum. Retroviruses 15, 689-698 (1999); Wyatt et al., Science. 280, 1884-1888 (1998); Parren et al., AIDS. 13 (Suppl A), S137-S162 (1999)). Because of the chronic nature of HIV-1 infection, the envelope glycoprotein has evolved to minimize the potential impact of neutralizing antibodies on viral infection. Broad-spectrum neutralization epitopes on the envelope glycoprotein complex appear to be rare and poorly immunogenic.

[0141] Notwithstanding, all the monoclonal antibodies (MAbs) that neutralize HIV-1 are able to bind the trimeric envelope glycoprotein spike (Sattentau et al., J. Exp. Med. 182, 185-196 (1995); Sullivan et al., J. Virol. 69, 4413-4422 (1995); Moore et al., J. Virol. 69, 101-109 (1995); Fouts et al., J. Virol. 71, 2779-2785 (1997); Parren et al., J. Virol. 72, 3512-3519 (1998)). Because the native, trimeric envelope glycoprotein complex is unstable, a major challenge in vaccine research has been to preserve the envelope trimer conformation in vaccine preparations (see, e.g., Sanders et al., J. Virol. 76, 8875-8889 (2002)). Thus, by providing a stabilized gp41/gp120 trimeric conformation, the present invention affords a solution to the problem(s) of reproducibly providing stable HIV immunogens that can be used to generate an anti-HIV immune response and potent, neutralizing anti-HIV antibodies.

[0142] Antibody molecules belong to a family of plasma proteins called immunoglobulins. The heavy and light chains of an antibody consist of different domains. Each light chain has one variable domain (VL) and one constant domain (CL), while each heavy chain has one variable domain (VH) and three or four constant domains (CH). See, e.g., Alzari, P. N. et al., (1988). Three-dimensional structure of antibodies. Annu. Rev. Immunol. 6:555-580. Each domain, consisting of about 110 amino acid residues, is folded into a characteristic .beta.-sandwich structure formed from two .beta.-sheets packed against each other, the immunoglobulin fold. The VH and VL domains each have three complementarity determining regions (CDR1-3) that are loops, or turns, connecting .beta.-strands at one end of the domains. The variable regions of both the light and heavy chains generally contribute to antigen specificity, although the contribution of the individual chains to specificity is not always equal. Antibody molecules have evolved to bind to a large number of molecules by using six randomized loops (CDRs).

[0143] Immunoglobulins can be assigned to different classes depending on the amino acid sequences of the constant domain of their heavy chains. There are at least five (5) major classes of immunoglobulins: IgA, IgD, IgE, IgG and IgM. Several of these may be further divided into subclasses (isotypes), for example, IgG1, IgG2 (IgG2a and IgG2b), IgG3 and IgG4; IgA1 and IgA2. The heavy chain constant domains that correspond to the IgA, IgD, IgE, IgG and IgM classes of immunoglobulins are called alpha (.alpha.), delta (.delta.), epsilon (.epsilon.), gamma (.gamma.) and mu (.mu.), respectively. The light chains of antibodies can be assigned to one of two clearly distinct types, called kappa (.kappa.) and lambda (.lamda.), based on the amino sequences of their constant domain. The subunit structures and three-dimensional configurations of different classes of immunoglobulins are well known.

[0144] Variability in antibody variable domains is concentrated in three segments called complementarity determining regions (CDRs), also known as hypervariable regions in both the light chain and the heavy chain variable domains. The more highly conserved portions of variable domains are called framework (FR) regions. The variable domains of native heavy and light chains each comprise four FR regions, largely adopting a .beta.-sheet configuration, connected by three CDRs, which form loops connecting, and in some cases forming part of, the .beta.-sheet structure. The CDRs in each chain are held together in close proximity by the FR regions and, with the CDRs from another chain, contribute to the formation of the antigen-binding site of antibodies. The constant domains are not involved directly in binding an antibody to an antigen, but exhibit various effector functions, such as participation of the antibody in antibody-dependent cellular toxicity.

[0145] An antibody that is contemplated for use in the present invention thus can be in any of a variety of forms, including a whole immunoglobulin, an antibody portion or fragment, such as Fv, Fab, Fab'2, and similar fragments, a single chain antibody which includes the variable domain complementarity determining regions (CDR), and the like forms, all of which fall under the broad term "antibody", as used herein. The present invention contemplates the use of any specificity of an antibody, polyclonal or monoclonal, and is not limited to antibodies that recognize and immunoreact with a specific peptide sequence described herein or a derivative thereof. However, in some embodiments, the antibody binds with specificity to a polypeptide with any of the polypeptide sequences disclosed herein, or a combination or complex thereof.

[0146] Moreover, the binding regions, or CDRs, of antibodies can be placed within the backbone of any convenient binding entity polypeptide. In some embodiments, in the context of methods described herein, an antibody, binding entity, or portion or fragment thereof is used that is immunospecific for any of the polypeptides described herein, as well as the derivatives thereof, including crosslinked derivatives thereof.

[0147] The term "antibody fragment" refers to a portion of a full-length antibody, generally the antigen binding or variable region. Examples of antibody fragments include Fab, Fab', F(ab').sub.2 and Fv fragments. Fv is the minimum antibody fragment that contains a complete antigen recognition and binding site. This region consists of a dimer of one heavy and one light chain variable domain in a tight, non-covalent association (V.sub.H-V.sub.L dimer). It is in this configuration that the three CDRs of each variable domain interact to define an antigen binding site on the surface of the V.sub.H-V.sub.L dimer. Collectively, the six CDRs confer antigen binding specificity to the antibody. However, even a single variable domain (or half of an Fv comprising only three CDRs specific for an antigen) has the ability to recognize and bind antigen, although at a lower affinity than the entire binding site. As used herein, "functional fragment" with respect to antibodies, refers to Fv, F(ab) and F(ab').sub.2 fragments.

[0148] Additional fragments can include diabodies, linear antibodies, single-chain antibody molecules and multispecific antibodies formed from antibody fragments. Single chain antibodies are genetically engineered molecules containing the variable region of the light chain, the variable region of the heavy chain, linked by a suitable polypeptide linker as a genetically fused single chain molecule. Such single chain antibodies are also referred to as "single-chain Fv" or "sFv" antibody fragments. Generally, the Fv polypeptide further comprises a polypeptide linker between the VH and VL domains that enables the sFv to form the desired structure for antigen binding. For a review of sFv see Pluckthun, in The Pharmacology of Monoclonal Antibodies, vol. 113, Rosenburg and Moore eds. Springer-Verlag, N.Y., pp. 269-315 (1994).

[0149] The term "diabodies" refers to a small antibody fragments with two antigen-binding sites, where the fragments comprise a heavy chain variable domain (VH) connected to a light chain variable domain (VL) in the same polypeptide chain (VH-VL). By using a linker that is too short to allow pairing between the two domains on the same chain, the domains are forced to pair with the complementary domains of another chain and create two antigen-binding sites. Diabodies are described more fully in, for example, EP 404,097; WO 93/11161, and Hollinger et al., Proc. Natl. Acad Sci. USA 90:6444-6448 (1993).

[0150] Antibody portions or fragments contemplated by the invention are therefore not full-length antibodies. However, such antibody fragments can have similar or improved immunological properties relative to a full-length antibody. Such antibody fragments may be as small as about 3-4 amino acids, 5 amino acids, 6 amino acids, 7 amino acids, 9 amino acids, about 12 amino acids, about 15 amino acids, about 17 amino acids, about 18 amino acids, about 20 amino acids, about 25 amino acids, about 30 amino acids or more.

[0151] In general, an antibody fragment of the invention can have any upper size limit as long as it has similar or improved immunological properties relative to an antibody that binds with specificity to a polypeptide described herein. For example, smaller binding entities and light chain antibody fragments can have less than about 200 amino acids, less than about 175 amino acids, less than about 150 amino acids, or less than about 120 amino acids if the antibody fragment is related to a light chain antibody subunit. Moreover, larger binding entities and heavy chain antibody fragments can have less than about 425 amino acids, less than about 400 amino acids, less than about 375 amino acids, less than about 350 amino acids, less than about 325 amino acids or less than about 300 amino acids if the antibody fragment is related to a heavy chain antibody subunit.

[0152] Antibodies directed against various immunogens or disease markers can be made by a number of known procedures. Methods for preparing polyclonal antibodies are practiced by those skilled in the art. See, for example, Green, et al., Production of Polyclonal Antisera, in: Immunochemical Protocols (Manson, ed.), pages 1-5 (Humana Press); Coligan, et al., Production of Polyclonal Antisera in Rabbits, Rats Mice and Hamsters, in: Current Protocols in Immunology, section 2.4.1 (1992), which are hereby incorporated by reference.

[0153] Monoclonal antibodies, which are highly specific and directed against a single epitopic site or determinant on an antigen (or immunogen), are also embraced by this invention. As used herein, monoclonal antibodies herein specifically include "chimeric" antibodies in which a portion of the heavy and/or light chain is identical or homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the remainder of the chain(s) is identical or homologous to corresponding sequences in antibodies derived from another species or belonging to another antibody class or subclass. Fragments of such antibodies can also be used, so long as they exhibit the desired biological activity. See U.S. Pat. No. 4,816,567; Morrison et al. Proc. Natl. Acad Sci. USA. 81, 6851-55 (1984). The monoclonal antibodies herein also specifically include those made from different animal species, including mouse, rat, human and rabbit.

[0154] The preparation of monoclonal antibodies is conventional in the art. (See, for example, Kohler & Milstein, Nature, 256:495 (1975); Coligan, at al., sections 2.5.1-2.6.7; and Harlow, et al., in: Antibodies: A Laboratory Manual, page 726 (Cold Spring Harbor Pub. (1988)), which are hereby incorporated by reference. Monoclonal antibodies can be isolated and purified from hybridoma cultures by a variety of well-established techniques. Such isolation techniques include affinity chromatography with Protein-A Sepharose, size-exclusion chromatography, and ion-exchange chromatography. (See, e.g., Coligan, at al., sections 2.7.1-2.7.12 and sections 2.9.1-2.9.3; Barnes, et al., Purification of Immunoglobulin G (IgG), in: Methods in Molecular Biology, Vol. 10, pages 79-104 (Humana Press (1992)).

[0155] Methods of in vitro and in vivo manipulation of antibodies are understood by those skilled in the art. For example, the monoclonal antibodies to be used in accordance with the present invention may be made by the hybridoma method as described above, or they may be made by recombinant methods, e.g., as described in U.S. Pat. No. 4,816,567. Monoclonal antibodies may also be isolated from phage antibody libraries using the techniques described, for example, in Clackson et al. Nature. 352:624-628 (1991), as well as in Marks et al., J. Mol Biol. 222:581-597 (1991).

[0156] Methods of making antibody fragments are also known in the art (see for example, Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, New York, (1988), incorporated herein by reference). Antibody fragments of the present invention can be prepared by proteolytic hydrolysis of the antibody or by expression of nucleic acids encoding the antibody fragment in a suitable host. Antibody fragments can be obtained by pepsin or papain digestion of whole antibodies conventional methods. For example, antibody fragments can be produced by enzymatic cleavage of antibodies with pepsin to provide a 5S fragment described as F(ab').sub.2. This fragment can be further cleaved using a thiol reducing agent, and optionally using a blocking group for the sulfhydryl groups resulting from cleavage of disulfide linkages, to produce 3.5S Fab' monovalent fragments. Alternatively, enzymatic cleavage using pepsin produces two monovalent Fab' fragments and an Fc fragment directly. These methods are described, for example, in U.S. Pat. No. 4,036,945 and No. 4,331,647, and references contained therein. These patents are hereby incorporated by reference in their entireties.

[0157] Other methods of cleaving antibodies, such as separation of heavy chains to form monovalent light-heavy chain fragments, further cleavage of fragments, or other enzymatic, chemical, or genetic techniques may also be used, so long as the fragments bind to the antigen that is recognized by the intact antibody. For example, Fv fragments comprise an association of V.sub.H and V.sub.L chains. This association may be noncovalent, or the variable chains can be linked by an intermolecular disulfide bond, or cross-linked by chemicals such as glutaraldehyde. Preferably, the Fv fragments comprise V. and V. chains connected by a peptide linker. These single-chain antigen binding proteins (sFv) are prepared by constructing a structural gene comprising DNA sequences encoding the V.sub.H and V.sub.L domains connected by an oligonucleotide. The structural gene is inserted into an expression vector, which is subsequently introduced into a host cell such as E. coli. The recombinant host cells synthesize a single polypeptide chain with a linker peptide bridging the two V domains. Methods for producing sFvs are described, for example, by Whitlow, et al., Methods: a Companion to Methods in Enzymology, Vol. 2, page 97 (1991); Bird, et al., Science. 242:423-426 (1988); Ladner, et al, U.S. Pat. No. 4,946,778; and Pack, et al., Bio/Technology. 11:1271-77 (1993).

[0158] Another form of an antibody fragment is a peptide coding for a single complementarity-determining region (CDR). CDR peptides ("minimal recognition units") are often involved in antigen recognition and binding. CDR peptides can be obtained by cloning or constructing genes encoding the CDR of an antibody of interest. Such genes are prepared, for example, by using the polymerase chain reaction to synthesize the variable region from RNA of antibody-producing cells. See, for example, Larrick, et al., Methods: a Companion to Methods in Enzymology. Vol. 2, page 106 (1991).

[0159] The invention also encompasses human and humanized forms of non-human (e.g., murine) antibodies (monoclonal antibodies). Such humanized antibodies are chimeric immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab').sub.2 or other antigen-binding subsequences of antibodies) that contain minimal sequence derived from non-human immunoglobulin. For the most part, humanized antibodies are human immunoglobulins (recipient antibody) in which residues from a complementary determining region (CDR) of the human recipient antibody are replaced by residues from the CDRs of a nonhuman species (donor antibody) such as mouse, rat or rabbit having the desired specificity, affinity and capacity.

[0160] In some instances, Fv framework residues of the human immunoglobulin are replaced by corresponding non-human residues. Furthermore, humanized antibodies may comprise residues that are found neither in the recipient antibody nor in the imported CDR or framework sequences. These modifications are made to further refine and optimize antibody performance. In general, humanized antibodies will comprise substantially all of at least one, and typically two, variable domains, in which all or substantially all of the CDR regions correspond to those of a non-human immunoglobulin and all or substantially all of the FR regions are those of a human immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. For further details, see: Jones at al., Nature. 321, 522-525 (1986); Reichmann et al., Nature. 332, 323-329 (1988); Presta, Curr. Op. Struct. Biol. 2, 593-596 (1992); Holmes, et al., J. Immunol., 158:2192-2201 (1997) and Vaswani, et al., Annals Allergy, Asthma & Immunol., 81:105-115 (1998).

[0161] While standardized procedures are available and useful to generate antibodies, the size of antibodies, the multi-stranded structure of antibodies and the complexity of six binding loops present in antibodies constitute a hurdle to the improvement and the manufacture of large quantities of antibodies. Hence, the invention further encompasses the use of binding entities, which comprise polypeptides that can recognize and bind to gp41 and/or gp120 polypeptides having the three dimensional structures provided herein.

[0162] A number of proteins can serve as protein scaffolds to which binding domains can be attached and thereby form a suitable binding entity. The binding domains bind or interact with the polypeptide sequences of the invention while the protein scaffold merely holds and stabilizes the binding domains so that they can bind. A number of protein scaffolds can be used, for example, phage capsid proteins. See Review in Clackson & wells, Trends Biotechnol. 12:173-184 (1994). Phage capsid proteins have been used as scaffolds for displaying random peptide sequences, including bovine pancreatic trypsin inhibitor (Roberts et al., PNAS USA. 89:2429-2433 (1992)), human growth hormone (Lowman et al., Biochemistry. 30:10832-10838 (1991)), Venturini et al., Protein Peptide Letters. 1:70-75 (1994)), and the IgG binding domain of Streptococcus (O'Neil at al., Techniques in Protein Chemistry V (Crabb, L., ed.) pp. 517-524, Academic Press, San Diego (1994)). These scaffolds have displayed a single randomized loop or region that can be modified to include binding domains for gp41 and/or gp120 polypeptides with the structures described herein.

[0163] The small 74 amino acid .alpha.-amylase inhibitor Tendamistat has also been employed as a presentation scaffold on the filamentous phage M13. (McConnell, S. J., & Hoess, R. H., J. Mol. Biol. 250:460-470 (1995)). Tendamistat is a .beta.-sheet protein derived from Streptomyces tendae. It has a number of features that make it an attractive scaffold for binding entities, including its small size, stability, and the availability of high resolution NMR and X-ray structural data. The overall topology of Tendamistat is similar to that of an immunoglobulin domain, with two .beta.-sheets connected by a series of loops. In contrast to immunoglobulin domains, the .beta.-sheets of Tendamistat are held together with two rather than one disulfide bond, accounting for the considerable stability of the protein. The loops of Tendamistat can serve a function similar to the CDR loops of immunoglobulins and can be easily randomized by in vitro mutagenesis. Tendamistat may be antigenic in humans. Hence, binding entities that employ Tendamistat are preferably employed in vitro.

[0164] Fibronectin type III domain has also been used as a protein scaffold to which binding entities can be attached. Fibronectin type III is part of a large subfamily (Fn3 family or s-type Ig family) of the immunoglobulin superfamily. Sequences, vectors and cloning procedures for using such a fibronectin type III domain as a protein scaffold for binding entities (e.g. CDR peptides) are provided, for example, in U.S. Patent Application Publication 20020019517. See also, Bork, P. & Doolittle, R. F. (1992) Proc. Natl. Acad. Sci. USA. 89, 8990-8994; Jones, E. Y. (1993) The immunoglobulin superfamily. Curr. Opinion Struct. Biol. 3, 846-852; Bork, P. at al., (1994) J. Mol. Biol. 242, 309-320; and Campbell, I.D. & Spitzfaden, C. (1994) Structure. 2, 233-337.

[0165] In the immune system, specific antibodies are selected and amplified from a large library (affinity maturation). The combinatorial techniques employed in immune cells can be mimicked by mutagenesis and the generation of combinatorial libraries of binding entities. Variant binding entities, antibody fragments and antibodies therefore can also be generated through display-type technologies. Such display-type technologies include, for example, phage display, retroviral display, ribosomal display, and other techniques. Techniques available in the art can be used for generating libraries of binding entities and for screening those libraries; the selected binding entities can be subjected to additional maturation, such as affinity maturation. Wright and Harris, supra., Hanes and Plucthau PNAS USA 94:4937-4942 (1997) (ribosomal display), Parmley and Smith, Gene. 73:305-318 (1988) (phage display), Scott TIBS. 17:241-245 (1992), Cwirla et al. PNAS USA. 87:6378-6382 (1990), Russel et al. Nucl. Acids Res. 21:1081-1085 (1993), Hoganboom et al. Immunol. Reviews. 130:43-68 (1992), Chiswell and McCafferty TIBTECH. 10:80-84 (1992), and U.S. Pat. No. 5,733,743.

[0166] The invention therefore also provides methods of mutating antibodies, CDRs, or binding domains to optimize their affinity, selectivity, binding strength and/or other desirable properties. A mutant binding domain refers to an amino acid sequence variant of a selected binding domain (e.g., a CDR). In general, one or more of the amino acid residues in the mutant binding domain is different from what is present in the reference binding domain. Such mutant antibodies necessarily have less than 100% sequence identity or similarity with the reference amino acid sequence. In general, mutant binding domains have at least 75% amino acid sequence identity or similarity with the amino acid sequence of the reference binding domain. Preferably, mutant binding domains have at least 80%, more preferably at least 85%, even more preferably at least 90%, and most preferably at least 95% amino acid sequence identity or similarity with the amino acid sequence of the reference binding domain.

[0167] For example, affinity maturation using phage display can be utilized as one method for generating mutant binding domains. Affinity maturation using phage display refers to a process, such as is described in Lowman et al., Biochemistry. 30(45): 10832-10838 (1991) and in Hawkins et al., J. Mol. Biol. 254: 889-896 (1992). While not strictly limited to the following description, this process can be described briefly as involving mutation of several binding domains or antibody hypervariable regions at a number of different sites with the goal of generating all possible amino acid substitutions at each site. The binding domain mutants thus generated are displayed in a monovalent fashion from filamentous phage particles as fusion proteins. Fusions are generally made to the gene III product of M13. The phage expressing the various mutants can be cycled through several rounds of selection for the trait of interest, e.g. binding affinity or selectivity. The mutants of interest are isolated and sequenced. Such methods are described in more detail in U.S. Pat. No. 5,750,373, U.S. Pat. No. 6,290,957 and in Cunningham, B. C. et al., EMBO J. 13(11), 2508-2515 (1994).

[0168] In one embodiment, the invention provides methods of manipulating binding entity or antibody polypeptides or the nucleic acids encoding them to generate binding entities, antibodies and antibody fragments with improved binding properties that recognize and bind to gp41, gp120 and/or gp41/gp120 stabilized trimer complexes.

[0169] Such methods of mutating portions of an existing binding entity or antibody involve fusing a nucleic acid encoding a polypeptide that encodes a binding domain for an antigen, immunogen, or disease marker to a nucleic acid encoding a phage coat protein to generate a recombinant nucleic acid encoding a fusion protein, mutating the recombinant nucleic acid encoding the fusion protein to generate a mutant nucleic acid encoding a mutant fusion protein, expressing the mutant fusion protein on the surface of a phage, and selecting phage that bind to the gp41 and/or gp120 polypeptides comprising a stabilized trimer.

[0170] Accordingly, the invention provides antibodies, antibody fragments, and binding entity polypeptides that can recognize and bind to a gp140 or a gp41-gp120 stabilized trimer complex (e.g., polypeptides having any of the sequences provided herein or combinations thereof). The invention further provides methods of manipulating those antibodies, antibody fragments, and binding entity polypeptides to optimize their binding properties or other desirable properties (e.g., stability, size, ease of use).

Administration

[0171] The polypeptides, binding entities and antibodies of the invention, including their salts, are administered so as to achieve a reduction in at least one symptom associated with an infection, indication or disease, or a decrease in the amount of antibody associated with the indication or disease.

[0172] To achieve the desired effect(s), the binding entities, antibodies, polypeptides (e.g. having any of the sequences disclosed here, or combinations thereof), variants thereof, a combination thereof, or compositions comprising any of these may be administered as single or divided dosages, for example, of at least about 0.01 mg/kg to about 500 to 750 mg/kg, of at least about 0.01 mg/kg to about 300 to 500 mg/kg, at least about 0.1 mg/kg to about 100 to 300 mg/kg or at least about 1 mg/kg to about 50 to 100 mg/kg of body weight, although other dosages may provide beneficial results. The amount administered will vary depending on various factors including, but not limited to, the polypeptide, binding entity or antibody chosen, the disease, the weight, the physical condition, the health, the age of the mammal, whether prevention or treatment is to be achieved, and if the polypeptide, binding entity or antibody is chemically modified. Such factors can be readily determined by the clinician employing animal models or other test systems that are available in the art.

[0173] Administration of the therapeutic agents in accordance with the present invention may be in a single dose, in multiple doses, in a continuous or intermittent manner, depending, for example, upon the recipient's physiological condition, whether the purpose of the administration is therapeutic or prophylactic, and other factors known to skilled practitioners. The administration of the therapeutic agents of the invention may be essentially continuous over a pre-selected period of time or may be in a series of spaced doses. Both local and systemic administration is contemplated.

[0174] To prepare a composition for administration to a subject, polypeptides, binding entities or antibodies are synthesized or otherwise obtained, purified as necessary or desired and then lyophilized and stabilized. Such therapeutic agents can then be adjusted to the appropriate concentration, and optionally combined with other agents. The absolute weight of a given therapeutic agent included in a unit dose can vary widely. For example, about 0.01 to about 2g, or about 0.1 to about 500 mg, of at least one therapeutic agent of the invention, or a plurality of therapeutic agents can be administered. Alternatively, the unit dosage can vary from about 0.01 g to about 50 g, from about 0.01 g to about 35g, from about 0.1 g to about 25 g, from about 0.5 g to about 12 g, from about 0.5 g to about 8 g, from about 0.5 g to about 4 g, or from about 0.5 g to about 2 g. Daily doses of the therapeutic agents of the invention can vary as well. Such daily doses can range, for example, from about 0.1 g/day to about 50 g/day, from about 0.1 g/day to about 25 g/day, from about 0.1 g/day to about 12 g/day, from about 0.5 g/day to about 8 g/day, from about 0.5 g/day to about 4 g/day, and from about 0.5 g/day to about 2 g/day.

[0175] In the treatment or prevention of viral infections, an appropriate dosage level will generally be about 0.001 to 100 mg per kg patient body weight per day, which can be administered in single or multiple doses. Preferably, the dosage level will be about 0.01 to about 25 mg/kg per day; more preferably about 0.05 to about 10 mg/kg per day. A suitable dosage level may be about 0.01 to 25 mg/kg per day, about 0.05 to 10 mg/kg per day, or about 0.1 to 5 mg/kg per day. Within this range the dosage may be about 0.005 to about 0.05, 0.05 to 0.5 or 0.5 to 5 mg/kg per day. For oral administration, the compositions are preferably provided in the form of tablets containing about 1 to 1000 milligrams of the active ingredient, particularly about 1, 5, 10, 15, 20, 25, 50, 75, 100, 150, 200, 250, 300, 400, 500, 600, 750, 800, 900, and 1000 milligrams of the active ingredient for the symptomatic adjustment of the dosage to the patient to be treated. The compounds may be administered on a regimen of 1 to 4 times per day, preferably once or twice per day.

[0176] Thus, one or more suitable unit dosage forms comprising the therapeutic agents of the invention can be administered by a variety of routes including oral, parenteral (including subcutaneous, intravenous, intramuscular and intraperitoneal), rectal, vaginal, dermal, transdermal, intrathoracic, intrapulmonary and intranasal (respiratory) routes. The therapeutic agents may also be formulated for sustained release (for example, using microencapsulation, see WO 94/07529, and U.S. Pat. No. 4,962,091). The formulations may, where appropriate, be conveniently presented in discrete unit dosage forms and may be prepared by any of the methods well known to the pharmaceutical arts. Such methods may include the step of mixing the therapeutic agent with liquid carriers, solid matrices, semi-solid carriers, finely divided solid carriers or combinations thereof, and then, if necessary, introducing or shaping the product into the desired delivery system.

[0177] When the therapeutic agents of the invention are prepared for oral administration, they are generally combined with a pharmaceutically acceptable carrier, diluent or excipient to form a pharmaceutical formulation, or unit dosage form. For oral administration, the therapeutic agents may be present as a powder, a granular formulation, a solution, a suspension, an emulsion or in a natural or synthetic polymer or resin for ingestion of the active ingredients from a chewing gum. The active therapeutic agents may also be presented as a bolus, electuary or paste. Orally administered therapeutic agents of the invention can also be formulated for sustained release, e.g., the therapeutic agents can be coated, micro-encapsulated, or otherwise placed within a sustained delivery device. The total active ingredients in such formulations comprise from 0.1 to 99.9% by weight of the formulation.

[0178] By "pharmaceutically acceptable" it is meant a carrier, diluent, excipient, and/or salt that is compatible with the other ingredients of the formulation and that is not deleterious to the recipient thereof. Pharmaceutically acceptable formulations containing the therapeutic agents of the invention can be prepared by procedures known in the art using well-known and readily available ingredients. For example, the therapeutic agents can be formulated with common excipients, diluents, or carriers, and formed into tablets, capsules, solutions, suspensions, powders, aerosols and the like. Examples of excipients, diluents, and carriers that are suitable for such formulations include buffers, as well as fillers and extenders such as starch, cellulose, sugars, mannitol, and silicic derivatives. Binding agents can also be included such as carboxymethyl cellulose, hydroxymethylcellulose, hydroxypropyl methylcellulose and other cellulose derivatives, alginates, gelatin, and polyvinyl-pyrrolidone. Moisturizing agents can be included such as glycerol, disintegrating agents such as calcium carbonate and sodium bicarbonate. Agents for retarding dissolution can also be included such as paraffin. Resorption accelerators such as quaternary ammonium compounds can also be included. Surface active agents such as cetyl alcohol and glycerol monostearate can be included. Adsorptive carriers such as kaolin and bentonite can be added. Lubricants such as talc, calcium and magnesium stearate, and solid polyethyl glycols can also be included. Preservatives may also be added. The compositions of the invention can also contain thickening agents such as cellulose and/or cellulose derivatives. They can also contain gums such as xanthan, guar or carbo gum or gum arabic, or alternatively polyethylene glycols, bentones and montmorillonites, and the like.

[0179] Illustratively, tablets or caplets containing the therapeutic agents of the invention can include buffering agents, such as calcium carbonate, magnesium oxide and magnesium carbonate. Caplets and tablets can also include inactive ingredients such as cellulose, pre-gelatinized starch, silicon dioxide, hydroxy propyl methyl cellulose, magnesium stearate, microcrystalline cellulose, starch, talc, titanium dioxide, benzoic acid, citric acid, corn starch, mineral oil, polypropylene glycol, sodium phosphate, zinc stearate, and the like. Hard or soft gelatin capsules containing at least one therapeutic agent of the invention can contain inactive ingredients such as gelatin, microcrystalline cellulose, sodium lauryl sulfate, starch, talc, and titanium dioxide, and the like, as well as liquid vehicles such as polyethylene glycols (PEGS) and vegetable oil. Moreover, enteric-coated caplets or tablets containing one or more therapeutic agents of the invention are designed to resist disintegration in the stomach and dissolve in the more neutral to alkaline environment of the duodenum.

[0180] The therapeutic agents of the invention can also be formulated as elixirs or solutions for convenient oral administration or as solutions appropriate for parenteral administration, for instance by intramuscular, subcutaneous, intraperitoneal or intravenous routes. The pharmaceutical formulations of the therapeutic agents of the invention can also take the form of an aqueous or anhydrous solution or dispersion, or alternatively the form of an emulsion or suspension or salve.

[0181] Thus, the therapeutic agents may be formulated for parenteral administration (e.g., by injection, for example, bolus injection or continuous infusion) and may be presented in unit dose form in ampoules, pre-filled syringes, small volume infusion containers or in multi-dose containers. As noted above, preservatives can be added to help maintain the shelve life of the dosage form. The therapeutic agents and other ingredients may form suspensions, solutions, or emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. Alternatively, the therapeutic agents and other ingredients may be in powder form, obtained by aseptic isolation of sterile solid or by lyophilization from solution, for constitution with a suitable vehicle, e.g., sterile, pyrogen-free water, before use.

[0182] These formulations can contain pharmaceutically acceptable carriers, vehicles and adjuvants that are well known in the art. It is possible, for example, to prepare solutions using one or more organic solvent(s) that is/are acceptable from the physiological standpoint, chosen, in addition to water, from solvents such as acetone, ethanol, isopropyl alcohol, glycol ethers such as the products sold under the name "Dowanol," polyglycols and polyethylene glycols, C.sub.1-C.sub.4 alkyl esters of short-chain acids, ethyl or isopropyl lactate, fatty acid triglycerides such as the products marketed under the name "Miglyol," isopropyl myristate, animal, mineral and vegetable oils and polysiloxanes.

[0183] It is possible to add, if necessary, an adjuvant selected from antioxidants, surfactants, other preservatives, film-forming, keratolytic or comedolytic agents, perfumes, flavorings and colorings. Antioxidants such as t-butylhydroquinone, butylated hydroxyanisole, butylated hydroxytoluene and .alpha.-tocopherol and its derivatives can be added.

[0184] Additionally, the therapeutic agents are well suited to formulation as sustained release dosage forms and the like. The formulations can be so constituted that they release the active therapeutic agents, for example, in a particular part of the intestinal or respiratory tract or within the vagina or rectum, possibly over a period of time. Coatings, envelopes, and protective matrices may be made, for example, from polymeric substances, such as polylactide-glycolates, liposomes, microemulsions, microparticles, nanoparticles, or waxes.

[0185] For topical, vaginal or rectal administration, the therapeutic agents may be formulated as is known in the art for direct application to a target area. Forms chiefly conditioned for topical application take the form, for example, of creams, milks, gels, foams, dispersion or microemulsions, lotions thickened to a greater or lesser extent, impregnated pads of tampons, ointments or sticks, aerosol formulations (e.g., sprays or foams), soaps, detergents, lotions or cakes of soap. Other conventional forms for this purpose include wound dressings, coated bandages or other polymer coverings, ointments, creams, foams, lotions, pastes, jellies, sprays, and aerosols. Thus, the therapeutic agents of the invention can be delivered via patches or bandages for dermal administration. Alternatively, the therapeutic agents can be formulated to be part of an adhesive polymer, such as polyacrylate or acrylate/vinyl acetate copolymer. For long-term applications it might be desirable to use microporous and/or breathable backing laminates, so hydration or maceration of the skin can be minimized. The backing layer can be any appropriate thickness that will provide the desired protective and support functions. A suitable thickness will generally be from about 10 to about 200 microns.

[0186] Ointments and creams may, for example, be formulated with an aqueous or oily base with the addition of suitable thickening and/or gelling agents. Lotions may be formulated with an aqueous or oily base and will in general also contain one or more emulsifying agents, stabilizing agents, dispersing agents, suspending agents, thickening agents, or coloring agents. The active polypeptides can also be delivered via iontophoresis, e.g., as disclosed in U.S. Pat. No. 4,140,122; 4,383,529; or 4,051,842. The percent by weight of a therapeutic agent of the invention present in a topical formulation will depend on various factors, but generally will be from 0.01% to 95% of the total weight of the formulation, and typically 0.1-85% by weight.

[0187] Drops, such as eye drops or nose drops, may be formulated with one or more of the therapeutic agents in an aqueous or non-aqueous base also comprising one or more dispersing agents, solubilizing agents or suspending agents. Liquid sprays are conveniently delivered from pressurized packs. Drops can be delivered via a simple eye dropper-capped bottle, or via a plastic bottle adapted to deliver liquid contents dropwise, via a specially shaped closure.

[0188] The therapeutic agents may further be formulated for topical administration in the mouth or throat. For example, the active ingredients may be formulated as a lozenge further comprising a flavored base, usually sucrose and acacia or tragacanth; pastilles comprising the composition in an inert base such as gelatin and glycerin or sucrose and acacia; and mouthwashes comprising the composition of the present invention in a suitable liquid carrier.

[0189] The pharmaceutical formulations of the present invention may include, as optional ingredients, pharmaceutically acceptable carriers, diluents, solubilizing or emulsifying agents, and salts of the type that are available in the art. examples of such substances include normal saline solutions such as physiologically buffered saline solutions and water. Specific non-limiting examples of the carriers and/or diluents that are useful in the pharmaceutical formulations of the present invention include water and physiologically acceptable buffered saline solutions such as phosphate buffered saline solutions pH 7.0-8.0.

[0190] In general, the dosage forms of the invention comprise an amount of at least one of the agents of the invention effective to treat, reduce the severity of, or prevent the clinical symptoms of a specific infection, indication, condition, or disease. Any statistically significant attenuation of one or more symptoms of an infection, indication or disease that has been treated pursuant to the method of the present invention is considered to be a treatment of such infection, indication or disease within the scope of the invention.

[0191] Alternatively, for administration by inhalation or insufflation, the composition may take the form of a dry powder, for example, a powder mix of the therapeutic agent and a suitable powder base such as lactose or starch. The powder composition may be presented in unit dosage form in, for example, capsules or cartridges, or, e.g., gelatin or blister packs from which the powder may be administered with the aid of an inhalator, insufflator, or a metered-dose inhaler (see, for example, the pressurized metered dose inhaler (MDI) and the dry powder inhaler disclosed in Newman, S. P. in Aerosols and the Lung, Clarke, S. W. and Davia, D. eds., pp. 197-224, Butterworths, London, England, 1994).

[0192] Therapeutic agents of the present invention can be administered as a dry powder or in an aqueous solution when administered in an aerosol or inhaled form. Other aerosol pharmaceutical formulations may comprise, for example, a physiologically acceptable buffered saline solution containing between about 0.1 mg/ml and about 100 mg/ml of one or more of the polypeptides of the present invention specific for the indication or disease to be treated. Dry aerosol in the form of finely divided solid compound, polypeptide or polypeptide particles that are not dissolved or suspended in a liquid are also useful in the practice of the present invention. Therapeutic agents of the present invention may be formulated as dusting powders and comprise finely divided particles having an average particle size of between about 1 and 5 .mu.m, alternatively between 2 and 3 .mu.m. Finely divided particles may be prepared by pulverization and screen filtration using techniques well known in the art. The particles may be administered by inhaling a predetermined quantity of the finely divided material, which can be in the form of a powder. It will be appreciated that the unit content of active ingredient or ingredients contained in an individual aerosol dose of each dosage form need not in itself constitute an effective amount for treating the particular infection, indication or disease since the necessary effective amount can be reached by administration of a plurality of dosage units. Moreover, the effective amount may be achieved using less than the dose in the dosage form, either individually, or in a series of administrations.

[0193] The therapeutic agents of the invention can also be administered to the respiratory tract. Thus, the present invention also provides aerosol pharmaceutical formulations and dosage forms for use in the methods of the invention. For administration to the upper (nasal) or lower respiratory tract by inhalation, the therapeutic polypeptides of the invention are conveniently delivered from a nebulizer or a pressurized pack or other convenient means of delivering an aerosol spray. Pressurized packs may comprise a suitable propellant such as dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or other suitable gas.

[0194] In the case of a pressurized aerosol, the dosage unit may be determined by providing a valve to deliver a metered amount. Nebulizers include, but are not limited to, those described in U.S. Pat. Nos. 4,624,251; 3,703,173; 3,561,444; and 4,635,627. Aerosol delivery systems of the type disclosed herein are available from numerous commercial sources including Fisons Corporation (Bedford, Mass.), Schering Corp. (Kenilworth, N.J.) and American Pharmoseal Co., (Valencia, Calif.). For intra-nasal administration, the therapeutic agents may be administered via nose drops, a liquid spray, such as via a plastic bottle atomizer or metered-dose inhaler. Typical of atomizers are the Mistometer (Wintrop) and the Medihaler (Riker).

[0195] Also contemplated are combination products that include one or more of the therapeutic agents as active agents, e.g., antibodies and binding proteins, of the present invention and one or more other therapeutic agents, e.g., anti-viral agents, anti-microbial agents, pain relievers, anti-inflammatory agents, anti-bacterial agents, antihistamines, bronchodilators and the like, whether for the condition(s) described or some other condition. Accordingly, other anti-retroviral agents can be included in the compositions of the invention such as protease inhibitors, retroviral polymerase inhibitors, azidothymidine (AZT), didanoside (DDI), soluble CD4, a polysaccharide sulfates, T22, bicyclam, suramin, antisense oliogonulceotides, ribozymes, rev inhibitors, protease inhibitors, glycolation inhibitors, interferon and the like.

[0196] The present invention further pertains to a packaged pharmaceutical composition for treating and/or preventing viral (e.g. HIV) infections, such as a kit or other container. The kit or container holds a therapeutically effective amount of a pharmaceutical composition for treating and preventing viral infections and instructions for using the pharmaceutical composition for treating and preventing the viral infection. The pharmaceutical composition can include at least one polypeptide of the present invention, in a therapeutically effective amount such that viral infection is treated or prevented.

[0197] In an alternative embodiment, the pharmaceutical composition can include at least one binding entity or antibody of the present invention in a therapeutically effective amount such that the viral infection is treated, reduced, ameliorated, or prevented.

Experimental Details

[0198] This invention is illustrated in the Experimental Details sections which follow. The Experimental Details section is set forth to aid in an understanding of the invention but is not intended to, and should not be construed to, limit in any way the invention as set forth in the claims which follow thereafter.

Experimental Details I

Introduction

[0199] The HIV-1 envelope glycoprotein is expressed on the viral membrane as a trimeric complex, formed by three gp120 surface glycoproteins non-covalently associated with three membrane-anchored gp41 subunits. The labile nature of the association between gp120 and gp41 hinders the expression of soluble, fully cleaved, trimeric gp140 proteins for structural and immunization studies. Disruption of the primary cleavage site within gp160 allows the production of stable gp140 trimers, but cleavage-defective trimers are antigenically dissimilar from their cleaved counterparts. Soluble, stabilized, proteolytically cleaved, trimeric gp41 proteins can be generated by engineering an intermolecular disulphide bond between gp120 and gp41 (SOS), combined with a single residue change, 1559P, within gp41 (SOSIP). SOSIP gp140 proteins based on the subtype A HIV-1 strain KNH1144 form particularly homogenous trimers compared to a prototypic strain (JR-FL, subtype B). Described herein are the determinants of this enhanced stability which are located in the N-terminal region of KNH11144 gp41 and that, when substituted into heterologous Env sequences (e.g., JR-FL and Ba-L) they have a similarly beneficial effect on trimer stability. These stabilized trimers retain the epitopes for several neutralizing antibodies and related agents (CD4-IgG2, b12, 2G12, 2F5 and 4E10) and the CD4-IgG2 molecule, so that the overall antigenic structure of the gp140 protein has not been adversely impaired by the trimer-stabilizing substitutions.

Materials and Methods

[0200] Reagents: CD4-IgG2 (PRO 542) (Allaway et al., 1995) and monoclonal antibody (MAb) PA-1 were provided by Dr. William Olson (Progenics Pharmaceuticals, Inc.) Soluble D1D2-CD4 (sCD4-183, 2 domain) (Garlick et al., 1990) was obtained from the NIH AIDS Research and Reference Program. MAb CA13 (ARP3119), from Ms C. Arnold, was provided by the EU Programme EVA Centralized Facility for AIDS Reagents, NIBSC, UK (AVIP Contract Number LSHP-CT-2004-503457). MAbs 2G12 (Calarese et al., 2003; Trkola et al., 1996), 2F5 (Parker et al., 2001; Zwick et al., 2001), 4E10 (Cardoso et al., 2005; Zwick et al., 2001) were obtained from Hermann Katinger, MAb 17b (Thali at al., 1993) from James Robinson and MAb b12 (Burton et al., 1994) from Dennis Burton. The hybridoma for the production of MAb B13 (HIV-1 gp160 Hyb, Chessie 13-39.1) (Abacioglu et al., 1994) was obtained from NIH AIDS Research and Reference Program (donated by George K. Lewis).

Plasmids and Construction of Chimeric and Mutant Env Genes:

[0201] Various HIV-1 env genes, cloned into the high-level mammalian expression vector pPPI4, were used for expression of soluble gp140 glycoproteins as previously described. Furin was expressed from pcDNA3.1-Furin (Binley et al., 2000; Sanders et al., 2000). The HIV-1 Env subtype A clone KNH1144 (accession number AF457066) (Beddows et al., 2006) and the subtype B clones JR-FL and Ba-L have been described previously (Binley et al., 2000). In domain-swap experiments, the JR-FL gp41 ectodomain was replaced with the corresponding region of KNH1144 gp41, using EcoRI and HindIII restriction enzymes, followed by repair of the restriction sites and verification of the sequences. Specific amino acid substitutions were made using the QuikChange.RTM. II XL site-directed mutagenesis kit (Stratagene Inc., La Jolla, Calif.) and the appropriate primers. The introduced mutations were verified by sequencing.

Transfection and Expression of Soluble qp140 Envelope Glycoproteins:

[0202] The human Embryonic Kidney cell line HEK293T was used for expression of the various envelope glycoproteins by transient transfection, as previously described (Binley at al., 2000; Sanders et al., 2000; Sanders et al., 2002). HEK293T cells were grown in Dulbecco's modified Eagle's medium (DMEM, Gibco) supplemented with 10% fetal calf serum, penicillin, streptomycin and L-glutamine. Transient transfections were performed using Polyethylenimine (PEI) (Polysciences Inc., Warrington, Pa.) (Boussif et al., 995; Kirschner et al., 2006). For each small-scale transfection, 7 .mu.g of env DNA and 3.5 .mu.g of Turin DNA were used. Five hours post-transfection, the 293T cells were washed and the media replaced with DMEM containing 0.05% bovine serum albumin (BSA), antibiotics (penicillin, streptomycin) and L-glutamine. Forty-eight hours post-transfection, the supernatant was collected and filtered using a 0.45 .mu.m filter. A cocktail of protease inhibitors (Roche Diagnostics, Indianapolis, Ind.) was added before concentration of the supernatant by >20-fold using the Amicon ultracentrifugal filter system (Millipore, Billerica, Mass.). Aliquots of concentrated supernatant were analyzed by sodium-dodecyl sulfate polyacrylamide electrophoresis (SDS-PAGE), or stored at -80.degree. C.

Purification of Soluble Envelope Glycoproteins:

[0203] Supernatants (1 L) from transfected HEK293T cells were concentrated by >20-fold then processed by Lectinaffinity chromatography. The column eluate was then size-fractionated using an analytical Superose.TM. 6 column (GE Amersham Pharmacia, Piscataway, N.J.) equilibrated with phosphate-buffered saline (PBS; 100 mM NaCl, 50 mM sodium phosphate, pH 7.0). The column was calibrated with protein standards of known molecular weights (HMW Gel Filtration Calibration Kit; Amersham Pharmacia, Piscataway, N.J.). Fractions (200 .mu.l) were collected and analyzed using Blue-native polyacrylamide electrophoresis (BN-PAGE) and SDS-PAGE. Quantification of proteins was carried out using the BCA quantification kit (Pierce) with known BSA standards.

BN-PAGE, SDS-PAGE and Western Blot Analysis:

[0204] BN-PAGE was performed as described previously by Schulke et al. (2002). Concentrated culture supernatants or purified protein samples were diluted with an equal volume of a loading buffer containing 100 mM 4-(N-morpholino) propane sulfonic acid (MOPS), 100 mM Tris-HCl (pH 7.7), 40% glycerol, 0.1% Coomassie blue, and loaded onto a 4-12% Bis-Tris NuPAGE gel (Invitrogen). Gel electrophoresis was performed at 100 V for 3 h using 50 mM MOPS, 50 mM Tris (pH 7.7) as electrophoresis buffer. SOS-PAGE was performed as described previously by Schulke et al. (2002). Reduced and non-reduced samples were prepared in Laemmli sample buffer (62.5 mM Tris-HCl, pH6.8, 2% SDS, 25% glycerol, 0.01% DTT) and boiled for 5 min in the presence or absence of 50 mM dithiothreitol (DTT), respectively. Western blot analyses were performed as described elsewhere (Schulke et al., 2002). Following transfer, the polyvinylidene difluoride (PVDF) membrane was destained, then probed using anti-Env MAbs CA13 (ARP3119) or 613, followed by horseradish peroxidase-labeled anti-mouse immunoglobulin G (IgG) (Kirkegaard s Perry Labs), at a final concentration of 0.2 .mu.g/ml. The bound MAbs were detected using the Western Blot Chemiluminescence Reagent Plus system (Perkin-Elmer Life Sciences, Boston, Mass.). Protein mixtures containing Thyroglobulin (669 kDa), Ferritin (440 kDa), Catalase (232 kDa), Lactate dehydrogenase (140 kDa) and BSA (66 kDa) (Amersham Biosciences) were used as standard markers for native gels. For denaturing electrophoresis, the MultiMark.RTM. multi-colored standard (Invitrogen) was used.

BIAcore Surface Plasmon Resonance (SPR):

[0205] The BIAcore X system (BIAcore Inc., Uppsala, Sweden) was used for comparison of the JR-FL WT versus mutant gp140 env binding to various monoclonal antibodies. All assays were performed at 25.degree. C. using HBS-EP buffer (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 mM EDTA, 0.005% [v/v] Surfactant P20; BIAcore, Uppsala, Sweden), which was degassed for 1 h before use. The flow-rate was maintained at 10 .mu.l/minute. A BIAcore streptavidin (SA) chip was used for capturing .about.1000 response units (RU) of biotinylated protein G (Pierce) in both the experimental and the control flow-cells. Biotin was then used to block the uncoated streptavidin surface on both flow-cells. In the experimental cell, .about.1000 RU of various MAbs were directionally captured onto the surface-attached biotinylated Protein G via their Fc regions. Purified envelope glycoproteins (5 nM) were then injected for analysis of their ligand-binding properties. For study of the CD4-induced binding of MAbs directed to the CD41-epitope, D1D2-CD4 (at a 6-molar excess concentration) was incubated with the envelope glycoprotein for 1 hour at room temperature before injection. Following each run, the sensor surface was regenerated using two 10 .mu.l injections of 10 mM Glycine-HCl, pH 3.0. For each analyte, association was measured for 180 s, dissociation for a further >500 s. All binding studies were performed three times (n=3) with good reproducibility. The data were analyzed using BIAevaluation software 3.2 (BIAcore Inc). To correct for refractive index changes and instrument noise, the response data from the control flow-cell were subtracted from those obtained from the experimental flow-cell. For comparison of the antigenicity profiles of the wild-type and mutant gp140 proteins, the end-of-injection RU values +/-SD (n=3) are reported.

Results

[0206] Specific amino acids in the N-terminal region of gp41.sub.ECTO contribute to enhanced oligomerization of cleaved gp140 from KNH1144. Cleaved, SOSIP gp140 proteins from the subtype A strain KNH1144 form unusually stable and homogenous trimers compared to JR-FL SOSIP gp140, which is expressed as both dimers and trimers (Sanders et al., 2002). The SOS gp140 protein from KNH1144 is also more stable than the corresponding JR-FL construct, the latter being expressed as a mixture of trimers, dimers and, predominantly, monomers (Beddows et al., 2006; Binley et al., 2000). On purification, JR-FL SOS gp140 yields mostly monomeric gp140 proteins as a result of the instability of the gp41-gp41 interactions (Binley et al., 2000). The N-terminal region of gp41, particularly around the Heptad Repeat 1 (HR1) region, plays a role in oligomerization of gp140 proteins (Center, Kemp, and Poumbourios, 1997; Center et al., 2004; Poumbourios at al., 1997). When the N-terminal regions of gp41 from KNH144 and JR-FL were aligned, five amino acids were seen to differ at amino acid positions 535, 543, 553, 567 and 588 (FIG. 1). While KNH1144 has isoleucine (I) at amino acid position 535, JR-FL has methionine (M); while KNH1144 has glutamine (Q) at amino acid position 543, JR-FL has leucine (L); while KNH1144 has serine (S) at amino acid position 553, JR-FL has asparagine (N); while KNH1144 has lysine (K) at amino acid position 567, JR-FL has glutamine (Q); and while KNH1144 has arginine (R) at amino acid position 588, JR-FL has glycine (G). To determine, which, if any, of these five differences contributed to the enhanced stability of KNH1144 trimers, each residue in KNH11144 SOSIP gp140 was substituted with the corresponding one from JR-FL; the mutant Env proteins were then expressed and studied on BN-PAGE gels. The wild-type forms of KNH1144 SOS and SOSIP gp140 proteins were also analyzed to allow a comparison with the trimer-stabilizing effect of the I559P substitution in the SOSIP version (FIG. 2A). In general, the amino acid substitutions described below had similar effects whether they were made in the SOS or the SOSIP gp140 background, so only a subset of the results is depicted. The S553N and 85880 changes had little or no effect on trimer formation by KNH1144 SOSIP gp140 (FIG. 2B, lanes 3 and 5), whereas the 1535M substitution enhanced trimer formation (FIG. 2B, lane 1), an observation confirmed in a larger scale expression and purification study. In contrast, substitutions of glutamine and lysine at positions 543 and 567 (Q543L and K567Q) destabilized the KNH1144 SOSIP gp140 trimers (FIG. 28, lanes 2 and 4). When all five amino acids were substituted in the KNH1144 SOS and SOSIP gp140 templates, the destabilizing effect on trimer formation was pronounced. The extent of the increase in monomer formation, compared to wild-type, was estimated to be -45% and -60% for the KNH1144 SOSIP and SOS gp140 mutants, respectively (FIG. 2C, lane 1, SOSIP; lane 2, SOS; compare with FIG. 2A). Hence, the five amino acid differences between the N-terminal regions of KNH1144 and JR-FL gp41 do influence the stability of cleaved gp140 trimers.

[0207] Substitution of five amino acids from the N-terminal region of KNH1144 gp41.sub.ECTO promotes JR-FL gp140 trimer formation. Both the SOS and SOSIP versions of JR-FL gp140 were used as templates on which to explore the effects of the five amino acid differences in the KNH1144 gp41 ectodomain, to take into account the additional, possibly complicating, influence of the I559P change. In the first construct, a chimera, the JR-FL gp120 subunit was combined with the KNH1144 gp41 ectodomain (JR-FLgp120-1144gp41 ECTO); the second construct was a mutant JR-FL SOS gp140 in which the five varying amino acids (positions 535, 543, 553, 567 and 588) were substituted by the corresponding residues from KNH1144 (JR-FL gp41 NT 1-5); the third was another chimera in which the C-terminal region of gp41.sub.ECTO from JR-FL was replaced by the corresponding segment of KNH1144 gp41 (JR-FL-1144 gp41 CT) (FIG. 3A). The various chimeric and mutant envelope glycoproteins were expressed in HEK293T cells and analyzed on BN-PAGE gels (FIG. 3B). JR-FL SOS gp140 was predominantly monomeric, while by contrast, the SOSIP gp140 formed dimers and trimers (FIG. 3B, lanes 1 and 2). The insertion of gp41.sub.ECTO from KNH1144 into the JR-FL SOS gp140 template stabilized the trimeric form, with a reduction in the amount of monomers present (FIG. 38, lane 3). The same change, but made in the JR-FL SOSIP gp140 context, had a lesser effect (FIG. 3B, compare lanes 2 and 4). Swapping the C-terminal region of gp41.sub.ECTO in either SOS or SOSIP JR-FL gp140 had no visible effect on oligomerization (FIG. 3B, compare lane 1 to lane 5 and lane 2 to lane 6). In contrast, oligomer formation by JR-FL SOS or SOSIP gp140 was increased by the substitution of the five varying amino acids in gp41.sub.ECTO with the corresponding residues from KNH1144 (M535I, L543Q, N553S, Q567K and G588R) (FIG. 3B, compare lane 7 to lane 1 and lane 8 to lane 2). These results confirm that the trimer-promoting determinants of KNH1144 are located in the N-terminal region of gp41. Moreover, the greater stability of KNH1144 gp140 trimers can be conferred upon a heterologous gp140, JR-FL, by altering the specific residues that differ between the two proteins.

[0208] Further studies showed that all five changes were necessary for creating an optimally stable and homogenous JR-FL gp140 trimer; various combinations of the five changes had negligible or partial effects. To ascertain whether the chimeric/mutant proteins were fully cleaved, the JR-FLgp120-KNH1144gp41(ECTO) SOS gp140 chimera and the JR-FL gp41 NT 1-5 SOS gp140 mutant were analyzed using SDS-PAGE (FIG. 3C). Under denaturing conditions, both gp140 proteins were resolved as monomers (lanes 1 and 3). When DTT was added to reduce the SOS disulphide bond, release of the gp120 subunit was complete (lanes 2 and 4), along with gp41.sub.ECTO (which is not detectable by the b13 MAb used for blotting). Hence aberrantly linked, uncleaved products do not contribute to the enhanced oligomerization of JR-FL gp140 conferred by substitution of the residues from the KNH1144 gp41 N-terminal region. The stabilized, mutant proteins are fully cleaved.

[0209] To further assess the formation and stability of JR-FL gp41 NT 1-5 SOS gp140 trimers, this protein and JR-FL SOS gp140 were purified using lectin-affinity and size exclusion chromatography (SEC) techniques. The SEC-fractionated aliquots were then resolved by BN-PAGE (FIG. 4). The JR-FL SOS gp140 protein was predominantly a monomer (FIG. 4A), while a much greater proportion of the Env species present in JR-FL gp41 NT 1-5 SOS migrated as well-resolved trimers (FIG. 4B; compare lanes 11-16 with the corresponding lanes in FIG. 4A). A densitometric analysis of the resolved gp140 trimer, dimer and monomer fractions on BN-PAGE was combined with BCA quantification of the pooled SEC fractions (trimer, dimer and monomer), to estimate the change in each gp140 species. Compared to the wild-type JR-FL SOS gp140 protein, trimer formation by JR-FL gp41 NT 1-5 SOS gp140 was increased by -20% and dimmer formation by .about.10%, whereas the monomer content was reduced by .about.50%.

Antigenic Properties of the Wild-Type JR-FL SOS and Stabilized JR-FL gp41 NT 1-5 SOS Mutant gp140s:

[0210] To determine whether the antigenicity of the mutant JR-FL gp140 had been altered by the introduction of the trimer-stabilizing substitutions, SPR methods were used to study the binding of various antibodies to the mutant JR-FL gp140, in comparison to the wild-type gp140. In these studies, biotinylated Protein G was immobilized onto a Streptavidin (SA)-coated chip, which was then used to capture various agents via their Fc regions (FIG. 5A). The CD4-IgG2 protein (used as a surrogate for CD4) and the following MAbs were all studied: b12 (neutralizing, anti-CD4BS), 2G12 (neutralizing, high-mannose epitope on the `silent face`), 2F5, 4E10 (both neutralizing, anti-gp41), PA-1 (nonneutralizing, anti-V3), b6 (non-neutralizing, anti-CD4BS) and 17b (non-neutralizing, CD4-induced epitope). Equal molar amounts of purified wild-type and mutant gp140 trimers (>90% purity) were then injected at 10 .mu.l/min, to react with the immobilized MAbs. Both the wild-type SOS gp140 and the mutant gp41 NT 1-5 SOS gp140 bound CD4-IgG2 similarly (FIG. 5A and Table 1). The reactivities of wild-type SOS gp140 and the mutant gp41 NT 1-5 SOS gp140 with b12 and 2G12 were also similar, with similar response unit (RU) values at the end-of-injection time (t=180 s). The two neutralizing anti-gp41 MAbs, 2F5 and 4E10, also bound similarly to the two gp140 proteins (FIG. 5A and Table 1). In the absence of sCD4 (D1D2-004), neither gp140 protein bound efficiently to MAb 17b, but when D1D2-CD4 was added, the 17b epitope was induced on both proteins. The extent of the induction was greater for the stabilized trimer than for the wild-type protein (25-fold compared to 5-fold respectively; FIG. 5A and Table 1). The non-neutralizing MAbs PA-1 and b6 bound less efficiently to the stabilized trimer than to its wild-type counterpart (FIG. 5A and Table 1). To ensure that the MAb binding properties of two trimer variants (wild-type and stabilized) were compared, the injected gp140 samples used in the ligand-binding assays were manually collected from the BIAcore X system and analyzed using BNPAGE. Both gp140 proteins were substantially trimeric, even after passage through the BIAcore system (FIG. 5B).

TABLE-US-00001 TABLE 1 SPR binding of various monoclonal antibodies or CD4-IgG2 to WT and mutant forms of trimeric JR-FL gp140 proteins Gp41 NT 1-5 SOS WT SOS gp140 gp140 mutant Mean RU .+-. SD Mean RU .+-. SD Test Agent (t = 180 s.sup.a) (t = 180 s.sup.a) CD4-IgG2 .sup. 212 .+-. 8.sup.b .sup. 224 .+-. 13 .sup.b b12 223 .+-. 6 195 .+-. 10 2G12 269 .+-. 7 278 .+-. 15 2F5 191 .+-. 8 216 .+-. 6 4E10 83 .+-. 4 94 .+-. 5 PA-1 242 .+-. 11 123 .+-. 10 b6 391 .+-. 14 233 .+-. 11 17b 39 .+-. 5 14 .+-. 2 17b (+D1D2 CD4) 237 .+-. 8 399 .+-. 18 .sup.aEnd-of-injection time (t) in seconds (s). .sup.bMean RU .+-. SD based on three experiments, all using 5 nM of analyte.

Substitution of Four Amino Acids in N-Terminal Region of gp41 ECTO also Increases the Stability of SOS gp140 from HIV-1 Ba-L:

[0211] To test whether the trimer-stabilizing effect of the above five gp41 amino acids was a generalized phenomenon, another subtype B Env protein, Ba-L was studied. Like JR-FL, Ba-L contains Met, Leu, Asn and Gln residues at positions 535, 543, 553 and 567, respectively. However, at position 588, Ba-L contains Arg, as does KNH1144 (FIG. 1). The four non-cognate amino acids from KNH1144 were introduced into the N-terminal region of Ba-L (M535I, L543Q, N553S, Q567K) to construct a mutant Ba-L gp41 NT 1-4 SOS gp140 protein. When expressed in HEK293T cells, the wild-type Ba-L SOS gp140, like JR-FL, was a mixture of monomers, dimers and trimers (FIG. 6A, lane 1). However, the mutant containing the above four amino acid substitutions was predominantly trimeric (FIG. 6A, lane 2), with >40% reduction in monomer formation. No individual substitution had as pronounced an effect as the quadruple combination. The enhanced trimerization of the mutant Ba-L gp41 NT 1-4 SOS gp140 was not attributable to the presence of aberrantly cross-linked proteins, as shown by SDS-PAGE under reducing and non-reducing conditions (FIG. 6B). Thus, under denaturing conditions, in the absence of the reducing agent, the mutant protein resolved as monomeric gp140; in the presence of DTT, reduction of the disulphide bond dissociated the gp140 into its constituent subunits (as in FIG. 3C, only the gp120 component is detected in this analysis). Taken together, these results suggest that modifications of a few selected amino acids in the N-terminal region of gp41 can improve the stability of gp140 trimers and that the finding might be generalizable to diverse HIV-1 genotypes.

Discussion

[0212] Described herein are residues in the N-terminal region of the gp41 ectodomain that influence the stability of trimeric forms of the HIV-1 gp140 glycoprotein, particularly the trimers that most, but of course incompletely, resemble the native form of the Env complex. The residues were found by inspection of the sequence of gP41.sub.ECTO from a subtype A SOSIP gp140 (KNH1144) that formed stable, cleaved trimers with unusual efficiency. Comparison of this sequence with that of JR-FL, a strain from which homogenous trimers are less easily made, identified five variable residues in a plausibly relevant region of gp41.sub.ECTO that lay in and around HR1. Substitution of those five residues in KNH1144 gp140 by the corresponding ones from JR-FL destabilized the resulting gp140 trimers. Conversely, and of more relevance, formation of JR-FL gp140 trimers could be considerably improved when the variable residues from KNH1144 were introduced in place of the JR-FL residues. The same approach also improved trimer formation in the context of the Ba-L sequence, suggesting that the observation is generally relevant for making stable, cleaved gp140 trimers. Substituting naturally variable amino acids may be a less invasive way to promote trimer stabilization than previously described alternatives, such as the use of heterologous trimerization domains (Yang et al., 2000; Yang et al., 2002), or the insertion of the SIV gp41 N-terminal region to make a HIV-SIV chimeric envelope glycoprotein (Center at al., 2004). The effect of substituting the KNH1144 gp41 residues into JR-FL and Ba-L is to reduce the heterogeneity of the oligomeric forms of SOS gp140 proteins when they are expressed as unpurified culture supernatants. Thus, there was a marked decrease in the amount of monomers present, lesser but sill notable decreases in dimers, tetramers and high-molecular weight aggregates and, of most relevance, an increase in the proportion of trimers.

[0213] When the stabilized JR-FL SOS gp140 protein was purified by lectin-affinity and size-exclusion chromatography, the amounts of monomers, tetramers and aggregates were reduced, whereas trimers were markedly more abundant and a small increase in the amount of dimers was also apparent. The dimers are likely to be dissociation products of trimers that arise during the purification process. This would not be too surprising, since the increase in trimer-stability is presumably only relative, not absolute, compared to the wild-type protein. Some of the amino acids in the KNH1144 N-terminal region have longer side chains than their JR-FL counterparts (KNH1144 vs. JR-FL: .quadrature.543L, K567Q and R588G).

[0214] It is also noteworthy that the S553, K567 and 8588 residues in KNH1144 have greater .alpha.-helix-stabilizing propensities than the corresponding residues, N553, Q567 and G588, in JR-FL. Hence, alterations in the size or the nature of the side chains strengthen localized helix-to-helix packing interactions in a way that stabilizes gp140 oligomers. Both the wild-type JR-FL SOS gp140 and the stabilized JR-FL gp41 NT 1-5 SOS gp140 mutant bound similarly to neutralizing antibodies and proteins (b12, 2G12, 2F5, 4E10 and CD4-IgG2). As shown herein, the overall antigenic structure of the stabilized gp140 trimers was not adversely influenced by the sequence changes introduced into gp41. The stabilized JR-FL trimers bound non-neutralizing antibodies (PA-1, b6 and 17b) to a lesser extent than the corresponding wild type trimers. Stabilizing the conformation of gp140 trimers is advantageous for use of these proteins as vaccine immunogens.

Experimental Details II:

[0215] To overcome the structural instability of the native Env complex, or soluble forms thereof, various amino acid sequence changes have been designed and introduced into the Env polypeptide to stabilize inter-subunit interactions between gp120 and gp41, or between the gp41 components of a trimer (Binley et al., 2000; Sanders et al., 2002). A disulfide bond was introduced between gp120 and gp41, together with an additional change in gp41 that promotes trimer stability after gp120 and gp41 are cleaved into separate subunits during Env processing (Binley et al., 2000; Sanders et al., 2002). The resulting gp140 proteins are designated SOSIP. Additional changes at the cleavage site between gp120 and gp41 promote proteolytic processing (Binley et al., 2002). As described herein, five amino acid changes in the highly conserved Leucine zipper (LZ)-like motif near the N-terminus of gp41 (i.e., I535, Q543, S553, K567 and R588) have been shown to contribute to trimer stability by reducing the prevalence of monomeric, dimeric, or aggregated forms of gp140. The resulting reduction in the qualitative heterogeneity of Env may be useful for the production of vaccines designed to mimic native trimers. Accordingly, the invention provides less heterogeneous envelope trimers for the production of virus like particles (VPLs) and pseudoparticles for use as VLP-based immunogens and vaccines. In accordance with the invention, gp120/gp41 trimers comprising the stabilizing N-terminal gp41 mutations of the invention, as well as gp120/gp41 trimers comprising other stabilizing mutations in gp120 and gp41 and the N-terminal gp41 mutations as described herein, can be used to generate VPLs and pseudovirions having reduced monomer, dimer and tetramer forms and enhanced trimer forms of gp120/gp41 Env. The N-terminal stabilizing mutations in the context of HIV-1 virus as described herein can serve to restrict VLP and pseudovirion immunogens to the expression of Env trimers and to yield trimer forms of Env (gp120/gp41) on VLP and pseudovirions to the virtual exclusion of monomer, dimer, tetramer, or aggregate forms, thus providing an immunogen and/or vaccine that more closely resembles native HIV envelope trimers.

[0216] The beneficial effect of the described amino acid changes in gp41 could potentially be countered if they were found to substantially compromise Env structure by creating a non-native configuration. Even though the sequence changes are in gp41, there is ample precedent that amino acid variation in this subunit may affect the conformation of gp120 and the overall topology and function of the entire Env complex. For example, mutations in or near the LZ region of gp41 are known to affect the binding and action of anti-gp120 antibodies, creating neutralization-resistant viruses. (Back et al., 1993; Klasse et al., 1993; Park et al., 2000; Park and Quinnan, 1999; Park et al., 1998; Thali et al., 1994). Other changes in gp41 affect the sensitivity of HIV-1 to small molecules that bind near the CD4 binding site on gp120 (Guo et al., 2003; Lin et al., 2003). Thus, this set of experimental details examines whether the five amino acid changes in gp41 according to the invention, which promote the stability of gp140 trimers, affect Env expression, antigenic structure, neutralization sensitivity and fusion function when made in the context of fusion-competent proteins from HIV-1 JR-FL. As described further herein, it was found that the altered Env proteins retained their function, albeit with a modest reduction in the rate of fusion.

[0217] It was also found that the five amino acid changes reduced the proportion of aberrant, non-trimeric Env forms present on the surfaces of virions. These non-functional Env proteins serve as targets for the binding of non-neutralizing antibodies, thereby complicating any analysis of the relationships between the antibody binding and virus neutralization. (Broder et al., 1994; Cavacini and Posner, 2004; Fouts et al., 1997 and 1998; Herrara et al., 2003; Moore et al., 1995 and 2006; Poignard et al., 2003; Sattentau and Moore, 1995; York et al., 2001). Accordingly, the binding of various non-neutralizing antibodies to virions and Env-expressing cells was reduced for the gp41 mutant compared with wild-type JR-FL, without adversely affecting the binding of neutralizing antibodies. The use of the form of Env gp41 containing the five mutations, as well as SOS and SOSIP mutations, may also simplify the analyses of antibody binding and neutralization.

Materials and Methods

Plasmids and DNA Mutagenesis:

[0218] The pCI plasmid was used to express full-length WT JR-FL (gp160) Env (JR-FL WT) (Herrera et al., 2005). The JR-FL gp41 NT 1-5 mutant was created by site-directed DNA mutagenesis; five amino acid substitutions (M535I, L543Q, N553S, Q567K and G588R) near the N-terminus (NT) of gp41 were made using the QuikChange.RTM. II XL site-directed mutagenesis kit (Stratagene Inc., CA) and the appropriate primers according to the manufacturer's instructions. The introduced mutations were verified by sequencing. The pcDNA3.1-Furin plasmid was used for expressing Furin (Binley et al., 2000).

Antibodies and Cell Lines:

[0219] Soluble CD4, CD4-IgG2 (PRO-542) (Allaway et al., 1995) and the anti-V3 (JR-FL) MAb PA1 (Trkola et al., 1996a) were provided by Dr. William Olson (Progenics Pharmaceuticals, Inc., NY). MAb CA13 (ARP3119) directed to a linear epitope in the gp120 C1 region, was provided by the EU Programme EVA Centralized Facility for AIDS Reagents, NIBSC, UK (AVIP Contract Number LSHP-CT-2004-503487). MAbs 2G12 (Buchacher et al., 1994; Scanlan et al., 2002; Trkola et al., 1996b), 2F5 (Buchacher et al., 1992; Muster et al., 1993) and 4E10 (Buchacher et al., 1992; Stiegler et al., 2001; Zwick et al., 2001) were obtained from Hermann Katinger, MAb 17b (Thali et al., 1993) was obtained from James Robinson and MAb b12 (Burton et al., 1991) was obtained from Dennis Burton. MAbs A32 (Moore et al., 1994; Wyatt et al., 1995) and 15e (Robinson et al., 1990) were obtained from the Neutralizing Antibody Consortium (NAC) repository. F425-B4e8 (Cavacini et al., 2003) was obtained through the NIH AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH, from Dr. Marshall Posner and Dr. Lisa Cavacini. The anti-V3 MAb 447-520 was also obtained from the NIH AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH, contributed by S. Zolla-Pazner (Gorny et al., 1992 and 1993). T-20 (Enfuvirtide), (Wild et al., 1994), was a gift from Roche Laboratories, Inc., NJ.

[0220] All cell lines were maintained at 37.degree. C. in an atmosphere containing 5% CO.sub.2. Human Embryonic Kidney (HEK) 293T cells were grown in Dulbecco's minimal essential medium (DMEM, GIBCO), supplemented to contain 10% fetal calf serum (FCS), 2 mM L-glutamine, antibiotics (100 U/ml penicillin, 100 .mu.g/ml streptomycin) and 0.5 mg/ml of the neomycin analog G-418. U87.CD4.CCR5 and U87.CD4.CXCR4 cells (provided by Dan Littman and available through the NIH AIDS Research and Reference Reagent Program, Division of AIDS, NIAID, NIH, Cat. No. 4035 and 4036, respectively) were cultured under conditions similar to those of the HEK 293T cells, but under selection by 0.3 mg/ml of G-418 and 0.5 mg/ml of puromycin.

Transfection Conditions for Pseudovirus Production and Purification:

[0221] To produce luciferase-expressing, Env-pseudotyped viruses, 1.times.10.sup.8 HEK 293T cells cultured in growth medium lacking antibiotics were co-transfected with plasmid DNA expressing gp160 Envs (WT or gp41 NT 1-5 mutant) and the pNL4-3Env(-)Luc(+) reporter plasmid (Connor et al., 1995 and 1996) using Polyethylenimine (PEI), (Polysciences, Inc., Warrington, Pa.), (Boussif et al., 1995; Kirschner et al., 2006). After sixteen hours, the cells were washed and the medium was replaced with DMEM containing 10% FCS, antibiotics and L-glutamine. Forty-eight hours post-transfection, the virion-containing supernatants were clarified by low speed centrifugation and were filtered through a 0.45-.mu.m membrane. The clarified, filtered supernatants were layered over a 20% sucrose cushion in phosphate buffered saline (PBS) and were centrifuged for 2 hours at 100,000.times.g. The viral pellet (also referred to as pseudovirions or pseudoviruses herein) was then resuspended in either PBS for biochemical analysis or DMEM for virus infectivity assays and neutralization assays.

[0222] For studying cell-surface expression of JR-FL Env, 5.times.10.sup.6 HEK 293T cells were transiently transfected with plasmid DNA essentially as described above. pcDNA3.1-Furin was used for Furin co-transfection at a Furin:Env plasmid DNA ratio of 2:1 (Binley et al., 2000). After 24 hours, the cells were washed, and fresh culture medium was added. Forty-eight hours post-transfection, the cell-surface expressed Envs were biotin labeled for polyacrylamide gel electrophoresis (PAGE), or were stained with anti-Env antibodies for FACS analysis as described below.

Biotinylation of Cell Surface-Expressed Env:

[0223] Forty-eight hours post-transfection, the cell surface-expressed envelope glycoproteins were biotinylated as described previously (Daniels and Amara, 1998) with minor modifications. Briefly, the Env-expressing cells were washed extensively with ice-cold PBS containing 1.2 mM CaCl.sub.2, 1 mM MgCl.sub.2 and were incubated with 0.5 mg/ml of EZ-link sulfo-NHS-SS-Biotin (Pierce Biotechnology, Rockford, Ill.) for 1 hour at 4.degree. C. The biotin reaction was quenched using 50 mM ammonium chloride. The cells were then washed extensively and lysed in a buffer containing 25 mM Tris-HCl (pH 8.0), 150 mM NaCl, 1% Triton X-100, 1 mM phenylmethylsulfonyl fluoride (PMSF) and 1.times. Protease Inhibitors cocktail (Roche Diagnostic GmbH, Mannheim, DE). The homogenates were centrifuged at 14,000.times.g for 15 minutes at 4.degree. C. A 50 .mu.l aliquot of the supernatants was removed to measure total protein levels and to analyze the Env content; the remaining supernatant was incubated with 50 .mu.l of NeutrAvidin agarose resin (Pierce Biotechnology, Rockford, Ill.) for 2 hours at 4.degree. C. to precipitate biotin-labeled proteins. The pellets were then washed three times in TSA buffer, one time in 20 mM Tris-HCl (pH 8.0), 500 mM NaCl, and finally in 50 mM Tris-HCl (pH 8.0). Bound proteins were resuspended in 50 .mu.l of 2.times.SDS-PAGE sample buffer, boiled and resolved on a reduced SDS-PAGE gradient (4-12%) Tris-glycine gel (Invitrogen, Carlsbad, Calif.). After transfer to a PVDF membrane, the samples were immunoblotted with anti-gp120 antibody ARP3119, with anti-CD47 antibody (Santa Cruz Biotechnology) at 0.31 .mu.g/ml, or with anti-GAPDH antibody (Meridian Life Science, Inc.) at 0.1 .mu.g/ml. GAPDH was used as a control for equal loading of proteins in the total cell lysate; cell-surface CD47 expression served a similar purpose for studying cell-surface-expressed Env.

Fluorescence-Activated Cell Sorting (FACS) Assay for MAb Binding, to Cell Surface Env:

[0224] CD4-IgG2 and MAbs were biotinylated using the EZ-link Sulfo-NHS-LC-Biotinylation kit (Pierce) according to the manufacturer's instructions. Env-expressing, transiently-transfected HEK 293T cells (0.5.times.10.sup.6 cells per analysis) were harvested, washed extensively with PBS and incubated with 10 .mu.g/ml of biotinylated CD4-IgG2 or MAbs in FACS buffer (PBS containing 5% FCS) for 1 hour at 4.degree. C. The cells were washed repeatedly in FACS buffer and then were incubated with 100 .mu.l of Streptavidin-phycoerthyrin (PE), (BD Biosciences), at a 1:250 dilution for 30 minutes at 4.degree. C. The stained cells were then washed, fixed using 2% paraformaldehyde and analyzed. Each binding assay was performed in triplicate. Mean Fluorescence Intensity (MFI) values were derived using the appropriate isotype-matched control MAb. The resulting background signal was subtracted from the experimental results and presented as Mean.+-.Standard Deviation.

BN-PAGE, SDS-PAGE and Western Blotting:

[0225] Env that was expressed on the surface of pseudovirions was analyzed under non-denaturing conditions by the use of BN-PAGE (Schulke et al., 2002), with modifications as described elsewhere (Moore et al., 2006; Schagger et al., 1994). To release Env glycoproteins from the pseudovirion surface, an equal volume of solubilization buffer (0.12% Triton X.RTM.-100 in 1 mM EDTA/1.5 M aminocaproic acid) and 5 .mu.l of a protease inhibitor cocktail (Sigma-Aldrich) were added, followed by the addition of an equal volume of double-strength sample buffer (100 mM morpholinepropanesulfonic acid (MOPS), 100 mM Tris-HCl, pH 7.7, 40% glycerol, 0.1% Commassie Blue). The extracts were then loaded onto a 4-12% Bis-Tris NuPAGE gel (Invitrogen) and electrophoresed at 4.degree. C. for 3 hours at 100 V. The cathode buffer was 50 mM MOPS/50 mM Tris, pH 7.7, containing 0.002% Coommassie Blue, and the same buffer without Coommassie Blue served as the anode buffer. The gel was then blotted onto a polyvinylidene difluoride (PVDF) membrane, which was then washed with 30% methanol/10% acetic acid, followed by 100% methanol, to remove excess Coommassie Blue dye. Thyroglobulin (669 kDa), Ferritin (440 kDa), Catalase (232 kDa), Lactate dehydrogenase (140 kDa) and BSA (66 kDa) were used as molecular weight markers (Amersham Biosciences, PA). Densitometric analyses were performed using ImageJ software (NIH).

[0226] SDS-PAGE analysis of denatured Env glycoproteins was performed as described previously (Schulke et al., 2002). The pseudovirions were lysed by boiling in Laemmli sample buffer (62.5 mM Tris-HCl, pH 6.8, 25% glycerol, 2% SDS, 0.1% bromophenol blue, 10% 2-mercaptoethanol) in the presence of 50 mM DTT, and then were fractionated using either a 4-12% or an 8-16% gel. Overnight blotting onto a PVDF membrane was performed as described previously (Schulke et al., 2002). The membrane was then destained, treated with blocking buffer (4% nonfat milk in PBS) for 30 minutes and probed using 0.5 .mu.g/ml of the anti-gp120 MAb CA13 or 20 .mu.g/ml each of the 4E10 and 2F5 antibodies (anti-gp41 MAb cocktail). The mouse MAb 39/6.14 (Abcam Inc., MA) was used to detect p24. Goat anti-human and/or mouse Fc and Fab'2 alkaline phosphatase conjugates (Jackson Labs) were used at a dilution of 1:3,000, as appropriate, to detect the primary MAbs, followed by the Western blot Chemiluminescence Reagent Plus System (Perkin-Elmer Life Sciences, MA). The MultiMark.RTM. multi-colored standard kit (Invitrogen, CA) was used as a molecular weight marker.

Assay for gp120 Dissociation from Pseudovirions:

[0227] Pseudovirions, in 200 .mu.l of PBS containing 0.5% BSA, were incubated with or without sCD4 for 2 hours at 4.degree. C. or 37.degree. C. (or without sCD4, but at various temperatures), and then were layered over a 1 ml cushion of 20% sucrose and ultracentrifuged at 100,000 g for 1.2 hours. The purified virions were then resuspended in 200 .mu.l of dilution buffer (TMSS: 2% milk powder, 20% sheep serum in Tris-Buffered Saline (TBS)) for analysis of their gp120 content by capture ELISA.

gp120 Capture ELISA:

[0228] The gp120 capture ELISA was carried out as previously described (Moore et al., 1992). Briefly, gp120 was captured onto microtiter plate wells by the absorbed sheep antibody D7324 to the C-terminus (Cliniqa Corp.) and detected using either polyclonal HIVIg or CD4-IgG2 (0.1 .mu.g/ml).

Virus Capture Assay:

[0229] The virus capture assay was performed as previously described (Poignard et al., 2003). Briefly, ELISA plates were coated with goat anti-mouse IgG (Fc-specific) antibody (Sigma-Aldrich, MO), blocked with 3% BSA in PBS, and then incubated with anti-gp120 MAbs at 10 .mu.g/ml in 100 .mu.l of PBS. After washing thoroughly to remove unbound MAbs, 100 .mu.l of medium containing pseudoviruses (0.5-1.5 ng of p24 antigen) was added for 4 hours at room temperature. After washing thoroughly, the captured pseudoviruses were lysed in 200 .mu.l of lysis buffer and their p24 content was determined using an HIV-1 p24 ELISA kit (ZeptoMetrix Corp., NY). Wells that lacked anti-gp120 antibody served as negative controls for background binding of the added pseudoviruses.

Infectivity and Neutralization Assays:

[0230] Pseudotyped virions bearing JR-FL envelope glycoproteins (WT and mutant) were produced by cotransfection of selected env clones with a luciferase-expressing reporter vector, pNL4-3Env(-)Luc(+), as described above. To measure infectivity, luciferase-expressing Env-pseudotyped viruses (50 .mu.l) containing normalized amounts of p24 antigen were added to 3.times.10.sup.3 U87.CD4.CCR5 cells/well. U87.CD4.CXCR4 cells were used as a negative control. After 4 days, th cells were lysed with 75 .mu.l of 1.times.Glo lysis buffer (Promega, CA) for subsequent quantification of luciferase, which was expressed by the Env-pseudotyped virions that contain the gene for firefly luciferase inserted into the nef gene of HIV-1, using the Bright-Glo.TM. Luciferase Assay Substrate (Promega, CA) and a VICTOR31420 multilabel counter (Perkin Elmer Life Sciences, MA).

[0231] Neutralization of infectivity was performed as described previously (Trkola et al., 1998). The pseudovirions were incubated with an equal volume of various MAbs, CD4-IgG2, or T-20 for 1 hour at 37.degree. C. before the residual infectivity was determined using U87.CD4.CCR5 cells as described above. The concentration of each inhibitor was expressed as the amount present after the inhibitor-virus mixture was added to the cells. The data were analyzed by non-linear regression (variable-slope sigmoidal dose response) to calculate the inhibitor concentrations that caused 50% reductions in luciferase expression (IC50), using the GraphPad Prism 4 software (maximal viral production in the absence of inhibitor was designated as 100%).

Cell-Cell Fusion Assay:

[0232] The .beta.-lactamase cell-cell fusion assay was performed in HeLa-CD4/CCR5 cells (RC49) as described previously. (Lineberger et al., 2002; Rucker et al., 1997).

Results

[0233] As described in Experimental Details I above, the five amino acids I535, Q543, S553, K567 and R588 located near the N-terminus of HIV-1 gp41 are associated with the formation and/or stability of soluble, trimeric gp140 proteins based on the subtype A strain KNH1144 of HIV-1. Moreover, introducing these residues into HIV-1 subtype B gp140 proteins with different amino acids at the same positions had a beneficial impact on trimer stability. Of the five residues, Q543, S553 and K567 had the greatest effect when introduced in combination, with I535 and R588 perhaps making an additional minor contribution.

[0234] An analysis of gp41 sequences in the Los Alamos HIV-1 Sequence Database shows that I535, Q543, S553 and K567, but not R588, were individually and collectively far more prevalent in subtype A viruses than in ones from subtype B. See Table 2. Their frequency in subtype C viruses was similar to subtype A, with the notable exception of K567, which was completely absent. Subtype D frequencies were similar, but not identical, to subtype B. Too few sequences from the individual subtypes F, G, H, J and K are available to warrant a similar analysis, but treating these "minor" subtypes en masse showed that their gp41 residue frequencies were more similar to subtype A than to subtype B. Overall, with respect to these five positions in gp41, subtype B viruses stand out as being different from the other subtypes. (See Table 2 for statistical significance).

TABLE-US-00002 TABLE 2 No. of All 5 Subtypes Isolates Amino & CRFs (n) I535 Q543 S553 K567 R588 Acids A 78 72 94 90 81 33 22% B 200 26 59 10 0.5 35 0% C 201 87 95 90 0 12 0% D 54 4 89 4 0 22 0% F-H, J, K 35 91 97 69 86 6 3% P values (subtype B vs. rest): <<0.001 <<0.001 <<0.001 <<0.001 <<0.001 <0.001

[0235] Table 2 lists the prevalence (expressed as a percentage) of the five trimer-promoting acids (I535, Q543, S553, K567 and R588), singly or in combination, in gp41 sequences from viruses of subtypes A, B, C, D and from subtypes F+G+H+J+K treated collectively (too few sequences from subtypes F, G, H, J and K were available to warrant a separate analysis). Note that Env sequences from what was formerly called subtype E are included within subtype A as the "subtype E" any gene is actually from subtype A. The comparatively high collective prevalence of the five amino acids in subtype A sequences is highlighted in gray. The statistical significance (P value) of the prevalence of the five amino acids in subtype B viruses, singly or in combination, relative to the rest of the subtypes is calculated using Fisher's Exact Test.

Effect of gp41 Substitutions of the Quantity and Quality of Env Incorporated into Pseudovirus:

[0236] In view of the potential general utility of the gp41 mutation strategy in accordance with this invention for making stabilized gp140 trimers, the effects of the five amino acid substitutions on Env function when these mutations were introduced into full-length, virion-associated envelope glycoproteins were assessed. The subtype 8 isolate HIV-1 JR-FL was used for infectivity and related studies, since the effect of the five amino acid substitutions on gp140 trimer formation had been studied using this isolate, as described in Experimental Details I.

[0237] The full-length, wild type (WT) JR-FL gp160 env gene (JR-FL WT) was mutated at the same five positions to generate the JR-FL gp41 NT 1-5 mutant. Env-pseudotyped virions (pseudovirions) based on the WT and mutant sequences were generated by co-transfecting HEK 293T cell with each individual full-length Env-encoding plasmid and the pNL4-3.Luc.R-E- reporter plasmid (Connor et al., 1996). Pseudovirions from the two virus preparations were pelleted by ultracentrifugation onto a 20% sucrose cushion and were found to contain similar amounts of the p24 antigen (107 pg/ml for WT; 112 pg/ml for the NT 1-5 mutant).

[0238] The gp120 and gp41 content of the WT and the NT 1-5 mutant viruses were then determined by SDS-PAGE and Western blotting, followed by densitometric analysis using ImageJ software (FIG. 7). The normalized gp120:p24 ratios for the WT and mutant viruses were 1 and 0.28, respectively. The corresponding gp41:p24 ratios were 1 and 0.4, respectively. Thus, on average, the mutant pseudovirions contained .about.3.5-fold less gp120 and 2.5-fold less gp41 than the WT viruses per unit of particulate p24 antigen. Although similar values were obtained in four replicate experiments, the imprecision of this type of analysis makes it hard to judge whether the modestly greater reduction in gp120 content, compared with gp41, for the mutant viruses is genuine. Additional analyses of gp120-shedding from the WT and mutant viruses suggest that the difference may not be real (see below). Overall, it can be concluded that the mutant viruses incorporate and retain-30-40% of the total Env content of the WT viruses.

[0239] The Env content of the purified pseudovirions was studied in more detail by using BN-PAGE to assess the presence of dimeric, trimeric and tetrameric Env forms (FIG. 8A). Consistent with the gel analysis under denaturing conditions, the total Env content of the mutants was .about.2.5-fold lower than that of the WT viruses. However, a densitometric analysis showed that this reduction was entirely attributable to a decrease in the amounts of Env tetramers and dimers that were present on the mutant particles (no monomers were visible in either of the preparations); the trimer contents of the two sets of virions were identical (FIGS. 8A and 8B). Overall, tetramers comprised 58%; trimers comprised 34%, and dimers comprised 8% of the total Env content of the WT viruses, while the corresponding values, respectively, were 35%, 62% and 3% for the mutant viruses. Thus, it can be concluded that the majority of the Env proteins incorporated into JR-FL WT pseudovirions produced on 293T cells are non-trimeric and that the presence of these aberrant Env forms on the viral surface can be significantly reduced (P<0.05, Mann-Whitney U Test, one-tailed) by substituting the five amino acids in the N-terminal region of gp41.

Effect of gp41 Substitutions on Soluble CD4- and Temperature-Induced gp120 Shedding from Pseudovirions:

[0240] As noted above, there appeared to be a slight reduction in the gp120 content of the mutant pseudoviruses relative to gp41 that could be attributable to an increase in the shedding of gp120 from functional or non-functional spikes. This scenario is plausible, because changes in residues 528 to 562 have been associated with a destabilization of the gp120-gp41 interaction (Cao et al., 1993). To determine whether the WT and mutant pseudoviruses behaved differently in this respect, soluble CD4 (sCD4) and mild heat were used as inducers of gp120 shedding. To provide an external frame of reference, the HxB2 isolate was also studied under the same conditions, as this type of T-cell line-adapted virus is particularly prone to shedding its gp120 content. (Moore et al., 1992).

[0241] As a result of this analysis, it was found that neither the WT nor the mutant JR-FL pseudovirus shed gp120 significantly in response to sCD4 concentrations up to 10 .mu.g/ml, at either 4.degree. C. or 37.degree. C. In contrast, HxB2 Env-bearing pseudoviruses lost over half of their gp120 content following treatment with 10 .mu.g/ml of sCD4 at 37.degree. C. (FIG. 9A). Moreover, the two versions of JR-FL pseudoviruses (WT and mutant) behaved similarly in their reactions to elevated temperatures; both lost significant amounts of gp120 only at 50.degree. C. and 65.degree. C., but even lesser amounts than did the HxB2 reference virus (FIG. 9B). Thus, the five amino acid substitutions in the N-terminal region of gp41 in JR-FL do not detectably destabilize the interactions between gp120-gp41.

Effect of gp41 Substitutions on Pseudovirion Infectivity:

[0242] Specific substitutions in the highly conserved N-terminal heptad repeat region can impair the function of gp41 and the infectivity of mutant viruses (Cao et al., 1993; Chen et al., 1993; Weng and Weiss, 1998). Therefore, the infectivity of the WT and mutant JR-FL Env-pseudotyped viruses (normalized for p24 content) was assessed in a single cycle assay using U87.CD4.CCR5 target cells and a luciferase reporter gene read-out four days post-infection.

[0243] The two viruses were found to be equally infectious (FIG. 10). Thus, the five amino acid changes in the gp41 N-terminal region have no impact on JR-FL Env-pseudovirus infectivity. The modestly reduced total Env content of the mutant pseudoviruses is therefore not relevant to the infection process, presumably because the trimer content remains unchanged.

Effect of gp41 Substitutions on the Efficiency and Rate of Env-Mediated Fusion:

[0244] Despite the results from the above infectivity experiment, it still could have been possible that the five amino acid substitutions in the NT of gp41 could affect Env function, as assessed in a more direct fusion assay. Some amino acid changes in the gp41 HR1 region can impair fusion efficiency and slow the kinetics of fusion. (Reeves et al., 2005). Therefore, a study of the kinetics of fusion mediated by the WT and mutant Env glycoproteins was undertaken using a cell-cell fusion assay. To this end, HeLa-CD4/CCR5 target cells were loaded with the fluorescent dye CCF2-AM and mixed with effector cells expressing Env glycoproteins and .beta.-lactamase. Cell-cell fusion that leads to cytoplasm mixing allows cleavage of CCF2 by .beta.-lactamase, which causes a change in fluorescence that can be accurately quantified. Using this assay, it was observed that membrane fusion mediated by the mutant Env glycoprotein occurred slightly, but detectably, more slowly than did fusion mediated by the WT Env glycoprotein (FIG. 11A and Table 3). From these results, it could be seen that the changes in gp41 NT do have a modest adverse effect on the fusion function of Env, but not to a degree that impairs pseudovirus infectivity.

TABLE-US-00003 TABLE 3 Kinetic parameters of cell-cell fusion mediated by the WT and NT mutant JR-FL Env-pseudotyped virions Envelope Y.sub.max (% WT)* t1/2 (min)** B*** JR-FL WT 97.8 .+-. 2.9 48 .+-. 1 12 .+-. 1 JR-FL gp41 NT 67.6 .+-. 5.8 59 .+-. 3 14.1 .+-. 2.1 1-5 The kinetic parameters were derived using a .beta.-lactamase reporter assay (See, FIGS. 11A-B). Data derived from three independent experiments were fitted to the equation Y = Y.sub.max/{1 + exp[-(t - t1/2)/b]}. The coefficients extracted from these curves .+-. standard errors of the mean are shown. *Fusion expressed as percentage of the maximal fusion mediated by the WT JR-FL Env. **Time to half-maximal fusion (in minutes). ***Exponential rate constant.

Binding of Neutralizing and Non-Neutralizing Antibodies to the WT and gp41 Mutant Env Glycoproteins on Pseudoviruses, and Correlations with Infection-Inhibition:

[0245] Measurements of antibody binding to Env glycoproteins on the surface of virions and Env-expressing cells are compromised by the presence of non-functional forms of Env intermingled with functional, native trimers (Moore at al., 2006; Poignard et al., 2003). This heterogeneity of Env binding sites for antibodies renders it impossible to be sure that a binding event involves a functional spike, and, since only functional spikes are relevant to infectivity neutralization, it has been difficult to draw meaningful conclusions between binding and neutralization events. (Cavacini et al., 1999; Fouts et al., 1997 and 1998; Moore et al., 2006; Nyambi at al., 1998; Poignard at al., 2003). The observation that the five amino acid substitutions in the N-terminus of gp41 reduce the abundance of non-native Env forms (e.g., dimers and tetramers) present on virions, without affecting trimeric forms, led to conducting experiments to determine how various antibodies reacted with the mutant Env proteins. Another reason to conduct such experiments was to determine whether the gp41 substitutions could affect the antigenic structure of gp120, in view of reports that other changes in gp41 can have such an effect. (Back et. al., 1993; Klasse et al.; 1993; Park and Quinnan, 1999; Park et al., 1998; Thali et al., 1994).

[0246] To measure Mab-Env interactions, a widely-used virion-binding assay was first employed. This type of assay typically generates results that are highly misleading for judging neutralization mechanisms. (Moore et al., 2006; Poignard et al., 2003). Upon completion of the assay, it was found that there was no difference between the WT and the mutant JR-FL Env-pseudotyped virions in the extent to which they bound the neutralizing MAbs b12, 2G12, 2F5 and 4E10, or the CD4-IgG2 protein (FIG. 12A). However, three non-neutralizing MAbs, i.e., b6 and 15e directed to the CD4BS on gp120, and MAb PA1 directed to the V3 region, captured significantly fewer mutant pseudovirions than WT, compared with the neutralizing anti-V3 MAb F425-B4e8 (P.ltoreq.0.05 for each, Mann-Whitney U test, one-tailed), (FIGS. 12A and 12B). A modest but not significant decrease in capture of the mutant pseudovirions was also observed with another non-neutralizing, anti-V3 MAb, 447-52D (FIG. 12B). An additional non-neutralizing Mab, A32, directed to the C1-C4 region of gp120, bound minimally, but comparably, to both pseudovirion preparations (FIG. 12B). The non-neutralizing MAb, 17b, captured both pseudovirion preparations weakly in the absence of sCD4, but its binding was increased when sCD4 was also present, which is consistent with the known ability of CD4 to induce the exposure of the 17b epitope on gp120 (Thali et al., 1993). It is noted that the sCD4-induction of the 17b epitope was significantly greater (P=0.04 Mann-Whitney U test, one-tailed) for the mutant pseudovirions than for the WT pseudovirions (FIG. 12A).

[0247] For comparison with the Env-binding data and to further assess whether the changes to gp41 affected the native structure of the Env complex, experiments were conducted to measure the sensitivities of the WT and mutant JR-FL Env-pseudovirions to inhibition by MAbs (and to the CD4-IgG2 protein). U87.CD4.CCR5 cells served as targets for infection. The four broadly neutralizing MAbs (b12, 2G12, 2F5 and 4E10) and CD4-IgG2 all inhibited infection of the two pseudoviruses with comparable potencies, as did the V3 MAb F425-B4e8 (Table 4). By contrast, five MAbs, b6, 15e, A32, PA1 and 17b, that lack neutralizing activity against HIV-1 JR-FL failed to inhibit infection by either pseudovirus. (Table 4). The V3 MAb 447-52D was unable to be tested for neutralization, as it was not available in sufficient quantities; however, this MAb has been reported to lack strong activity against JR-FL Env pseudotyped viruses in a similar assay (Binley et al., 2004).

[0248] It was observed that in the presence of sCD4, MAb 17b neutralized the mutant Env-pseudotyped virions .about.2-fold more efficiently than the WT virus (IC50 values of 15 and 26 .mu.g/ml, respectively), (Table 4). This finding is consistent with the modestly increased binding of 17b to the mutant pseudoviruses in the presence of sCD4 (FIG. 12A) and suggests that the gp41 substitutions do have a detectable impact on either the conformation of the gp120 component of the native Env complex, or on the way in which that complex changes its configuration after sCD4 binding. The effect of the gp41 NT 1-5 amino acid changes must be modest, however, as there was no difference in the neutralization of the two pseudovirus preparations by any of the other test MAbs. (Table 4)

[0249] Sensitivity to T-20 was also studied in the same assay system, because the five amino acid changes are located close to the gp41 HR1 region, which is associated with T-20 resistance (Carmona at al., 2005; Greenberg and Cammack, 2004). The IC50 for T-20 against the WT Env-pseudotyped virus was 2-fold greater than against the mutant, suggesting that one or more of the five amino acid substitutions does modestly affect the binding or antiviral activity of the T-20 peptide against the gp41 NT 1-5 mutant. (Table 4).

TABLE-US-00004 TABLE 4 Neutralization of pseudovirus activity by MAbs, CD4-IgG2 and T-20 Reagent JR-FL WT JR-FL gp41 NT 1-5 CD4-IgG2 0.82 0.74 b12 0.15 0.17 2G12 1.6 1.7 b6 >50 >50 15e >50 >50 A32 >50 >50 PA1 >50 >50 F425-B4e8 2.94 2.64 447-52D ND* ND* 17b >50 >50 17b + sCD4 25.9 15.2 2F5 6.2 5.9 4E10 13.4 14.1 T-20 24.2 11.5

[0250] The numbers recorded in Table 4 are mean IC50 values in .mu.g/ml for the reagents indicated (in nM for T-20). IC50 values that differ between the WT and mutant viruses are highlighted in bold.

[0251] *: ND=not done. Not enough of this reagent was able to be procurred for use in neutralization assays. However, MAb 447-52D has been reported to lack potent neutralization activity (IC50=32.6 .mu.g/ml) against JR-FL Env-pseudotyped viruses in an assay of comparable design. (Binley et al., 2004).

Binding of Neutralizing and Non-Neutralizing Antibodies to Cells Expressing the WT and gp41 Mutant Env Glycoproteins:

[0252] Various studies have shown that structural forms of Env and antibody-binding profiles are different on transfected cells than on infectious virions, probably because over-expression affects Env processing pathways. (Herrera at al., 2003 and 2005; Sattentau and Moore, 1995; Si et al., 2001; York at al., 2001). To address this in view of the WT and gp41 NT 1-5 Env mutant viruses, full-length WT and gp41 mutant Envs were expressed in HEK 293T cells and the binding of various neutralizing and non-neutralizing antibodies was investigated. As seen with the pseudovirions, WT Env was expressed at -3-fold higher levels than the gp41 mutant Env, both within the transfected cells and on the cell surface (FIG. 13A).

[0253] Flow cytometry was used to analyze antibody binding to the surface of Env-expressing cells (FIG. 13B). Each Mab was tested at a concentration of 10 .mu.g/ml. In contrast to what was observed with the pseudovirions, few, if any, differences were detected between the WT and gp41 mutant Envs. A slight decrease in PA1 binding to the mutant Env was marginally significant (P=0.05, Mann-Whitney U test, one-tailed). The differences between what was observed with the cell surface binding assay compared with the pseudovirion capture assay may reflect the greater diversity of Env forms that are present on cells. (Herrera et al., 2003 and 2005; Pancera and Wyatt, 2005).

Discussion:

[0254] When the HIV-1 envelope glycoproteins are expressed as recombinant proteins for use as vaccine antigens, for structural studies, or for analysis of neutralization mechanisms, their structural heterogeneity creates problems. Thus, preparations of soluble gp140 proteins can, and often do, contain monomers, dimers, trimers, tetramers and aggregates (Center et al., 2004; Earl et al., 1994; Schulke et al., 2002; Staropoli et al., 2000), and multiple forms of membrane-bound Env are present on pseudovirions and on Env-expressing cells. (Herrera et al., 2005; Moore et al., 2006; Poignard et al., 2003). The degree to which these problems arise from the over-expression of Env, or from the use of non-lymphoid cells, is hard to determine; the Env content of naturally-produced viruses may be less diverse than what arises in transfection-based systems. Nonetheless, transfection systems are widely used experimentally, and the practical production of Env vaccine candidates usually requires the use of non-lymphoid cells. The development of ways to reduce the extent of Env heterogeneity prior to purification of trimers is therefore useful, since it is generally assumed that trimers best mimic the native, virion-associated form of Env.

[0255] In accordance with the present invention, the identity of selected residues near the N-terminus of gp41 provides one of the genetic influences on the formation of aberrant forms of Env. The residues associated with increased trimer formation/stabilization are much rarer in HIV-1 subtype B viruses compared with those from other subtypes, particularly HIV-1 subtype A, for reasons that are not completely understood. However, when the relevant residues are inserted into subtype B viruses, they increase the formation and/or stability of trimers. Considering that most vaccine-related studies with soluble gp140 proteins have been carried out using subtype B sequences as templates, it seems possible that the commonly observed Env instability might not be as pronounced with proteins from other subtypes as it is with subtype B. (Jeffs et al., 2004).

[0256] The effect of the N-terminal gp41 residues has been studied with the prototypic subtype B primary isolate JR-FL, initially in the context of soluble gp140 proteins and now with full-length gp160 proteins that are the basis of infectious Env-pseudoviruses. These studies were performed to determine whether the gp41 NT substitutions adversely affect the overall structure of Env, which could be problematic for vaccine production and to learn whether the reduction in the formation of aberrant forms of Env could beneficially influence previously problematic analyses of the relationships between antibody binding to Env and the neutralization of virus infectivity.

[0257] The introduction of the five amino acid changes into full-length gp160 proteins reduced Env expression overall by .about.2-4 fold in different assays, thus implying that they have a modest affect on Env production or degradation. However, all of that reduction was accounted for by the presence of lesser amounts of Env dimers and tetramers; the trimer content was unchanged. Without wishing to be limited by theory, the most likely explanation of this effect is that the trimers are more stable and do not dissociate as readily into dimers and monomers (the tetramers are probably dimers of dimers).

[0258] The gp41 amino acid changes in the N-terminal region had no effect on pseudovirion infectivity, and they did not cause any destabilization of the gp120-gp41 linkage. Thus, they seem to be benign from the perspective of the overall topology of the Env complex. Their lack of effect on the overall structure of functional, native Env complexes is further shown by the similar binding of various neutralizing MAbs to both the WT and mutant pseudovirions and their identical neutralization sensitivities. An exception was the slightly greater sensitivity of the mutant viruses to the CD41 MAb 17b in the presence of sCD4, which was associated with a comparable increase in 17b binding in a pseudovirion-capture assay in the presence of sCD4. Presumably, the gp41 NT substitutions do have a modest impact on the exposure of the CD41 epitope post CD4 binding. Broadly similar effects of selected gp41 sequence changes on gp120 topology have been described. (Back et al., 1993; Klasse et al., 1993; Reitz et al., 1988; Thali at al., 1994).

[0259] The gp41 changes did have a modest effect on fusion kinetics in a cell-cell fusion assay, but seemingly not enough to affect infectivity when the same Env proteins were present on pseudovirions. Mutational studies of the gp41 HR1 segment have shown that mutations in the "a" and "d" positions, particularly helix-disrupting mutations, impair fusion. (Cao et al., 1993; Chen et al., 1993; Chen, 1994; Dubay at al., 1992). The S553, K567 and R588 residues that were identified are not helix disrupting and occupy the "b" position on the coiled-coil helix, which may explain why their adverse effect is so modest. The NT substitutions also had a slight affect on T-20 sensitivity, decreasing the IC50 by 2-fold. Although the substitutions lie outside the .sub.547GIV.sub.549 `hot spot` associated with T-20 reactivity (Rimsky et al., 1998), natural polymorphisms at position 553 (e.g., N553S) most commonly observed in non-subtype B isolates, have been associated with increased susceptibility to T-20. (Carmona et al., 2005; Whitcomb et al., 2003).

[0260] Studies of antibody binding to pseudovirions showed that several non-neutralizing MAbs (i.e., b6, 15e, F425-4-e8 and PA1) directed to the CD4BS and V3 epitopes bound to the mutant Env significantly (2-3 fold) less than to the WT Env. The differential binding was not seen, however, with two other MAbs that also lack neutralizing capacity against JR-FL, namely, 447-52D and A32. The decreased binding of the non-neutralizing MAbs to pseudoviruses bearing the mutant Env could be due to the lesser content of non-trimeric (i.e., non-functional) Env forms present. This explanation would be consistent with the findings of Poignard et al., 2003.

[0261] It was notable that when the pseudovirion binding (FIGS. 12A and 12B) and neutralization data (Table 4) were compared, there was no clear relationship between the ability of a MAb to capture pseudoviruses and impair their infectivity, as reported previously. (Moore et al., 2006; Poignard et al., 2003). Thus, some neutralizing MAbs captured pseudoviruses strongly, e.g., b12, 2G12, F425-B4e8, but others did not, e.g., 2F5, 4E10. Additionally, all three V3 MAbs, e.g., PA1, 447-52D and F425-84e8, captured pseudoviruses efficiently, but only F425-B4e8 was neutralizing. Nonetheless, the introduction of the five NT substitutions in gp41 that reduce (but that do not eliminate) the presence of non-native forms of Env on pseudoviruses did improve the performance of the virion capture assay. For example, the difference between the closely related neutralizing and non-neutralizing CD4BS MAbs b12 and b6 was substantially increased in the virion capture assay when the gp41 NT 1-5 Env mutant pseudoviruses were used (FIG. 12A).

[0262] Binding assays involving Env expressed on the surface of transfected cells were less informative, probably because of the multiplicity of Env configurations present on the cells. This further reinforces the problems associated with these types of assays when they are used to assess the relationship between antibody binding and virus neutralization (Gorse et al., 1999; Herrera et al., 2003; Sattenau and Moore, 1995; Si et al., 2001; York et al., 2001).

[0263] Thus, from the results presented in Experimental Details II, it can be concluded that the five amino acid changes in gp41 have a generally beneficial effect on the overall configuration of the subtype B gp140 and gp160 Env proteins (as well as on those of subtype A Env proteins as described in Experimental Details I) by reducing the presence of non-native Env forms, i.e., non-trimer forms, without compromising the function of trimers. Accordingly, the present invention encompasses a facilitation of the production of Env trimers for HIV vaccine development and production and for HIV structural and immunogenicity studies.

REFERENCES FOR EXPERIMENTAL DETAILS I AND II

[0264] Abacioglu, Y. H., Fouts, T. R., Laman, J. D., Claassen, E., Pincus, S. H., Moore, J. P., Roby, C. A., Kamin-Lewis, R., and Lewis, G. K. (1994). Epitope mapping and topology of baculovirus-expressed HIV-1 gp160 determined with a panel of murine monoclonal antibodies. AIDS Res Hum Retroviruses 10(4), 371-81. [0265] Allaway, G. P., Davis-Bruno, K. L., Beaudry, G. A., Garcia, E. B., Wong, E. L., Ryder, A. M., Hasel, K. W., Gauduin, M. C., Soup, R. A., McDougal, J. S., and et al. (1995). Expression and characterization of CD4-IgG2, a novel heterotetramer that neutralizes primary HIV type 1 isolates. AIDS Res HumRetroviruses 11(5), 533-9. [0266] Back, N. K., Smit, L., Schutten, M., Nara, P. L., Tersmette, M. and Goudsmit, J. (1993). Mutations in human immunodeficiency virus type 1 gp41 affect sensitivity to neutralization by gp120 antibodies. J. Virol. 67(11), 6897-902. [0267] Beddows, S., Kirschner, M., Campbell-Gardener, L., Franti, M., Dey, A. K., Iyer, S. P., Maddon, P. J., Paluch, M., Master, A., Overbaugh, J., Vancott, T., Olson, W. C., and Moore, J. P. (2006). Construction and Characterization of Soluble, Cleaved, and Stabilized Trimeric Env Proteins Based on HIV Type 1 Env Subtype A. AIDS Res Hum Retroviruses 22(6), 569-79. [0268] Berger, E. A., Murphy, P. M., and Farber, J. M. (1999). Chemokine receptors as HIV-1 coreceptors: roles in viral entry, tropism, and disease. Annu Rev Immunol 17, 657-700. [0269] Binley, J. M., Sanders, R. W., Clas, B., Schuelke, N., Master, A., Guo, Y., Kajumo, F., Anselma, D. J., Maddon, P. J., Olson, W. C., and Moore, J. P. (2000). A recombinant human immunodeficiency virus type 1 envelope glycoprotein complex stabilized by an intermolecular disulfide bond between the gp120 and gp41 subunits is an antigenic mimic of the trimeric virion-associated structure. J Virol 74(2), 627-43. [0270] Binley, J. M., Sanders, R. W., Master, A., Cayanan, C. S., Wiley, C. L., Schiffner, L., Travis, B., Kuhmann, S., Burton, D. R., Hu, S. L., Olson, W. C., and Moore, J. P. (2002). Enhancing the proteolytic maturation of human immunodeficiency virus type 1 envelope glycoproteins. J Virol 76(6), 2606-16. [0271] Binley, J. M., Wrin, T., Korger, B., Zwick, M. B., Wang, M., Chappey, C., Stiegler, G., Kunert, R., Zolla-Pazner, S., Katinger, H., Petropoulos, C. J. and Burton, D. R. (2004). Comprehensive cross-clade neutralization analysis of a panel of anti-human immunodeficiency virus type 1 monoclonal antibodies. J Virol 78(23), 13232-52. [0272] Blacklow, S. C., Lu, M., and Kim, P. S. (1995). A trimeric subdomain of the simian immunodeficiency virus envelope glycoprotein. Biochemistry 34(46), 14955-62. [0273] Boussif, O., Lezoualc'h, F., Zanta, M. A., Mergny, M. D., Scherman, D., Demeneix, B., and Behr, J. P. (1995). A versatile vector for gene and oligonucleotide transfer into cells in culture and in vivo: polyethylenimine. Proc Natl Acad Sci USA 92(16), 7297-301. [0274] Broder, C. C., Earl, P. L., Long, D., Abedon, S. T., Moss, B. and Domms, R. W. (1994). Antigenic implications of human immunodeficiency virus type 1 envelope quaternary structure: oligomer-specific and -sensitive monoclonal antibodies. Proc. Natl. Acad. Sci. USA. 91(24), 11699-703. [0275] Buchacher, A., Predl, R., Strutzenberger, K., Steinfellner, W., Trkola, A., Purtscher, M., Gruber, G., Tauer, C., Steindl, F., Jungbauer, A. and Katinger, H. (1994). Generation of human monoclonal antibodies against HIV-1 proteins; electrofusion and Epstein-Barr virus transformation for peripheral blood lymphocyte immortalization. AIDS Res. Hum. Retroviruses. 10, 359-369. [0276] Buchacher, A., Predl, R., Tauer, C., Purtscher, M., Gruber, G., Heider, R., Steindl, F., Trkola, A., Jungbauer, A. and Katinger, H. (1992). Human monoclonal antibodies against gp41 and gp120 as potential agents for passive immunization. Vaccines. 92, 191-195. [0277] Burton, D. R., Barbas, C. F., 3.sup.rd, Persson, M. A., Koenig, S., Chanock, R. M. and Lerner, R. A. (1991). A large array of human monoclonal antibodies to type 1 human immunodeficiency virus from combinatorial libraries of asymptomatic seropositive individuals. Proc. Natl. Acad. Sci. USA. 88(22), 10134-7. [0278] Burton, D. R., Pyati, J., Koduri, R., Sharp, S. J., Thornton, G. B., Parren, P. W., Sawyer, L. S., Hendry, R. M., Dunlop, N., Nara, P. L., and at al. (1994).Efficient neutralization of primary isolates of HIV-1 by a recombinant human monoclonal antibody. Science 266(5187), 1024-7. [0279] Calarese, D. A., Scanlan, C. N., Zwick, M. B., Deechongkit, S., Mimura, Y., Kunert, R., Zhu, P., Wormald, M. R., Stanfield, R. L., Roux, K. H., Kelly, J. W., Rudd, P. M., Dwek, R. A., Katinger, H., Burton, D. R., and Wilson, I. A. (2003). Antibody domain exchange is an immunological solution to carbohydrate cluster recognition. Science 300(5628), 2065-71. [0280] Cao, J., Bergerson, L., Helseth, E., Thali, M., Repke, H and Sodroski, J. (1993). Effects of amino acid changes in the extracellular domain of the human immunodeficiency virus type 1 gp41 envelope glycoprotein. J. Virol. 67(5), 2747-55. [0281] Cardoso, R. M., Zwick, M. B., Stanfield, R. L., Kunert, R., Binley, J. M., Katinger, H., Burton, D. R., and Wilson, I. A. (2005). Broadly neutralizing anti-HIV antibody 4E10 recognizes a helical conformation of a highly conserved fusion-associated motif in gp41. Immunity 22(2), 163-73. [0282] Carmona, R. et al. (2005). Natural resistance-associated mutations to Enfuvirtide (T20) and polymorphisms in the gp41 region of different HIV-1 genetic forms from T-20 naive patients. J. Clin. Virol. 32(3), 248-53. [0283] Cavacini, L. at al. (2003). Conformational changes in env oligomer induced by an antibody dependent on the V3 loop base. AIDS. 17(5), 685-9. [0284] Cavacini, L. and Posner, M. (2004). Native HIV type 1 virion surface structures: relationships between antibody binding and neutralization or lessons from the viral capture assay. AIDS Res. Hum. Retroviruses. 20(4), 435-41. [0285] Cavacini, L. et al., (1999). Minimal incidence of serum antibodies reactive with intact primary isolate virions in human immunodeficiency virus type 1-infected individuals. J. Virol. 73(11), 9638-41. [0286] Center, R. J., Kemp, B. E., and Poumbourios, P. (1997). Human immunodeficiency virus type 1 and 2 envelope glycoproteins oligomerize through conserved sequences. J Virol 71(7), 5706-11. [0287] Center, R. J., Leapman, R. D., Lebowitz, J., Arthur, L. O., Earl, P. L., and Moss, B. (2002). Oligomeric structure of the human immunodeficiency virus type 1 envelope protein on the virion surface. J Virol 76(15), 7863-7. [0288] Center, R. J., Lebowitz, J., Leapman, R. D., and Moss, B. (2004). Promoting trimerization of soluble human immunodeficiency virus type 1 (HIV-1) Env through the use of HIV-1/simian immunodeficiency virus chimeras. J Virol 78(5), 2265-76. [0289] Center, R. J., Schuck, P., Leapman, R. D., Arthur, L. O., Earl, P. L., Moss, B., and Lebowitz, J. (2001). Oligomeric structure of virion-associated and soluble forms of the simian immunodeficiency virus envelope protein in the prefusion activated conformation. Proc Natl Acad Sci USA 98(26), 14877-82. [0290] Chakrabarti, B. K., Kong, W. P., Wu, B. Y., Yang, Z. Y., Friborg, J., Ling, X., King, S. R., Montefiori, D. C., and Nabel, G. J. (2002). Modifications of the human immunodeficiency virus envelope glycoprotein enhance immunogenicity for genetic immunization. J Virol 76(11), 5357-68. [0291] Chan, D. C., Fess, D., Berger, J. M., and Kim, P. S. (1997). Core structure of gp41 from the HIV envelope glycoprotein. Cell 89(2), 263-73. [0292] Chan, D.C., and Kim, P. S. (1998). HIV entry and its inhibition. Cell 93(5), 681-4. [0293] Chen, S. S. (1994). Functional role of the zipper motif region of human immunodeficiency virus type 1 transmembrane protein gp41. J. Virol. 68(3), 2002-10. [0294] Chen, S. S. et al. (1993). Mutational analysis of the leucine zipper-like motif of the human immunodeficiency virus type 1 envelope transmembrane glycoprotein. J. Virol. 67(6), 3615-9. [0295] Connor, R. I. et al. (1995). Vpr is required for efficient replication of human immunodeficiency virus type 1 in human mononuclear phagocytes. Virology. 206(2), 935-44. [0296] Connor, R. I. et al. (1996). Characterization of the functional properties of env genes from long-term survivors of human immunodeficiency virus type 1 infection. J. Virol. 70(8), 5306-11. [0297] Daniels, G. M. and Amara, S. G. (1998). Selective labeling of neurotransmitter transporters at the cell surface. Methods Enzymol. 296, 307-18. [0298] Doms, R. W., and Moore, J. P. (2000). HIV-1 membrane fusion: targets of opportunity. J Cell Biol 151(2), F9-14. [0299] Dubay, J. W. et al. (1992). Mutations in the leucine zipper of the human immunodeficiency virus type 1 transmembrane glycoprotein affect fusion and infectivity. J. Virol. 66(8), 4748-56. [0300] Earl, P. L. et al. (1994). Native oligomeric human immunodeficiency virus type 1 envelope glycoprotein elicits diverse monoclonal antibody reactivities. J. Virol. 68(5), 3015-26. [0301] Earl, P. L., moss, B., and Doms, R. W. (1991). Folding, interaction with GRP78-BiP, assembly, and transport of the human immunodeficiency virus type 1 envelope protein. J Virol 65(4), 2047-55. [0302] Fields, B. N., Knipe, D. M., Hawley, P. M., Chanock, R. M., Melinick, J. L., Monath, T. P., Roizman, B., and Straus, S. E. (1996). "Fields Virology." (P. A. Luciw, Ed.) Lippincott-Raven Publishers, Philadelphia. [0303] Fouts, T. R. et al. (1997). Neutralization of the human immunodeficiency virus type 1 primary isolate JR-FL by human monoclonal antibodies correlates with antibody binding to the oligomeric form of the envelope glycoprotein complex. J. Virol. 71(4), 2779-85. [0304] Fouts, T. R. et al. (1998). Interactions of polyclonal and monoclonal anti-glycoprotein gp120 antibodies with oligomeric glycoprotein gp120-glycoprotein 41 complexes of a primary HIV type 1 isolate: relationship to neutralization. AIDS Res. Hum. Retroviruses. 14(7), 591-597. [0305] Garlick, R. L., Kirschner, R. J., Eckenrode, F. M., Tarpley, W. G., and Tomich, C. S. (1990). Escherichia coli expression, purification, and biological activity of a truncated soluble CD4. AIDS Res Hum Retroviruses 6(4), 465-79. [0306] Gorny, M. K. et al. (1992). Neutralization of diverse human immunodeficiency virus type 1 variants by an anti-V3 human monoclonal antibody. J. Virol. 66(12), 7538-42. [0307] Gorny, M. K. et al. (1993). Repertoire of human monoclonal antibodies specific for the V3 domain of HIV-1 gp120. J. Immunol., 150(2), 635-43.

[0308] Gorse, G. J. at al. (1999) MN and IIIB recombinant glycoprotein 120 vaccine-induced binding antibodies to native envelope glycoprotein of human immunodeficiency virus type 1 primary isolates. National Institute of Allergy and Infectious Disease AIDS Vaccine Evaluation Group, AIDS Res. Hum. Retroviruses. 15(10), 921-930. [0309] Greenberg, M. L. and Cammack, N. (2004). Resistance to enfuvirtide, the first HIV fusion inhibitor. J. Antimicrob. Chemother., 54(2), 333-340. [0310] Guo, Q. et al. (2003). Biochemical and genetic characterizations of a novel human immunodeficiency virus type 1 inhibitor that blocks gp120-CD4 interactions. J. Virol., 77(19), 10528-36. [0311] Herrera, C. et al. (2005). The impact of envelope glycoprotein cleavage on the antigenicity, infectivity, and neutralization sensitivity of Env-pseudotyped human immunodeficiency virus type 1 particles. Virology. 338(1), 154-72. [0312] Herrera, C. at al. (2003). Normeutralizing antibodies to the CD4-binding site on the gp120 subunit of human immunodeficiency virus type 1 do not interfere with the activity of a neutralizing antibody against the same site. J. Virol. 77(2), 1084-91. [0313] Hunter, E., and Swanstrom, R. (1990). Retrovirus envelope glycoproteins. Curr Top Microbiol Immunol 157, 187-253. [0314] Jeffs, S. A. et al. (2004). Expression and characterisation of recombinant oligomeric envelope glycoproteins derived from primary isolates of HIV-1. Vaccine. 22(8), 1032-46. [0315] Jones, P. L., Korte, T., and Blumenthal, R. (1998). Conformational changes in cell surface HIV-1 envelope glycoproteins are triggered by cooperation between cell surface CD4 and co-receptors. J Biol Chem 273(1), 404-9. [0316] Kirschner, M., Monrose, V., Paluch, M., Techodamrongsin, N., Rethwilm, A., and Moore, J. P. (2006). The production of cleaved, trimeric human immunodeficiency virus type 1 (HIV-1) envelope glycoprotein vaccine antigens and infectious pseudoviruses using linear polyethylenimine as a transfection reagent. Protein Expr Purif 48(1), 61-8. [0317] Klasse, P. J. at al., (1993). An immune-selected point mutation in the transmembrane protein of human immunodeficiency virus type 1 (HxB2-Env:Ala582(-->Thr) decreases viral neutralization by monoclonal antibodies to the CD4-binding site. Virology. 196(1), 332-7. [0318] Kuznetsov, Y. G. et al. (2003). Atomic force microscopy investigation of human immunodeficiency virus (HIV) and HIV-infected lymphocytes. J. Virol. 77(22), 11896-909. [0319] Lin, P. F. et al. (2003). A small molecule HIV-1 inhibitor that targets the HIV-1 envelope and inhibits CD4 receptor binding. Proc. Natl. Acad. Sci. USA., 100(19), 11013-8. [0320] Lineberger, J. E. et al., (2002). Altering expression levels of human immunodeficiency virus type 1 gp120-gp41 affects efficiency but not kinetics of cell-cell fusion. J. Virol. 76(7), 3522-33.

[0321] Lu, M., Blacklow, S. C., and Kim, P. S. (1995). A trimeric structural domain of the HIV-1 transmembrane glycoprotein. Nat Struct Biol 2(12), 1075-82. [0322] Melikyan, G. B., Markosyan, R. M., Hemmati, H., Delmedico, M. K., Lambert, D. M., and Cohen, F. S. (2000). Evidence that the transition of HIV-1 gp41 into a six-helix bundle, not the bundle configuration, induces membrane fusion. J Cell Biol 151(2), 413-23. [0323] Moore, J. P., and Doms, R. W. (2003). The entry of entry inhibitors: a fusion of science and medicine. Proc Natl Acad Sci USA 100(19), 10598-602. [0324] Moore, J. P. et al. (1995). Primary isolates of human immunodeficiency virus type 1 are relatively resistant to neutralization by monoclonal antibodies to gp120, and their neutralization is not predicted by studies with monomeric gp120. J. Virol. 69(1), 101-9. [0325] Moore, J. P. et al. (1994). Exploration of antigenic variation in gp120 from clades A through F of human immunodeficiency virus type 1 by using monoclonal antibodies. J. Virol. 68(12), 8350-64. [0326] Moore, J. P. et al., (1992). Virions of primary human immunodeficiency virus type 1 isolates resistant to soluble CD4 (sCD4) neutralization differ in sCD4 binding and glycoprotein gp120 retention from sCD4-sensitive isolates. J. Virol. 66(1), 235-43. [0327] Moore, J. P. et al., (2006). Nature of nonfunctional envelope proteins on the surface of human immunodeficiency virus type 1. J. Virol. 80(5), 2515-28. [0328] Muster, T. et al., (1993). A conserved neutralizing epitope on gp41 of human immunodeficiency virus type 1. J. Virol. 67(11), 6642-7. [0329] Nyambi, P. N. et al., (1998). Mapping of epitopes exposed on intact human immunodeficiency virus type 1 (HIV-1) virions: a new strategy for studying the immunologic relatedness of HIV-1. J. Virol. 72(11), 9384-91. [0330] Pancera, M. and Wyatt, R. (2005). Selective recognition of oligomeric HIV-1 primary isolate envelope glycoproteins by potently neutralizing ligands requires efficient precursor cleavage. Virology. 332(1), 145-56. [0331] Pantophlet, R., Ollmann Saphire, E., Poignard, P., Parren, P. W., Wilson, I. A., and Burton, D. R. (2003). Fine mapping of the interaction of neutralizing and normeutralizing monoclonal antibodies with the CD4 binding site of human immunodeficiency virus type 1 gp120. J Virol 77(1), 642-58. [0332] Park, E. J. et al. (2000). A global neutralization resistance phenotype of human immunodeficiency virus type 1 is determined by distinct mechanisms mediating enhanced infectivity and conformational change of the envelope complex. J. Virol. 74(9), 4183-91. [0333] Park, E. J., and Quinnan, G. V., Jr. (1999). Both neutralization resistance and high infectivity phenotypes are caused by mutations of interacting residues in the human immunodeficiency virus type 1 gp41 leucine zipper and the gp120 receptor- and coreceptor-binding domains. J. Virol. 73(7), 5707-13 [0334] Park, E. J., Vujcic, L. K., Anand, R., Theodore, T. S., and Quinnan, G. V., Jr. (1998). Mutations in both gp120 and gp41 are responsible for the broad neutralization resistance of variant human immunodeficiency virus type 1 MN to antibodies directed at V3 and non-V3 epitopes. J. Virol. 72(9), 7099-107. [0335] Parker, C. E., Deterding, L. J., Hager-Braun, C., Binley, J. M., Schulke, N., Katinger, H., Moore, J. P., and Tomer, K. B. (2001). Fine definition of the epitope on the gp41 glycoprotein of human immunodeficiency virus type 1 for the neutralizing monoclonal antibody 2F5. J Virol 75(22), 10906-11. [0336] Poignard, P., Moulard, M., Golez, E., Vivona, V., Franti, M., Venturini, S., Wang, M., Parren, P. W., and Burton, D. R. (2003). Heterogeneity of envelope molecules expressed on primary human immunodeficiency virus type 1 particles as probed by the binding of neutralizing and normeutralizing antibodies. J. Virol. 77(1)353-65. [0337] Poumbourios, P., Wilson, K. A., Center, R. J., El Ahmar, W., and Kemp, B. E. (1997). Human immunodeficiency virus type 1 envelope glycoprotein oligomerization requires the gp41 amphipathic alpha-helical/leucine zipperlike sequence. J Virol 71(3), 2041-9. [0338] Reeves, J. D., Lee, F. H., Miamidian, J. L., Jabara, C. B., Juntilla, M. M., and Doms, R. W. (2005). Enfuvirtide resistance mutations; impact on human immunodeficiency virus envelope function, entry inhibitor sensitivity, and virus neutralization. J. Virol. 79(8), 4991-9. [0339] Reitz, M. S., Jr., Wilson, C., Naugle, C., Gallo, R. C., and Robert-Guroff, M. (1988). Generation of a neutralization-resistant variant of HIV-1 is due to selection for a point mutation in the envelope gene. Cell, 54(1), 57-63. [0340] Rimsky, L. T., Shugars, D. C., and Matthews, T. J. (1998). Determinants of human immunodeficiency virus type 1 resistance to gp41-derived inhibitory peptides. J. Virol. 72(2), 986-93. [0341] Robinson, J. E., Holton, D., Pacheco-Morell, S., Liu, J., and McMurdo, H. (1990). Identification of conserved and variant epitopes of human immunodeficiency virus type 1 (HIV-1) gp120 by human monoclonal antibodies produced by EBV transformed cell lines. AIDS Res. Hum. Retroviruses. 6(5), 567-79. [0342] Rizzuto, C. D., Wyatt, R., Hernandez-Ramos, N., Sun, Y., Kwong, P. D., Hendrickson, W. A., and Sodroski, J. (1998). A conserved HIV gp120 glycoprotein structure involved in chemokine receptor binding. Science 280(5371), 1949-53.

[0343] Rucker, J., B J., D., Edinger, A., Long, D., Berson, J., and Doms, R. (1997). Cell-cell fusion assay to study role of chemokine receptors in human immunodeficiency virus type 1 entry. "Methods Enzymology", Vol. 288, pp. 118-133. [0344] Sanders, R. W., Schiffner, L., Master, A., Kajumo, F., Guo, Y., Dragic, T., Moore, J. P., and Binley, J. M. (2000). Variable-loop-deleted variants of the human immunodeficiency virus type 1 envelope glycoprotein can be stabilized by an intermolecular disulfide bond between the gp120 and gp41 subunits. J Virol 74(11), 5091-100. [0345] Sanders, R. W., Vesanen, M., Schuelke, N., Master, A., Schiffner, L., Kalyanaraman, R., Paluch, M., Berkhout, B., Maddon, P. J., Olson, W. C., Lu, M., and Moore, J. P. (2002). Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1. J Virol 76(17), 8875-89. [0346] Sattentau, Q. J., and Moore, J. P. (1991). Conformational changes induced in the human immunodeficiency virus envelope glycoprotein by soluble CD4 binding. J Exp Med 174(2), 407-15. [0347] Sattentau, Q. J., and Moore, J. P. (1995). Human immunodeficiency virus type 1 neutralization is determined by epitope exposure on the gp120 oligomer. J Exp Med. 182(1), 185-96. [0348] Scanlan, C. N., Pantophlet, R., Wormald, M. R., Ollmann Saphire, E., Stanfield, R., Wilson, I. A., Katinger, H., Dwek, R. A., Rudd, P. M., and Burton, D. R. (2002). The broadly neutralizing anti-human immunodeficiency virus type 1 antibody 2G12 recognizes a cluster of alpha1-->2 mannose residues on the outer face of gp120. J. Virol. 76(14), 7306-21. [0349] Schagger, H., Cramer, W. A., and von Jagow, G. (1994). Analysis of molecular masses and oligomeric states of protein complexes by blue native electrophoresis and isolation of membrane protein complexes by two-dimensional native electrophoresis. Anal. Biochem. 217(2), 220-30. [0350] Schulke, N., Vesanen, M. S., Sanders, R. W., Zhu, P., Lu, M., Anselma, D. J., Villa, A. R., Parren, P. W., Binley, J. M., Roux, K. H., Maddon, P. J., Moore, J. P., and Olson, W. C. (2002). Oligomeric and conformational properties of a proteolytically mature, disulfide-stabilized human immunodeficiency virus type 1 gp140 envelope glycoprotein. J Virol 76(15), 7760-76. [0351] Shugars, D. C., Wild, C. T., Greenwell, T. K., and Matthews, T. J. (1996). Biophysical characterization of recombinant proteins expressing the leucine zipper-like domain of the human immunodeficiency virus type 1 transmembrane protein gp41. J Virol 70(5), 2982-91. [0352] Si, Z., Cayabyab, M., and Sodroski, J. (2001). Envelope glycoprotein determinants of neutralization resistance in a simian-human immunodeficiency virus (SHIV HXBc2P 3.2) derived by passage in monkeys. J. Virol. 75(9), 4208-18. [0353] Srivastava, I. K., Stamatatos, L., Legg, H., Kan, E., Fong, A., Coates, S. R., Leung, L., Wininger, M., Donnelly, J. J., Ulmer, J. B., and Barnett, S. W. (2002). Purification and characterization of oligomeric envelope glycoprotein from a primary R5 subtype B human immunodeficiency virus. J Virol 76(6), 2835-47. [0354] Staropoli, I., Chanel, C., Girard, M., and Altmeyer, R. (2000). Processing, stability, and receptor binding properties of oligomeric envelope glycoprotein from a primary HIV-1 isolate. J. Biol. Chem. 275(45), 35137-45. [0355] Stiegler, G., Kunert, R., Purtscher, M., Wolbank, S., Voglauer, R., Steindl, F., and Katinger, H. (2001). A potent cross-clade neutralizing human monoclonal antibody against a novel epitope on gp41 of human immunodeficiency virus type 1. AIDS Res. Hum. Retroviruses. 17(18), 1757-65. [0356] Sullivan, N., Sun, Y., Sattentau, Q., Thali, M., Wu, D., Denisova, G., Gershoni, J., Robinson, J., Moore, J., and Sodroski, J. (1998). CD4-Induced conformational changes in the human immunodeficiency virus type 1 gp120 glycoprotein: consequences for virus entry and neutralization. J Virol 72(6), 4694-703. [0357] Thali, M., Charles, M., Furman, C., Cavacini, L., Posner, M., Robinson, J. and Sodroski, J. (1994). Resistance to neutralization by broadly reactive antibodies to the human immunodeficiency virus type 1 gp120 glycoprotein conferred by a gp41 amino acid change. J. Virol., 68(2), 674-80. [0358] Thali, M., Moore, J. P., Furman, C., Charles, M., Ho, D. D., Robinson, J., and Sodroski, J. (1993). Characterization of conserved human immunodeficiency virus type 1 gp120 neutralization epitopes exposed upon gp120-CD4 binding. J Virol 67(7), 3978-88. [0359] Thomas, D. J., Wall, J. S., Hainfeld, J. F., Kaczorek, M., Booy, F. P., Trus, B. L., Eiserling, F. A., and Steven, A. C. (1991). gp160, the envelope glycoprotein of human immunodeficiency virus type 1, is a dimer of 125-kilodalton subunits stabilized through interactions between their gp41 domains. J. Virol. 65(7), 3797-803. [0360] Trkola, A., Purtscher, M., Muster, T., Ballaun, C., Buchacher, A., Sullivan, N., Srinivasan, K., Sodroski, J., Moore, J. P., and Katinger, H. (1996). Human monoclonal antibody 2G12 defines a distinctive neutralization epitope on the gp120 glycoprotein of human immunodeficiency virus type 1. J Virol 70(2), 1100-8. [0361] Trkola, A., Dragic, T., Arthos, J., Binley, J. M., Olson, W. C., Allaway, G. P., Cheng-Mayer, C., Robinson, J., Maddon, P. J., and Moore, J. P. (1996a). CD4-dependent, antibody-sensitive interactions between HIV-1 and its co-receptor CCR-5. Nature. 384(6605), 184-7. [0362] Trkola, A., Ketas, T., Kewalramani, V. N., Endorf, F., Binley, J. M., Katinger, H., Robinson, J., Littman, D. R., and Moore, J. P. (1998). Neutralization sensitivity of human immunodeficiency virus type 1 primary isolates to antibodies and CD4-based reagents is independent of coreceptor usage. J. Virol. 72(3), 1876-85. [0363] Weng, Y., and Weiss, C. D. (1998). Mutational analysis of residues in the coiled-coil domain of human immunodeficiency virus type 1 transmembrane protein gp41. J. Virol. 72(12), 9676-82. [0364] Whitcomb, J. M., Huang, S., Fransen, S., Wrin, T., Paxinos, E., Toma, J., Greenberg, M., Sista, P., Melby, T., Matthews, T., DeMasi, R., Heilek-Snyder, G., Cammack, N., Hellmann, N., and Petropoulos, C. (2003). 10.sup.th Conference on Retroviruses and Opportunistic Infection, Boston, Mass. [0365] Wild, C. T., Shugars, D. C., Greenwell, T. K., McDanal, C. B., and Matthews, T. J. (1994). Peptides corresponding to a predictive alpha-helical domain of human immunodeficiency virus type 1 gp41 are potent inhibitors of virus infection. Proc. Natl. Acad. Sci. USA., 100(19), 91(21), 9770-4. [0366] Wu, L., Gerard, N. P., Wyatt, R., Choe, H., Parolin, C., Ruffing, N., Borsetti, A., Cardoso, A. A., Desjardin, E., Newman, W., Gerard, C., and Sodroski, J. (1996). CD4-induced interaction of primary HIV-1 gp120 glycoproteins with the chemokine receptor CCR-5. Nature 384(6605), 179-83. [0367] Wyatt, R., Moore, J., Accola, M., Desjardin, E., Robinson, J., and Sodroski, J. (1995). Involvement of the V1/V2 variable loop structure in the exposure of human immunodeficiency virus type 1 gp120 epitopes induced by receptor binding. J. Virol. 69(9), 5723-33. [0368] Wyatt, R., and Sodroski, J. (1998). The HIV-1 envelope glycoproteins: fusogens, antigens, and immunogens. Science. 280(5371), 1884-8. [0369] Yang, X., Florin, L., Farzan, M., Kolchinsky, P., Kwong, P. D., Sodroski, J., and Wyatt, R. (2000). Modifications that stabilize human immunodeficiency virus envelope glycoprotein trimers in solution. J Virol 74(10), 4746-54. [0370] Yang, X., Lee, J., Mahony, E. M., Kwong, P. D., Wyatt, R., and Sodroski, J. (2002). Highly stable trimers formed by human immunodeficiency virus type 1 envelope glycoproteins fused with the trimeric motif of T4 bacteriophage fibritin. J Virol 76(9), 4634-42. [0371] York, J., Follis, K. E., Trahey, M., Nyambi, P. N., Zolla-Pazner, S., and Nunberg, J. H. (2001). Antibody binding and neutralization of primary and T-cell line-adapted isolates of human immunodeficiency virus type 1. J. Virol. 75(6), 2741-52. [0372] Zanetti, G., Briggs, J. A., Grunewald, K., Sattentau, Q. J., and Fuller, S. D. (2006). Cryo-Electron Tomographic Structure of an Immunodeficiency Virus Envelope Complex In Situ. PLoS Pathog 2(8). [0373] Zhang, C. W., Chishti, Y., Hussey, R. E., and Reinherz, E. L. (2001). Expression, purification, and characterization of recombinant HIV gp140. The gp41 ectodomain of HIV or simian immunodeficiency virus is sufficient to maintain the retroviral envelope glycoprotein as a trimer. J Biol Chem 276(43), 39577-85. [0374] Zhang, W., Canziani, G., Plugariu, C., Wyatt, R., Sodroski, J., Sweet, R., Kwong, P., Hendrickson, W., and Chaiken, I. (1999). Conformational changes of gp120 in epitopes near the CCR5 binding site are induced by CD4 and a CD4 miniprotein mimetic. Biochemistry 38(29), 9405-16. [0375] Zhu, P., Chertova, E., Bess, J., Jr., Lifson, J. D., Arthur, L. O., Liu, J., Taylor, K. A., and Roux, K. H. (2003). Electron tomography analysis of envelope glycoprotein trimers on HIV and simian immunodeficiency virus virions. Proc Natl Acad Sci USA 100(26), 15812-7. [0376] Zhu, P., Liu, J., Bess, J., Jr., Chertova, E., Lifson, J. D., Grise, H., Ofek, G. A., Taylor, K. A., and Roux, K. H. (2006). Distribution and three-dimensional structure of AIDS virus envelope spikes. Nature 441(7095), 847-52. [0377] Zwick, M. B., Labrijn, A. F., Wang, M., Spenlehauer, C., Saphire, E. O., Binley, J. M., Moore, J. P., Stiegler, G., Katinger, H., Burton, D. R., and Parren, P. W. (2001). Broadly neutralizing antibodies targeted to the membrane-proximal external region of human immunodeficiency virus type 1 glycoprotein gp41. J Virol 75(22), 10892-905.

Experimental Details III:

[0378] According to the 2005 World Health Organization AIDS epidemic update, there are over 40 million people infected with the HIV' virus worldwide, with close to 5 million newly infected cases just last year (1). Among the hardest hit areas is sub-Saharan Africa, with over 25 million people living with HIV and about 10% dying of AIDS-related illnesses. It has been widely recognized and accepted that prophylactic measures in the form of an HIV vaccine, in addition to therapeutic medicines, need to be implemented to curtail the spread of AIDS globally.

[0379] An effective HIV vaccine needs to demonstrate an ability to elicit neutralizing antibodies (NAb) that would be capable of blocking the fusogenic interaction and entry of HIV with the CD4 receptor on CD4.sup.+ helper T cells, mediated by the cell surface viral env glycoproteins, gp120 and gp41. Since the genetic polymorphism of the HIV-1 gag and env genes are diverse and constantly evolving due to rapid mutation within individuals (2), the NAbs targeting the gp120 and gp41 envelope proteins on the viral surface need to be capable of blocking the viral interaction with the CD4 receptor and thereby neutralize viruses from a broad range of subtypes, without discrimination.

[0380] One logical design of recombinant env vaccine candidates is to base the vaccine sequence on currently existing HIV-1 isolates that are prevalent in the infected population. To this end, several oligomeric any proteins from several different subtypes or "clades" have been described, with subtype B sequences serving as a basis for the majority of those that have been reported (3-11, 29, 31). The oligomeric any protein complex on the surface of the virus is comprised of a gp120-gp41 heterodimer present in a homotrimer configuration (held together via non-covalent interactions), resembling a "spike" structure. These glycoproteins are derived from a gp160 precursor protein, which undergoes processing and cleavage in the cell to result in gp120 and gp41 heterodimers that are then targeted to the surface of the HIV viral envelope (12, 13). Fusion of the virus with the CD4.sup.+ cell membrane and oligomerization of the trimer spike is mediated by the gp41 glycoprotein, which is tethered to the virion surface via its transmembrane domain (12, 13).

[0381] It has been reasoned that design of a recombinant vaccine should mimic the native trimer spike of the HIV envelope against which NAbs would naturally be generated. Since the native Env trimer is technically challenging to produce in a recombinant form, modified versions of the trimer that could serve as potential vaccine templates have been reported. One typical modification is truncation of the gp41 transmembrane domain from the precursor gp160 to yield gp140 proteins in a soluble form. However, following processing and cleavage, the resulting gp120 and gp41 ectodomain or gp41.sub.ECTO (lacking the transmembrane domain) have been shown to form unstable associations and tend to dissociate into their respective monomeric subunits (13, 14).

[0382] To address these issues, subtype B HIV.sub.JR-FL Env was used as a template and a disulfide bond was introduced between gp120-gp41.sub.ECTO subunits (SOS gp140), followed by a further modification to gp41.sub.ECTO (I559P mutation), which successfully allowed for the expression of stable, cleaved and fully processed oligomeric gp140 proteins in a trimeric conformation (SOSIP gp140) (8-11, 15-17, and WO 2003/022869). While immunization of rabbits performed with the engineered HIV-1.sub.JR-FL SOSIP gp140 elicited antibodies capable of neutralization, the activity was limited primarily to the homologous strain, with only a modest and limited ability to neutralize across different HIV-1 primary isolates (11).

[0383] While the SOSIP technology addresses stability and expression, another issue that has limited production and purification of the recombinant trimers has been the spontaneous association of the oligomeric gp140 proteins into aberrant "aggregate" species (3, 9, 11, 18). These aggregate species, typically identified by their reduced mobility on blue native PAGE (SN-PAGE) and non-reduced SDS-PAGE have been difficult to purify from the SOSIP gp140 trimer without compromising yield and/or stability of the trimer. Attempts to fully characterize the aggregate have been limited and their true nature remains elusive.

[0384] To explore a wider variety of oligomeric env proteins that could elicit higher breadths of cross-neutralization activity and serve as potential vaccine immunogens, a panel of subtype A sequences from HIV-1 primary isolates in sub-Saharan Africa were studied (19). The env proteins from these sequences were expressed as SOSIP gp140 proteins, with a further engineered mutation at the gp120-gp41.sub.ECTO cleavage site (R6) for enhanced furin cleavage (>95% efficiency) to yield soluble, stable and fully processed gp140 trimers. Described herein is the purification and biochemical characterization of KNH1144 SOSIP R6 gp140, derived from a contemporary East African subtype A HIV-1 primary isolate, using methodologies that improve on currently implemented purification procedures. The purified KNH1144 SOSIP R6 gp140 is a trimer based on BN-PAGE and size exclusion chromatography (SEC). In addition, described herein are novel findings of the effects of non-ionic detergents such as Tween 20 on the KNH1144 SOSIP R6 aggregates (19). These findings reveal new insights into the nature of the aggregate species. The effects of non-ionic detergent, e.g., Tween.RTM. 20, treatment on the antigenic properties of KNH1144 SOSIP R6 gp140 aggregates and trimers were examined. Finally, digital imaging based on negative stain electron microscopy was performed and revealed the structure of purified KNH1144 SOSIP R6 gp140 as trimeric oligomers.

Materials and Methods

Subtype A KNH1144 SOSIP R6 Transfection and Expression:

[0385] The KNH1144 SOSIP R6 envelope and furin DNA plasmids were as described. For a typical 8 L preparation, HEX 293T cells were seeded in triple flasks at a density of 2.5.times.10.sup.7 cells/flask and cultured in DMEM/10% FBS/1% pen-strep with 1% L-glutamine 24 hours prior to transfection. On the day of transfection, 270 ug of KNH1144 SOSIP R6 envelope DNA was mixed with 90 ug of Furin protease DNA plasmid (per flask) in Opti-MEM. Polyethyleneimine (PEI) was added stepwise (2 mg PEI: 1 mg total DNA) and vortexed immediately in between each addition. The PEI/DNA complex solutions were incubated for 20 minutes at room temperature. Complexes were then added to the flasks and incubated for 6 hours at 32.degree. C., 5% CO.sub.2. The cells were then washed with warmed PBS and then incubated in exchange media (DMEM/0.05% BSA/1% pen-strep) for 48 hours at 32.degree. C., 5% CO.sub.2. After the 48 hour incubation, the supernatants were collected and a cocktail of protease inhibitors was added to minimize protein degradation. Harvested supernatants were then clarified by filtration through a 0.45 um filter and concentrated to 53.times.. Expression of KNH1144 gp120 monomer has been previously described (1) and typically, 1-2 L of cell culture supernatants from transfected cells were harvested. Supernatants were clarified by filtration and stored at -80.degree. C. without any concentration prior to purification.

Purification of KNH1144 SOSIP R6 40140 and 80120:

[0386] KNH1144 SOSIP R6 gp140 trimer was purified via a four step process starting with an ammonium sulfate precipitation followed by lectin affinity, size exclusion and ion-exchange chromatography. 53.times. concentrated cell culture supernatant was precipitated with an equal volume of 3.8 M ammonium sulfate to remove contaminant proteins (with the major contaminant being -2-macroglobulin). The ammonium sulfate was added with constant stirring with a stir bar and then was immediately centrifuged at 4000 rpm, 4.degree. C. for 45 minutes. The resulting supernatant was diluted 4-fold with PBS, pH 7.25, and was filtered using a 0.45 um vacuum filter. The sample was then loaded at 0.5-0.8 ml/min onto a Galanthus nivalis (GNA) lectin (Vector Laboratories, Burlingame, Calif.) column equilibrated with PBS--pH 7.25. Once the load was finished, the column was washed with PBS pH 7.25 until OD.sub.280 reached baseline, followed by a second wash with 0.5 M NaCl PBS pH 7.25 at 1 ml/min in order to remove contaminant proteins (mainly BSA). The column was then eluted with 1 M MMP PBS pH 7.25 starting with flowing, one half CV through the column at 0.3 ml/min and pausing the purification for a 1 hour incubation in MMP elution buffer. Following the incubation, the flow was restarted at 0.3 ml/min and 0.5-1 ml fractions were collected. All peak fractions were then pooled and concentrated to a final volume of 1 ml using a Vivaspin 100,000 MWCO concentrator (Vivascience, Edgewood, N.Y.) centrifuged at 1000.times.g. The concentrated lectin elution was applied over a Superdex 200 SEC column (GE Healthcare, Piscataway, N.J.) equilibrated in 20 mM Tris pH 8, 200 mM NaCl (TN-200), injecting 0.5 ml of sample per run and was resolved at 0.4 ml/min, collecting 0.4 ml fractions. The fractions were analyzed by BN-PAGE using a 4-12% Bis-Tris NuPAGE gel (Invitrogen, Carlsbad, Calif.) (10). All trimer containing fractions were pooled and diluted to 75 mM NaCl with 20 mM Tris pH 8. The diluted SEC pool was then applied over a 1 ml HiTrap DEAE FF column (GE Healthcare), equilibrated in 20 mM Iris pH 8, 75 mM NaCl (TN-75). The diluted SEC pool was loaded at 0.5 ml/min. The column was washed with TN-75 at 1 ml/min until the OD.sub.280 reached baseline. The column was then eluted with 20 mM Tris, 300 mM NaCl pH 8 at 1 ml/min, collecting 0.5 ml fractions.

[0387] To maximize trimer yield, the flow-through fraction from the DEAE column was re-applied over the column (equilibrated in TN-75) and typically 20-30% or 30-40% more trimer was recovered in this manner. The fractions were analyzed by BN-PAGE and by reducing and non-reducing SDS-PAGE. Western blot analysis on non-reduced SDS-PAGE gel was performed with the ARP3119 monoclonal antibody. The trimer containing fractions were pooled and trimer concentration was determined through densitometry on a reducing SDS-PAGE gel using JR-FL gp120 as a standard.

KNH1144 gp120 Monomer:

[0388] Unconcentrated cell culture supernatants containing secreted gp120 monomer were applied directly over a GNA lectin column equilibrated in 20 mM imidazole pH 7.1 at 1-2 ml/min. Following adsorption, the column was washed with a high salt (PBS containing 1 M NaCl, pH 7.1) wash, followed by a low salt (20 mM imidazole pH 7.1) wash. The column was eluted with 1 M MMP in 20 mM imidazole, 0.2 M NaCl pH 7.1. Peak fractions were pooled and diluted with 20 mM imidazole, pH 7.1, thirteen-fold to a final buffer concentration of 20 mM imidazole, pH 7.1, 15 mM NaCl. The diluted GNA elution was applied over 1 ml HiTrap Q Sepharose FF (GE Healthcare) equilibrated in 20 mM imidazole, pH 7.1. Following binding, the column was washed with 20 mM imidazole, pH 7.1, and was eluted with 20 mM imidazole, 0.2 M NaCl, pH 7.1. The Q elutions were pooled and concentrated and applied over a Superdex 200 column equilibrated in PBS in 0.5 ml volumes and resolved at 0.4 ml/min. Peak fractions were analyzed by 4-12% Bis-tris gels (Invitrogen), followed by Coomassie staining. Fractions containing gp120 were pooled and quantified as described above for the SOSIP R6 gp140 trimers and stored at -80.degree. C.

Tween.RTM. 20 Aggregate "Conversion"/"Collapse" Experiments:

[0389] Tween.RTM. 20 Dose effect: 1 ug of purified KNH1144 SOSIP R6 trimer was incubated with varying concentrations of Tween.RTM. 20 (polyoxyethylene sorbitan monolaurate) ranging from 0 to 0.0001 (v/v) and incubated for 1 hour at room temperature. Following incubation, samples were analyzed by BN-PAGE as described above.

[0390] Kinetics of Tween.RTM. 20 effect: To ascertain the early kinetics of the Tween.RTM. 20 effect on aggregate, 1 ug of purified KNH1144 SOSIP R6 trimer was incubated with Tween.RTM. 20 at a final concentration of 0.05% (v/v) for 5 minutes and for 10 minutes. A no-detergent control was included separately for each timepoint.

[0391] Temperature dependence on Tween.RTM. 20 effect: To determine if temperature affected the ability of Tween.RTM. 20 to recover trimers from aggregates (i.e., collapse aggregate into trimer), 1 ug of purified KNH1144 SOSIP R6 trimer was incubated with Tween.RTM. 20 to a final concentration of 0.05% (v/v) at 0.degree. C. (on ice), room temperature (22-23.degree. C.) at 37.degree. C., or left untreated for 10 minutes. Following the incubation, samples were analyzed by ON-PAGE and Coomassie staining.

[0392] Tween.RTM. 20 effect on KNH1144 gp120: To test if Tween.RTM. 20 had a similar effect on KNH1144 gp120, 1 ug of purified gp120 monomer was either untreated or incubated with Tween.RTM. 20 at a final concentration of 0.05% for 10 minutes at room temperature. Following the treatment, samples were analyzed by BN-PAGE and Coomassie staining.

[0393] Tween.RTM. 20 effect on -2-macroglogulin (a.sub.2M): 0.5 ug of purified -2-macroglobulin was either untreated or treated with Tween.RTM. 20 at a final concentration of 0.05% for 10 minutes at room temperature. Reactions were analyzed via BN-PAGE, followed by Coomassie staining.

Size Exclusion Chromatography (SEC) Analysis:

[0394] All runs were performed at 4.degree. C. on the AKTA FPLC system (GE Healthcare). Each run was performed at least twice.

[0395] Molecular weight standards SEC: A Superdex 200 10/300 GL column was equilibrated in 20 mM This pH 8, 0.5 M NaCl (TN-500) and calibrated with the following molecular weight standard proteins: thyroglobulin 669,000 Da; ferritin 440,000 Da; BSA 67,000 Da; and RNAse A 13,700 Da. A standard curve was generated by plotting the observed retention volumes of the standard proteins against the log values of their predicted molecular weights.

[0396] KNH1144 gp120 SEC analysis: 14 ug of purified KNH1144 gp120 (either untreated or Tween.RTM. 20-treated as described above) was applied over the Superdex 200 column equilibrated in TN-500 and resolved at a flow rate of 0.4 ml/min. As a control, 10-14 ug of JR-FL gp120 was also analyzed in a similar manner.

[0397] KNH1144 SOSIP R6 gp140 SEC analysis: 8-10 ug of purified KNH1144 SOSIP R6 gp140 was treated with Tween.RTM. 20 at a final concentration of 0.05% for 10-30 minutes at room temperature. Treated samples were then applied over the Superdex 200 column equilibrated with TN-500 containing 0.05% Tween.RTM. 20 (TNT-500) and resolved at 0.4 ml/min, collecting 0.4 ml fractions. Trimer-containing fractions were then analyzed by BN-PAGE, followed by silver staining. Fractions were also separated by BN-PAGE, followed by Western blot analysis with ARP 3119 antibody.

Blue Native PAGE (BN-PAGE) and SDS-PAGE Analysis:

[0398] All SDS-PAGE analysis (reduced and non-reduced) were performed using 4-12% Bis-Tris NuPage gels (Invitrogen). BN-PAGE analysis was performed as described (10). Silver stain analysis was performed with the SilverQuest kit (Invitrogen). Coomassie G-250 stain was performed using either the SimplyBlue SafeStain or Easy-to-Use Coomassie.RTM. G-250 Stain (Invitrogen).

Antiqenicity Experiments--Lectin ELISA:

[0399] Human mAbs b6 (32), b12 (33) and 2G12 (26), HIVIg (40) were obtained from Dr. Dennis Burton (The Scripps Research Institute, La Jolla, Calif.) or Dr. Herman Katinger (University of Natural Resources and Applied Life Sciences, Austria, Vienna). For the lectin based ELISA, anti-Env antibodies 2G12, b6, b12 and HIVIg were used. In addition, the CD4-IgG2 antibody conjugate PRO 542 (39) was also used.

[0400] ELISA plates were coated overnight at 4.degree. C. with lentil lectin powder from Lens culinaris (L9267, Sigma) at 10 ug/ml concentration. Plates were washed with PBS twice and blocked with SuperBlock (Pierce) (warmed to RT). Excess blocking agent was washed off with PBS. SEC fractions containing HMW aggregate were either untreated or treated with 0.05% Tween.RTM. 20 (v/v, final concentration) for 30 minutes at room temperature (RT) and were added at 0.3 ug/ml (diluted in PBS) and bound to the plates (via the lectin) for 4 hours at RT. Following binding, plates were washed 4 times with PBS and incubated with primary anti-Env antibodies starting at 10 ug/ml in PBS/5% milk. 4.times. serial dilutions were performed and incubations were performed for 3 hours at RT. Following antibody incubation, plates were washed 6 times and goat anti-human IgG (H+L) alkaline phosphatase conjugate secondary antibody (Jackson ImmunoResearch) was added at 1/4000 concentration in PBS/5% milk. Plates were washed 4 times and ELISAs were developed using the Ampak detection system (Dako Cytomation, Carpinteria, Calif.) as per the manufacturer's instructions.

DEAE Anion Exchange Chromatography of Tween.RTM. 20-Treated KNH1144 SOSIP R60p140 Trimers:

[0401] Purified KNH1144 SOSIP R6 gp140 trimers, treated either with or without 0.05% Tween.RTM. 20 (final), containing a.sub.2M contaminant in TN-75 buffer was applied over 1 ml DEAE HiTrap FF column (equilibrated in TN-75) at 0.25 ml/min at RT and flow-through (FT) fractions were collected. Following sample loading, the column was washed with TN-75 at 0.5 ml/min and wash fractions were collected. Finally, the column was eluted with TN-300 and equal amounts from each fraction were analyzed via BN-PAGE, followed by Coomassie G-250 staining.

Electron Microscopy:

[0402] EM analysis of the SOSIP trimers was performed by negative stain as previously described (34, 35). Because this technique is incompatible with detergent, 20 1 of the original sample (0.5 mg/ml in TN-300) was dialyzed against BSB (0.1 M H.sub.3BO.sub.3, 0.025 M Na.sub.2B.sub.4O.sub.7, 0.075 M NaCl, pH 8.3) and subsequently depleted of detergent using the Mini Detergent-OUT.TM. detergent removal kit (Calbiochem, La Jolla, Calif.) as described by the manufacturer. Two microliters of the resulting protein solution, diluted in 200 1 BSB, was affixed to carbon support membrane, stained with 1% uranyl formate, and mounted on 600 mesh copper grids for analysis. EMs were recorded at X100,000 at 100 kV on a JOEL JEM 1200 electron microscope. Measurements were made using the Image-Pro Plus software program. Fifty or more trimers were measured and analyzed statistically. The average diameter of the compact trimers formed by the SOSIP gp140 (e.g., KNH1144.R6 SOSIP) proteins was about 12-13 nm.

Results

[0403] Expression and Purification of Trimeric KNH1144 SOSIP R6 gp140:

[0404] The purification of KNH1144 SOSIP R6 gp140 trimers typically involved three chromatography steps: GNA lectin affinity, Superdex 200 size exclusion and DEAE weak anion exchange. 53.times. concentrated cell culture supernatant precipitated with ammonium sulfate was clarified by centrifugation, diluted and applied over the GNA lectin affinity column to capture gp140 proteins via (-1, 3) mannose residues. Analysis of the ammonium sulfate precipitation using different starting concentrations of harvested cell culture supernatant (100.times. to 40.times.) revealed that 53.times. was the optimum condition at which maximum -2-macroglobulin precipitated out, with minimal envelope protein loss. While the GNA lectin column was highly efficient in capture of the gp140 trimer, elution of the protein under even extremely mild conditions, with the competing MMP eluant, caused significant de-stabilization of the trimer and resulted in marked dissociation of the trimer into dimer and monomer species. Attempts to separate the different oligomeric gp140 species via Superdex SEC resulted in efficient separation of the monomer from the dimer and trimer. Superdex 200 SEC of the GNA eluate yielded trimers that were free of monomers, but not of dimers. To resolve trimers away from dimers (and residually co-migrating monomers), a DEAE anion exchange step was incorporated, which led to very efficient separation of diner from trimer, thereby yielding pure trimers at the end of the purification protocol.

[0405] SDS-PAGE analysis under reducing conditions showed that the final preparation was of high purity (at least 90%), with only the gp120 moeity visible on the reduced gel (FIG. 14, left panel, center lane). Common serum contaminants that were detectable by reducing SOS-PAGE were-2-macroglobulin (a.sub.2M) and BSA, which typically comprised up to .about.10% of the final preparation. The non-reduced gel shows intact gp140 protein on SOS-PAGE (FIG. 14, left panel, right lane). In addition, little to no disulfide-linked aggregate (typically revealed as migrating much slower on a non-reducing gel) was detected. This was confirmed by anti-envelope Western blot analysis on the non-reduced gel (FIG. 14, Anti-Env blot, middle panel). BN-PAGE analysis of the purified trimer revealed the purified trimer to migrate between the 669k thyroglobulin and 440k ferritin marker proteins (FIG. 14, right panel, SOSIP R6). This is consistent with the migration patterns for JR-FL SOSIP gp140 which has been observed to migrate in the lower range of 669k and 440 kDa (9, 10, 11). An additional slower migrating band, typically classified as high molecular weight (HMW) SOSIP aggregates and comprising about 30% of the preparation, was also detected (FIG. 14, right panel, SOSIP R6, - lane). Typical HMW aggregate content ranged from 10 to 40% of the final preparation prior to non-ionic detergent treatment. Treatment of the purified preparation with Tween.RTM. 20 at a final concentration of 0.05% converted the HMW aggregate species to trimers, yielding a homogenous trimer preparation (FIG. 14, right panel, SOSIP R6, + lane) (19). It should be noted that treatment with Tween.RTM. 20 also caused the treated trimer to migrate slightly more rapidly than the untreated trimer (notice faster mobility of trimer in the + lane).

[0406] Purification of the monomeric protein yielded a homogenous preparation as evident by a single band when analyzed by reducing SDS-PAGE (FIG. 14, left panel, left lane) and Superdex 200 SEC. BN-PAGE analysis of the purified monomer, either in the presence or absence of Tween.RTM. 20 revealed a single migrating monomeric gp120 species, devoid of any higher order oligomers, consistent with its purity on SDS-PAGE (FIG. 14, right panel, gp120-/+ lanes).

[0407] Since Tween.RTM. 20 provided a simple and mild means to obtain homogenous trimers, further characterization of the non-ionic detergent effect was performed. A purified trimer preparation containing .about.30% aggregates (e.g., monomer, dimmer and trimer) was treated with Tween.RTM. 20 at final concentrations of 0.0001% to 0.1% (v/v) (FIG. 15A). The SOSIP R6 aggregates were converted to trimers at concentrations of 0.1% to 0.01% (FIG. 15A, lanes 3-5). No conversion was observed at Tween.RTM. 20 concentrations of 0.001 and 0.0001% (FIG. 15A, lanes 6 and 7). Close examination of the 0.01% reaction (lane 5) revealed that traces of aggregate were present, thus indicating that 0.01% Tween.RTM. 20 is probably the threshold concentration. To study the kinetics of the conversion, trimer preparations containing .about.30% aggregate were incubated with Tween.RTM. 20 for 0, 5 and 10 minutes prior to analysis by BN-PAGE. As shown in FIG. 15B, both the 5 minute and 10 minute incubations completely eliminated the aggregate, indicating that the kinetics of the reaction was rapid and within a 5 minute time span.

[0408] The effect of temperature on aggregate rearrangement was also examined. Aggregate/trimer preparations were incubated with Tween.RTM. 20 either at 0.degree. C. (on ice), room temperature (22-23.degree. C.), or 37.degree. C. As shown in FIG. 15C, conversion of aggregate to trimer occurred at all 3 temperatures, indicating that the Tween.RTM. 20 effect on aggregate was independent of temperature over this range. Similar results were obtained when Tween.RTM. 80 was used instead of Tween.RTM. 20.

[0409] Similar Tween.RTM. 20 treatment of the gp120 monomer showed that there was no difference observed in its migratory pattern either in the presence or absence of Tween.RTM. 20, indicating that Tween.RTM. 20 did not affect the gp120 monomer (FIG. 14, right panel, gp120, -/+ lanes). In some cases, a mild increase in the staining intensity of the gp120 monomer occurred.

[0410] To test if the detergent had a collapsive effect on another large multi-subunit protein, -2-macroglobulin (.sub.2M), which is an acidic 726 kDa tetrameric glycoprotein comprised of four identical 185 kDa subunits, was incubated with Tween.RTM. 20. No change was observed in the migratory pattern of .sub.2M in the presence of Tween.RTM. 20, although there was a slight increase in the staining intensity of the protein. (See FIG. 21)

[0411] To examine whether Tween.RTM. 20 could convert preparations containing predominantly aggregate as the major oligomeric species to resulting trimers, a KNH1144 SOSIP R6 preparation containing >70% HMW aggregate was incubated with Tween.RTM. 20 and analyzed by BN-PAGE. As shown in FIG. 15D, Tween.RTM. 20 was effective in converting the aggregate rich fraction to trimer (FIG. 19D, left panel). Fractions of less purity containing HMW aggregate, dimers and monomers (FIG. 15D, right panel, - lane, each species denoted by arrows), when treated with Tween.RTM. 20 also resulted in collapse of HMW aggregate to resulting trimer (FIG. 15D, right panel, + lane). However, no effect on dimer or monomer migration was observed (FIG. 15D, right panel, + lane, arrows), indicating that the Tween.RTM. 20 action was specific to KNH1144 SOSIP R6HMW aggregate and trimer. Consistent with previous observations, some increase in monomer staining was observed. Thus, these results indicate that Tween.RTM. 20 efficiently converts the KNH1144 SOSIP HMW aggregate into trimeric form. According to this invention, Tween.RTM. 20 efficiently converted into trimers HMW preparations having greater than 10%, (e.g., greater than 10-40%), aggregate. Greater than 90-99%, or 100%, trimers were able to be recovered from non-ionic detergent-, e.g., Tween.RTM. 20, treated HMW aggregates.

SEC Analysis of KNH1144 gp120 Monomer and SOSIP R6 gp140 Trimer:

[0412] Size exclusion chromatography (SEC) analysis was performed as a second means to characterize the molecular sizes of KNH1144 gp120 monomer and SOSIP R6 gp140 trimer proteins. A Superdex 200 size exclusion column was calibrated with thyroglobulin (669 kDa), ferritin (440 kDa), BSA (67 kDa) and RNAse A (13.7 kDa) as molecular weight standards. In, addition, monomeric JR-FL gp120 was also analyzed as a control. KNH1144 gp120 and JR-FL gp120 were each found to migrate at an apparent molecular weight of 210 kDa (see FIG. 20). These values are consistent with those found for JR-FL gp120 (10).

[0413] To further study the oligomeric nature of the KNH1144 SOSIP R6 gp140 trimer, final purified preparations were treated with Tween.RTM. 20 prior to analysis on Superdex 200 SEC to yield homogenous and unambiguous trimer samples devoid of HMW aggregate. Initial studies showed re-formation of HMW aggregate when treated trimer samples were resolved in non-detergent TN-500 buffer on the SEC column. The resulting mixed trimer-aggregate fractions, presumably re-formed upon separation of the Tween.RTM. 20 from the gp140 oligomers in non-detergent buffer, was considered unsuitable for SEC analysis due to its heterogeneous nature.

[0414] In order to maintain homogenous trimers, treated trimer was resolved in the presence of TN-500 containing 0.05% Tween.RTM. 20 (TNT-500). As shown in FIG. 16, (bottom panel BN-PAGE), the trimer (thick arrow) migrated from fractions B10 through C2, represented in the major peak, with its peak signal at fraction B12 (vertical arrow). The retention time at this fraction corresponds to an apparent calculated molecular weight of .about.518 kDa. The reported apparent molecular weight (MW) of JR-FL SOSIP gp140 trimer calculated via Superdex 200 SEC analysis is .about.520 kDa (9); and thus, the calculated apparent MW value for KNH1144 SOSIP R6 gp140 trimer is consistent with MW values of other SOSIP envelope trimers.

Effect of Tween.RTM. 20 Treatment on KNN1144 SOSIP R6 Antigenicity:

[0415] Studies of the antigenic properties of unpurified KNH1144 SOSIP R6 gp140 (19) showed that it was immunoprecipitated by the neutralizing molecules 2G12, b12, CD4-IgG2, as well as the non-neutralizing mAb b6. The experiments described herein further assessed the effect of the Tween.RTM. 20 aggregate collapse on the antigenic properties of KNH1144 SOSIP HMW aggregates to determine if conversion of HMW aggregate into trimer favorably enhanced antigenicity.

[0416] SEC fractions containing 80% KNH1144 SOSIP R6 HMW aggregate content (as shown in FIG. 15D, - lane) were either untreated or Tween.RTM. 20 treated (typical reaction is represented in FIG. 15D). The antigenicity of the proteins in the presence and absence of Tween.RTM. 20 was examined using a lectin based ELISA. These results are shown in FIG. 18A. All the anti-env antibodies and CD4-IgG2, displayed increased binding to the Tween.RTM. 20 treated aggregate. The above experiments were performed on Tween.RTM. 20 converted trimer, using preps containing >80% HMW aggregate.

[0417] To demonstrate that Tween.RTM. 20 treatment did not unfavorably disrupt the above antibody epitopes on trimers, similar lectin ELISAs were performed using 2G12, b6, b12 and CD4-IgG2 on SOSIP R6 gp140 trimers that contained low amounts of HMW aggregate (.about.10-15% content) that were either untreated or treated with Tween.RTM. 20. As shown in FIG. 18B, no significant differences were observed in the antigenicity of trimer in presence or absence of Tween.RTM. 20. Unfortunately, since the HMW aggregate species is present in very limiting quantities, the Tween.RTM. 20 effect was assessed using only the above mentioned mAbs. These results show that Tween.RTM. 20 treatment and consequential conversion of HMW aggregate to resulting trimer enhances epitope exposure for Env binding antibodies. Thus Tween.RTM. 20 treatment and presence may offer favorable consequences in the context of KNH1144 SOSIP R6 gp140 trimer stability and antibody epitope exposure.

Effect of Tween.RTM. 20 Treatment on the Ionic Properties of KNH1144 SOSIP R6 gp140 Trimer:

[0418] DEAE anion exchange chromatography was used to examine the effect of Tween.RTM. 20 on the ionic properties of SOSIP R6 gp140 and control proteins. Untreated or Tween.RTM. 20 treated KNH1144 SOSIP R6 gp140 trimer spiked with a.sub.2M contaminating protein (which is unaffected by Tween.RTM. 20 and binds to anion exchange resins) were applied over DEAE anion exchange column (FIG. 18, Load). The column was washed and eluted and fractions were analyzed via BN-PAGE and Coomassie staining and is shown in FIG. 18. As expected, untreated SOSIP R6 gp140 trimer and the a.sub.2M contaminant bound to the DEAE column and were recovered in the elution fraction (FIG. 18, Untreated control, top panel, denoted by asterisks). However, upon treatment with Tween.RTM. 20, the KNH1144 SOSIP R6 gp140 trimer was found in the flow-through (FT) fractions of the column (FIG. 18, Tween.RTM. 20 treated, bottom panel, FT, denoted by asterisks), indicating that it did not bind to the DEAE, unlike the untreated trimer. Residual trimer is further recovered in the wash fraction (FIG. 18, wash). In contrast, the a.sub.2M contaminant, which was used as the internal control, bound to the DEAE column and was recovered in the elution, indicating that it was unaffected by the presence of Tween.RTM. 20 (FIG. 18, Tween.RTM. 20 treated, bottom panel, Elution).

[0419] In other similar experiments, in which BSA, another acidic protein was substituted as the contaminant, similar results were obtained. This indicates that Tween.RTM. 20 treatment may exert its action on KNH1144 SOSIP R6 HMW aggregate and trimer through a combination of hydrophobic interactions that possibly involve perturbations in inter- and/or intra-subunit charge-charge interactions, as examined by DEAE anion exchange chromatography.

Electron Microscopy and Digital Imaging of KNH1144 SOSIP R6 gp140 Trimers:

[0420] Electron microscopy was performed on purified SOSIP R6 preparations employing negative stain EM analysis. The results, shown in FIG. 19, reveal that the majority of the observed structures displayed a regular compact morphology with approximate three-fold symmetry. This tri-lobed configuration is most apparent in preparations with deeper stain (FIG. 19; panel of trimers) that are less subject to the flattening that can occur in thinner staining preparations.

[0421] Initially, for the EM studies, it was found that the uranyl formate negative straining technique was not compatible with detergent-containing buffers. However, some trimeric structures of the anticipated dimensions were observed in the poorly staining preparations. Thereafter, the KNH1144 SOSIP preparation was subjected to a detergent removal protocol, which yielded improved staining. Following detergent removal, the majority of the observed structures displayed a regular compact morphology with approximate three-fold symmetry (e.g., FIG. 19). This configuration is most apparent in preparations with deeper stain that are lees subject to the flattening that can occur in thinner staining preparations.

[0422] In order to calculate diameters of the trimers, 70 spikes in the shallow stain samples were scored and a diameter of 13.5.+-.1.73 nm was calculated. Seventy eight (78) trimers from the deep stain were scored and resulted in a diameter of 11.6 nm.+-.1.75 nm. The shallow stain preparation likely gives a slight overestimation of the size and the deep stain preparation gives a slightly underestimated size. Therefore, the true size is likely to be 12.6.+-.1.74 nm (i.e., and in line with authentic Env spikes measured in situ on both negatively stained, as well as unstained, cryo-EM preparations of SIV (36, 37). Thus the biophysical EM analysis of KNH1144 SOSIP R6 gp140 is in good agreement with the above biochemical data and confirms the oligomeric status of the purified KNH1144 env complex as being trimeric.

Discussion

[0423] In the context of identifying and pursuing a variety of HIV-1 Env-based protein vaccines, described herein is the purification and characterization of a novel subtype A KNH1144 trimeric envelope spike protein and its properties. Several novel insights were gained as a result of these studies, which revealed the biochemical effects of Tween.RTM. 20 on the oligomeric conformations of the KNH1144 SOSIP R6 proteins. Until the present invention, only one subtype B envelope, HIV-1 JR-FL has been manipulated to a purified form to mimic as closely as possible the native trimeric structure of the HIV-1 viral surface envelope complex via the SOSIP technology (8-11, 15-17). The present invention provides another clade, clade A KNH1144, for which the SOSIP technology results in purified trimeric envelopes that are stable, soluble, and fully cleaved.

[0424] The purification process implemented according to the present invention for the KNH1144 SOSIP trimers provides a marked improvement over that utilized for JR-FL SOSIP gp140 trimers. For the KNH1144 SOSIP, the GNA lectin column provided a significant enrichment of gp140 proteins, but elution off the column significantly destabilized the gp140 trimers, resulting in a compromise of trimer fidelity on the column. As a result, significant dissociation of the trimer to resulting dimer and monomer was noticed. This destabilization could be brought about from Galanthus Nivalis lectin binding to 1-3 and 1-6 mannose linkages on the gp140 high mannose chains, which are internal linkages and not terminal linkages (20). During elution, the affinity of the lectin for the mannan is likely much higher than the intersubunit protein-protein affinities of the 3 gp120-gp41.sub.ECTO monomers contributing to trimer formation, resulting in destabilization and dissociation into component dimers and monomers. To alleviate some measure of the destabilization that could be caused due to resulting sheer stresses during elution, a one hour incubation in MMP eluting buffer was included. So while a highly enriching step, the lectin affinity column also decreased the final yield of trimer significantly, due to its dissociation during the elution phase.

[0425] The next step in the purification, Superdex 200 SEC, while somewhat efficient in resolving away monomer, was not very effective in resolution of dimer from trimer. The incorporation of a DEAE weak anion exchange chromatography step was very efficient in resolving dimer (and residual monomer) away from trimer, resulting in trimeric KNH1144 SOSIP R6 gp140 of high purity. Notably, binding (and retention) of the trimer occurred under a relatively polar environment (vis-a-vis ion exchange) at 75 mM NaCl, while dimer and monomer flowed through the DEAE column under these conditions.

[0426] It is relevant to extrapolate from its behavior on anion exchange chromatography that the nature of the KNH1144 SOSIP R6 g140 trimer is that of an acidic protein, which would be contrary to its predicted basic isoelectric point (pI) of 8.73 calculated for the protein backbone. However, the likely presence of the predicted acidic sialylated complex oligosaccharide chains on the gp140 (21, 22) would contribute to a decrease in the overall charge of the glycoprotein and thus confer on it properties of an acidic protein. Indeed, analysis of purified KNH1144 SOSIP R6 gp140 trimers on isoelectric focusing gels reveal it to migrate at a pI range of 5.9 to 6.1, consistent with the above observations.

[0427] The purified trimer was shown to contain variable amounts of HMW aggregate (FIG. 14, right panel, BN-PAGE), which could not be attributed to being formed at any one particular step of the purification, although one possibility might be at the lectin elution step. As mentioned before, one of the key improvements made in this purification protocol is absence of SDS-insoluble aggregates in the final prep, which are formed by abberantly formed disulfide bonds and are visualized by their slow migration on a non-reduced SDS-PAGE. As detected by Coomassie staining and confirmed by anti-envelope Western blot, little to no SDS-insoluble aggregates were observed (FIG. 14, left and middle panels, Non-Red SDS-PAGE and Anti-Env blot). This is in contrast to what was observed with JR-FL SOSIP gp140 (R6 and non-R6 versions), where SDS-insoluble aggregates comprised a significant percentage of the final preparations (9, 10, 11).

[0428] Based on observations regarding non-ionic detergent treatments of KNH1144 SOSIP R6 gp140 trimers (19), Tween.RTM. 20 was used to address the co-purifying HMW aggregate present in the final trimer preparations. Tween.RTM. 20 was chosen because initial observations had shown that Tween.RTM. 20 treatment was mild and did not result in any detectable monomer formation, unlike treatment with the other non-ionic detergents NP-40 and Triton X-100, where diners and monomers were observed upon treatment (19). Tween.RTM. 20 treatment of the final purified KNH1144 SOSIP R6 trimer preparation was highly reproducible and resulted in the "conversion" of the HMW aggregate species, as shown in FIG. 14 (right panel, BN-PAGE). Since this resulted in a single, homogenous, oligomeric species of KNH1144 SOSIP R6 gp140 trimers, we routinely incorporated it as the final step in our preparations. Further analysis using reduced SDS-PAGE gels showed that the purified trimer was fully cleaved, with practically undetectable uncleaved protein (as visualized by both Coomassie staining and Western blot analysis) (FIG. 14, left panel, Red SDS-PAGE). The initial purifications were performed using a non-R6 version of KNH1144 SOSIP gp140, which resulted in .about.40-50% of uncleaved protein in the final preparation, prompting the development of the R6 version. This also represents another improvement over JR-FL SOSIP R6 gp140 trimers, where cleavage of gp120-gp41.sub.ECTO was not as efficient (9, 11).

[0429] In order to expand the initial Tween.RTM. 20 observations to the stability of HMW aggregates, a variety of experiments were performed to characterize the effect of Tween.RTM. 20 and to better understand its mechanism of action. As shown in FIG. 15, the effect of Tween.RTM. 20 is dose dependent, time dependent and temperature independent within the parameters that were examined. Its effect is remarkably specific to KNH1144 SOSIP R6 HMW aggregate and trimers and has no effect on gp120 monomers, or KNH1144 SOSIP R6 dimers. In addition, other similar large, macromolecular, acidic proteins such as a.sub.2M are not affected by the detergent. Initially, the hypothesis was that the Tween.RTM. 20 specifically interacted with points of gp120-gp41.sub.ECTO intersubunit contact within the HMW aggregate, presumably in a hydrophobic manner. In this context, the HMW aggregate would have to be comprised of some multiple of trimer (most likely a dimer of trimers), since detergent treatment specifically results in a "rearrangement" to a trimeric configuration. The specificity of this reaction can further be defined by the observation that dimeric KNH1144 SOSIP R6 gp140 proteins are unaffected and do not undergo the collapse (FIG. 15D). In addition, Tween.RTM. 20 treatment would also seem to cause the trimer to assume a more compact configuration, as evident by its slightly more rapid mobility on BN-PAGE (FIG. 14).

[0430] While the anti-flocculatory effects of non-ionic detergents on aggregates of macromolecular proteins such as antibodies (immunoglobulins, for example) are well known and documented, the mechanisms of their actions have been realized to be largely by pre-emption of unfavorable hydrophobic interactions by detergent intercalation. Tween.RTM. 20, however, would seem to exert its action in a somewhat paradoxical mechanism, since treatment of the KNH1144 SOSIP R6 gp140 trimer with the detergent renders it unable to interact with anion exchange resins such as DEAE (FIG. 18, bottom panel, Tween.RTM. 20 treated), indicating that the overall charge of the trimer was being affected by the detergent.

[0431] Since the nature of non-ionic detergents is exactly that, i.e., non-ionic, it is difficult to realize how an uncharged molecule such as Tween.RTM. 20 would affect the charge status of a large, macromolecular oligomer such as the KNH1144 SOSIP R6 trimer. Furthermore, this effect is highly specific to the trimer, as other such large, highly charged (acidic) oligomeric proteins such as a.sub.2M and even smaller ones such as BSA are unaffected by the detergent. One hypothesis that has emerged from this invention is that perhaps the Tween.RTM. 20 was "coating" the trimer in a manner that may cause perturbations in its conformation, resulting in its "compactness". These perturbations would be of a subtle nature which involve the various points of contact between the individual component gp140 monomers, causing disruption and destabilization of interactions that favor the HMW aggregate conformation. A consequence of these perturbations would be "shielding" of ionic charges that would normally be exposed (and contribute to binding to ion exchange resins). It is reasonable to speculate that perhaps the charges that are "shielded" are those on the sialic acid residues of the complex carbohydrate chains, since these would be most likely to be highly exposed at the surface (21, 22). Tween.RTM. 20 and Tween.RTM. 80 are polyoxyethylene sorbitan esters of fatty acids and thus may likely interact with the sialic acids, causing a charge "neutralization" effect. The involvement of the sialic acid residues can be investigated by mild sialidase treatment (21, 22) and removal of these residues, followed by Tween.RTM. 20 treatment, followed by monitoring of binding on ion exchange resins.

[0432] To further biochemically characterize the purified KNH1144 monomeric and trimeric envelope proteins, size exclusion chromatography analyses were performed in order to ascertain their apparent molecular masses. These were performed on Tween.RTM. 20 treated trimers that were devoid of any HMW aggregates and thus consisted of only one homogeneously oligomeric species, i.e., the trimer, and therefore would yield unambiguous results. The retention times of the KNH1144 SOSIP R6 gp140 trimer resulted in a calculated apparent molecular weight of .about.518 kDa. This is consistent with the reported calculated apparent molecular weight of 520 kDa for the other SOSIP gp140 trimer, JR-FL SOSIP gp140 (9). The predicted molecular weight for a trimer such as KNH1144 (and JR-FL) would be .about.420 kDa (3.times.140 kDa monomers). Thus, similar to JR-FL SOSIP gp140, the KNH1144 SOSIP R6 gp140 trimer also exhibits an abberant migration on SEC, presumably due to interactions of its N-linked glycans with the dextran- (agarose polymer) based matrix of Superdex 200, resulting in a higher than expected apparent molecular mass. In addition, envelope proteins have been shown to be non-globular in shape (10, 23, 24); therefore, gel filtration may not be optimal for determination of their precise molecular masses. This also extends to the KNH1144 gp120 monomer as well. Values of .about.210 kDa were obtained for KNH1144 gp120 and the control JR-FL gp120 (see FIG. 20). The reported value for JR-FL gp120 is 200 kDa (10); accordingly, the obtained values are well within the expected range (given that molecular weight determination via SEC is not extremely accurate, unlike other methodologies such as mass spectrometry). Thus, gp120, whose predicted molecular weight ranges from .about.95 to .about.120 kDa, results in an abberant migratory pattern on SEC, presumably due to its glycan interactions with the sizing column matrix. It should be noted that unlike the KNH1144 SOSIP R6 gp140 trimer, migration of KNH1144 gp120 (and JR-FL gp120) were not affected by the presence or absence of Tween.RTM. 20, consistent with the initial BN-PAGE observations (FIG. 14, right panel, gp120).

[0433] While it would seem that the presence of Tween.RTM. 20 for KNH1144 SOSIP R6 gp140 proteins would be advantageous, possible Tween.RTM. 20 effects on the antigenicity of the HMW aggregate and trimer were examined. Effects on antigenicity was examined by performing lectin ELISAs with the NAbs 2G12, b12, HIVIg, the CD4-IgG2 antibody conjugate PRO 542, as well as the non-neutralizing mAb b6, to gain information on neutralizing/non-neutralizing epitope exposure and accessibility. It was reasoned that trimer preparations containing 10-30% HMW aggregate may not undergo significant enough changes that would be detectable in a non-quantitative assay such as IPs, i.e., subtle changes (20-30% changes) may go undetected in such an assay due to sensitivity. However, samples representing extremes may undergo significantly high changes that should be detectable in an assay format such as ELISA. Therefore, SEC fractions that contained 80% HMW aggregate were used, which would reflect one extreme prior to Tween.RTM. 20 treatment and the resulting trimer, which would reflect the other extreme poet treatment. A representative reaction of this is illustrated in FIG. 15D.

[0434] As shown in FIG. 17A, significant epitope exposures were observed upon Tween.RTM. 20 rearrangement of the HMW aggregate to trimer, and these changes were noticed for all of the anti-env agents. These changes indeed were not as apparent in trimer preparations that were predominantly trimer, with low aggregate content (10-15%) (FIG. 21H). Thus the treated, purified trimer displays antigenic properties similar to that which was previously observed with crude, unpurified trimer supernatants, i.e., binding to 2G12, b6, b12 and PRO 542 (19). In the context of HIVIg, which is a low neutralizing polyclonal human antisera directed against gp120 hypervariable loop (40), it can be inferred that this epitope is accessible on the surface of the HMW aggregate, based on its ability to bind the antibody in absence of Tween.RTM. 20. Consistent with the other anti-Env agents examined here, HIVIg epitope exposure also significantly increased on the rearranged trimer, upon treatment with Tween.RTM. 20. The likely explanation to these increases in epitope exposure is that "disruption/rearrangement" of the aggregate and its subsequent conversion to trimer unshields the above mentioned surfaces and thus, upon conversion, these surfaces are now exposed on their individual trimers and are accessible to the antibodies. From the context of a single HMW aggregate which is likely to be a multimer of trimers, only a small portion of these epitopes are accessible, most probably due to steric hindrance from adjacently "clumped" SOSIP R6 trimers/oligomers. When the single HMW aggregate is then Tween.RTM. 20 converted to resulting trimers, antibody epitopes are now exposed on every one of the resulting individual component trimers, resulting in an increase in antibody accessibility and binding. Thus Tween.RTM. 20 treatment and its conversion of the, aggregate to trimer do not seem to have detrimental effects on antigenicity and may be favorable to the structural properties of the KNH1144 SOSIP R6 gp140 proteins.

[0435] Analysis of KNH1144 SOSIP R6 gp140 proteins by negative stain EM further confirmed the biochemical observations that these gp140 proteins were indeed trimeric in nature (FIG. 19). A distinguishing feature of the KNH1144 SOSIP R6 construct, in comparison to other similar constructs of trimerized gp120 and gp140, is its compact nature. Most other constructs show either predominantly loosely associated subunits or a mix of loosely and tightly associated subunits (5, 18, 38). The observation that the KNH1144 SOSIP R6 trimer is compact is associated with anti-Env antibody epitope availability. EM on Tween.RTM.-treated trimer which has favorable anti-Env epitope exposure was performed. It is somewhat incongruous from a purely steric standpoint that a "compact" trimer would also have improved epitope exposure, a consequence expected from a "loose" or "elongated" structure. Immunoelectron microscopy analyses with the above mentioned antibodies will further address the exposure of epitopes on trimeric forms.

[0436] The present invention expands the panel of trimeric HIV-1 envelope proteins that may be used as protein-based HIV-1 vaccine candidates or serve as a template for future design of Env based protein vaccine candidates, using the SOSIP technology. Immunological studies in rabbits with JR-FL SOSIP R6 gp140 trimers, while effective in eliciting NAbs, were limited in their breadth of neutralization of primary HIV-1 isolates (11). Factors associated with the biochemical nature of the JR-FL SOSIP gp140 and other oligomeric Env proteins that are thought to limit their observed immunological response in animals, such as inefficient furin cleavage of the gp120-gp41.sub.ECTO cleavage site giving rise to heterogenous trimers (containing both cleaved and uncleaved trimers), presence of SDS-insoluble aggregates and presence of undesirable gp140 oligomers such as dimers and monomers (5, 6, 9, 10, 11, 27-30) have been issues needing resolution.

[0437] The description of the KNH1144 SOSIP R6 gp140 trimers of the present invention addresses moat of these issues. Furthermore, the description of the Tween.RTM. 20 affects on converting HMW aggregates to trimeric forms further expands on current knowledge of the aggregate species in HIV-1 biology. Of significance, it was shown for the first time, that oligomeric Env protein complexes designed using the SOSIP technology platform are indeed trimeric from EM images and that the trimers are of a similar diameter as native spikes on the HIV-1 virion (36, 37). Expansion of the panel of potential HIV-1 SOSIP protein vaccine candidates by development of a clade A envelope according to this invention now allows for immunological evaluation of the KNH1144 SOSIP R6 gp140 trimer in small animals, for example. Such evaluations will assist in determining the efficacy of KNH1144 SOSIP R6 gp140 trimers as immunogens capable of eliciting broadly neutralizing immune responses directed against HIV-1.

REFERENCES

[0438] 1. UNAIDS: AIDS Epidemic Update: December 2005, ISBN 92 9 173439X. [0439] 2. Kijak, G. H., and McCutchan, F. E. (2005) Curr Infect Dis Rep. 7, 480-488. [0440] 3. Jeffs, S. A., Goriup, S., Kebble, B., Crane, D., Bolgiano, B., Sattentau, Q., Jones, S., and Holmes, H. (2004) Vaccine. 22, 1032-1046. [0441] 4. Srivastava, I. K., Stamatatos, L., Legg, H., Kan, E., Fong, A., Coates, S. R., Leung, L., Wininger, M., Donnelly, J. J., Ulmer, J. B., and Barnett, S. W. (2002) J Virol. 76, 2835-2847. [0442] 5. Srivastava, I. K., Stamatatos, L., Kan, E., Vajdy, M., Lian, Y., Hilt, S., Martin, L., Vita, C., Zhu, P., Roux, K. H., Vojtech, L., Montefiori, D. C., Donnelly, J., Ulmer, J. B., and Barnett, S. W. (2003) J Virol. 77, 11244-11259. [0443] 6. Yang, X., Florin, L., Farzan, M., Kolchinsky, P., Kwong, P. D., Sodroski, J., and Wyatt, R. (2000) J Virol. 74, 4746-4754. [0444] 7. Grundner, C., Li, Y., Louder, M., Mascola, J., Yang, X., Sodroski, J., and Wyatt R. (2005) Virology 331, 33-46. [0445] 8. Binley, J. M., Sanders, R. W., Clas, B., Schuelke, N., Master, A., Guo, Y., Kajumo, F., Anselma, D. J., Maddon, P. J., Olson, W. C., and Moore, J. P. (2000) J Virol. 74, 627-643. [0446] 9. Sanders, R. W., Vesanen, M., Schuelke, N., Master, A., Schiffner, L., Kalyanaraman, R., Berkhout, B., Maddon, P. J., Olson, W. C., Lu, M., and Moore, J. P. (2002) J Virol. 76, 8875-8889. [0447] 10. Schulke, N., Vesanen, M. S., Sanders, R. W., Zhu, P., Lu, M., Anaemia, D. J., Villa, A. R., Parren, P. W., Binley, J. M., Roux, K. H., Maddon, P. J., Moore, J. P., and Olson, W. C. (2002) J. Virol. 76, 7760-7776. [0448] 11. Beddows, S., Schulke, N., Kirschner, M., Barnes, K., Franti, M., Michael, E., Ketas, T., Sanders, R. W., Maddon, P. J., Olson, W. C., and Moore, J. P. (2005) J Virol. 79, 8812-8827. [0449] 12. Emini, E. A. and Koff, W. C. (2004) Science 304, 1913-1914. [0450] 13. Poignard, P., Saphire, E. O., Parren, P. W., and Burton, D. R. (2001) Annu Rev Immunol. 19, 253-274. [0451] 14. Wyatt, R., and Sodroski, J. (1998) Science 280, 1884-1888. [0452] 15. Sanders, R. W., Schiffner, L., Master, A., Kajumo, F., Guo, Y., Dragic, T., Moore, J. P., and Binley, J. M. (2000) J Virol. 74, 5091-5100. [0453] 16. Binley, J. M., Sanders, R. W., Master, A., Cayanan, C. S., Wiley, C. L., Schiffner, L., Travis, B., Kuhmann, S., Burton, D. R., Hu, S. L., Olson, W. C., and Moore, J. P. (2002) J Virol. 76, 2606-2616. [0454] 17. Binley, J. M., Cayanan, C. S., Wiley, C., Schulke, N., Olson, W. C., and Burton, D. R. (2003) J Virol. 77, 5678-5684. [0455] 18. Pancera, M., Lebowitz, J., Schon, A., Zhu, P., Freire, E., Kwong, P. D., Roux, K. H., Sodroski, J., and Wyatt, R. (2005) J Virol. 79, 9954-9969. [0456] 19. Beddows, S., Kirschner, M., Campbell-Gardener, L., Franti, M., Dey, A, K., Iyer, S, N., Paluch, M., Master, A., Overbaugh, J., VanCOtt, T., Olson, W. C., and Moore, J. P. (2006) AIDS Res Hum Retroviruses (in press). [0457] 20. Botos, I., and Wlodawer, A. (2005) Prog Biophys Mol Biol. 88, 233-282. [0458] 21. Scanlan, C. N., Pantophlet, R., Wormald, M. R., Ollmann, Saphire, E., Stanfield, R., Wilson, I. A., Katinger, H., Dwek, R. A., Rudd, P. M., and Burton, D. R. (2002) J. Virol. 76, 7306-7321. [0459] 22. Sanders, R. W., Venturi, M., Schiffner, L., Kalyanaraman, R., Katinger, H., Lloyd, K. O., Kwong, P. D., and Moore, J. P. (2002) J Virol. 76, 7293-7305. [0460] 23. Center, R. J., Earl, P. L., Lebowitz, J., Schuck, P., and Moss, B. (2000) J Virol. 74, 4448-4455. [0461] 24. Center, R. J., Schuck, P., Leapman, R. D., Arthur, L. O., Earl, P. L., Moss, B., and Lebowitz, J. (2001) Proc Natl Aced Sci USA. 98, 14877-14882. [0462] 25. Trkola, A., Pomales, A. B., Yuan, H., Korber, B., Maddon, P. J., Allaway, G. P., Katinger, H., Barbas, C. F. 3rd, Burton, D. R., Ho, D. D., and Moore, J. P. (1995) J Virol. 69, 6609-6617. [0463] 26, Trkola, A., Purtscher, M., Muster, T., Ballaun, C., Buchacher, A., Sullivan, N., Srinivasan, K., Sodroski, J., Moore, J. P., and Katinger, H. (1996) J Virol. 70, 1100-1108. [0464] 27. Center, R. J., Lebowitz, J., Leapman, R. D., and Moss, B. (2004) J Virol. 78, 2265-2276. [0465] 28. Chakrabarti, B. K., Kong, W. P., Wu, B. Y., Yang, Z. Y., Friborg, J., Ling, X., King, S. R., Montefiori, D. C., and Nabel, G. J. (2002) J Virol. 76, 5357-5368. [0466] 29. Zhang, C. W., Chishti, Y., Hussey, R. E., and Reinherz, E. L. (2001) J Biol Chem. 276, 39577-39585. [0467] 30. Yang, X., Lee, J., Mahony, E. M., Kwong, P. D., Wyatt, R., and Sodroski, J. (2002) J Virol. 76, 4634-4642. [0468] 31. Lien, Y., Srivastava, I., Gomez-Roman, V. R., Zur Megede, J., Sun, Y., Kan, E., Hilt, S., Engelbrecht, S., Himathongkham, S., Luciw, P. A., Otten, G., Ulmer, J. B., Donnelly, J. J., Rabussay, D., Montefiori, D., van Rensburg, E. J., and Barnett, S. W. (2005) J Virol. 79, 13338-13349. [0469] 32. Roben, P., Moore, J. P., Thali, M., Sodroski, J., Barbas, C. F. 3rd., and Burton, D. R. (1994) J Virol. 68, 4821-4828. [0470] 33. Burton, D. R., Pyati, J., Koduri, R., Sharp, S. J., Thornton, G. B., Parren, P. W. H. I., Sawyer, L. S. W., Hendry, R. M., Dunlop, N., Nara, P. L., Lamacchia, M., Garratty, G., Stiehm, E. R., Bryson, Y. J., Cao, Y., Moore, J. P., Ho, D. D., and Barbas, C. F. 3rd. (1994) Science 266, 1024-1027. [0471] 34. Roux, K. H. (1989) Methods Enzymol. 178, 130-144. [0472] 35. Roux, K. H. (1996) Methods 10, 247-256. [0473] 36. Zhu, P., Chertova, E., Bess, J., Jr., Lifson, J. D., Arthur, L. O., Liu, J., Taylor, K. A., and Roux, K. H. (2003) Proc Natl Aced Sci USA. 100, 15812-15817. [0474] 37. Qiao, Z. S., Kim, M., Reinhold, B., Montefiori, D., Wang, J. H., and Reinherz, E. L. (2005) J Biol Chem. 280, 23138-23146. [0475] 38. Olson, W. C., and Maddon, P. J. (2003) Curr Drug Targets Infect Disord. 3, 255-262.

Experimental Details IV:

[0476] According to the present invention, the gp41/gp120 trimeric conformation can be stabilized by one or more of the following changes in the gp120 and gp41 sequences: [0477] (1) specific, targeted amino acid sequence changes in the N-terminal region of the gp41 subunit that stabilize the gp120-gp41 trimeric conformation; [0478] (2) an isoleucine to proline substitution at a position equivalent to KNH1144 position 559 (1559P) in the N-terminal heptad region of gp41 ectodomain to promote association between gp41-gp41 association; and [0479] (3) inter-subunit disulfide bonds (SOS) between gp120 and gp41.

[0480] Several molecular determinants of enhanced trimer stability are described herein.

[0481] Many examples of nucleotide and amino acids for gp160 sequences are available, for example, in the database provided by the National Center for Biotechnology Information (NCBI) (see http://www.ncbi.nlm.nih.gov/).

[0482] One example of a gp160 glycoprotein sequence is that of the HIV-1 KNH1144 isolate. A sequence for the KNH1144 gp160 is available at NCBI accession number AAW72237 (gi: 58374202); a nucleotide sequence encoding this gp160 protein is available at accession number AY736812 (gi: 58374201). See website at ncbi.nlm.nih.gov. The amino acid sequence for this KNH1144 gp160 protein is provided below (SEQ ID NO:5).

TABLE-US-00005 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR ARRRVVGREK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0483] According to the invention, the KNH1144 HIV gp160 protein gives rise to modified gp120 and gp41 polypeptides that have improved gp41/gp120 trimer stability relative gp41/gp120 trimers from other HIV strains. Such stability is due in part to five amino acids differences between the KNH1144 HIV gp160 protein and other HIV gp160 proteins. These five amino acid differences are found at amino acid positions 535, 543, 553, 567 and 588 of the KNH1144 amino acid sequence. Thus, the modified, stabilized KNH1144 HIV gp160 protein comprises isoleucine at position 535 (I535), glutamine at position 543 (Q543), serine at position 553 (S553), lysine at position 567 (K567) and arginine at position 588 (R588). These "stabilizing" amino acids are highlighted and underlined in the KNH1144 HIV gp160 sequence shown above. Of the foregoing five amino acid residues, Q543, S553 and K567 have the greatest effect when introduced in combination. I535 and R588 make an additional minor contribution. All five of the amino acid residues may be included in an HIV isolate for the production of stable trimers. Alternatively, Q543, S553 and K567 are included, while I535 and R588 may be optionally included, for stabilization in modified HIV-1 isolates. The introduction of these changes did not impair the exposure of various neutralizing antibody epitopes on the resulting gp140 proteins, suggesting the overall antigenic structure of the trimer is not adversely affected.

[0484] As provided by the present invention, stabilized gp41/gp120 trimers are formed by modifying an HIV isolate to contain isoleucine at position 535, glutamine at position 543, serine at position 553, and lysine at position 567 and/or arginine at position 588 in any HIV gp160 or gp41 polypeptide.

[0485] Moreover, according to the invention, a gp41 protein has improved stability if a proline is used at an amino acid position equivalent to amino acid position 559, for example of the below KNH1144 gp160 polypeptide. The KNH1144 gp160 polypeptide typically has isoleucine instead of proline at position 559. The sequence of the 1559P mutant polypeptide of the KNH1144 gp160 protein is provided below (SEQ ID NO:6).

TABLE-US-00006 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR ARRRVVGREK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAPE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0486] In addition, in some embodiments, a KNH1144 gp41 protein has improved stability if methionine is used at position 535 instead of isoleucine. The sequence of this I535M mutant of the KNH1144 gp160 protein is provided below (SEQ ID NO:7).

TABLE-US-00007 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR ARRRVVGREK RAVGIGAVFL 521 GFLGAAGSTM GAASMTLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0487] Additionally, methionine can be used in any HIV gp160 or gp41 glycoprotein to replace a non-methionine amino acid at an amino acid position equivalent to position 535 of the KNH1144 gp160 protein to stabilize the HIV gp160 or gp41. In addition, the I535M mutation can be used in combination with any of the other mutations or amino acid substitutions contemplated herein. Thus, for example, the I535M mutation can be combined with the I559P mutation described above (see SEQ ID NO:6) to generate the following mutant KNH1144 gp160 protein (SEQ ID NO:8):

TABLE-US-00008 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR ARRRVVGREK RAVGIGAVFL 521 GFLGAAGSTM GAASMTLTVQ ARQLLSGIVQ QQSNLLRAPE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0488] Another example of a gp160 sequence is that of the HIV-1 JR-FL isolate. The JR-FL gp160 amino acid sequence is described as NCBI accession number AAB05604 (gi: 1465781); a nucleotide sequence for this gp160 protein is available at accession number U63632 (gi: 1465777). See website at ncbi.nlm.nih.gov. The amino acid sequence for this JR-FL gp160 protein is provided below (SEQ ID NO:9).

TABLE-US-00009 1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVVLENVTE HFNMWKNNMV EQMQEDIISL WDQSLKPCVK 121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT 161 SIRDEVQKEY ALFYKLDVVP IDNNNTSYRL ISCDTSVITQ 201 ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 241 TVQCTHGIRP VVSTQLLLNG SLAEEEVVIR SDNFTNNAKT 281 IIVQLKESVE INCTRPNNNT RKSIHIGPGR AFYTTGEIIG 321 DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 361 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT 401 EGNTITLPCR IKQIINMWQE VGKAMYAPPI RGQIRCSSNI 441 TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKVV 481 KIEPLGVAPT KAKRRVVQRE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAI EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0489] According to the invention, the amino acid sequence for this JR-FL gp160 protein can also have a proline instead of an isoleucine at an amino acid position equivalent to the position of isoleucine at amino acid position 559 in the KNH1144 gp160 protein. This mutant JR-FL gp160 protein is provided below (SEQ ID NO:10).

TABLE-US-00010 1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVVLENVTE HFNMWKNNMV EQMQEDIISL WDQSLKPCVK 121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT 161 SIRDEVQKEY ALFYKLDVVP IDNNNTSYRL ISCDTSVITQ 201 ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 241 TVQCTHGIRP VVSTQLLLNG SLAEEEVVIR SDNFTNNAKT 281 IIVQLKESVE INCTRPNNNT RKSIHIGPGR AFYTTGEIIG 321 DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 361 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT 401 EGNTITLPCR IKQIINMWQE VGKAMYAPPI RGQIRCSSNI 441 TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKVV 481 KIEPLGVAPT KAKRRVVQRE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAP EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0490] As provided by the invention, the amino acid sequence for this JR-FL gp160 protein can also have isoleucine at a position equivalent to position 535 of the KNH1144 gp160 protein, glutamine at a position equivalent position 543 of the KNH1144 gp160 protein, serine at position 553 of the KNH1144 gp160 protein, lysine at position 567 of the KNH1144 gp160 protein and arginine at position 588 of the KNH1144 gp160 protein, as well as a proline at a position equivalent to the position of the KNH1144 gp160 protein. This mutant JR-FL gp160 protein is provided below (SEQ ID NO:11).

TABLE-US-00011 1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVVLENVTE HFNMWKNNMV EQMQEDIISL WDQSLKPCVK 121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT 161 SIRDEVQKEY ALFYKLDVVP IDNNNTSYRL ISCDTSVITQ 201 ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 241 TVQCTHGIRP VVSTQLLLNG SLAEEEVVIR SDNFTNNAKT 281 IIVQLKESVE INCTRPNNNT RKSIHIGPGR AFYTTGEIIG 321 DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 361 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT 401 EGNTITLPCR IKQIINMWQE VGKAMYAPPI RGQIRCSSNI 441 TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKVV 481 KIEPLGVAPT KAKRRVVQRE KRAVGIGAVF LGFLGAAGST 521 MGAASITLTV QARQLLSGIV QQQSNLLRAP EAQQRMLKLT 561 VWGIKQLQAR VLAVERYLRD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0491] Another example of a sequence for gp160 is the HIV-1 Ba-L gp160 amino acid sequence at NCBI accession number AAT67504 (gi: 49617617); a nucleotide sequence for this gp160 protein is available at accession number AY669732 (gi: 49617616). See website at ncbi.nlm.nih.gov. The amino acid sequence for the HIV-1 Ba-L gp160 protein is provided below (SEQ ID NO:12):

TABLE-US-00012 1 MRVTEIRKSY QHWWRWGIML LGXLMICNAE EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVXXXNVTE NFNMWKNNMV EQMHEDILSL WDQSLKPCVK 121 LTPLCVTLNC TDLRNATXXN XTXTTSSSRG MVGGGEXKNC 161 SFNITTXIRG KVQKEYALFY ELDIVPIDNX IDRYRLISCN 201 TSVITQACPK VSFEPIPIHY CAPAGFAILK CKDKKFNGKG 241 PCXNVSTVQC THGIRPVVST QLLLNGSLAE EEVVIRSXNF 281 XBNAKXIIVQ LNESVEINCT RPNNNTRKSI HIGPGRAFYT 321 TGEIIGDIRQ AHCNLSRAKW NDTLNKIVXK LREQFGNKTI 361 VFKHSSGGDP EIVTHSFNCG GEFFYCNSTQ LFNSTWNVTE 401 ESNNTVENNT ITLPCRIKQI INMWQXVGRA MYAPPIRGQI 441 RCSSNITGLL LTRDGGPEDN KTEVFRPGGG DMRDNWRSEL 481 YKYKVVKIEP LGVAPTKAKR RVVQREKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAIEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0492] According to the invention, the amino acid sequence for this Ba-L gp160 protein can also have a proline instead of an isoleucine at an amino acid position equivalent to the amino acid position of the specified isoleucine in the KNH1144 gp160 protein. Such a modified mutant Ba-L gp160 protein is provided below (SEQ ID NO:13):

TABLE-US-00013 1 MRVTEIRKSY QHWWRWGIML LGXLMICNAE EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVXXXNVTE NFNMWKNNMV EQMHEDIISL WDQSLKPCVK 121 LTPLCVTLNC TDLRNATXXN XTXTTSSSRG MVGGGEXKNC 161 SFNITTXIRG KVQKEYALFY ELDIVPIDNX IDRYRLISCN 201 TSVITQACPK VSFEPIPIHY CAPAGFAILK CKDKKFNGKG 241 PCXNVSTVQC THGIRPVVST QLLLNGSLAE EEVVIRSXNF 281 XBNAKXIIVQ LNESVEINCT RPNNNTRKSI HIGPGRAFYT 321 TGEIIGDIRQ AHCNLSRAKW NDTLNKIVXK LREQFGNKTI 361 VFKHSSGGDP EIVTHSFNCG GEFFYCNSTQ LFNSTWNVTE 401 ESNNTVENNT ITLPCRIKQI INMWQXVGRA MYAPPIRGQI 441 RCSSNITGLL LTRDGGPEDN KTEVFRPGGG DMRDNWRSEL 481 YKYKVVKIEP LGVAPTKAKR RVVQREKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAPEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0493] Additionally, the amino acid sequence for this Ba-L gp160 protein can have isoleucine at a position equivalent to position 535 of the KNH1144 gp160 protein, glutamine at a position equivalent position 543 of the KNH1144 gp160 protein, serine at position 553 of the KNH1144 gp160 protein, lysine at position 567 of the KNH1144 gp160 protein and arginine at position 588 of the KNH1144 gp160 protein, as well as a proline at a position equivalent to the position of the specified isoleucine in the KNH1144 gp160 protein. Such a modified Ba-L gp160 protein is provided below (SEQ ID NO:14):

TABLE-US-00014 1 MRVTEIRKSY QHWWRWGIML LGXLMICNAE EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT RACVPTDPNP 81 QEVXXXNVTE NFNMWKNNMV EQMHEDIISL WDQSLKPCVK 121 LTPLCVTLNC TDLRNATXXN XTXTTSSSRG MVGGGEXKNC 161 SFNITTXIRG KVQKEYALFY ELDIVPIDNX IDRYRLISCN 201 TSVITQACPK VSFEPIPIHY CAPAGFAILK CKDKKFNGKG 241 PCXNVSTVQC THGIRPVVST QLLLNGSLAE EEVVIRSXNF 281 XBNAKXIIVQ LNESVEINCT RPNNNTRKSI HIGPGRAFYT 321 TGEIIGDIRQ AHCNLSRAKW NDTLNKIVXK LREQFGNKTI 361 VFKHSSGGDP EIVTHSFNCG GEFFYCNSTQ LFNSTWNVTE 401 ESNNTVENNT ITLPCRIKQI INMWQXVGRA MYAPPIRGQI 441 RCSSNITGLL LTRDGGPEDN KTEVFRPGGG DMRDNWRSEL 481 YKYKVVKIEP LGVAPTKAKR RVVQREKRAV GIGAVFLGFL 521 GAAGSTMGAA SITLTVQARQ LLSGIVQQQS NLLRAPEAQQ 561 HLLKLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0494] Another example of a sequence for gp160 is the amino acid sequence at NCBI accession number AAA76668 (gi: 665491); a nucleotide sequence for this gp160 protein is available at accession number U12032 (gi: 665490). See website at ncbi.nlm.nih.gov. The amino acid sequence for this gp160 protein is provided below (SEQ ID NO:15):

TABLE-US-00015 1 MRVKEKYQHL RRWGWRWGTM LLGMLMICSA TEKLWVTVYY 41 GVPVWKEATT TLFCASDAKA YDTEVHNVWA THACVPTDPN 81 PQEVVLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 121 KLTPLCVSLK CTDLKNDTNT NSSSGGMIME KGEIKNCSFN 161 ISTSIRGKVQ KEYAFFYKLD IIPIDNDTTS YTLTSCNTSV 201 ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 241 NVSTVQCTHG IRPVVSTQLL LNGSLAEEEV VIRSANFTDN 281 VKTIIVQLNQ SVEINCTKPN NNTGKRIRIQ RGPGRTFVTI 321 GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQYGNNKTII 361 FKQSSGGDLE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTG 401 SNNTEGSDTI TLPCRIKQII NMWQEVGKAM YAPPISGQIR 441 CSSNITGLLL TRDGGNNNNG SEIFRPGGGD MRDNWRSELY 481 KYKVVKIEPL GVAPTKAKRR VVQREKRAVG IGALFLGFLG 521 AAGSTMGAAS MTLTVQARQL LSGIVQQQNN LLRAIEAQQH 561 LLQLTVWGIK QLQARILAVE RYLKDQQLLG IWGCSGKLIC 601 TTAVPWNASW SNKSLERIWN HTTWMEWDRE INNYTSLIHS 641 LIEESQNQQE KNEQELLELD KWASLWNWFN ITNWLWYVKI 681 FIMIVGGLVG LRIVFAVLSI VNRVRQGYSP LSFQTHLPTP 721 GGPDRPEGIE EEGGERDRDR SIRLVNGS

[0495] According to the invention, the amino acid sequence for this gp160 protein can also be modified to include a proline instead of an isoleucine at an amino acid position equivalent to the amino acid position of the specified isoleucine in the KNH1144 gp160 protein. This mutant gp160 protein is provided below (SEQ ID NO:16):

TABLE-US-00016 1 MRVKEKYQHL RRWGWRWGTM LLGMLMICSA TEKLWVTVYY 41 GVPVWKEATT TLFCASDAKA YDTEVHNVWA THACVPTDPN 81 PQEVVLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 121 KLTPLCVSLK CTDLKNDTNT NSSSGGMIME KGEIKNCSFN 161 ISTSIRGKVQ KEYAFFYKLD IIPIDNDTTS YTLTSCNTSV 201 ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 241 NVSTVQCTHG IRPVVSTQLL LNGSLAEEEV VIRSANFTDN 281 VKTIIVQLNQ SVEINCTKPN NNTGKRIRIQ RGPGRTFVTI 321 GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQYGNNKTII 361 FKQSSGGDLE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTG 401 SNNTEGSDTI TLPCRIKQII NMWQEVGKAM YAPPISGQIR 441 CSSNITGLLL TRDGGNNNNG SEIFRPGGGD MRDNWRSELY 481 KYKVVKIEPL GVAPTKAKRR VVQREKRAVG IGALFLGFLG 521 AAGSTMGAAS MTLTVQARQL LSGIVQQQNN LLRAPEAQQH 561 LLQLTVWGIK QLQARILAVE RYLKDQQLLG IWGCSGKLIC 601 TTAVPWNASW SNKSLERIWN HTTWMEWDRE INNYTSLIHS 641 LIEESQNQQE KNEQELLELD KWASLWNWFN ITNWLWYVKI 681 FIMIVGGLVG LRIVFAVLSI VNRVRQGYSP LSFQTHLPTP 721 GGPDRPEGIE EEGGERDRDR SIRLVNGS

[0496] Further, as provided by the invention, the amino acid sequence for this gp160 protein can also have isoleucine at a position equivalent to position 535 of the KNH1144 gp160 protein, glutamine at a position equivalent position 543 of the KNH1144 gp160 protein, serine at position 553 of the KNH1144 gp160 protein, lysine at position 567 of the KNH1144 gp160 protein and arginine at position 588 of the KNH1144 gp160 protein, as well as a proline at a position equivalent to the position of the specified isoleucine in the KNH1144 gp160 protein. Such a modified gp160 protein is provided below (SEQ ID NO:17):

TABLE-US-00017 1 MRVKEKYQHL RRWGWRWGTM LLGMLMICSA TEKLWVTVYY 41 GVPVWKEATT TLFCASDAKA YDTEVHNVWA THACVPTDPN 81 PQEVVLVNVT ENFNMWKNDM VEQMHEDIIS LWDQSLKPCV 121 KLTPLCVSLK CTDLKNDTNT NSSSGGMIME KGEIKNCSFN 161 ISTSIRGKVQ KEYAFFYKLD IIPIDNDTTS YTLTSCNTSV 201 ITQACPKVSF EPIPIHYCAP AGFAILKCNN KTFNGTGPCT 241 NVSTVQCTHG IRPVVSTQLL LNGSLAEEEV VIRSANFTDN 281 VKTIIVQLNQ SVEINCTKPN NNTGKRIRIQ RGPGRTFVTI 321 GKIGNMRQAH CNISRAKWNN TLKQIASKLR EQYGNNKTII 361 FKQSSGGDLE IVTHSFNCGG EFFYCNSTQL FNSTWFNSTG 401 SNNTEGSDTI TLPCRIKQII NMWQEVGKAM YAPPISGQIR 441 CSSNITGLLL TRDGGNNNNG SEIFRPGGGD MRDNWRSELY 481 KYKVVKIEPL GVAPTKAKRR VVQREKRAVG IGALFLGFLG 521 AAGSTMGAAS ITLTVQARQL LSGIVQQQSN LLRAPEAQQH 561 LLKLTVWGIK QLQARILAVE RYLRDQQLLG IWGCSGKLIC 601 TTAVPWNASW SNKSLERIWN HTTWMEWDRE INNYTSLIHS 641 LIEESQNQQE KNEQELLELD KWASLWNWFN ITNWLWYVKI 681 FIMIVGGLVG LRIVFAVLSI VNRVRQGYSP LSFQTHLPTP 721 GGPDRPEGIE EEGGERDRDR SIRLVNGS

[0497] Another example of an amino acid sequence for a HIV gp160 protein is available in the NCBI database at accession number AAA76666 (gi: 665487); the nucleotide sequence for this HIV gp160 protein can be found at accession number U12030 (gi: 665486). See website at ncbi.nlm.nih.gov. Many more sequences for HIV gp160 are available, for example, at the ncbi.nlm.nih.gov website.

[0498] The gp120 protein derived from the gp160 precursor directs target-cell recognition and viral tropism through interaction with the cell-surface receptor CD4 and one of several co-receptors that are members of the chemokine receptor family. (Broder, C. C. et al., Pathobiology. 64:171-179 (1996); D'Souza, M. P. at al., Nature Medicine. 2:1293-1300 (1996); Wilkinson, D., Current Biology. 6:1051-1053 (1996)). The membrane-spanning gp41 subunit then promotes fusion of the viral and cellular membranes, a process that results in the release of viral contents into the host cell.

[0499] Binding of gp120/gp41 complexes to cellular receptors (e.g., CD4 and a chemokine receptor such as CCR5 or CXCR4) triggers a series of structural rearrangements in the envelope glycoprotein. A transient species arises, termed the prehairpin intermediate, in which gp41 exists as a membrane protein simultaneously in both the viral and cellular membranes. This extended gp41 prehairpin intermediate ultimately collapses into a trimer-of-hairpins structure that provides sufficient tension to drive membrane fusion. The core of the HIV-1 trimer-of-hairpins is a bundle of six .alpha.-helices from three gp41 ectodomains. Three .alpha.-helices derived from the N-terminal HR1 regions form a central, trimeric coiled coil, around which three .alpha.-helices derived from the C-terminal HR2 regions pack in an anti-parallel manner into hydrophobic grooves on the surface of the coiled coil. Thus, formation of the timer-of-hairpins structure is believed to bring the membranes into close apposition necessary for the fusion event.

[0500] The gp120 and gp41 envelope glycoproteins can, of course, have a variety of sequences, depending upon the strain, clade, or type of HIV. For example, the KNH1144 gp41 protein can have the following sequence (SEQ ID NO:18):

TABLE-US-00018 508 REK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0501] A modified KNH1144 gp41 protein can include a proline rather than an isoleucine at position 559, as indicated in the following sequence (SEQ ID NO:19):

TABLE-US-00019 508 REK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAPE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0502] In some embodiments, a KNH1144 gp41 protein has improved stability if an isoleucine is used at position 535 instead of a methionine residue. The amino acid sequence of this M535I mutant of the KNH1144 gp41 protein is provided below (SEQ ID NO:20).

TABLE-US-00020 REK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0503] Additionally, the M535I mutation can be included in combination with any of the other mutations or amino acid substitutions contemplated herein. Thus, for example, the M535I mutation can be combined with the I559P mutation described above (see SEQ ID NO:19) to generate the following modified or mutant KNH1144 gp41 protein (SEQ ID NO:21):

TABLE-US-00021 REK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAPE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICTTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 ANLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0504] Another example of a gp41 amino acid is that of the HIV-1 JR-FL isolate. The amino acid sequence for the HIV-1 JR-FL gp41 protein is provided below (SEQ ID NO:22):

TABLE-US-00022 499 RE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAI EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0505] According to the invention, the amino acid sequence for the JR-FL gp41 protein may also include a proline instead of an isoleucine at an amino acid position equivalent to amino acid position 559 of the KNH1144 gp41 protein. This modified or mutant JR-FL gp41 protein is provided below (SEQ ID NO:23):

TABLE-US-00023 499 RE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAP EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0506] As further provided by the invention, the amino acid sequence for the modified HIV-1 JR-FL gp41 protein may also include isoleucine at an amino acid position equivalent to amino acid position 535 of the KNH1144 gp160 protein, glutamine at an amino acid position equivalent amino acid position 543 of the KNH1144 gp160 protein, serine at an amino acid position equivalent to amino acid position 553 of the KNH1144 gp160 protein, lysine at an amino acid position equivalent to amino acid position 567 of the KNH1144 gp160 protein and arginine at an amino acid position equivalent to amino acid position 588 of the KNH1144 gp160 protein, as well as proline at an amino acid position equivalent to amino acid position 559 of the KNH1144 gp160 protein. This modified or mutant JR-FL gp41 protein is provided below (SEQ ID NO:24):

TABLE-US-00024 RE KRAVGIGAVF LGFLGAAGST 521 MGAASITLTV QARQLLSGIV QQQSNLLRAP EAQQRMLKLT 561 VWGIKQLQAR VLAVERYLRD QQLLGIWGCS GKLICTTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0507] Another example of a sequence for gp41 is the HIV-1 Ba-L gp41 amino acid sequence. The amino acid sequence for the HIV-1 Ba-L gp41 protein is provided below (SEQ ID NO:25):

TABLE-US-00025 505 REKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAIEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0508] According to the invention, the amino acid sequence for a modified Ba-L gp41 protein may include proline instead of an isoleucine at an amino acid position equivalent to the position of the proline amino acid in the KNH1144 gp41 protein. This mutant Ba-L gp41 protein is provided below (SEQ ID NO:26):

TABLE-US-00026 505 REKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAPEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0509] As provided by the invention, the amino acid sequence for the modified HIV-1 Ba-L gp41 protein can also include isoleucine at an amino acid position equivalent to amino acid position 535 of the KNH1144 gp160 protein, glutamine, at an amino acid position equivalent to amino acid position 543 of the KNH1144 gp160 protein, serine at an amino acid position equivalent to amino acid position 553 of the KNH1144 gp160 protein, lysine at an amino acid position equivalent to amino acid position 567 of the KNH1144 gp160 protein and arginine at an amino acid position equivalent to amino acid position 588 of the KNH1144 gp160 protein, as well as a proline at a position equivalent to the position of the specified isoleucine in the KNH1144 gp160 protein. Such a mutant or modified Ba-L gp41 protein is provided below (SEQ ID NO:27):

TABLE-US-00027 REKRAV GIGAVFLGFL 521 GAAGSTMGAA SITLTVQARQ LLSGIVQQQS NLLRAPEAQQ 561 HLLKLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CTTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRAGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0510] Another example of a sequence for the envelope gp41 glycoprotein is the amino acid sequence at accession number 521998 (gi: 94245). See website at ncbi.nlm.nih.gov. The amino acid sequence for this gp41 protein is provided below (SEQ ID NO:28):

TABLE-US-00028 1 KAKRRVVQRE KRAVGMGAAF FLGFLGAAGS TMGAASLTLT 41 VQARLLLSGI VQQQNNLLRA IEAHEHLLQL TVWGIKQLQA 81 RILAVERYLK DQQLLGIWGC SGKLICTTTV PWNASWSNKS 121 LDKIWNNMTW MEWDREINNY TSLIYTLIEQ SQNQQEKNEQ 161 ELLELDKWAS LWNWFDITQW LWYIKIFIMI VGGLIGLRIV 201 FTVLSIVNRV RQGYSPLSFQ TRRPARRGPD RPEGIEEEGG 241 ERDRDRSGRL VNGFLALIWD DLRSLCLFSY HRLRDLLLIV 281 TRIVELLGRR GWEVLKYLWN LLQYWSQELK NSAVSLLNAT 321 AIAVAEGTDR VIELLQRAFR AILHIPRRXR QGLERALL

[0511] The gp41 polypeptide of accession number S21998 (gi: 94245) can also have a proline instead of an isoleucine at a position equivalent to that of the KNH1144 gp41. Such a mutant gp41 protein has the following sequence (SEQ ID NO:29):

TABLE-US-00029 1 KAKRRVVQRE KRAVGMGAAF FLGFLGAAGS TMGAASLTLT 41 VQARLLLSGI VQQQNNLLRA PEAHEHLLQL TVWGIKQLQA 81 RILAVERYLK DQQLLGIWGC SGKLICTTTV PWNASWSNKS 121 LDKIWNNMTW MEWDREINNY TSLIYTLIEQ SQNQQEKNEQ 161 ELLELDKWAS LWNWFDITQW LWYIKIFIMI VGGLIGLRIV 201 FTVLSIVNRV RQGYSPLSFQ TRRPARRGPD RPEGIEEEGG 241 ERDRDRSGRL VNGFLALIWD DLRSLCLFSY HRLRDLLLIV 281 TRIVELLGRR GWEVLKYLWN LLQYWSQELK NSAVSLLNAT 321 AIAVAEGTDR VIELLQRAFR AILHIPRRXR QGLERALL

[0512] Moreover, as provided by the invention, the amino acid sequence for the gp41 protein having accession number S21998 (gi: 94245) can also be modified to contain isoleucine at an amino acid position equivalent to amino acid position 535 of the KNH1144 gp160 protein, glutamine at an amino acid position equivalent to amino acid position 543 of the KNH1144 gp160 protein, serine at an amino acid position equivalent to amino acid position 553 of the KNH1144 gp160 protein, lysine at an amino acid position equivalent to amino acid position 567 of the KNH1144 gp160 protein and arginine at an amino acid position equivalent to amino acid position 588 of the KNH1144 gp160 protein, as well as a proline at an amino acid position equivalent to the 559 position of the specified isoleucine in the KNH1144 gp160 protein. This modified or mutant. gp41 protein is provided below (SEQ ID NO:30):

TABLE-US-00030 1 KAKRRVVQRE KRAVGMGAAF FLGFLGAAGS TMGAASITLT 41 VQARQLLSGI VQQQSNLLRA PEAHEHLLKL TVWGIKQLQA 81 RILAVERYLR DQQLLGIWGC SGKLICTTTV PWNASWSNKS 121 LDKIWNNMTW MEWDREINNY TSLIYTLIEQ SQNQQEKNEQ 161 ELLELDKWAS LWNWFDITQW LWYIKIFIMI VGGLIGLRIV 201 FTVLSIVNRV RQGYSPLSFQ TRRPARRGPD RPEGIEEEGG 241 ERDRDRSGRL VNGFLALIWD DLRSICLFSY HRLRDLLLIV 281 TRIVELLGRR GWEVLKYLWN LLQYWSQELK NSAVSLLNAT 321 AIAVAEGTDR VIELLQRAFR AILHIPRRXR QGLERALL

[0513] As would be appreciated by the skilled practitioner, many more sequences for HIV gp41 polypeptides are available, for example, at the ncbi.nlm.nih.gov website.

[0514] According to the invention, in addition to any of the foregoing amino acid changes or substitutions, at least one intermolecular disulfide bond can also be placed between the gp41 and gp120 proteins of the HIV-1 strains. The one or more disulfide bonds are generated by placement of cysteine residues at selected locations in the gp41 and gp120 proteins. Thus, for example, in the gp160 glycoprotein, one. cysteine can be placed at any of positions 470 to 505 and another cysteine can be placed at any of positions 570 to 620.

[0515] For example, cysteine residues can be placed at positions 492 and 596 in the HIV-1 JR-FL gp160 amino acid sequence (NCBI accession number AAB05604; gi: 1465781). The amino acid sequence for this A492C and T596C double mutant JR-FL gp160 protein is provided below (SEQ ID NO:31):

TABLE-US-00031 1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG 41 VPVWREATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVVLENVTE HFNMWKNNMV EQMQEDIISL WDQSLKPCVK 121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT 161 SIRDEVQKEY ALFYKLDVVP IDNNNTSYRL ISCDTSVITQ 201 ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 241 TVQCTHGIRP VVSTQLLLNG SLAEEEVVIR SDNFTNNAKT 281 IIVQLKESVE INCTRPNNNT RKSIHIGPGR AFYTTGEIIG 321 DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 361 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT 401 EGNTITLPCR IKQIINMWQE VGKAMYAPPI RGQIRCSSNI 441 TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKVV 481 KIEPLGVAPT KCKRRVVQRE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAI EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICCTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0516] Thus, after cleavage of the JR-FL gp160 glycoprotein, a gp120 glycoprotein with a cysteine instead of an alanine at position 492 has the following sequence (SEQ ID NO:32):

TABLE-US-00032 1 MRVKGIRKSY QYLWKGGTLL LGILMICSAV EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVVLENVTE HFNMWKNNMV EQMQEDIISL WDQSLKPCVK 121 LTPLCVTLNC KDVNATNTTN DSEGTMERGE IKNCSFNITT 161 SIRDEVQKEY ALFYKLDVVP IDNNNTSYRL ISCDTSVITQ 201 ACPKISFEPI PIHYCAPAGF AILKCNDKTF NGKGPCKNVS 241 TVQCTHGIRP VVSTQLLLNG SLAEEEVVIR SDNFTNNAKT 281 IIVQLKESVE INCTRPNNNT RKSIHIGPGR AFYTTGEIIG 321 DIRQAHCNIS RAKWNDTLKQ IVIKLREQFE NKTIVFNHSS 361 GGDPEIVMHS FNCGGEFFYC NSTQLFNSTW NNNTEGSNNT 401 EGNTITLPCR IKQIINMWQE VGKAMYAPPI RGQIRCSSNI 441 TGLLLTRDGG INENGTEIFR PGGGDMRDNW RSELYKYKVV 481 KIEPLGVAPT KCKRRVVQ

[0517] Similarly, after cleavage of the JR-FL gp160 glycoprotein, a gp41 glycoprotein with a cysteine instead of an threonine at position 596 has the following sequence (SEQ ID NO:33):

TABLE-US-00033 RE KRAVGIGAVF LGFLGAAGST 521 MGAASMTLTV QARLLLSGIV QQQNNLLRAI EAQQRMLQLT 561 VWGIKQLQAR VLAVERYLGD QQLLGIWGCS GKLICCTAVP 601 WNASWSNKSL DRIWNNMTWM EWEREIDNYT SEIYTLIEES 641 QNQQEKNEQE LLELDKWASL WNWFDITKWL WYIKIFIMIV 681 GGLVGLRLVF TVLSIVNRVR QGYSPLSFQT LLPAPRGPDR 721 PEGIEEEGGE RDRDRSGRLV NGFLALIWVD LRSLCLFSYH 761 RLRDLLLTVT RIVELLGRRG WEVLKYWWNL LQYWSQELKN 801 SAVSLLNATA IAVAEGTDRI IEALQRTYRA ILHIPTRIRQ 841 GLERALL

[0518] Moreover, such cysteine residues can be placed in gp160, gp120 and/or gp41 polypeptides of other HIV-1 isolates at amino acid positions equivalent to the alanine 492 and threonine 596 amino acid positions of the JR-FL glycoprotein. For example, the amino acid sequence for the KNH1144 gp160 protein can be modified to contain cysteines at positions equivalent to the alanine 492 and threonine 596 positions of the JR-FL glycoprotein as provided below (SEQ ID NO:34):

TABLE-US-00034 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR CRRRVVGREK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICCTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0519] Thus, after cleavage of the KNH1144 gp160 glycoprotein, a gp120 glycoprotein containing a cysteine at the amino acid position equivalent to amino acid position 492 of the HIV-1 JR-FL strain has the following sequence (SEQ ID NO:35):

TABLE-US-00035 1 MIVMGTQRNY QHLLRWGTMI LGLIIICSAA DNLWVTVYYG 41 VPVWKDAETT LFCASDAKAY ETEKHNVWAT HACVPTDPNP 81 QEIPLENVTE EFNMWKNKMV EQMHTDIISL WDQSLQPCVK 121 LTPLCVTLNC TDATNGTIGN ITDEMKGEIK NCSFNITTEI 161 RDKKQKVYSL FYRLDVVPIE PDSSNSSRNS SEYRLINCNT 201 SAITQACPKV SFEPIPIHYC APAGFAILKC RDKEFNGTGK 241 CKNVSTVQCT HGIKPVVSTQ LLLNGSLAEG EVRIRSENIT 281 NNAKTIIVQL VEPVRINCTR PNNNTRESVR IGPGQAFFAT 321 GDIIGDIRQA HCNVSRSQWN KTLQQVAAQL GEHFKNKAIT 361 FNSSSGGDLE ITTHSFNCGG EFFYCNTSGL FNSTWKANNG 401 TWKANISESN NTEITLQCRI KQIINMWQRT GQAIYAPPIQ 441 GVIRCESNIT GLLLTRDGGE GNNESEIFRP GGGDMRDNWR 481 SELYKYKVVK IEPLGVAPTR CRRRVVG

[0520] Also after cleavage, a KNH1144 gp41 glycoprotein modified to contain a cysteine at the amino acid position equivalent to amino acid position 596 in the HIV-1 JR-FL isolate has the following sequence (SEQ ID NO:36):

TABLE-US-00036 REK RAVGIGAVFL 521 GFLGAAGSTM GAASITLTVQ ARQLLSGIVQ QQSNLLRAIE 561 AQQHMLKLTV WGIKQLQARV LAVERYLRDQ QLLGIWGCSG 601 KLICCTNVPW NSSWSNKSHD EIWNNMTWLQ WDKEISNYTN 641 LIYSLIEESQ NQQEKNEQDL LALDKWASLW NWFDISKWLW 681 YIKIFIMIVG GLIGLRIVFA VLAVIKRVRQ GYSPVSFQIH 721 NPNPGGLDRP GRIEEEGGEP GRGRSIRLVS GFLALAWDDL 761 RNLCLFSYHR LRDFALIVAR TVELLGHSSL KGLRLGWEGL 801 KYLWNLLVYW SQELKTSAIN LVDTIAIAVA GWTDRVIEIG 841 QGIGRAFLHI PRRIRQGLER ALL

[0521] In another example, the amino acid sequence for the Ba-L gp160 protein can be modified to contain cysteines at amino acid positions equivalent to amino acid position 492 (alanine) and amino acid position 596 (threonine) of the JR-FL glycoprotein as provided below (SEQ ID NO:37):

TABLE-US-00037 1 MRVTEIRKSY QHWWRWGIML LGXLMICNAE EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVXXXNVTE NFNMWKNNMV EQMHEDIISL WDQSLKPCVK 121 LTPLCVTLNC TDLRNATXXN XTXTTSSSRG MVGGGEXKNC 161 SFNITTXIRG KVQKEYALFY ELDIVPIDNX IDRYRLISCN 201 TSVITQACPK VSFEPIPIHY CAPAGFAILK CKDKKFNGKG 241 PCXNVSTVQC THGIRPVVST QLLLNGSLAE EEVVIRSXNF 281 XBNAKXIIVQ LNESVEINCT RPNNNTRKSI HIGPGRAFYT 321 TGEIIGDIRQ AHCNLSRAKW NDTLNKIVXK LREQFGNKTI 361 VFKHSSGGDP EIVTHSFNCG GEFFYCNSTQ LFNSTWNVTE 401 ESNNTVENNT ITLPCRIKQI INMWQXVGRA MYAPPIRGQI 441 RCSSNITGLL LTRDGGPEDN KTEVFRPGGG DMRDNWRSEL 481 YKYKVVKIEP LGVAPTKCKR RVVQREKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAIEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CCTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0522] After cleavage of the HIV-1 Ba-L gp160 glycoprotein, a gp120 glycoprotein modified to contain a cysteine at an amino acid position equivalent to amino acid position 492 of HIV-1 JR-FL has the following sequence (SEQ ID NO:38):

TABLE-US-00038 1 MRVTEIRKSY QHWWRWGIML LGXLMICNAE EKLWVTVYYG 41 VPVWKEATTT LFCASDAKAY DTEVHNVWAT HACVPTDPNP 81 QEVXXXNVTE NFNMWKNNMV EQMHEDIISL WDQSLKPCVK 121 LTPLCVTLNC TDLRNATXXN XTXTTSSSRG MVGGGEXKNC 161 SFNITTXIRG KVQKEYALFY ELDIVPIDNX IDRYRLISCN 201 TSVITQACPK VSFEPIPIHY CAPAGFAILK CKDKKFNGKG 241 PCXNVSTVQC THGIRPVVST QLLLNGSLAE EEVVIRSXNF 281 XBNAKXIIVQ LNESVEINCT RPNNNTRKSI HIGPGRAFYT 321 TGEIIGDIRQ AHCNLSRAKW NDTLNKIVXK LREQFGNKTI 361 VFKHSSGGDP EIVTHSFNCG GEFFYCNSTQ LFNSTWNVTE 401 ESNNTVENNT ITLPCRIKQI INMWQXVGRA MYAPPIRGQI 441 RCSSNITGLL LTRDGGPEDN KTEVFRPGGG DMRDNWRSEL 481 YKYKVVKIEP LGVAPTKCKR RVVQ

[0523] Also after cleavage, a Ba-L gp41 glycoprotein modified to contain a cysteine at an amino acid position equivalent to amino acid position 596 of HIV-1 JR-FL has the following sequence (SEQ ID NO:39):

TABLE-US-00039 REKRAV GIGAVFLGFL 521 GAAGSTMGAA SMTLTVQARL LLSGIVQQQN NLLRAIEAQQ 561 HLLQLTVWGI KQLQARVLAV ERYLRDQQLL GIWGCSGKLI 601 CCTAVPWNAS WSNKSLNKIW DNMTWMEWDR EINNYTSIIY 641 SLIEESQNQQ EKNEQELLEL DKWASLWNWF DITXWLWYIK 681 IFIMIVGGLI GLRIVFSVLS IVNRVRQGYS PLSFQTHLPA 721 SRGPDRPGGI EEEGGERDRD RSGPLVNGFL XLIWVDLRSL 761 XLFSYHRLRD LLLIVTRIVE LLGRRGWEVL KYWWXLLQYW 801 SQELKNSAVS LLNXXAXAVA EGTDRVIEVX QRAVRAILHI 841 PRRIRQGLER ALL

[0524] Similarly, any other HIV gp120 and gp41 glycoproteins, including any of the gp160, gp120 and/or gp41 polypeptides described herein, can be modified to contain cysteines residues at amino acid positions equivalent to the amino acid positions of the HIV-1 JR-FL isolate, e.g., at amino acid positions equivalent to amino acid positions 492 and 596 in JR-FL. In addition, any of the other "stabilizing" mutations described herein can be combined with the substitution of cysteine at amino acid positions equivalent to the amino acids at positions 492 and 596 in JR-FL.

REFERENCES FOR EXPERIMENTAL DETAILS IV

[0525] 1. Broder, C. C. et al., (1996). Pathobiology. 64:171-179. [0526] 2. D'Souza, M. P. et al., (1996). Nature Medicine. 2:1293-1300. [0527] 3. Wilkinson, D., (1996). Current Biology. 6:1051-1053.

Sequence CWU 1

1

39184PRTArtificial SequenceKNH1144 gp41 1Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln284PRTArtificial SequenceJRFL gp41 2Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Gly Asp Gln Gln384PRTArtificial SequenceBa-L gp41 3Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ala Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln439DNAArtificial SequenceT7 Translation Initiation Sequence 4tctagaaata attttgttta actttaagaa ggagatata 395863PRTArtificial SequenceKNH1144 with 5 trimer stability enhancing amino acids 5Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala 500 505 510Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 515 520 525Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 530 535 540Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu545 550 555 560Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 565 570 575Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu 580 585 590Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 595 600 605Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn 610 615 620Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn625 630 635 640Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 645 650 655Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 660 665 670Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 675 680 685Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val 690 695 700Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His705 710 715 720Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu 725 730 735Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe 740 745 750Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr 755 760 765His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu 770 775 780Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu785 790 795 800Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr 805 810 815Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp 820 825 830Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu 835 840 845His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 850 855 8606863PRTArtificial SequenceKNH1144 gp160 with I559P mutation 6Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala 500 505 510Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 515 520 525Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 530 535 540Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu545 550 555 560Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 565 570 575Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu 580 585 590Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 595 600 605Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn 610 615 620Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn625 630 635 640Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 645 650 655Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 660 665 670Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 675 680 685Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val 690 695 700Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His705 710 715 720Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu 725 730 735Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe 740 745 750Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr 755 760 765His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu 770 775 780Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu785 790 795 800Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr 805 810 815Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp 820 825 830Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu 835 840 845His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 850 855 8607863PRTArtificial SequenceKNH1144 gp160 with I535M mutation 7Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro

Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala 500 505 510Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 515 520 525Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu 530 535 540Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu545 550 555 560Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 565 570 575Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu 580 585 590Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 595 600 605Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn 610 615 620Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn625 630 635 640Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 645 650 655Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 660 665 670Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 675 680 685Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val 690 695 700Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His705 710 715 720Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu 725 730 735Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe 740 745 750Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr 755 760 765His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu 770 775 780Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu785 790 795 800Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr 805 810 815Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp 820 825 830Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu 835 840 845His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 850 855 8608863PRTArtificial SequenceKNH1144 gp160 with I535M and I559P mutations 8Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Ala Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala 500 505 510Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 515 520 525Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu 530 535 540Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu545 550 555 560Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 565 570 575Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu 580 585 590Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Asn Val 595 600 605Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn 610 615 620Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn625 630 635 640Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 645 650 655Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 660 665 670Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 675 680 685Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val 690 695 700Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His705 710 715 720Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu 725 730 735Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe 740 745 750Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr 755 760 765His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu 770 775 780Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu785 790 795 800Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr 805 810 815Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp 820 825 830Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu 835 840 845His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 850 855 8609847PRTArtificial SequenceJRFL gp160 9Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly1 5 10 15Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 130 135 140Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr145 150 155 160Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 165 170 175Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 180 185 190Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 195 200 205Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 210 215 220Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser225 230 235 240Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 245 250 255Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 260 265 270Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 275 280 285Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 290 295 300His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly305 310 315 320Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 325 330 335Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 340 345 350Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 355 360 365His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 370 375 380Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr385 390 395 400Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 405 410 415Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 420 425 430Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 435 440 445Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 450 455 460Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val465 470 475 480Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 485 490 495Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 500 505 510Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu 515 520 525Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn 530 535 540Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr545 550 555 560Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 565 570 575Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 580 585 590Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 595 600 605Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 610 615 620Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser625 630 635 640Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 645 650 655Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 660 665 670Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 675 680 685Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 690 695 700Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg705 710 715 720Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 725 730 735Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 740 745 750Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 755 760 765Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 770 775 780Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn785 790 795 800Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 805 810 815Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 820 825 830His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 835 840 84510847PRTArtificial SequenceJRFL with P mutation at position equivalent to position 559 in KNH1144 10Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly1 5 10 15Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 130 135 140Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr145 150 155 160Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 165 170 175Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 180 185 190Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 195 200 205Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 210 215 220Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser225 230 235 240Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 245 250 255Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 260 265 270Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 275 280 285Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 290 295 300His Ile Gly Pro Gly Arg Ala Phe Tyr Thr

Thr Gly Glu Ile Ile Gly305 310 315 320Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 325 330 335Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 340 345 350Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 355 360 365His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 370 375 380Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr385 390 395 400Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 405 410 415Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 420 425 430Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 435 440 445Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 450 455 460Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val465 470 475 480Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 485 490 495Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 500 505 510Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu 515 520 525Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn 530 535 540Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln Arg Met Leu Gln Leu Thr545 550 555 560Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 565 570 575Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 580 585 590Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 595 600 605Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 610 615 620Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser625 630 635 640Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 645 650 655Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 660 665 670Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 675 680 685Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 690 695 700Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg705 710 715 720Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 725 730 735Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 740 745 750Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 755 760 765Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 770 775 780Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn785 790 795 800Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 805 810 815Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 820 825 830His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 835 840 84511847PRTArtificial SequenceJRFL gp160 modified to contain P at position equivalent to position 559 in KNH1144 and the 5 trimer stability enhancing amino acids 11Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly1 5 10 15Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 130 135 140Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr145 150 155 160Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 165 170 175Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 180 185 190Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 195 200 205Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 210 215 220Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser225 230 235 240Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 245 250 255Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 260 265 270Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 275 280 285Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 290 295 300His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly305 310 315 320Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 325 330 335Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 340 345 350Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 355 360 365His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 370 375 380Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr385 390 395 400Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 405 410 415Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 420 425 430Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 435 440 445Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 450 455 460Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val465 470 475 480Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Ala Lys Arg Arg Val 485 490 495Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 500 505 510Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu 515 520 525Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser 530 535 540Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln Arg Met Leu Lys Leu Thr545 550 555 560Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 565 570 575Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 580 585 590Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 595 600 605Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 610 615 620Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser625 630 635 640Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 645 650 655Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 660 665 670Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 675 680 685Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 690 695 700Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg705 710 715 720Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 725 730 735Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 740 745 750Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 755 760 765Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 770 775 780Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn785 790 795 800Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 805 810 815Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 820 825 830His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 835 840 84512853PRTArtificial SequenceBa-L gp160 12Met Arg Val Thr Glu Ile Arg Lys Ser Tyr Gln His Trp Trp Arg Trp1 5 10 15Gly Ile Met Leu Leu Gly Xaa Leu Met Ile Cys Asn Ala Glu Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Xaa Xaa Xaa Asn Val Thr Glu Asn Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Leu Arg Asn Ala Thr Xaa Xaa Asn Xaa Thr Xaa Thr 130 135 140Thr Ser Ser Ser Arg Gly Met Val Gly Gly Gly Glu Xaa Lys Asn Cys145 150 155 160Ser Phe Asn Ile Thr Thr Xaa Ile Arg Gly Lys Val Gln Lys Glu Tyr 165 170 175Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Xaa Ile Asp 180 185 190Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 195 200 205Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 210 215 220Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Lys Gly225 230 235 240Pro Cys Xaa Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro 245 250 255Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 260 265 270Val Val Ile Arg Ser Xaa Asn Phe Xaa Asx Asn Ala Lys Xaa Ile Ile 275 280 285Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 290 295 300Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr305 310 315 320Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser 325 330 335Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Xaa Lys Leu Arg 340 345 350Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser Gly Gly 355 360 365Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 370 375 380Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val Thr Glu385 390 395 400Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro Cys Arg 405 410 415Ile Lys Gln Ile Ile Asn Met Trp Gln Xaa Val Gly Arg Ala Met Tyr 420 425 430Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly 435 440 445Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr Glu Val 450 455 460Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu465 470 475 480Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr 485 490 495Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile 500 505 510Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 515 520 525Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Leu Leu Leu Ser Gly 530 535 540Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln545 550 555 560His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg 565 570 575Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile 580 585 590Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn 595 600 605Ala Ser Trp Ser Asn Lys Ser Leu Asn Lys Ile Trp Asp Asn Met Thr 610 615 620Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Ile Ile Tyr625 630 635 640Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu 645 650 655Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile 660 665 670Thr Xaa Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly 675 680 685Leu Ile Gly Leu Arg Ile Val Phe Ser Val Leu Ser Ile Val Asn Arg 690 695 700Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Ala705 710 715 720Ser Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu 725 730 735Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Xaa Leu 740 745 750Ile Trp Val Asp Leu Arg Ser Leu Xaa Leu Phe Ser Tyr His Arg Leu 755 760 765Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly Arg 770 775 780Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Xaa Leu Leu Gln Tyr Trp785 790 795 800Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Xaa Xaa Ala 805 810 815Xaa Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val Xaa Gln Arg 820 825 830Ala Val Arg Ala Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu 835 840 845Glu Arg Ala Leu Leu 85013853PRTArtificial SequenceBa-L containing mutation to P at position equivalent to position 559 in KNH1144 13Met Arg Val Thr Glu Ile Arg Lys Ser Tyr Gln His Trp Trp Arg Trp1 5 10 15Gly Ile Met Leu Leu Gly Xaa Leu Met Ile Cys Asn Ala Glu Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Xaa Xaa Xaa Asn Val Thr Glu Asn Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Leu Arg Asn Ala Thr Xaa Xaa Asn Xaa Thr Xaa Thr 130 135 140Thr Ser Ser Ser Arg Gly Met Val Gly Gly Gly Glu Xaa Lys Asn Cys145 150 155 160Ser Phe Asn Ile Thr Thr Xaa Ile Arg Gly Lys Val Gln Lys Glu Tyr 165 170 175Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Xaa Ile Asp 180

185 190Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 195 200 205Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 210 215 220Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Lys Gly225 230 235 240Pro Cys Xaa Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro 245 250 255Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 260 265 270Val Val Ile Arg Ser Xaa Asn Phe Xaa Asx Asn Ala Lys Xaa Ile Ile 275 280 285Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 290 295 300Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr305 310 315 320Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser 325 330 335Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Xaa Lys Leu Arg 340 345 350Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser Gly Gly 355 360 365Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 370 375 380Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val Thr Glu385 390 395 400Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro Cys Arg 405 410 415Ile Lys Gln Ile Ile Asn Met Trp Gln Xaa Val Gly Arg Ala Met Tyr 420 425 430Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly 435 440 445Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr Glu Val 450 455 460Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu465 470 475 480Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr 485 490 495Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile 500 505 510Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 515 520 525Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Leu Leu Leu Ser Gly 530 535 540Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln545 550 555 560His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg 565 570 575Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile 580 585 590Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn 595 600 605Ala Ser Trp Ser Asn Lys Ser Leu Asn Lys Ile Trp Asp Asn Met Thr 610 615 620Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Ile Ile Tyr625 630 635 640Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu 645 650 655Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile 660 665 670Thr Xaa Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly 675 680 685Leu Ile Gly Leu Arg Ile Val Phe Ser Val Leu Ser Ile Val Asn Arg 690 695 700Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Ala705 710 715 720Ser Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu 725 730 735Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Xaa Leu 740 745 750Ile Trp Val Asp Leu Arg Ser Leu Xaa Leu Phe Ser Tyr His Arg Leu 755 760 765Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly Arg 770 775 780Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Xaa Leu Leu Gln Tyr Trp785 790 795 800Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Xaa Xaa Ala 805 810 815Xaa Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val Xaa Gln Arg 820 825 830Ala Val Arg Ala Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu 835 840 845Glu Arg Ala Leu Leu 85014853PRTArtificial SequenceBa-L gp160 modified to contain P at position equivalent to position 559 in KNH1144 and the 5 trimer mutations 14Met Arg Val Thr Glu Ile Arg Lys Ser Tyr Gln His Trp Trp Arg Trp1 5 10 15Gly Ile Met Leu Leu Gly Xaa Leu Met Ile Cys Asn Ala Glu Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Xaa Xaa Xaa Asn Val Thr Glu Asn Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Leu Arg Asn Ala Thr Xaa Xaa Asn Xaa Thr Xaa Thr 130 135 140Thr Ser Ser Ser Arg Gly Met Val Gly Gly Gly Glu Xaa Lys Asn Cys145 150 155 160Ser Phe Asn Ile Thr Thr Xaa Ile Arg Gly Lys Val Gln Lys Glu Tyr 165 170 175Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Xaa Ile Asp 180 185 190Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 195 200 205Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 210 215 220Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Lys Gly225 230 235 240Pro Cys Xaa Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro 245 250 255Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 260 265 270Val Val Ile Arg Ser Xaa Asn Phe Xaa Asx Asn Ala Lys Xaa Ile Ile 275 280 285Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 290 295 300Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr305 310 315 320Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser 325 330 335Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Xaa Lys Leu Arg 340 345 350Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser Gly Gly 355 360 365Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 370 375 380Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val Thr Glu385 390 395 400Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro Cys Arg 405 410 415Ile Lys Gln Ile Ile Asn Met Trp Gln Xaa Val Gly Arg Ala Met Tyr 420 425 430Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly 435 440 445Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr Glu Val 450 455 460Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu465 470 475 480Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr 485 490 495Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile 500 505 510Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 515 520 525Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly 530 535 540Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln545 550 555 560His Leu Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg 565 570 575Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile 580 585 590Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn 595 600 605Ala Ser Trp Ser Asn Lys Ser Leu Asn Lys Ile Trp Asp Asn Met Thr 610 615 620Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Ile Ile Tyr625 630 635 640Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu 645 650 655Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile 660 665 670Thr Xaa Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly 675 680 685Leu Ile Gly Leu Arg Ile Val Phe Ser Val Leu Ser Ile Val Asn Arg 690 695 700Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Ala705 710 715 720Ser Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu 725 730 735Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Xaa Leu 740 745 750Ile Trp Val Asp Leu Arg Ser Leu Xaa Leu Phe Ser Tyr His Arg Leu 755 760 765Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly Arg 770 775 780Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Xaa Leu Leu Gln Tyr Trp785 790 795 800Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Xaa Xaa Ala 805 810 815Xaa Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val Xaa Gln Arg 820 825 830Ala Val Arg Ala Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu 835 840 845Glu Arg Ala Leu Leu 85015748PRTArtificial SequenceAAA76668 gp160 15Met Arg Val Lys Glu Lys Tyr Gln His Leu Arg Arg Trp Gly Trp Arg1 5 10 15Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu 20 25 30Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 35 40 45Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 50 55 60Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn65 70 75 80Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 85 90 95Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp 100 105 110Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 115 120 125Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 130 135 140Gly Gly Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn145 150 155 160Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe 165 170 175Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Thr 180 185 190Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val 195 200 205Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala 210 215 220Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr225 230 235 240Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser 245 250 255Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile 260 265 270Arg Ser Ala Asn Phe Thr Asp Asn Val Lys Thr Ile Ile Val Gln Leu 275 280 285Asn Gln Ser Val Glu Ile Asn Cys Thr Lys Pro Asn Asn Asn Thr Gly 290 295 300Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Thr Phe Val Thr Ile305 310 315 320Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala 325 330 335Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln 340 345 350Tyr Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Gly385 390 395 400Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu Pro Cys Arg Ile 405 410 415Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala 420 425 430Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu 435 440 445Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu Ile Phe 450 455 460Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr465 470 475 480Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys 485 490 495Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly 500 505 510Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 515 520 525Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile 530 535 540Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln His545 550 555 560Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Ile 565 570 575Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp 580 585 590Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala 595 600 605Ser Trp Ser Asn Lys Ser Leu Glu Arg Ile Trp Asn His Thr Thr Trp 610 615 620Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile His Ser625 630 635 640Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu 645 650 655Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn Ile Thr 660 665 670Asn Trp Leu Trp Tyr Val Lys Ile Phe Ile Met Ile Val Gly Gly Leu 675 680 685Val Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile Val Asn Arg Val 690 695 700Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Thr Pro705 710 715 720Gly Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg 725 730 735Asp Arg Asp Arg Ser Ile Arg Leu Val Asn Gly Ser 740 74516748PRTArtificial SequenceAAA76668 gp160 containing mutation to P at position equivalent to position 559 in KNH1144 16Met Arg Val Lys Glu Lys Tyr Gln His Leu Arg Arg Trp Gly Trp Arg1 5 10 15Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu 20 25 30Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 35 40 45Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 50 55 60Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn65 70 75 80Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 85 90 95Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp 100 105 110Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 115 120 125Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 130 135 140Gly Gly Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn145 150 155 160Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe

165 170 175Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Thr 180 185 190Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val 195 200 205Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala 210 215 220Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr225 230 235 240Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser 245 250 255Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile 260 265 270Arg Ser Ala Asn Phe Thr Asp Asn Val Lys Thr Ile Ile Val Gln Leu 275 280 285Asn Gln Ser Val Glu Ile Asn Cys Thr Lys Pro Asn Asn Asn Thr Gly 290 295 300Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Thr Phe Val Thr Ile305 310 315 320Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala 325 330 335Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln 340 345 350Tyr Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Gly385 390 395 400Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu Pro Cys Arg Ile 405 410 415Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala 420 425 430Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu 435 440 445Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu Ile Phe 450 455 460Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr465 470 475 480Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys 485 490 495Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly 500 505 510Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 515 520 525Ala Ser Met Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile 530 535 540Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln His545 550 555 560Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Ile 565 570 575Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly Ile Trp 580 585 590Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala 595 600 605Ser Trp Ser Asn Lys Ser Leu Glu Arg Ile Trp Asn His Thr Thr Trp 610 615 620Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile His Ser625 630 635 640Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu 645 650 655Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn Ile Thr 660 665 670Asn Trp Leu Trp Tyr Val Lys Ile Phe Ile Met Ile Val Gly Gly Leu 675 680 685Val Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile Val Asn Arg Val 690 695 700Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Thr Pro705 710 715 720Gly Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg 725 730 735Asp Arg Asp Arg Ser Ile Arg Leu Val Asn Gly Ser 740 74517748PRTArtificial SequenceAAA76668 gp160 modified to contain proline (P) at position equivalent to amino acid position 559 in KNH1144 and the 5 trimer stability enhancing amino acids 17Met Arg Val Lys Glu Lys Tyr Gln His Leu Arg Arg Trp Gly Trp Arg1 5 10 15Trp Gly Thr Met Leu Leu Gly Met Leu Met Ile Cys Ser Ala Thr Glu 20 25 30Lys Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala 35 40 45Thr Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu 50 55 60Val His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn65 70 75 80Pro Gln Glu Val Val Leu Val Asn Val Thr Glu Asn Phe Asn Met Trp 85 90 95Lys Asn Asp Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp 100 105 110Asp Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Ser 115 120 125Leu Lys Cys Thr Asp Leu Lys Asn Asp Thr Asn Thr Asn Ser Ser Ser 130 135 140Gly Gly Met Ile Met Glu Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn145 150 155 160Ile Ser Thr Ser Ile Arg Gly Lys Val Gln Lys Glu Tyr Ala Phe Phe 165 170 175Tyr Lys Leu Asp Ile Ile Pro Ile Asp Asn Asp Thr Thr Ser Tyr Thr 180 185 190Leu Thr Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Val 195 200 205Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala 210 215 220Ile Leu Lys Cys Asn Asn Lys Thr Phe Asn Gly Thr Gly Pro Cys Thr225 230 235 240Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser 245 250 255Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile 260 265 270Arg Ser Ala Asn Phe Thr Asp Asn Val Lys Thr Ile Ile Val Gln Leu 275 280 285Asn Gln Ser Val Glu Ile Asn Cys Thr Lys Pro Asn Asn Asn Thr Gly 290 295 300Lys Arg Ile Arg Ile Gln Arg Gly Pro Gly Arg Thr Phe Val Thr Ile305 310 315 320Gly Lys Ile Gly Asn Met Arg Gln Ala His Cys Asn Ile Ser Arg Ala 325 330 335Lys Trp Asn Asn Thr Leu Lys Gln Ile Ala Ser Lys Leu Arg Glu Gln 340 345 350Tyr Gly Asn Asn Lys Thr Ile Ile Phe Lys Gln Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Phe Asn Ser Thr Gly385 390 395 400Ser Asn Asn Thr Glu Gly Ser Asp Thr Ile Thr Leu Pro Cys Arg Ile 405 410 415Lys Gln Ile Ile Asn Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala 420 425 430Pro Pro Ile Ser Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu 435 440 445Leu Leu Thr Arg Asp Gly Gly Asn Asn Asn Asn Gly Ser Glu Ile Phe 450 455 460Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr465 470 475 480Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys 485 490 495Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly 500 505 510Ala Leu Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 515 520 525Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser Gly Ile 530 535 540Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu Ala Gln Gln His545 550 555 560Leu Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Ile 565 570 575Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile Trp 580 585 590Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Ala Val Pro Trp Asn Ala 595 600 605Ser Trp Ser Asn Lys Ser Leu Glu Arg Ile Trp Asn His Thr Thr Trp 610 615 620Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile His Ser625 630 635 640Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu 645 650 655Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn Ile Thr 660 665 670Asn Trp Leu Trp Tyr Val Lys Ile Phe Ile Met Ile Val Gly Gly Leu 675 680 685Val Gly Leu Arg Ile Val Phe Ala Val Leu Ser Ile Val Asn Arg Val 690 695 700Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Thr Pro705 710 715 720Gly Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg 725 730 735Asp Arg Asp Arg Ser Ile Arg Leu Val Asn Gly Ser 740 74518356PRTArtificial SequenceKNH1144 gp41 18Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His 100 105 110Asp Glu Ile Trp Asn Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile 115 120 125Ser Asn Tyr Thr Asn Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ala Val Leu Ala Val Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val 195 200 205Ser Phe Gln Ile His Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly 210 215 220Arg Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg225 230 235 240Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala 260 265 270Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu 275 280 285Gly Trp Glu Gly Leu Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser 290 295 300Gln Glu Leu Lys Thr Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile305 310 315 320Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile 325 330 335Gly Arg Ala Phe Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu 340 345 350Arg Ala Leu Leu 35519356PRTArtificial SequenceKNH1144 gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 19Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His 100 105 110Asp Glu Ile Trp Asn Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile 115 120 125Ser Asn Tyr Thr Asn Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ala Val Leu Ala Val Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val 195 200 205Ser Phe Gln Ile His Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly 210 215 220Arg Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg225 230 235 240Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala 260 265 270Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu 275 280 285Gly Trp Glu Gly Leu Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser 290 295 300Gln Glu Leu Lys Thr Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile305 310 315 320Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile 325 330 335Gly Arg Ala Phe Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu 340 345 350Arg Ala Leu Leu 35520356PRTArtificial SequenceKNH1144 gp41 containing I535M mutation 20Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His 100 105 110Asp Glu Ile Trp Asn Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile 115 120 125Ser Asn Tyr Thr Asn Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ala Val Leu Ala Val Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val 195 200 205Ser Phe Gln Ile His Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly 210 215 220Arg Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg225 230 235 240Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala 260 265 270Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu 275 280 285Gly Trp Glu Gly Leu Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser 290 295 300Gln Glu Leu Lys Thr Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile305 310 315 320Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile 325 330 335Gly Arg Ala Phe Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu 340 345 350Arg Ala Leu Leu 35521356PRTArtificial SequenceKNH1144 gp41 containing I535M mutation and mutation to proline (P) at position equivalent to amino acid position 559 in

KNH1144 21Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His 100 105 110Asp Glu Ile Trp Asn Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile 115 120 125Ser Asn Tyr Thr Asn Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ala Val Leu Ala Val Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val 195 200 205Ser Phe Gln Ile His Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly 210 215 220Arg Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg225 230 235 240Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala 260 265 270Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu 275 280 285Gly Trp Glu Gly Leu Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser 290 295 300Gln Glu Leu Lys Thr Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile305 310 315 320Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile 325 330 335Gly Arg Ala Phe Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu 340 345 350Arg Ala Leu Leu 35522349PRTArtificial SequenceJR-FL gp41 22Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile 115 120 125Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe 180 185 190Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg225 230 235 240Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu His Ile 325 330 335Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34523349PRTArtificial SequenceJR-FL gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 23Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile 115 120 125Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe 180 185 190Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg225 230 235 240Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu His Ile 325 330 335Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34524349PRTArtificial SequenceJR-FL gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 and the 5 trimer stability enhancing amino acids of the invention 24Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln Arg Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile 115 120 125Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe 180 185 190Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg225 230 235 240Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu His Ile 325 330 335Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34525349PRTArtificial SequenceBa-L gp41 25Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asn Lys Ile Trp Asp Asn Met Thr Trp Met Glu Trp Asp Arg Glu Ile 115 120 125Asn Asn Tyr Thr Ser Ile Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Xaa Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ser Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr His Leu Pro Ala Ser Arg Gly Pro Asp Arg Pro Gly 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Pro225 230 235 240Leu Val Asn Gly Phe Leu Xaa Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Xaa Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Xaa Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Xaa Xaa Ala Xaa Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Val Ile Glu Val Xaa Gln Arg Ala Val Arg Ala Ile Leu His Ile 325 330 335Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34526349PRTArtificial SequenceBa-L gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 26Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asn Lys Ile Trp Asp Asn Met Thr Trp Met Glu Trp Asp Arg Glu Ile 115 120 125Asn Asn Tyr Thr Ser Ile Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Xaa Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ser Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr His Leu Pro Ala Ser Arg Gly Pro Asp Arg Pro Gly 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Pro225 230 235 240Leu Val Asn Gly Phe Leu Xaa Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Xaa Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Xaa Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Xaa Xaa Ala Xaa Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Val Ile Glu Val Xaa Gln Arg Ala Val Arg Ala Ile Leu His Ile 325 330 335Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34527349PRTArtificial SequenceBa-L gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 and the 5 trimer stability enhancing amino acids of the invention 27Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Pro Glu Ala Gln Gln His Leu Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asn Lys Ile Trp Asp Asn Met Thr Trp Met Glu Trp Asp Arg Glu Ile 115 120 125Asn Asn Tyr Thr Ser Ile Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Xaa Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ser Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr His Leu Pro Ala Ser Arg Gly Pro Asp Arg Pro Gly 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Pro225 230 235 240Leu Val Asn Gly Phe Leu Xaa Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Xaa Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr 260 265

270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Xaa Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Xaa Xaa Ala Xaa Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Val Ile Glu Val Xaa Gln Arg Ala Val Arg Ala Ile Leu His Ile 325 330 335Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34528358PRTArtificial SequenceS21998 gp41 28Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Met1 5 10 15Gly Ala Ala Phe Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 20 25 30Gly Ala Ala Ser Leu Thr Leu Thr Val Gln Ala Arg Leu Leu Leu Ser 35 40 45Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala His 50 55 60Glu His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala65 70 75 80Arg Ile Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly 85 90 95Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Thr Val Pro Trp 100 105 110Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Lys Ile Trp Asn Asn Met 115 120 125Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile 130 135 140Tyr Thr Leu Ile Glu Gln Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln145 150 155 160Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp 165 170 175Ile Thr Gln Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly 180 185 190Gly Leu Ile Gly Leu Arg Ile Val Phe Thr Val Leu Ser Ile Val Asn 195 200 205Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Arg Arg Pro 210 215 220Ala Arg Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly225 230 235 240Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn Gly Phe Leu Ala 245 250 255Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 260 265 270Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly 275 280 285Arg Arg Gly Trp Glu Val Leu Lys Tyr Leu Trp Asn Leu Leu Gln Tyr 290 295 300Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr305 310 315 320Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Leu Leu Gln 325 330 335Arg Ala Phe Arg Ala Ile Leu His Ile Pro Arg Arg Xaa Arg Gln Gly 340 345 350Leu Glu Arg Ala Leu Leu 35529358PRTArtificial SequenceS21998 gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 29Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Met1 5 10 15Gly Ala Ala Phe Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 20 25 30Gly Ala Ala Ser Leu Thr Leu Thr Val Gln Ala Arg Leu Leu Leu Ser 35 40 45Gly Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Pro Glu Ala His 50 55 60Glu His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala65 70 75 80Arg Ile Leu Ala Val Glu Arg Tyr Leu Lys Asp Gln Gln Leu Leu Gly 85 90 95Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Thr Val Pro Trp 100 105 110Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Lys Ile Trp Asn Asn Met 115 120 125Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile 130 135 140Tyr Thr Leu Ile Glu Gln Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln145 150 155 160Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp 165 170 175Ile Thr Gln Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly 180 185 190Gly Leu Ile Gly Leu Arg Ile Val Phe Thr Val Leu Ser Ile Val Asn 195 200 205Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Arg Arg Pro 210 215 220Ala Arg Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly225 230 235 240Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn Gly Phe Leu Ala 245 250 255Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 260 265 270Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly 275 280 285Arg Arg Gly Trp Glu Val Leu Lys Tyr Leu Trp Asn Leu Leu Gln Tyr 290 295 300Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr305 310 315 320Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Leu Leu Gln 325 330 335Arg Ala Phe Arg Ala Ile Leu His Ile Pro Arg Arg Xaa Arg Gln Gly 340 345 350Leu Glu Arg Ala Leu Leu 35530358PRTArtificial SequenceS21998 gp41 containing mutation to proline (P) at position equivalent to amino acid position 559 in KNH1144 and the 5 trimer stability enhancing amino acids of the invention 30Lys Ala Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Met1 5 10 15Gly Ala Ala Phe Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met 20 25 30Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu Leu Ser 35 40 45Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Pro Glu Ala His 50 55 60Glu His Leu Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala65 70 75 80Arg Ile Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly 85 90 95Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Thr Thr Thr Val Pro Trp 100 105 110Asn Ala Ser Trp Ser Asn Lys Ser Leu Asp Lys Ile Trp Asn Asn Met 115 120 125Thr Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Leu Ile 130 135 140Tyr Thr Leu Ile Glu Gln Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln145 150 155 160Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp 165 170 175Ile Thr Gln Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly 180 185 190Gly Leu Ile Gly Leu Arg Ile Val Phe Thr Val Leu Ser Ile Val Asn 195 200 205Arg Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr Arg Arg Pro 210 215 220Ala Arg Arg Gly Pro Asp Arg Pro Glu Gly Ile Glu Glu Glu Gly Gly225 230 235 240Glu Arg Asp Arg Asp Arg Ser Gly Arg Leu Val Asn Gly Phe Leu Ala 245 250 255Leu Ile Trp Asp Asp Leu Arg Ser Leu Cys Leu Phe Ser Tyr His Arg 260 265 270Leu Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly 275 280 285Arg Arg Gly Trp Glu Val Leu Lys Tyr Leu Trp Asn Leu Leu Gln Tyr 290 295 300Trp Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Ala Thr305 310 315 320Ala Ile Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Leu Leu Gln 325 330 335Arg Ala Phe Arg Ala Ile Leu His Ile Pro Arg Arg Xaa Arg Gln Gly 340 345 350Leu Glu Arg Ala Leu Leu 35531847PRTArtificial SequenceJR-FL gp160, "double mutant" containing A492C and T596C mutations 31Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly1 5 10 15Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 130 135 140Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr145 150 155 160Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 165 170 175Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 180 185 190Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 195 200 205Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 210 215 220Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser225 230 235 240Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 245 250 255Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 260 265 270Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 275 280 285Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 290 295 300His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly305 310 315 320Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 325 330 335Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 340 345 350Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 355 360 365His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 370 375 380Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr385 390 395 400Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 405 410 415Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 420 425 430Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 435 440 445Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 450 455 460Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val465 470 475 480Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Cys Lys Arg Arg Val 485 490 495Val Gln Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly 500 505 510Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu 515 520 525Thr Val Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn 530 535 540Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr545 550 555 560Val Trp Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg 565 570 575Tyr Leu Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys 580 585 590Leu Ile Cys Cys Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 595 600 605Ser Leu Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg 610 615 620Glu Ile Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser625 630 635 640Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys 645 650 655Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr 660 665 670Ile Lys Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu 675 680 685Val Phe Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser 690 695 700Pro Leu Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg705 710 715 720Pro Glu Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser 725 730 735Gly Arg Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg 740 745 750Ser Leu Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr 755 760 765Val Thr Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu 770 775 780Lys Tyr Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn785 790 795 800Ser Ala Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly 805 810 815Thr Asp Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu 820 825 830His Ile Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 835 840 84532498PRTArtificial SequenceJR-FL gp120 containing A492C mutation 32Met Arg Val Lys Gly Ile Arg Lys Ser Tyr Gln Tyr Leu Trp Lys Gly1 5 10 15Gly Thr Leu Leu Leu Gly Ile Leu Met Ile Cys Ser Ala Val Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Val Leu Glu Asn Val Thr Glu His Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met Gln Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Lys Asp Val Asn Ala Thr Asn Thr Thr Asn Asp Ser Glu Gly 130 135 140Thr Met Glu Arg Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr145 150 155 160Ser Ile Arg Asp Glu Val Gln Lys Glu Tyr Ala Leu Phe Tyr Lys Leu 165 170 175Asp Val Val Pro Ile Asp Asn Asn Asn Thr Ser Tyr Arg Leu Ile Ser 180 185 190Cys Asp Thr Ser Val Ile Thr Gln Ala Cys Pro Lys Ile Ser Phe Glu 195 200 205Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly Phe Ala Ile Leu Lys 210 215 220Cys Asn Asp Lys Thr Phe Asn Gly Lys Gly Pro Cys Lys Asn Val Ser225 230 235 240Thr Val Gln Cys Thr His Gly Ile Arg Pro Val Val Ser Thr Gln Leu 245 250 255Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu Val Val Ile Arg Ser Asp 260 265 270Asn Phe Thr Asn Asn Ala Lys Thr Ile Ile Val Gln Leu Lys Glu Ser 275 280 285Val Glu Ile Asn Cys Thr Arg Pro Asn Asn Asn Thr Arg Lys Ser Ile 290 295 300His Ile Gly Pro Gly Arg Ala Phe Tyr Thr Thr Gly Glu Ile Ile Gly305 310 315 320Asp Ile Arg Gln Ala His Cys Asn Ile Ser Arg Ala Lys Trp Asn Asp 325 330 335Thr Leu Lys Gln Ile Val Ile Lys Leu Arg Glu Gln Phe Glu Asn Lys 340 345 350Thr Ile Val Phe Asn His Ser Ser Gly Gly Asp Pro Glu Ile Val Met 355 360 365His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr Cys Asn Ser Thr Gln 370 375 380Leu Phe Asn Ser Thr Trp Asn Asn Asn Thr Glu Gly Ser Asn Asn Thr385 390 395

400Glu Gly Asn Thr Ile Thr Leu Pro Cys Arg Ile Lys Gln Ile Ile Asn 405 410 415Met Trp Gln Glu Val Gly Lys Ala Met Tyr Ala Pro Pro Ile Arg Gly 420 425 430Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly Leu Leu Leu Thr Arg Asp 435 440 445Gly Gly Ile Asn Glu Asn Gly Thr Glu Ile Phe Arg Pro Gly Gly Gly 450 455 460Asp Met Arg Asp Asn Trp Arg Ser Glu Leu Tyr Lys Tyr Lys Val Val465 470 475 480Lys Ile Glu Pro Leu Gly Val Ala Pro Thr Lys Cys Lys Arg Arg Val 485 490 495Val Gln33349PRTArtificial SequenceJR-FL gp41 containing T596C mutation 33Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln Arg Met Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Gly Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Cys Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asp Arg Ile Trp Asn Asn Met Thr Trp Met Glu Trp Glu Arg Glu Ile 115 120 125Asp Asn Tyr Thr Ser Glu Ile Tyr Thr Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Val Gly Leu Arg Leu Val Phe 180 185 190Thr Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr Leu Leu Pro Ala Pro Arg Gly Pro Asp Arg Pro Glu 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Arg225 230 235 240Leu Val Asn Gly Phe Leu Ala Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Thr Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Asn Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Ala Thr Ala Ile Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Ile Ile Glu Ala Leu Gln Arg Thr Tyr Arg Ala Ile Leu His Ile 325 330 335Pro Thr Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 34534863PRTArtificial SequenceKNH1144 gp160 modified to contain cysteines at positions equivalent to 492 and 596 in JR-FL 34Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Cys Arg Arg Arg Val Val Gly Arg Glu Lys Arg Ala 500 505 510Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser 515 520 525Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val Gln Ala Arg Gln Leu 530 535 540Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu Leu Arg Ala Ile Glu545 550 555 560Ala Gln Gln His Met Leu Lys Leu Thr Val Trp Gly Ile Lys Gln Leu 565 570 575Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu 580 585 590Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Asn Val 595 600 605Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His Asp Glu Ile Trp Asn 610 615 620Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile Ser Asn Tyr Thr Asn625 630 635 640Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn 645 650 655Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp 660 665 670Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile 675 680 685Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe Ala Val Leu Ala Val 690 695 700Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val Ser Phe Gln Ile His705 710 715 720Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly Arg Ile Glu Glu Glu 725 730 735Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg Leu Val Ser Gly Phe 740 745 750Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu Cys Leu Phe Ser Tyr 755 760 765His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala Arg Thr Val Glu Leu 770 775 780Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu Gly Trp Glu Gly Leu785 790 795 800Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser Gln Glu Leu Lys Thr 805 810 815Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile Ala Val Ala Gly Trp 820 825 830Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile Gly Arg Ala Phe Leu 835 840 845His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 850 855 86035507PRTArtificial SequenceKNH1144 gp120 modified to contain cysteine at position equivalent to 492 in JR-FL 35Met Ile Val Met Gly Thr Gln Arg Asn Tyr Gln His Leu Leu Arg Trp1 5 10 15Gly Thr Met Ile Leu Gly Leu Ile Ile Ile Cys Ser Ala Ala Asp Asn 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Asp Ala Glu 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Glu Thr Glu Lys 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Ile Pro Leu Glu Asn Val Thr Glu Glu Phe Asn Met Trp Lys 85 90 95Asn Lys Met Val Glu Gln Met His Thr Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Gln Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Ala Thr Asn Gly Thr Ile Gly Asn Ile Thr Asp Glu 130 135 140Met Lys Gly Glu Ile Lys Asn Cys Ser Phe Asn Ile Thr Thr Glu Ile145 150 155 160Arg Asp Lys Lys Gln Lys Val Tyr Ser Leu Phe Tyr Arg Leu Asp Val 165 170 175Val Pro Ile Glu Pro Asp Ser Ser Asn Ser Ser Arg Asn Ser Ser Glu 180 185 190Tyr Arg Leu Ile Asn Cys Asn Thr Ser Ala Ile Thr Gln Ala Cys Pro 195 200 205Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala Gly 210 215 220Phe Ala Ile Leu Lys Cys Arg Asp Lys Glu Phe Asn Gly Thr Gly Lys225 230 235 240Cys Lys Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Lys Pro Val 245 250 255Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Gly Glu Val 260 265 270Arg Ile Arg Ser Glu Asn Ile Thr Asn Asn Ala Lys Thr Ile Ile Val 275 280 285Gln Leu Val Glu Pro Val Arg Ile Asn Cys Thr Arg Pro Asn Asn Asn 290 295 300Thr Arg Glu Ser Val Arg Ile Gly Pro Gly Gln Ala Phe Phe Ala Thr305 310 315 320Gly Asp Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Val Ser Arg 325 330 335Ser Gln Trp Asn Lys Thr Leu Gln Gln Val Ala Ala Gln Leu Gly Glu 340 345 350His Phe Lys Asn Lys Ala Ile Thr Phe Asn Ser Ser Ser Gly Gly Asp 355 360 365Leu Glu Ile Thr Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe Tyr 370 375 380Cys Asn Thr Ser Gly Leu Phe Asn Ser Thr Trp Lys Ala Asn Asn Gly385 390 395 400Thr Trp Lys Ala Asn Ile Ser Glu Ser Asn Asn Thr Glu Ile Thr Leu 405 410 415Gln Cys Arg Ile Lys Gln Ile Ile Asn Met Trp Gln Arg Thr Gly Gln 420 425 430Ala Ile Tyr Ala Pro Pro Ile Gln Gly Val Ile Arg Cys Glu Ser Asn 435 440 445Ile Thr Gly Leu Leu Leu Thr Arg Asp Gly Gly Glu Gly Asn Asn Glu 450 455 460Ser Glu Ile Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg465 470 475 480Ser Glu Leu Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val 485 490 495Ala Pro Thr Arg Cys Arg Arg Arg Val Val Gly 500 50536356PRTArtificial SequenceKNH1144 gp41 modified to contain cysteine at position equivalent to 596 in JR-FL 36Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Ile Thr Leu Thr Val 20 25 30Gln Ala Arg Gln Leu Leu Ser Gly Ile Val Gln Gln Gln Ser Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Met Leu Lys Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Cys Thr Asn Val Pro Trp Asn Ser Ser Trp Ser Asn Lys Ser His 100 105 110Asp Glu Ile Trp Asn Asn Met Thr Trp Leu Gln Trp Asp Lys Glu Ile 115 120 125Ser Asn Tyr Thr Asn Leu Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Asp Leu Leu Ala Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Ser Lys Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ala Val Leu Ala Val Ile Lys Arg Val Arg Gln Gly Tyr Ser Pro Val 195 200 205Ser Phe Gln Ile His Asn Pro Asn Pro Gly Gly Leu Asp Arg Pro Gly 210 215 220Arg Ile Glu Glu Glu Gly Gly Glu Pro Gly Arg Gly Arg Ser Ile Arg225 230 235 240Leu Val Ser Gly Phe Leu Ala Leu Ala Trp Asp Asp Leu Arg Asn Leu 245 250 255Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Phe Ala Leu Ile Val Ala 260 265 270Arg Thr Val Glu Leu Leu Gly His Ser Ser Leu Lys Gly Leu Arg Leu 275 280 285Gly Trp Glu Gly Leu Lys Tyr Leu Trp Asn Leu Leu Val Tyr Trp Ser 290 295 300Gln Glu Leu Lys Thr Ser Ala Ile Asn Leu Val Asp Thr Ile Ala Ile305 310 315 320Ala Val Ala Gly Trp Thr Asp Arg Val Ile Glu Ile Gly Gln Gly Ile 325 330 335Gly Arg Ala Phe Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu Glu 340 345 350Arg Ala Leu Leu 35537853PRTArtificial SequenceBa-L gp160 modified to contain cysteines at positions equivalent to 492 and 596 in JR-FL 37Met Arg Val Thr Glu Ile Arg Lys Ser Tyr Gln His Trp Trp Arg Trp1 5 10 15Gly Ile Met Leu Leu Gly Xaa Leu Met Ile Cys Asn Ala Glu Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Xaa Xaa Xaa Asn Val Thr Glu Asn Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Leu Arg Asn Ala Thr Xaa Xaa Asn Xaa Thr Xaa Thr 130 135 140Thr Ser Ser Ser Arg Gly Met Val Gly Gly Gly Glu Xaa Lys Asn Cys145 150 155 160Ser Phe Asn Ile Thr Thr Xaa Ile Arg Gly Lys Val Gln Lys Glu Tyr 165 170 175Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Xaa Ile Asp 180 185 190Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 195 200 205Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 210 215 220Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe

Asn Gly Lys Gly225 230 235 240Pro Cys Xaa Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro 245 250 255Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 260 265 270Val Val Ile Arg Ser Xaa Asn Phe Xaa Asx Asn Ala Lys Xaa Ile Ile 275 280 285Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 290 295 300Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr305 310 315 320Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser 325 330 335Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Xaa Lys Leu Arg 340 345 350Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser Gly Gly 355 360 365Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 370 375 380Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val Thr Glu385 390 395 400Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro Cys Arg 405 410 415Ile Lys Gln Ile Ile Asn Met Trp Gln Xaa Val Gly Arg Ala Met Tyr 420 425 430Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly 435 440 445Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr Glu Val 450 455 460Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu465 470 475 480Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr 485 490 495Lys Cys Lys Arg Arg Val Val Gln Arg Glu Lys Arg Ala Val Gly Ile 500 505 510Gly Ala Val Phe Leu Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly 515 520 525Ala Ala Ser Met Thr Leu Thr Val Gln Ala Arg Leu Leu Leu Ser Gly 530 535 540Ile Val Gln Gln Gln Asn Asn Leu Leu Arg Ala Ile Glu Ala Gln Gln545 550 555 560His Leu Leu Gln Leu Thr Val Trp Gly Ile Lys Gln Leu Gln Ala Arg 565 570 575Val Leu Ala Val Glu Arg Tyr Leu Arg Asp Gln Gln Leu Leu Gly Ile 580 585 590Trp Gly Cys Ser Gly Lys Leu Ile Cys Cys Thr Ala Val Pro Trp Asn 595 600 605Ala Ser Trp Ser Asn Lys Ser Leu Asn Lys Ile Trp Asp Asn Met Thr 610 615 620Trp Met Glu Trp Asp Arg Glu Ile Asn Asn Tyr Thr Ser Ile Ile Tyr625 630 635 640Ser Leu Ile Glu Glu Ser Gln Asn Gln Gln Glu Lys Asn Glu Gln Glu 645 650 655Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asp Ile 660 665 670Thr Xaa Trp Leu Trp Tyr Ile Lys Ile Phe Ile Met Ile Val Gly Gly 675 680 685Leu Ile Gly Leu Arg Ile Val Phe Ser Val Leu Ser Ile Val Asn Arg 690 695 700Val Arg Gln Gly Tyr Ser Pro Leu Ser Phe Gln Thr His Leu Pro Ala705 710 715 720Ser Arg Gly Pro Asp Arg Pro Gly Gly Ile Glu Glu Glu Gly Gly Glu 725 730 735Arg Asp Arg Asp Arg Ser Gly Pro Leu Val Asn Gly Phe Leu Xaa Leu 740 745 750Ile Trp Val Asp Leu Arg Ser Leu Xaa Leu Phe Ser Tyr His Arg Leu 755 760 765Arg Asp Leu Leu Leu Ile Val Thr Arg Ile Val Glu Leu Leu Gly Arg 770 775 780Arg Gly Trp Glu Val Leu Lys Tyr Trp Trp Xaa Leu Leu Gln Tyr Trp785 790 795 800Ser Gln Glu Leu Lys Asn Ser Ala Val Ser Leu Leu Asn Xaa Xaa Ala 805 810 815Xaa Ala Val Ala Glu Gly Thr Asp Arg Val Ile Glu Val Xaa Gln Arg 820 825 830Ala Val Arg Ala Ile Leu His Ile Pro Arg Arg Ile Arg Gln Gly Leu 835 840 845Glu Arg Ala Leu Leu 85038504PRTArtificial SequenceBa-L gp120 modified to contain cysteine at position equivalent to 492 in JR-FL 38Met Arg Val Thr Glu Ile Arg Lys Ser Tyr Gln His Trp Trp Arg Trp1 5 10 15Gly Ile Met Leu Leu Gly Xaa Leu Met Ile Cys Asn Ala Glu Glu Lys 20 25 30Leu Trp Val Thr Val Tyr Tyr Gly Val Pro Val Trp Lys Glu Ala Thr 35 40 45Thr Thr Leu Phe Cys Ala Ser Asp Ala Lys Ala Tyr Asp Thr Glu Val 50 55 60His Asn Val Trp Ala Thr His Ala Cys Val Pro Thr Asp Pro Asn Pro65 70 75 80Gln Glu Val Xaa Xaa Xaa Asn Val Thr Glu Asn Phe Asn Met Trp Lys 85 90 95Asn Asn Met Val Glu Gln Met His Glu Asp Ile Ile Ser Leu Trp Asp 100 105 110Gln Ser Leu Lys Pro Cys Val Lys Leu Thr Pro Leu Cys Val Thr Leu 115 120 125Asn Cys Thr Asp Leu Arg Asn Ala Thr Xaa Xaa Asn Xaa Thr Xaa Thr 130 135 140Thr Ser Ser Ser Arg Gly Met Val Gly Gly Gly Glu Xaa Lys Asn Cys145 150 155 160Ser Phe Asn Ile Thr Thr Xaa Ile Arg Gly Lys Val Gln Lys Glu Tyr 165 170 175Ala Leu Phe Tyr Glu Leu Asp Ile Val Pro Ile Asp Asn Xaa Ile Asp 180 185 190Arg Tyr Arg Leu Ile Ser Cys Asn Thr Ser Val Ile Thr Gln Ala Cys 195 200 205Pro Lys Val Ser Phe Glu Pro Ile Pro Ile His Tyr Cys Ala Pro Ala 210 215 220Gly Phe Ala Ile Leu Lys Cys Lys Asp Lys Lys Phe Asn Gly Lys Gly225 230 235 240Pro Cys Xaa Asn Val Ser Thr Val Gln Cys Thr His Gly Ile Arg Pro 245 250 255Val Val Ser Thr Gln Leu Leu Leu Asn Gly Ser Leu Ala Glu Glu Glu 260 265 270Val Val Ile Arg Ser Xaa Asn Phe Xaa Asx Asn Ala Lys Xaa Ile Ile 275 280 285Val Gln Leu Asn Glu Ser Val Glu Ile Asn Cys Thr Arg Pro Asn Asn 290 295 300Asn Thr Arg Lys Ser Ile His Ile Gly Pro Gly Arg Ala Phe Tyr Thr305 310 315 320Thr Gly Glu Ile Ile Gly Asp Ile Arg Gln Ala His Cys Asn Leu Ser 325 330 335Arg Ala Lys Trp Asn Asp Thr Leu Asn Lys Ile Val Xaa Lys Leu Arg 340 345 350Glu Gln Phe Gly Asn Lys Thr Ile Val Phe Lys His Ser Ser Gly Gly 355 360 365Asp Pro Glu Ile Val Thr His Ser Phe Asn Cys Gly Gly Glu Phe Phe 370 375 380Tyr Cys Asn Ser Thr Gln Leu Phe Asn Ser Thr Trp Asn Val Thr Glu385 390 395 400Glu Ser Asn Asn Thr Val Glu Asn Asn Thr Ile Thr Leu Pro Cys Arg 405 410 415Ile Lys Gln Ile Ile Asn Met Trp Gln Xaa Val Gly Arg Ala Met Tyr 420 425 430Ala Pro Pro Ile Arg Gly Gln Ile Arg Cys Ser Ser Asn Ile Thr Gly 435 440 445Leu Leu Leu Thr Arg Asp Gly Gly Pro Glu Asp Asn Lys Thr Glu Val 450 455 460Phe Arg Pro Gly Gly Gly Asp Met Arg Asp Asn Trp Arg Ser Glu Leu465 470 475 480Tyr Lys Tyr Lys Val Val Lys Ile Glu Pro Leu Gly Val Ala Pro Thr 485 490 495Lys Cys Lys Arg Arg Val Val Gln 50039349PRTArtificial SequenceBa-L gp41 modified to contain cysteine at position equivalent to 596 in JR-FL 39Arg Glu Lys Arg Ala Val Gly Ile Gly Ala Val Phe Leu Gly Phe Leu1 5 10 15Gly Ala Ala Gly Ser Thr Met Gly Ala Ala Ser Met Thr Leu Thr Val 20 25 30Gln Ala Arg Leu Leu Leu Ser Gly Ile Val Gln Gln Gln Asn Asn Leu 35 40 45Leu Arg Ala Ile Glu Ala Gln Gln His Leu Leu Gln Leu Thr Val Trp 50 55 60Gly Ile Lys Gln Leu Gln Ala Arg Val Leu Ala Val Glu Arg Tyr Leu65 70 75 80Arg Asp Gln Gln Leu Leu Gly Ile Trp Gly Cys Ser Gly Lys Leu Ile 85 90 95Cys Cys Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys Ser Leu 100 105 110Asn Lys Ile Trp Asp Asn Met Thr Trp Met Glu Trp Asp Arg Glu Ile 115 120 125Asn Asn Tyr Thr Ser Ile Ile Tyr Ser Leu Ile Glu Glu Ser Gln Asn 130 135 140Gln Gln Glu Lys Asn Glu Gln Glu Leu Leu Glu Leu Asp Lys Trp Ala145 150 155 160Ser Leu Trp Asn Trp Phe Asp Ile Thr Xaa Trp Leu Trp Tyr Ile Lys 165 170 175Ile Phe Ile Met Ile Val Gly Gly Leu Ile Gly Leu Arg Ile Val Phe 180 185 190Ser Val Leu Ser Ile Val Asn Arg Val Arg Gln Gly Tyr Ser Pro Leu 195 200 205Ser Phe Gln Thr His Leu Pro Ala Ser Arg Gly Pro Asp Arg Pro Gly 210 215 220Gly Ile Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser Gly Pro225 230 235 240Leu Val Asn Gly Phe Leu Xaa Leu Ile Trp Val Asp Leu Arg Ser Leu 245 250 255Xaa Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu Ile Val Thr 260 265 270Arg Ile Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Val Leu Lys Tyr 275 280 285Trp Trp Xaa Leu Leu Gln Tyr Trp Ser Gln Glu Leu Lys Asn Ser Ala 290 295 300Val Ser Leu Leu Asn Xaa Xaa Ala Xaa Ala Val Ala Glu Gly Thr Asp305 310 315 320Arg Val Ile Glu Val Xaa Gln Arg Ala Val Arg Ala Ile Leu His Ile 325 330 335Pro Arg Arg Ile Arg Gln Gly Leu Glu Arg Ala Leu Leu 340 345

* * * * *

References

ncbi.nlm.nih.gov