Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
STABILIZATION OF HUMAN IMMUNODEFICIENCY VIRUS (HIV) ENVELOPES
Document Type and Number:
WIPO Patent Application WO/2024/091921
Kind Code:
A1
Abstract:
Disclosed are stabilized type I fusion proteins (e.g., HIV envelope proteins). For HIV-1, the stabilization of Env due to the disclosed modications are compatible and, in some embodiments, synergistic with stabilization mutations known as SOS, IP and SOSIP. The disclosed stabilized Env proteins retain the antigenicity of the unmodified protein.

Inventors:
WRAPP DANIEL (US)
SAUNDERS KEVIN (US)
HAYNES BARTON F (US)
Application Number:
PCT/US2023/077611
Publication Date:
May 02, 2024
Filing Date:
October 24, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV DUKE (US)
International Classes:
C07K14/005; A61K39/00; A61K39/21; A61P31/18
Domestic Patent References:
WO2016037154A12016-03-10
WO2017055522A12017-04-06
Foreign References:
US20180072777A12018-03-15
Other References:
GUENAGA JAVIER; GARCES FERNANDO; DE VAL NATALIA; STANFIELD ROBYN L.; DUBROVSKAYA VIKTORIYA; HIGGINS BRETT; CARRETTE BARBARA; WARD : "Glycine Substitution at Helix-to-Coil Transitions Facilitates the Structural Determination of a Stabilized Subtype C HIV Envelope Glycoprotein", IMMUNITY, CELL PRESS, AMSTERDAM, NL, vol. 46, no. 5, 16 May 2017 (2017-05-16), AMSTERDAM, NL , pages 792, XP085024972, ISSN: 1074-7613, DOI: 10.1016/j.immuni.2017.04.014
DANIEL WRAPP: "Structure-Based Stabilization of SOSIP Env Enhances Recombinant Ectodomain Durability and Yield", JOURNAL OF VIROLOGY, THE AMERICAN SOCIETY FOR MICROBIOLOGY, US, vol. 97, no. 1, 31 January 2023 (2023-01-31), US , XP093165666, ISSN: 0022-538X, DOI: 10.1128/jvi.01673-22
Attorney, Agent or Firm:
ESTRADA DE MARTIN, Paula (US)
Download PDF:
Claims:
CLAIMS

What is claimed:

1. A modified type I fusion protein from a retrovirus, comprising substituted amino acids at one or more of positions 536, 545, 568 and 569 (HXB2 numbering).

2. The modified type I fusion protein of claim 1, wherein the retrovirus comprises a lentivirus.

3. The modified type I fusion protein of claim 2, wherein the retrovirus comprises a human immunodeficiency virus (HIV).

4. The modified type I fusion protein of claim 3, wherein the HIV comprises HIV-1 or HIV - 2.

5. The modified type I fusion protein of claim 4, wherein the HIV-1 comprises a Group M, N or O HIV-1.

6. The modified type I fusion protein of claim 5, wherein the HIV-1 comprises a Group M HIV-1 from subtype/clade A, B. C, D. F, G. H, J or K.

7. The modified type I fusion protein of claim 6, wherein the HIV-1 comprises a Group M HIV-1 from subtype/clade B or C.

8. The modified type I fusion protein of claim 2, wherein the retrovirus comprises a simian immunodeficiency virus (SIV).

9. The modified type I fusion protein of claim 2, comprising an envelope protein (Env).

10. The modified type I fusion protein of claim 9, wherein the Env protein comprises a gpl60, agp!20 or a gp41.

11. The modified type I fusion protein of claim 9. wherein the Env protein comprises an ectodomain of the Env protein.

12. The modified type I fusion protein of claim 9, comprising the substituted amino acids at one of positions 536, 545, 568 and 569.

13. The modified type I fusion protein of claim 9, comprising the substituted amino acids at any two of positions 536, 545, 568 and 569.

14. The modified type I fusion protein of claim 9, comprising the substituted amino acids at any three of positions 536, 545, 568 and 569.

15. The modified type I fusion protein of claim 9. comprising the substituted amino acids at positions 536, 545, 568 and 569.

16. The modified type I fusion protein of claim 9, comprising the substituted amino acids at positions 536 and 545, 536 and 568, 536 and 569. 545 and 568. 545 and 569. or 568 and 569.

17. The modified type I fusion protein of claim 16, comprising the substituted amino acids at positions 536 and 545 or at positions 568 and 569.

18. The modified type I fusion protein of claim 17, comprising the substituted amino acids at positions 568 and 569.

19. The modified type I fusion protein of claim 9, wherein the substituted amino acids at one or more of positions 536, 545, 568 and 569 are the only substituted amino acids in the modified protein.

20. The modified type I fusion protein of claim 9, wherein the modified protein additionally comprises an I559P (IP) amino acid substitution.

21. The modified type I fusion protein of claim 9, wherein the modified protein additionally comprises a disulfide bond (SOS) linking a gpl20 and gp41 subunit.

22. The modified type I fusion protein of claim 9, wherein the modified protein additionally comprises an IP amino acid substitution and an SOS linking a gpl20 and gp41 subunit (SOSIP).

23. The modified type I fusion protein of claim 9, wherein the Env protein comprises the substituted amino acids at positions 568 and 569 and comprises a proline at position 559.

24. The modified type I fusion protein of any one of claims 9 and 12-23, wherein the modified protein additionally comprises mutations DS (1201C+A433C), DT (N133D+N138T), chimeric BG 505 gp41, Q171K, SOSIPv6, F14 (V68I+A204V+V208L+V255L) or combinations thereof.

25. The modified type I fusion protein of claim 1, wherein the substituted amino acids at the one or more positions comprise proline.

26. The modified type I fusion protein of claim 1, wherein the substituted amion acids at the one or more positions comprise glycine.

27. The modified type I fusion protein of claim 9, wherein the substituted amino acid at position 568 replaces leucine, at position 569 replaces threonine, or at position 568 replaces leucine and at position 569 replaces threonine.

28. The modified type I fusion protein of claim 1, wherein the modified protein substantially retains an antibody binding profile of an unmodified type I fusion protein from which the modified type I fusion protein is derived.

29. The modified type I fusion protein of claim 28, wherein the antibody binding profile comprises binding to the modified protein of neutralizing and non-neutralizing antibodies that span multiple epitopes.

30. The modified type I fusion protein of claim 28, wherein antibodies used to obtain the antibody binding profile comprise DH270 UCA3, DH270.6, PGT125, PGR128, 2G12, PGT151. N6, Fl 05, A32, F93F, 17B, 19B, and combinations thereof.

31. The modified ty pe I fusion protein of claim 1, wherein the modified protein substantially retains antigenicity of an unmodified type I fusion protein from which the modified type I fusion protein is derived.

32. The modified ty pe I fusion protein of claim 9, wherein coordinates of the modified type I fusion protein compared with a similar or identical protein that does not contain the substituted amino acids at one or more of positions 536, 545, 568 and 569 yield a root-mean- square-distance of less than 1.5, 1.4, 1.3, 1.2, 1.1, 1.0, 0.9, 0.8 or 0.7 angstroms, over 1200 alpha atoms.

33. The modified type I fusion protein of claim 32, wherein the coordinates of the modified ty pe I fusion protein and the identical protein that does not contain the substituted amino acids are obtained from structural models of the proteins derived from cryogenic electronic microscopy images of the proteins.

34. The modified type I fusion protein of claim 32, wherein the similar or identical protein that does not contain the substituted amino acids comprises CH848 10.17DT DS-SOSIP.

35. The modified type I fusion protein of claim 9, wherein the modified protein is expressed at higher amounts in expression studies, is more stable, in a prefusion conformation is less likely to form a postfusion conformation than an unmodified type 1 fusion protein from which the modified ty pe I fusion protein is derived, and combinations thereof.

36. A modified HIV envelope protein (Env) comprising prolines at positions 559, 568 and 569.

37. A method for modifying an envelope protein (Env) from a retrovirus, comprising: substituting one amino acid in an unmodified Env protein with a substituting amino acid to obtain a modified Env protein; and determining a stability of the modified Env protein as compared to a stability of the unmodified Env protein.

38. The method of claim 37. wherein a position of the one amino acid substitution is selected to inhibit the modified Env protein from assuming a postfusion conformation.

39. The method of claim 37, wherein a position of the one amino acid substitution is selected to inhibit the modified Env protein from forming an elongated alpha helix.

40. The method of claim 37, wherein a position of the one amino acid substitution is in a flexible loop of the unmodified Env protein.

41. The method of claim 37, wherein the substituting amino acid has a lower configuration entropy than an amino acid it replaces.

42. The method of claim 37. wherein the modified protein has a lower configurational or conformational entropy than the unmodified protein.

43. The method of claim 37, wherein the Env protein from a retrovirus comprises an Env protein from HIV-1, HIV -2 or SIV.

44. The method of claim 43, wherein the substituted one amino acid is at position 536, 545, 568 or 569 of the unmodified Env protein (HXB2 numbering).

45. The method of claim 41. wherein the substituting amino acid comprises proline.

46. The method of claim 37, further comprising selecting a modified Env from the method that is more stable than the unmodified Env.

47. The method of claim 46, wherein a substituting amino acid position from a first modified Env protein is combined with a substituting amino acid position from a second modified Env protein to form a third modified Env protein having the substituted amino acids of the first and the second modified Env proteins.

48. The method of claim 47, further comprising selecting a third modified Env protein from the method that is more stable than either the first or the second modified Env protein.

49. The method of claim 37, wherein the stability of the modified and unmodified Env proteins is determined by determining yield of the proteins in a recombinant protein expression system.

50. The method of claim 37, wherein an increased stability7 of the modified Env protein as compared to the unmodified Env protein results from a lower probability that the modified Env protein will assume a postfusion conformation as compared to the unmodified protein.

51. The method of claim 37, wherein an increased stability of the modified Env protein as compared to the unmodified Env protein results from a decreased metastability of the modified protein as compared to the unmodified protein.

52. A method for modifying an envelope protein (Env) from a retrovirus, comprising: identifying amino acid positions within flexible loops of an unmodified Env protein; substituting one or more of the identified amino acid positions in the unmodified Env protein with a proline to produce a modified Env; and determining yield of the modified Env in an expression system.

53. The method of claim 52, wherein the flexible loops of the unmodified Env protein comprise loops within a gp41 subunit which are unstructured in a prefusion conformation of the unmodified protein and rearrange into alpha helices in a postfusion conformation of the unmodified protein.

54. The method of claim 52, further comprising selecting a modified Env from the method that produces a greater yield than the unmodified Env under similar experimental conditions.

55. A method for modifying an envelope protein (Env) of HIV-1, HIV-2 or SIV, comprising: selecting an Env protein that has a threonine at position 568, a leucine at position 569, or a threonine at position 568 and a leucine at position 569; and substituting the threonine at position 568, the leucine at position 569, or the threonine at position 568 and the leucine at position 569 with proline.

56. A method for modifying an envelope protein (Env) of HIV. comprising: identifying amino acids at position 568, 569, or 568 and 569 in the Env; and substituting one or more of the amino acid(s) with proline or glycine.

57. A vaccine composition comprising the modified type I fusion protein of any one of claim 1-34 or the modified HIV envelope protein of claim 35..

58. A nucleic acid encoding the modified type I fusion proteins of any one of claims 1-34 or the modified HIV envelope protein of claim 35.

59. A vector comprising the nucleic acid of claim 58.

60. A cell expressing the vector of claim 59.

61. A vaccine composition comprising the nucleic acid of claim 58, the vector of claim 59, the cell of claim 60, and combinations thereof.

Description:
STABILIZATION OF HUMAN IMMUNODEFICIENCY VIRUS (HIV) ENVELOPES

[0001] This application claims priority to U.S. Provisional Application No. 63/418,720, filed on October 24, 2022, and U.S. Provisional Application No. 63/434,373, filed on December 21, 2023, the entire contents of which are incorporated herein by reference.

[0002] All patents, patent applications and publications cited herein are hereby incorporated by reference in their entirety. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art as known to those skilled therein as of the date of the invention described and claimed herein.

[0003] This patent disclosure contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the U.S. Patent and Trademark Office patent file or records, but otherwise reserves any and all copyright rights.

GOVERNMENT INTERESTS

[0004] This invention was made with government support from the Center for HIV/AIDS Vaccine Development (CHAVID) grant 5UM1AI44371.

FIELD OF THE INVENTION

[0005] Aspects of the invention are drawn to compositions and methods for stabilizing and/or increasing yield of lentiviral (e.g., HIV) envelope proteins.

SEQUENCE LISTING

[0006] This application contains a Sequence Listing which has been submitted electronically in XML format. The Sequence Listing XML is incorporated herein by reference. Said XML file, created on October 24, 2023, is named 2933311-000052- W01_SL.xml and is 30,683 bytes in size.

BACKGROUND OF THE INVENTION

[0007] The Env glycoprotein is a focus of HIV- 1 vaccine development due to the role that it plays during the process of viral entry. Despite significant advances in protein engineering, many Env proteins remain recalcitrant to recombinant expression due to their inherent metastability, and propensity to change conformation from a prefusion form to a postfusion form, making biochemical and immunologic experiments difficult.

SUMMARY OF THE INVENTION

[0008] Here we describe amino acid substitutions (e.g., proline substitutions) that significantly increase the yield of recombinantly expressed prefusion Env, including Env ectodomains, from lentiviruses. In some embodiments, substitutions referred to as “2P’’ increase yield of Env and can function synergistically with previously reported SOSIP mutations. The modified Env proteins do not alter the antigenicity or the overall structure of the prefusion Env trimer. While not wishing to be held to a mechanism, these mutations can increase yield through enhanced durability of the prefusion conformation of Env, rather than through boosting expression levels. We show that the 2P mutations can effectively be applied to a broad range of antigenically and evolutionarily diverse Envs. By facilitating the expression and purification of Env, the 2P mutations can be a useful tool for researchers investigating and developing medical countermeasures against viruses like HIV-1. These findings provide a useful Env stabilization platform to facilitate biochemical research and can improve the development of future HIV-1 vaccine candidates.

[0009] Disclosed is a modified type I fusion protein from a retrovirus having substituted amino acids at one or more of positions 536, 545, 568 and 569 (HBX2 numbering). The retrovirus can be a lentivirus. The retrovirus can be HIV. The retrovirus can be HIV-1, HIV- 2 or SIV. The type I fusion protein from these viruses can be an Env protein, including gp!60, gp!20 or gp41. The Env protein can include an ectodomain.

[0010] The substituted amino acids can be at one, two, three or all of positions 536, 545, 568 and 569 in the proteins. The substituted amino acids can be at amino acid positions 536 and 545, 536 and 568, 536 and 569, 545 and 568, 545 and 569 or 568 and 569. In some embodiments, threonines at positiosn 568 and 569 are substituted. The amino acids at these positions can be substituted with proline. These substituted Env proteins can contain additional/ other mutations elsewhere in the molecules.

[0011] The disclosed molecules can be expressed at higher levels in recombinant expression systems compared to proteins that do not have the amino acid substitutions. The disclosed molecules have a higher probability of remaining in a prefusion conformation and a lower probability of forming a postfusion conformation compared to molecules that do not have the amino acid substitutions. The disclosed molecules retain the antigenticity of the unsubstituted molecules. The disclosed molecules retain the antibody binding profile, with a panel of multiple antibodies, of the unsubstituted molecules. Coordinates from molecular models obtained using cryo-EM images were used to calculate root-mean-square-distance (RMSD) of the substituted molecules versus the unsubstituted molecules. In some embodiments, RMSD from these calculations was less than 1 A.

[0012] Methods for contracting the disclosed proteins or modifying an existing type I fusion protein to obtain the disclosed proteins are described. Also disclosed are vaccine compostions containing the disclosed proteins, nucleic acids encoding the disclosed proteins, vectors containing the nucleic acids, and cells containing the vectors.

BRIEF DESCRIPTION OF THE DRAWINGS

[0013] Certain illustrations, charts, or flow charts are provided to allow for a better understanding for the present invention. It is to be noted, however, that the drawings illustrate only selected embodiments of the inventions and are therefore not to be considered limiting of scope. Additional and equally effective embodiments and applications of the present invention exist.

[0014] FIG. 1. Illustrates design and evaluation of proline-stabilization mutations in gp41. (A, left) A single monomer of gp41 in the prefusion conformation (PDB ID: 6VZI) is shown as ribbons, colored white. Residues that were targeted for proline stabilization are shown as colored sticks surrounded by a transparent molecular surface. (A, right) A single monomer of gp41 in the postfusion conformation (homology model based on PDB ID: 7AEJ) is shown, colored according to the prefusion monomer. (B) Size-exclusion chromatograms from a Superose 6 Increase column for each individual mutant after affinity chromatography are overlaid. (C) Size-exclusion chromatograms from a Superose 6 Increase column are shown for individual proline mutants, colored either blue or red. The combination of the two mutations is colored purple and the unmutated CH848 10.17DT DS-SOSIP is colored black and labeled “WT”.

[0015] FIG. 2. Illustrates that 2P mutations do not alter the antigenic landscape of CH848 10. 17DT DS-SOSIP. ELISA binding curves for Env-directed monoclonal antibodies against CH848 10.17DT DS-SOSIP and CH848 10.17DT DS-SOSIP-2P are shown. CH848 10.17DT DS-SOSIP is shown in black (filled squares) and CH848 10.17DT DS-SOSIP-2P is shown in purple (filled circles). Data points represent the average of three replicates and the standard deviations are plotted as error bars. The y-axes show absorbance measured at 450 nm. Neutralizing antibody curves are shown in (A) and non-neutralizing antibody curves are shown in (B).

[0016] FIG. 3. Illustrates the cryo-EM structure of CH848 10.17DT DS-SOSIP-2P. (A) 2D class averages of the CH848 10.17DT DS-SOSIP-2P Env ectodomain, calculated in cryoSPARC v3, are shown. (B) The 3.73 A-resolution reconstruction is shown from side (left) and top (right) views. Two protomers are displayed as cryo-EM map in either white or dark gray and the third protomer is shown as a ribbon diagram of the corresponding model, with gpl20 colored red and gp41 colored yellow. (C) A single monomer of gp41 from CH848 10.17DT DS-SOSIP-2P is shown in yellow, aligned to a monomer of gp41 from a previously determined cryo-EM structure of CH848 10.17DT DS-SOSIP (PDB ID: 6UM7), colored purple.

[0017] FIG. 4. Illustrates investigations of the underlying mechanism of enhanced protein yield upon proline-stabilization. (A) Four cell populations were stained with a panel of Env- directed monoclonal antibodies. The average of three independent experiments is plotted ± standard deviation. Each antibody has been assigned a color and is labeled above the corresponding bars. ‘'Mock” = untransfected, “WT” = CH848 10. 17DT gpl60, “2P” = CH848 10.17DT 2P gp!60, “2P I559P” = CH848 10.17DT 2P I559P gpl60. (B, top left) CH848 10. 17DT DS-SOSIP Tm is plotted for each round of freeze/thaw. (B, top right) DSF melting curves used to calculate CH848 10. 17DT DS-SOSIP Tm are plotted, colored according to the bar graph on the left. The dotted line at 73.6 °C represents the average Tm of rounds 1-4 and the dotted line at 66.4 °C represents the average Tm for rounds 5-10. (B, bottom left) CH848 10.17DT DS-SOSIP-2P is plotted for each round of freeze/thaw. (B. bottom right) MST melting curves used to calculate CH848 10. 17DT DS-SOSIP-2P Tm are plotted, colored according to the bar graph on the left. The dotted line at 71.0 °C represents the average Tm of rounds 1-8 and the dotted line at 67.5 °C represents the average Tm for rounds 9-10. (C, top) Size-exclusion chromatograms for three aliquots of CH848 10. 17DT DS-SOSIP, incubated at 42 °C for 0, 48 or 96 hours, are overlaid. The 0 hour sample is colored blue, the 48 hour sample is colored black and the 96 hour sample is colored red. Data are plotted as a percentage of the maximum mAU value from the 0 hour sample. (C, bottom) Size-exclusion chromatograms for three aliquots of CH848 10.17DT DS-SOSIP-2P, incubated at 42 °C for 0, 48 or 96 hours, are overlaid. The 0 hour sample is colored blue, the 48 hour sample is colored black and the 96 hour sample is colored red. Data are plotted as a percentage of the maximum mAU value from the 0 hour sample. [0018] FIG. 5. Illustrates applying 2P mutations to evolutionarily diverse Envs. Sizeexclusion chromatograms from a Superose 6 Increase column are shown for each construct after PGT145 affinity chromatography. Curves for the 2P-stabilized construct are colored and the curves for the corresponding non-2P-stabilized constructs (“WT”) are overlaid in black. Chromatograms were generated using a Superose 6 Increase column.

[0019] FIG. 6. Illustrates evaluation of the effects of quadruple proline-substitution on Env homogeneity and yield. Size-exclusion chromatograms from a Superose 6 Increase column are shown. CH848 10.17DT DS-SOSIP is colored black (labeled “WT”), CH848 10.17DT DS-SOSIP L568P+T569P is colored purple and CH848 10.17DT DS-SOSIP T536P+L545P+L568P+T569P is colored orange. The curves from CH848 10.17DT DS- SOSIP and CH848 10.17DT DS-SOSIP L568P+T569P are the same as those shown in FIG.

1C.

[0020] FIG. 7. Illustrates structural comparison of CoV S-2P mutations and Env 2P mutations. (A) A monomer of the gp41 subunit in the prefusion conformation (PDB ID: 6VZI) is shown as a ribbon diagram, with the HR1 colored yellow, the CH colored orange, and residues 568-569 shown as purple spheres. (B) A monomer of the gp41 subunit in the occluded-open state (PDB ID: 6CM3) is shown as a ribbon diagram, colored according to panel A.

[0021] FIG. 8. Illustrates that the I559P mutation and the 2P mutations act synergistically to boost CH848 10. 17DT Env yield. (A) Size-exclusion chromatograms from a Superose 6 Increase column are shown. CH848 10.17DT DS-SOSIP is colored red (“SOSIP”), CH848 10.17DT DS A501C+T605C+2P is colored blue (“SOS 2P”) and CH848 10.17DT DS- S0SIP-2P is colored purple (“SOSIP 2P”). The dashed box denotes the zoomed-in view that is shown in panel B.

[0022] FIG. 9. Illustrates cryo-EM data processing workflow.

[0023] FIG. 10. Illustrates cryo-EM validation. (A) FSC curves for different masking strategies employed by cryoSPARC v3 are plotted. The dotted line represents an FSC value of 0. 143. (B) The viewing direction distribution plot, generated in cryoSPARC v3, is shown for the CH848 10.17DT DS-SOSIP-2P reconstruction. (C) Side (left) and top (right) views of the 3.73 A CH848 10.17DT DS-SOSIP-2P reconstruction are shown, colored according to local resolution. (D) The cryo-EM map is show n as a transparent surface with the corresponding atomic model shown as a ribbon diagram, colored by protomer. (E) Portions of the cryo-EM map are shown as a transparent surface with the atomic model shown as a purple ribbon diagram. Side chains are shown as sticks with oxygen atoms colored red and nitrogen atoms colored blue.

[0024] FIG. 11. Illustrates flow cytometric analysis of proline-stabilized CH848 10.17DT gpl60s. (A) Transiently transfected HEK293-F cells were gated to exclude doublets, debris and dead cells. mAb binding was detected using a PE-labeled goat anti-human IgG Fc secondary antibody. (B) Raw histograms for each mAb that is displayed in FIG. 4A are shown. CH848 10.17DT gp!60 curves are colored orange, CH848 10.17DT 2P gp!60 curves are colored purple, CH848 10.17DT 2P I559P gpl60 curves are colored blue and untransfected curves are colored gray. The number of cells, normalized to mode, is plotted on the y-axis and PE signal is plotted on the x-axis.

[0025] FIG. 12. Conservation of residues Leu568 and Thr569. A WebLogo plot, calculated using 6,599 Env sequences curated in the LANL HIV database, is shown for positions 558-579. Residues have been colored according to their chemical properties (polar = green, neutral = purple, basic = blue, acidic = red, hydrophobic = black).

[0026] FIG. 13. Representative negative-stain EM 2D class averages of CH848 10. 17DT DS-SOSIP L568P+T569P. calculated in Relion 3.0, are shown.

DETAILED DESCRIPTION OF THE INVENTION

[0027] Disclosed here are amino acid substitutions (e.g., proline substitutions) that increase the yield of recombinantly expressed prefusion Env, including Env ectodomains. By targeting the region in gp41 that connects the central helix to the rest of heptad repeat 1 (HR1), these amino acid substitutions disfavor formation of elongated alpha helices that drive transition of Env proteins to the postfusion conformation. We showed that this strategy did not alter the antigenicity or structure of prefusion Env. While not wanting to be bound to a mechanism, we showed that the substitutions can increase the yield of Env trimers by enhancing the durability of the prefusion conformation, rather than by increasing overall expression. During the harsh, prolonged conditions of transient transfection, Env ectodomains that can retain the prefusion conformation are less likely to spontaneously and irreversibly trigger to the postfusion conformation and be lost as insoluble aggregates (28). In addition to facilitating the recombinant expression and purification of recombinant prefusion Envs for use as biochemical reagents in the laboratory, the enhanced durability of example 2P Envs can have implications for future vaccine development efforts. For example, by enhancing durability of the prefusion conformation of Env, the residence of Env immunogens in the body can be increased. Improved stability of the modified Envs can stabilize vaccines prior to administration. For example, vaccines containing these Env proteins can be more stable under circumstances where it is not possible to maintain a reliable cold chain.

[0028] Many proposed vaccination regimens currently rely on extensive priming and boosting schedules, which can complicate vaccine adherence among the general population (32, 33). One reason for this fastidiousness is the difficulty 7 associated with eliciting even autologous neutralizing antibodies (36, 37). By enhancing the durability of the prefusion conformation of Env, it can increase the residence time of Env immunogens in the body, mimicking the conditions of natural infection where natively expressed Env is perpetually present and exposed to the adaptive immune system (20). Furthermore, the results of our thermostability and forced degradation assays indicate that 2P-stabilized Envs do not need the same fastidious storage conditions that are a requirement for unstabilized Env immunogens. In addition to the advantages that this poses in a laboratory environment, it is also a consideration when thinking about administering vaccines in regions that may lack sufficient infrastructure to reliably maintain a cold chain.

[0029] We also show that the 2P stabilization strategy 7 is not solely effective in the context of the CH848 10.17DT Env. By introducing these mutations into a diverse panel of different Env ectodomains, we were able to enhance the yield of multiple prefusion trimers, albeit to differing degrees of success depending on the variant in question. The 2P stabilization strategy can be a broadly applicable means to enhance the yield of antigenically diverse prefusion trimers and will prove to be useful for immunogen design.

[0030] Detailed descriptions of one or more embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in any appropriate manner.

[0031] The singular forms ‘'a”, “an” and ’‘the” include plural reference unless the context clearly dictates otherwise. The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.” [0032] Wherever any of the phrases “for example,” “such as,” “including” and the like are used herein, the phrase “and without limitation” is understood to follow unless explicitly stated otherwise. Similarly, “an example,” “exemplary” and the like are understood to be nonlimiting. [0033] The term ‘‘substantially'’ allows for deviations from the descriptor that do not negatively impact the intended purpose. Descriptive terms are understood to be modified by the term ‘'substantially” even if the word “substantially” is not explicitly recited.

[0034] The terms “comprising” and “including” and “having” and “involving” (and similarly “comprises”, “includes,” “has,” and “involves”) and the like are used interchangeably and have the same meaning. Specifically, each of the terms is defined consistent with the common United States patent law definition of “comprising” and is therefore interpreted to be an open term meaning “at least the following,” and is also interpreted not to exclude additional features, limitations, aspects, etc. Thus, for example, “a process involving steps a, b, and c” means that the process includes at least steps a. b and c. Wherever the terms “a” or “an” are used, “one or more” is understood, unless such interpretation is nonsensical in context.

[0035] As used herein, the term “about” can refer to approximately, roughly, around, or in the region of. When the term “about” is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. The term “about” is used herein to modify a numerical value above and below the stated value by a variance of 20 percent up or down (higher or lower).

Type I Fusion Proteins, Prefusion and Postfusion Forms, and Protein Instability

[0036] Herein are disclosed modified type I fusion proteins. In some embodiments the modified proteins have amino acid substitutions as compared to the naturally-occurring proteins. In some embodiments, the type I fusion proteins can be from viruses like influenza, some retroviruses (e.g., lentiviruses like human immunodeficiency viruses, including HIV-1 or HIV-2, and SIV), coronaviruses, Ebola viruses and the like. In some embodiments, the type I fusion proteins can be syncytins, which are proteins that can be used in mammalian cell fusions. In some embodiments, the ty pe I fusion proteins can be made of three subunits (i.e., trimeric). The ty pe I fusion proteins can be from HIV-1 Groups M, N or O. The HIV-1 can be from Group M and from subtype or clade A, B, C, D, F, G, H, J or K. In some embodiments, the proteins can be viral envelope proteins (Env). The Env protein can be a gpl60, a gpl20, or a gp41 protein. The Env protein can be an ectodomain.

[0037] HIV-1 is a lentivirus that makes use of a class I viral fusion protein, called Env, to gain entry' into host cells to begin the process of integration and replication (7). Like other class I viral fusion proteins, Env is proteolytically processed into two distinct subunits. gp!20 and gp41(8). These subunits remain associated and oligomerize with other protomers to form the functional prefusion conformation of Env, which is composed of a trimer of gpl20/gp41 heterodimers. The N-terminal gpl20 subunit is responsible for mediating binding to both CD4 and CCR5 (9-1 1), while the gp41 subunit contains the hydrophobic fusion peptide and the helical heptad repeats that drive membrane fusion (12).

[0038] Due to the role that Env plays during the process of infection, it is the main target for vaccine development. However, several characteristics of Env make it a complex and problematic immunogen. To begin, it is covered by a dense glycan shield composed of N- linked glycans that hamper the elicitation of neutralizing antibodies (13). Furthermore, constant immune pressure (14) and the infidelity of the HIV-1 reverse transcriptase has resulted in sequence diversity among Env variants (15). Env has also evolved to be metastable in the prefusion conformation, such that it is primed to rapidly and irreversibly transition to the postfusion conformation (16). This inherent metastability makes it difficult to recombinantly express the Env ectodomain in the antigenically desirable prefusion conformation.

[0039] In an effort to stabilize the prefusion conformation of the Env ectodomain without altering its antigenicity, Sanders et al. developed the SOSIP mutations, composed of an engineered disulfide bond (SOS) linking the gpl20 and gp41 subunits and an I559P (IP) substitution that disfavors formation of elongated postfusion helices that make up the six- helix bundle (17, 18; All amino acid numbering herein uses HXB2 numbering; Korber, Bette, et al. "Numbering positions in HIV relative to HXB2CG." Human retroviruses and AIDS 3 (1998): 102-1 1 1). Since their initial description, the SOSIP mutations have undergone iterative improvements to enhance the thermostability and the antigenic characteristics of the prefusion trimer (19-21). Alternative and complimentary protein engineering approaches have also been described as a means of yielding improved Env immunogens (22-26). However, despite these advancements in immunogen engineering, many Env variants remain recalcitrant to recombinant in vitro expression.

[0040] In some embodiments, the modified type I fusion proteins disclosed herein have substitutions of certain amino acids that are found in the naturally-occurring or wild-type forms of the proteins. In some embodiments, the amino acid positions that are substituted can be positions in a protein that are involved in a prefusion conformation of the protein changing to a postfusion conformation. Generally, modification of the protein by substitution of amino acids at these locations disfavors conformational change of the modified protein from a prefusion to postfusion state as compared to the unmodified protein (e.g., disfavors a- helices). In some embodiments, the amino acid positions that are substituted can be found in regions of the protein that are parts of flexible loops.

[0041] In some embodiments, the amino acid positions in a retroviral (e.g., HIV-1) envelope (Env) protein can be 536, 545, 568, 569, and combinations thereof (HXB2 numbering). In various embodiments, the amino acid substitution can be made at one of positions 536, 545,

568, 569. at any two of positions 536, 545, 568, 569, at any three of positions 536, 545, 568,

569, or at all of positions 536, 545, 568, 569. In some embodiments, the amino acid substitutions can be made at one or more of positions 536, 545, 568, 569 and are the only substituted amino acids in the protein, or the only substituted amino acids in the protein that affect protein stability. In some embodiments, the amino acid positions in a retroviral (e g., HIV-1) Env protein can be made at positions 536 and 545, 536 and 568, 536 and 569, 545 and 568, 545 and 569, or 568 and 569. In some embodiments, the substitution at position 568 can replace a leucine in the unmodified protein. In some embodiments, the substitution at position 569 can replace a threonine. In some embodiments, the leucine at position 568 and the threonine at position 569 can be replaced or substituted.

[0042] The amino acid substitutions can substitute an amino acid in a type I fusion protein for an amino acid that has a lower configurational entropy than the existing amino acid at a location. In some embodiments, the amino acids at these locations are substituted with proline. Proline substitution can reduce the conformational/configurational entropy of the protein. The cyclized side chain of proline also disfavors formation of alpha helices in proteins. In some embodiments, proline can be used as the substituting amino acid. In some embodiments, glycine can be used as the substituting amino acid.

[0043] In some embodiments, in addition to the substitutions at the 536, 545, 568 and/or 569 positions as above, the protein can have additional mutations as compared to the naturally- occurring protein. For example, the protein can have an amino acid substitution at position 559. In some embodiments, the amino acid substitution can be I559P (IP). In some embodiments, the protein can have a modification that is a disulfide bond linking a gpl20 and gp41 subunit (SOS). In some embodiments, the protein can have an IP and an SOS modification (SOSIP). In some embodiments, the protein can have one or more of modifications DS (1201C+A433C), DT (N133D+N138T), chimeric BG 505 gp41, Q171K, SOSIPv6, F14 (V68I+A204V+V208L+V255L), and combinations thereof.

[0044] In some embodiments, the modified lentiviral fusion proteins can be as below. The substituted 2P amino acids are indicated by underlining/highlighting. [0045] CH848,3.d949.10.17DTchim DS-SOSIP-2P

[0046] MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCAS

DARAYEKEVHNVWATHACVPTDPSPQELVLGNVTENFNMWKNDMVDQMHEDIISL

WDQSLKPCVKLTPLCVTLICSDATVKTGTVEEMKNCSFNTTTEIRDKEKKEYALFYK

PDIVPLSETNNTSEYRLINCNTSACTQACPKVTFEPIPIHYCAPAGYAILKCNDETF NG

TGPCSNVSTVQCTHGIRPVVSTQLLLNGSLAEKEIVIRSENLTNNAKIIIVHLHTPV EIV

CTRPNNNTRKSVRIGPGQTFYATGDIIGDIKQAHCNISEEKWNDTLQKVGIELQKHF P

NKTIKYNQSAGGDMEITTHSFNCGGEFFYCNTSNLFNGTYNGTYISTNSSANSTSTI TL

QCRIKQIINMWQGVGRCMYAPPIAGNITCRSNITGLLLTRDGGTNSNETETFRPAGG D

MRDNWRSELYKYKVVKIEPLGVAPTRCKRRVVGRRRRRRAVGIGAVFLGFLGAAG

STMGAASMTLTVQARNLLSGIVQQQSNLLRAPEAQQHLLKPPVWGIKQLQARVLAV

ERYLRDQQLLGIWGCSGKLICCTNVPWNSSWSNRNLSEIWDNMTWLQWDKEISNYT

QIIYGLLEESQNQQEKNEQDLLALD (SEQ ID NO: 1)

[0047] SIVcpzCAM13 Q171K DS-S0SIP-2P

[0048] MPMGSLQPLATLYLLGMLVASVLASDEWVTVYYGVPVWREVNTVLFCASD

AKAHSTEAHNIWATHACVPTDPNPQEVVLHNVTEDFNMWNNQMVEQMQEDISSL

WDQSLKPCVKLTPLCVTMKCSNVTRQNSTSNSNNNPTLKNNTSGKNETGVLQMKN

CTFNTTTELRDKKKKVYSLFYVDDLQSLGSGTGDTYTMINCNTTACTQACPKVSFEP

IPIHYCAPAGFAILKCNDVNFSGKGKCRNVSTVHCTHGIKPVVTTQLIINGSLATEN IT

VRVNNASKNTHDWIVQLSTAVNLTCKRVGNNTRGKVQIGPGMTFYNMDHIFGDTR

KAFCELNGTTWNETLQKVRESLIKEIKANANGTYNITFEPSSGGDPEIANHMFNCGG E

FFYCDTRKMFNESEPFHENMTIPCRIRQIVNSWMRVGRCIYAPPIPGHITCNSLITG LIL

TRDHVNNTNNTFRPIGGDMKNIWRSELYKYKVVRIEPLSVAPTKCKRHTVGERRRR

RRAAFGLGALFLGFLGAAGSTMGAAAVTLTVQARQLLSGIVQQQNNLLRAPEAQQH LLQPPVWGVKQLQARLLAVERYLQDQQILGLWGCSGKSICCTTVPWNKTWSGKSM SDIWNNLTWQQWDKLITNYTGTIFGLLEEAQSQQEKNEKDLLELD (SEQ ID NO: 2)

[0049] B41 S0SIP-2P

[0050] MPMGSLQPLATLYLLGMLVASVLAAAKKWVTVYYGVPVWKEATTTLFCAS

DAKAYDTEVHNVWATHACVPTDPNPQEIVLGNVTENFNMWKNNMVEQMHEDIISL

WDQSLKPCVKLTPLCVTLNCNNVNTNNTNNSTNATISDWEKMETGEMKNCSFNVT

TSIRDKIKKEYALFYKLDVVPLENKNNINNTNITNYRLINCNTSVITQACPKVSFEP IPI

HYCAPAGFAILKCNSKTFNGSGPCTNVSTVQCTHGIRPVVSTQLLLNGSLAEEEIVI RS

ENITDNAKTIIVQLNEAVEINCTRPNNNTRKSIHIGPGRAFYATGDIIGNIRQAHCN ISK

ARWNETLGQIVAKLEEQFPNKTIIFNHSSGGDPEIVTHSFNCGGEFFYCNTTPLFNS TW

NNTRTDDYPTGGEQNITLQCRIKQIINMWQGVGKAMYAPPIRGQIRCSSNITGLLLT R

DGGRDQNGTETFRPGGGNMRDNWRSELYKYKVVKIEPLGIAPTACKRRVVQRRRR

RRAVGLGAFILGFLGAAGSTMGAASMALTVQARLLLSGIVQQQNNLLRAPEAQQHM

LQPPVWGIKQLQARVLAVERYLRDQQLLGIWGCSGKIICCTNVPWNDSWSNKTINEI

WDNMTWMQWEKEIDNYTQHIYTLLEVSQIQQEKNEQELLELD (SEQ ID NO: 3)

[0051] JRFL SOSIPv6-2P

[0052] MPMGSLQPLATLYLLGMLVASCLGVEKLWVTVYYGVPVWKEACTTLFCAS

DAKAYDTKVRNVWATHCCVPTDPNPQEVVLENVTEHFNMWKNNMVEQMQEDIIS

LWDQSLKPCVKLTPLCVTLNCKDVNATNTTNDSEGTMERGEIKNCSFNITTSIRDKV

QKEYALFYKLDVVPIDNNNTSYRLISCDTSVITQACPKISFEPIPIHYCAPAGFAIL KCN

DKTFNGKGPCKNVSTVQCTHGIRPVVSTQLLLNGSLAEEEVVIRSDNFTNNAKTIIV Q

LKESVEINCTRPNNNTRKSIHIGPGRWFYTTGEIIGDIRQAHCNISRAKWNDTLKQI VI

KLREQFENKTIVFNHSSGGDPEIVMHSFNCGGEFFYCNSTQLFNSTWNNNTEGSNNT EGNTITLPCRIKQIINMWQEIGKAMYAPPIRGQIRCSSNITGLLLTRDGGINENGTEIFR

PGGGDMRDNWRSELYKYKVVKIEPLGVAPTKCKRRVVGRRRRRRAVGIGAVFLGF

LGAAGSTMGAASMTLTVQARQLLSGIVQQQNNCLRAPECQQRMLQPPVWGIKQLQ ARVLAVERYLGDQQLLGIWGCSGKLICCTAVPWNASWSNKSLDRIWNNMTWMEW

EREIDNYTSEIYTLIEESQNQQEKNEQELLELD (SEQ ID NO: 4)

[0053] T250-4 DS-S0SIP-2P

[0054] MDAMKRGLCCVLLLCGAVFVSPSQEIHARFRRGARAEKLWVTVYYGVPVW

READTTLFCASDAKGYDTEAHNVWATHACVPTDPRPQEMYLENVTENFNMWKNS

MVEQMHTDIISLWDESLKPCVKLTPLCVTLDCQAFNSSSHTNSSIAMQEMKNCSFNV

TTELRDKKKKEYSFFYKTDIEQINKNGRQYRLINCNTSACTQACPKVSFEPIPIHFC AP

AGFAILKCNEKHFNGKGPCKNVSTVQCTHGIKPVVSTQLLLNGSLAEEEVVIRVENT I

DNAKTIIVQLAKPVKINCTRPNNNTRKSIRIGPGQTFYATGDIIGNIRKAYCNVSKR EW

NNTLQQVAAQLSKSFNNTKIVFEKHSGGDLEVITHSFVCGGEFFYCNTSGLFNSTWT

NSTWTNSTTGSNGTESNDTITLQCEIKQFINMWQRVGRCMYAPPIPGVIRCESDITG L

LLTRDGPNSTQNETFRPGGGDMRDNWRSELYKYKVVQIEPLGVAPTHCKRRVVERR

RRRRAVGLGAVFFGFLGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLKAPEAQQ

QLLRPPVWGIKQLQARVLALERYLKDQQLLGIWGCSGKLICCTTVPWNSSWSNKNY

TDIWDNMTWLQWDREISNYTDEIYRLIEQSQNQQEKNEQDLLALD (SEQ ID NO: 5)

[0055] CH505w24 F14 DS-S0SIP-2P

[0056] MPMGSLQPLATLYLLGMLVASVLAAENLWVTVYYGVPVWKEAKTTLFCAS

DAKAYEKEVHNIWATHACVPTDPNPQEMVLKNVTENFNMWKNDMVDQMHEDVIS

LWDQSLKPCVKLTPLCVTLNCTNATANATASNSSIIEGMKNCSFNITTELRDKREKK

NALFYKLDIVQLDGNSSQYRLINCNTSVITQVCPKVSFDPIPIHYCAPAGYAILKCN NK

TFTGTGPCNNVSTVQCTHGIKPVLSTQLLLNGSLAEGEIIIRSENITNNGKTIIVQL NES

VKIECTRPNNKTRTSIRIGPGQAFYATGQVIGNIREAYCNISESKWNETLQRVSKKL KE

YFPHKNITFQPSSGGDLEITTHSFNCGGEFFYCNTSSLFNRTYMANSTDMANSTETN S

TRIITIHCRIKQIINMWQEVGRAMYAPPIAGNITCISNITGLLLTRDGGKNNTDTET FRP

GGGNMKDNWRSELYKYKVVEVKPLGVAPTNCRRRVVERRRRRRAVGMGAVFLGF

LGAAGSTMGAASITLTVQARQLLSGIVQQQSNLLKAPEAQQHMLKPPVWGIKQLQA RVLALERYLKDQQLLGMWGC SGKLICCTNVYWNS S WSNKTYGDIWDNMTWMQW

EREISNYTEIIYELLEESQNQQEKNEQDLLALD (SEQ ID NO: 6)

[0057] In some embodiments, the modified lentiviral fusion proteins can be as below (i.e., mma3_CAP256SU_UCA_OPT_4.0, mma4_CAP256SU_UCA_OPT_4.0 and mma5_CAP256SU_UCA_OPT_4.0). The substituted 2P amino acids are indicated by underlining/highlighting.

[0058] >mma3 CAP256SU UCA OPT 4,0

[0059] MAISGVPVLGFFIIAVLMSAQESWAGLWVTVYYGVPVWREAKTTLFCASDA

KSYEKEVANIWATHACVPTDPNPQELVLKNVTENFNMWKNDMVDQMHEDIISLWD

QSLKPCVKLTPLCVTLHCSTYNNTHNISKEIKNCSFNATTELRDKRRKEYALFYRLD I

VPLNKNGRQYRLINCNTSVCTQVCPKLTFDPIPIHYCAPAGYAILKCNNKTFNGTGP C

NNVSTVQCTHGIKPVLSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEINC TRP

NNNTRKSIRIGPGQTFYATGDIIGDIRQAHCNISEIKWNKTLQRVSEKLREHFNKTI IFN

QSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSNITLPCRIKQ IIN

MWQEVGRCMYASPIAGEITCKSNITGLLLTRDGGGNNSTEETFRPGGGNMRDNWRS ELYKYKVVEVKPLGIAPTECRRRVVQGGGGSGGGGSAVVGLGAVFLGFLGTAGSTM GAASITLTVQARQLLSGIVQQQSNLLRAPEAQQHMLQPPVWGIKQLQARVLTIERYA KDQQLLGMWGCSGKLICCTNVYWNSSWSNKTYNEIWDNMTWMQWDREIDNYTD TIYKLLEVSQKQQESNEKDLLALDSWNNLWNWFDISKWLWYIKIFIMIVGGLIGLRII

FAVLSLVNRVRQGIRPVFSSPPSYFQ (SEQ ID NO: 7)

[0060] >mma4 CAP256SU UCA OPT 4,0

[0061] MAISGVPVLGFFIIAVLMSAQESWAGLWVTVYYGVPVWREAKTTLFCASDA

KSYEKEVANIWATHACVPTDPNPQELVLKNVTENFNMWKNDMVDQMHEDIISLWD

QSLKPCVKLTPLCVTLHCSTYNNTHNISKEIKNCSFNATTELRDKRRKEYALFYRLD I

VPLNKNGRQYRLINCNTSVCTQVCPKLTFDPIPIHYCAPAGYAILKCNNKTFNGTGP C

NNVSTVQCTHGIKPVLSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEINC TRP

NNMTRKSIRIGPGQTFYALGDIIGDIRQPHCNISEIKWNKTLQRVSEKLREHFNKTI IFN

QSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSNITLPCRIKQ IIN

MWQEVGRCMYAPPIAGNITCKSNITGLLLTRDGGGNNSTEETFRPGGGNMRDNWRS

ELYKYKVVEVKPLGIAPTECRRRVVQGGGGSGGGGSAVVGLGAVFLGFLGTAGSTM

GAASNTLTVQARQLLSGIVQQQSNLLRAPEAQQHMLQPPVWGFKQLQARVLTIERY

AEVQQLLGMWGCSGKLICCTNVPWNSSWSNKTYNEIWDNMTWMQWDREIGNYTD TIYKLLEVSQFQQEINEKDLLALDSWNNLWNWFDISKWLWYIKIFIMIVGGLIGLRIIF AVLSLVNRVRQGIRPVFSSPPSYFQ (SEQ ID NO: 8)

[0062] mma5 CAP256SU UCA OPT

[0063] MAISGVPVLGFFIIAVLMSAQESWAGLWVTVYYGVPVWREAKTTLFCASDA

KSYEKEVHNVWATHACVPTDPNPQELVLKNVTENFNMWKNDMVDQMHEDIISLW

DQSLKPCVKLTPLCVTLHCSTYNNTHNISKEIKNCSFNATTELRDKRRKEYALFYRL D

IVPLNKNGRQYRLINCNTSVCTQICPKVTFDPIPIHYCAPAGYAILKCNNKTFNGTG PC

NNVSTVQCTHGIKPVVSTQLLLNGSLAEEEIIIRSENLTDNVKTIIVHLNESVEINC TRP

NNMTRKSIRIGPGQTFYALGDIIGDIRQPHCNISEIKWNKTLQRVSEKLREHFNKTI IFN

QSSGGDLEITTHSFNCGGEFFYCNTSDLFFNKTFNETYSTGSNSTNSNITLPCRIKQ IIN

MWQEVGRCMYAPPIAGNITCKSNITGLLLTRDGGGNNSTEETFRPGGGNMRDNWRS

ELYKYKVVEVKPLGIAPTECRRRVVQGGGGSGGGGSAVVGLGAVFLGFLGTAGSTM

GAASNTLTVQARQLLSGIVQQQSNLLRAPEAQQHMLQPPVWGFKQLQARVLAIERY

LEVQQLLGMWGCSGKLICCTNVPWNSSWSNKTYNEIWDNMTWMQWDREIGNYTD TIYKLLEVSQFQQEINEKDLLALDSWNNLWNWFDISKWLWYIKIFIMIVGGLIGLRIIF AVLSLVNRVRQGIRPVFSSPPSYFQ (SEQ ID NO: 9)

[0064] The nucleic acid sequences encoding the mma3, mma4 and mma5 proteins shown above are shown below (i.e., 2560_pUC-ccTEV-co.mRNA3_CAP256SU_UCA_OPT_4.0-

A101, 2560_pUC-ccTEV-co.mRNA4_CAP256SU_UCA_OPT_4.0-A101 and 2560_pUC- ccTEV-co.mRNA5_CAP256SU_UCA_OPT_4.0-A101).

[0065] >2560 pUC-ccTEV-co.mRNA3 CAP256SU UCA OPT 4.0-A101

[0066] aGcATAAAAGTCTCAACACAACATATACAAAACAAACGAATCTCAAGCAA

TCAAGCATTCTACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAAGCA

ATTTTCTGAAAATTTTCACCATTTACGAACGATAGCGCTgccaccATGGCCATCTCC GGCGTGCCCGTGCTGGGCTTCTTCATCATCGCCGTGCTGATGTCCGCCCAGGAGT

CCTGGGCCGGCCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGCGCGAGG CCAAGACCACCCTGTTCTGCGCCTCCGACGCCAAGTCCTACGAGAAGGAGGTGG CCAACATCTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGGAGC TGGTGCTGAAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGACATGGTGG ACCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCTGAAGCCCTGCGT

GAAGCTGACCCCCCTGTGCGTGACCCTGCACTGCTCCACCTACAACAACACCCAC

AACATCTCCAAGGAGATCAAGAACTGCTCCTTCAACGCCACCACCGAGCTGCGC

GACAAGCGCCGCAAGGAGTACGCCCTGTTCTACCGCCTGGACATCGTGCCCCTG

AACAAGAACGGCCGCCAGTACCGCCTGATCAACTGCAACACCTCCGTGTGCACC CAGGTGTGCCCCAAGCTGACCTTCGACCCCATCCCCATCCACTACTGCGCCCCCG CCGGCTACGCCATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCT

GCAACAACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGCTGTCCA

CCCAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTCCG AGAACCTGACCGACAACGTGAAGACCATCATCGTGCACCTGAACGAGTCCGTGG AGATCAACTGCACCCGCCCCAACAACAACACCCGCAAGTCCATCCGCATCGGCC

CCGGCCAGACCTTCTACGCCACCGGCGACATCATCGGCGACATCCGCCAGGCCC ACTGCAACATCTCCGAGATCAAGTGGAACAAGACCCTGCAGCGCGTGTCCGAGA AGCTGCGCGAGCACTTCAACAAGACCATCATCTTCAACCAGTCCTCCGGCGGCG

ACCTGGAGATCACCACCCACTCCTTCAACTGCGGCGGCGAGTTCTTCTACTGCAA CACCTCCGACCTGTTCTTCAACAAGACCTTCAACGAGACCTACTCCACCGGCTCC AACTCCACCAACTCCAACATCACCCTGCCCTGCCGCATCAAGCAGATCATCAACA

TGTGGCAGGAGGTGGGCCGCTGCATGTACGCCTCCCCCATCGCCGGCGAGATCA

CCTGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGCGACGGCGGCGGCAACA ACTCCACCGAGGAGACCTTCCGCCCCGGCGGCGGCAACATGCGCGACAACTGGC GCTCCGAGCTGTACAAGTACAAGGTGGTGGAGGTGAAGCCCCTGGGCATCGCCC

CCACCGAGTGCCGCCGCCGCGTGGTGCAGGGCGGCGGCGGCTCCGGCGGCGGCG

GCTCCGCCGTGGTGGGCCTGGGCGCCGTGTTCCTGGGCTTCCTGGGCACCGCCGG CTCCACCATGGGCGCCGCCTCCATCACCCTGACCGTGCAGGCCCGCCAGCTGCTG TCCGGCATCGTGCAGCAGCAGTCCAACCTGCTGCGCGCCCCCGAGGCCCAGCAG

CACATGCTGCAGCCCCCCGTGTGGGGCATCAAGCAGCTGCAGGCCCGCGTGCTG

ACCATCGAGCGCTACGCCAAGGACCAGCAGCTGCTGGGCATGTGGGGCTGCTCC GGCAAGCTGATCTGCTGCACCAACGTGTACTGGAACTCCTCCTGGTCCAACAAGA CCTACAACGAGATCTGGGACAACATGACCTGGATGCAGTGGGACCGCGAGATCG

ACAACTACACCGACACCATCTACAAGCTGCTGGAGGTGTCCCAGAAGCAGCAGG

AGTCCAACGAGAAGGACCTGCTGGCCCTGGACTCCTGGAACAACCTGTGGAACT GGTTCGACATCTCCAAGTGGCTGTGGTACATCAAGATCTTCATCATGATCGTGGG CGGCCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGTCCCTGGTGAACCGCGTG

CGCCAGGGCATCCGCCCCGTGTTCTCCTCCCCCCCCTCCTACTTCCAGtaataaact agtA

GTGACTGACTAGGATCTGGTTACCACTAAACCAGCCTCAAGAACACCCGAATGG AGTCTCTAAGCTACATAATACCAACTTACACTTACAAAATGTTGTCCCCCAAAAT GTAGCCATTCGTATCTGCTCCTAATAAAAAGAAAGTTTCTTCACATTCTAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO: 10)

[0067] >2560 pUC-ccTEV-co.mRNA4 CAP256SU UCA OPT 4.0-A101

[0068] aGcATAAAAGTCTCAACACAACATATACAAAACAAACGAATCTCAAGCAA

TCAAGCATTCTACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAAGCA

ATTTTCTGAAAATTTTCACCATTTACGAACGATAGCGCTgccaccATGGCCATCTCC

GGCGTGCCCGTGCTGGGCTTCTTCATCATCGCCGTGCTGATGTCCGCCCAGGAGT CCTGGGCCGGCCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGCGCGAGG CCAAGACCACCCTGTTCTGCGCCTCCGACGCCAAGTCCTACGAGAAGGAGGTGG CCAACATCTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGGAGC

TGGTGCTGAAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGACATGGTGG

ACCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCTGAAGCCCTGCGT

GAAGCTGACCCCCCTGTGCGTGACCCTGCACTGCTCCACCTACAACAACACCCAC

AACATCTCCAAGGAGATCAAGAACTGCTCCTTCAACGCCACCACCGAGCTGCGC

GACAAGCGCCGCAAGGAGTACGCCCTGTTCTACCGCCTGGACATCGTGCCCCTG

AACAAGAACGGCCGCCAGTACCGCCTGATCAACTGCAACACCTCCGTGTGCACC CAGGTGTGCCCCAAGCTGACCTTCGACCCCATCCCCATCCACTACTGCGCCCCCG

CCGGCTACGCCATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCT

GCAACAACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGCTGTCCA

CCCAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTCCG AGAACCTGACCGACAACGTGAAGACCATCATCGTGCACCTGAACGAGTCCGTGG

AGATCAACTGCACCCGCCCCAACAACATGACCCGCAAGTCCATCCGCATCGGCC

CCGGCCAGACCTTCTACGCCCTGGGCGACATCATCGGCGACATCCGCCAGCCCCA

CTGCAACATCTCCGAGATCAAGTGGAACAAGACCCTGCAGCGCGTGTCCGAGAA

GCTGCGCGAGCACTTCAACAAGACCATCATCTTCAACCAGTCCTCCGGCGGCGAC

CTGGAGATCACCACCCACTCCTTCAACTGCGGCGGCGAGTTCTTCTACTGCAACA

CCTCCGACCTGTTCTTCAACAAGACCTTCAACGAGACCTACTCCACCGGCTCCAA

CTCCACCAACTCCAACATCACCCTGCCCTGCCGCATCAAGCAGATCATCAACATG

TGGCAGGAGGTGGGCCGCTGCATGTACGCCCCCCCCATCGCCGGCAACATCACC

TGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGCGACGGCGGCGGCAACAAC

TCCACCGAGGAGACCTTCCGCCCCGGCGGCGGCAACATGCGCGACAACTGGCGC

TCCGAGCTGTACAAGTACAAGGTGGTGGAGGTGAAGCCCCTGGGCATCGCCCCC

ACCGAGTGCCGCCGCCGCGTGGTGCAGGGCGGCGGCGGCTCCGGCGGCGGCGGC

TCCGCCGTGGTGGGCCTGGGCGCCGTGTTCCTGGGCTTCCTGGGCACCGCCGGCT

CCACCATGGGCGCCGCCTCCAACACCCTGACCGTGCAGGCCCGCCAGCTGCTGTC

CGGCATCGTGCAGCAGCAGTCCAACCTGCTGCGCGCCCCCGAGGCCCAGCAGCA

CATGCTGCAGCCCCCCGTGTGGGGCTTCAAGCAGCTGCAGGCCCGCGTGCTGACC

ATCGAGCGgTACGCCGAGGTGCAGCAGCTGCTGGGCATGTGGGGCTGCTCCGGC

AAGCTGATCTGCTGCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAGACCT

ACAACGAGATCTGGGACAACATGACCTGGATGCAGTGGGACCGCGAGATCGGCA

ACTACACCGACACCATCTACAAGCTGCTGGAGGTGTCCCAGTTCCAGCAGGAGA

TCAACGAGAAGGACCTGCTGGCCCTGGACTCCTGGAACAACCTGTGGAACTGGT

TCGACATCTCCAAGTGGCTGTGGTACATCAAGATCTTCATCATGATCGTGGGCGG

CCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGTCCCTGGTGAACCGCGTGCGC

CAGGGCATCCGCCCCGTGTTCTCCTCCCCCCCCTCCTACTTCCAGtaataaactagt AGTG

ACTGACTAGGATCTGGTTACCACTAAACCAGCCTCAAGAACACCCGAATGGAGT

CTCTAAGCTACATAATACCAACTTACACTTACAAAATGTTGTCCCCCAAAATGTA

GCCATTCGTATCTGCTCCTAATAAAAAGAAAGTTTCTTCACATTCTAAAAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO: 11)

[0069] >2560 pUC-ccTEV-co.mRNA5 CAP256SU UCA OPT 4.0-A101

[0070] aGcATAAAAGTCTCAACACAACATATACAAAACAAACGAATCTCAAGCAA

TCAAGCATTCTACTTCTATTGCAGCAATTTAAATCATTTCTTTTAAAGCAAAAGCA

ATTTTCTGAAAATTTTCACCATTTACGAACGATAGCGCTgccaccATGGCCATCTCC

GGCGTGCCCGTGCTGGGCTTCTTCATCATCGCCGTGCTGATGTCCGCCCAGGAGT

CCTGGGCCGGCCTGTGGGTGACCGTGTACTACGGCGTGCCCGTGTGGCGCGAGG

CCAAGACCACCCTGTTCTGCGCCTCCGACGCCAAGTCCTACGAGAAGGAGGTGC

ACAACGTGTGGGCCACCCACGCCTGCGTGCCCACCGACCCCAACCCCCAGGAGC TGGTGCTGAAGAACGTGACCGAGAACTTCAACATGTGGAAGAACGACATGGTGG

ACCAGATGCACGAGGACATCATCTCCCTGTGGGACCAGTCCCTGAAGCCCTGCGT

GAAGCTGACCCCCCTGTGCGTGACCCTGCACTGCTCCACCTACAACAACACCCAC

AACATCTCCAAGGAGATCAAGAACTGCTCCTTCAACGCCACCACCGAGCTGCGC

GACAAGCGCCGCAAGGAGTACGCCCTGTTCTACCGCCTGGACATCGTGCCCCTG

AACAAGAACGGCCGCCAGTACCGCCTGATCAACTGCAACACCTCCGTGTGCACC

CAGATCTGCCCCAAGGTGACCTTCGACCCCATCCCCATCCACTACTGCGCCCCCG

CCGGCTACGCCATCCTGAAGTGCAACAACAAGACCTTCAACGGCACCGGCCCCT

GCAACAACGTGTCCACCGTGCAGTGCACCCACGGCATCAAGCCCGTGGTGTCCA

CCCAGCTGCTGCTGAACGGCTCCCTGGCCGAGGAGGAGATCATCATCCGCTCCG

AGAACCTGACCGACAACGTGAAGACCATCATCGTGCACCTGAACGAGTCCGTGG

AGATCAACTGCACCCGCCCCAACAACATGACCCGCAAGTCCATCCGCATCGGCC

CCGGCCAGACCTTCTACGCCCTGGGCGACATCATCGGCGACATCCGCCAGCCCCA

CTGCAACATCTCCGAGATCAAGTGGAACAAGACCCTGCAGCGCGTGTCCGAGAA

GCTGCGCGAGCACTTCAACAAGACCATCATCTTCAACCAGTCCTCCGGCGGCGAC

CTGGAGATCACCACCCACTCCTTCAACTGCGGCGGCGAGTTCTTCTACTGCAACA

CCTCCGACCTGTTCTTCAACAAGACCTTCAACGAGACCTACTCCACCGGCTCCAA

CTCCACCAACTCCAACATCACCCTGCCCTGCCGCATCAAGCAGATCATCAACATG

TGGCAGGAGGTGGGCCGCTGCATGTACGCCCCCCCCATCGCCGGCAACATCACC

TGCAAGTCCAACATCACCGGCCTGCTGCTGACCCGCGACGGCGGCGGCAACAAC

TCCACCGAGGAGACCTTCCGCCCCGGCGGCGGCAACATGCGCGACAACTGGCGC

TCCGAGCTGTACAAGTACAAGGTGGTGGAGGTGAAGCCCCTGGGCATCGCCCCC

ACCGAGTGCCGCCGCCGCGTGGTGCAGGGCGGCGGCGGCTCCGGCGGCGGCGGC

TCCGCCGTGGTGGGCCTGGGCGCCGTGTTCCTGGGCTTCCTGGGCACCGCCGGCT

CCACCATGGGCGCCGCCTCCAACACCCTGACCGTGCAGGCCCGCCAGCTGCTGTC

CGGCATCGTGCAGCAGCAGTCCAACCTGCTGCGCGCCCCCGAGGCCCAGCAGCA

CATGCTGCAGCCCCCCGTGTGGGGCTTCAAGCAGCTGCAGGCCCGCGTGCTGGCC

ATCGAGCGCTACCTGGAGGTGCAGCAGCTGCTGGGCATGTGGGGCTGCTCCGGC

AAGCTGATCTGCTGCACCAACGTGCCCTGGAACTCCTCCTGGTCCAACAAGACCT

ACAACGAGATCTGGGACAACATGACCTGGATGCAGTGGGACCGCGAGATCGGCA

ACTACACCGACACCATCTACAAGCTGCTGGAGGTGTCCCAGTTCCAGCAGGAGA

TCAACGAGAAGGACCTGCTGGCCCTGGACTCCTGGAACAACCTGTGGAACTGGT

TCGACATCTCCAAGTGGCTGTGGTACATCAAGATCTTCATCATGATCGTGGGCGG

CCTGATCGGCCTGCGCATCATCTTCGCCGTGCTGTCCCTGGTGAACCGCGTGCGC

CAGGGCATCCGCCCCGTGTTCTCCTCCCCCCCCTCCTACTTCCAGtaataaactagt AGTG

ACTGACTAGGATCTGGTTACCACTAAACCAGCCTCAAGAACACCCGAATGGAGT

CTCTAAGCTACATAATACCAACTTACACTTACAAAATGTTGTCCCCCAAAATGTA

GCCATTCGTATCTGCTCCTAATAAAAAGAAAGTTTCTTCACATTCTAAAAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA (SEQ ID NO: 12)

[0071] Table 1, below, discloses amino acid modifications made in the mrna3_CAP256SU_UCA_OPT_4.0, mma4_CAP256SU_UCA_OPT_4.0 and mrna5_CAP256SU_UCA_OPT_4.0 amino acid sequences, shown above.

[0072] The modified proteins can have properties that are different than the unmodified protein from which the modified proteins are derived. In various embodiments, the modified proteins can be more likely to retain a prefusion conformation and less likely to assume a postfusion conformation as compared to the unmodified protein from which the modified protein is derived. In embodiments, the modified protein can be more stable than the unmodified protein. In embodiments, the modified protein can be produced in higher yields than the unmodified protein under similar experimental conditions. In some embodiments, increased stability can result in improved protein yield in recombinant expression systems. In some embodiments, the modified proteins can have longer half-life in the body of an individual who has been administered the protein (e.g., as a vaccine).

[0073] In embodiments, the modified proteins can substantially retain the immunogenicity or antigenicity of the unmodified protein from which the modified protein is derived. In examples, retention of immunogenicity can be determined by binding of various antibodies to the modified protein as compared to the unmodified protein (i.e, antibody binding profile). In some examples, antibody binding profiles can be obtained through use of combinations of antibodies that span multiple epitopes on the proteins. In some examples, neutralizing antibodies and/or non-neutralizing antibodies can be used.

[0074] In some examples, a nonlimiting list of antibodies that can be used to develop an antibody binding profile can include DH270 UCA3 (Saunders, Kevin O., et al. "Targeted selection of HIV-specific antibody mutations by engineering B cell maturation." Science 366.6470 (2019): eaay7199), DH270.6 (Bonsignori, Mattia, et al. "Staged induction of HIV-1 gly can-dependent broadly neutralizing antibodies." Science translational medicine 9.381 (2017): eaai7514.), PGT125 and PGR128 (Walker, Laura M., et al. "Broad neutralization coverage of HIV by multiple highly potent antibodies." Nature 477.7365 (2011): 466-470), 2G12 (Trkola, Alexandra, et al. "Human monoclonal antibody 2G12 defines a distinctive neutralization epitope on the gpl20 glycoprotein of human immunodeficiency virus type 1." Journal of virology 70.2 (1996): 1100-1108), PGT151 (Falkowska, Emilia, et al. "Broadly neutralizing HIV antibodies define a gly can-dependent epitope on the prefusion conformation of gp41 on cleaved envelope trimers." Immunity 40.5 (2014): 657-668), N6 (Huang, Jinghe, et al. "Identification of a CD4-binding-site antibody to HIV that evolved near-pan neutralization breadth." Immunity 45.5 (2016): 1108-1121), F105 (Posner, Marshall R., et al. "Neutralization of HIV-1 by F105, a human monoclonal antibody to the CD4 binding site of gpl20." Journal of acquired immune deficiency syndromes 6.1 (1993): 7-14, A32 (Liao, Hua- Xin, et al. "Immunogenicity of constrained monoclonal antibody A32-human immunodeficiency virus (HIV) Env gpl20 complexes compared to that of recombinant HIV type 1 gpl20 envelope glycoproteins." Journal of virology 78.10 (2004): 5270-5278), F93F (Moody, M. Anthony, et al. "Anti-phospholipid human monoclonal antibodies inhibit CCR5- tropic HIV-1 and induce P-chemokines." Journal of Experimental Medicine 207.4 (2010): 763-776), 17B (Zhang, Wentao, et al. "Antibody 17b binding at the coreceptor site weakens the kinetics of the interaction of envelope glycoprotein gpl20 with CD4." Biochemistry 40.6 (2001): 1662-1670), 19B (Moore, John P., et al. "A human monoclonal antibody to a complex epitope in the V3 region of gpl20 of human immunodeficiency virus type 1 has broad reactivity within and outside clade B." Journal of virology' 69.1 (1995): 122-130 ), and combinations thereof.

[0075] In some embodiments, retention of immunogenicity can be determined by modeling proteins structures obtained using cryogenic electronic microscopy (cryo-EM), modeling the structures obtained from the method, and comparing data from the models (e.g., modified protein versus unmodified protein). In some embodiments, structural modeling of the proteins can be performed using data from cryo-EM to calculate a three-dimensional map of the protein. A model of the corresponding protein can be constructed using sequence information, sterochemical bond lengths and angles, and the three-dimensional map itself. The coordinates obtained from the model can be compared to coordinates from other models using, for example, root-mean-square-distance (RMSD), which calculates structural similarities between the two proteins based on positions of the paired C-alpha backbond carbons for each amino acid (e.g., Carugo, Oliviero, and Sandor Pongor. "A normalized root- mean-spuare distance for comparing protein three-dimensional structures." Protein science 10.7 (2001): 1470-1473).

[0076] In an embodiment, the Env protein containing the 2P mutations (i.e., CH848 10.17DT DS-SOSIP-2P Env) was compared to an Env protein not containing the 2P mutations (i.e., CH848 10.17DT DS-SOSIP) and the RMSD was 0.757 A over 1.253 C alpha atoms. In some embodiments, comparison of an Env that has 2P mutations with a protein that does not have 2P mutations can be less than about 2.0, 1.9, 1.8, 1.7, 1.6, 1.5, 1.4, 1.3, 1.2, 1.1, 1.0, 0.9, 0.8, 0.7, 0.6 or 0.5 A.

Vaccine Compositions

[0077] Herein, vaccine compositions refer to compositions of the modified proteins described herein suitable for administration to an individual for the purpose of prophy lactically or therapeutically protecting the individual against, for example, an HIV-1, HIV-2 or SIV infection. Generally, the vaccine compositions contain prophy tactically- or therapeutically-effective amounts of the modified proteins. In some embodiments, the vaccine compositions can include pharmaceutically acceptable carriers, diluents or excipients. The vaccine compositions can be administered to an individual by various routes, including oral, subcutaneous, parenteral (such as, intravenous, intraperitoneal), intramuscular, rectal, epidural, intratracheal, intranasal, dermal, vaginal, buccal, ocular and pulmonary administration.

Methods

[0078] Disclosed are methods for making the modified proteins described herein. In some embodiments of the methods, amino acids within the proteins are selected for amino acid substitution based on the position and/or likely function of the amino acid in the protein. For ty pe I fusion proteins that can change conformation from a prefusion form to a postfusion form (e.g., HIV and SIV Env proteins), amino acids that may play a role in the conformational change can be selected. In some embodiments, amino acid positions can be selected that, if substituted with a different amino acid, can decrease the probability that a prefusion conformation of the Env protein will change to a postfusion conformation. In some embodiments, these amino acid positions can be in flexible loops in the protein. In some examples, amino acids within the protein can be randomly or systematically substituted (e.g., a library' of proteins can be made, each protein within the library' have one or more amino acid substitutions).

[0079] In some embodiments, the substituting amino acid (i.e., the amino acid that replaces the amino acid in the unmodified protein) can prevent or decrease the probability that the protein, when in the prefusion conformation will transition to the postfusion conformation (e.g.. inhibit the protein from forming an elongated alpha helix). In some embodiments, the substituting amino acid can have a lower configurational entropy than the amino acid that is substituted. In some embodiments, the substituting amino acid can be proline.

[0080] Generally, after the above-described amino acid substitutions are made (e.g., selected amino acids substituted or random amino acids substituted), the modified proteins containing the amino acid substitutions can be tested. In some embodiments, the ability of the modified proteins to be produced at high yields in, for example, recombinant protein expression systems, can be tested. In some embodiments, modified proteins produced at higher yield than unmodified proteins, under similar experimental conditions, are selected. In some embodiments, these high-yield modified proteins can be proteins with improved stability. Protein stability can also be tested more directly (rather than inferred from yields). For example, the ability' of a modified protein to withstand repeated cycles of freeze and thaw can be tested. In some examples, thermostability of a modified protein can be determined (see Example 5). Other methods are known for investigating protein stability under various conditions, and for comparing stability' of multiple proteins under different conditions. In some embodiments, the propensity' of a modified protein that is in a prefusion conformation to assume a postfusion conformation can be tested.

[0081] In some embodiments, single substituted amino acids from different modified proteins that have increased yield, increases stability, decreased ability to form a postfusion conformation, as described above, can be combined in a single protein to test for additive, synergistic, or even antagonistic effects of the combined substitutions on the protein.

[0082] In some embodiments, amino acid positions to be substituted in a protein can be determined by referencing related proteins. For example, as discussed in Example 5 and illustrated in FIG. 12, Leu568 and Thr569 are 99.8% and 98.9% conserved among the 6,599 HIV-1 and SIVcpz Env sequences curated in the LANL HIV database (calculated using the AnalyzeAlign tool within the database). Therefore, in some embodiments, in proteins that have a threonine at position 568. a leucine at position 569, or a threonine at position 568 and a leucine at position 569, position 568 and/or 569 can be substituted with another amino acid. In some embodiments, the substituting amino acid can be proline.

[0083] In some embodiments, the amino acids at positions 568, 569, or 568 and 569 in an HIV Env can be identified and substituted with another amino acid. In some embodiments, the substituting aminoacid can be proline.

EXAMPLES

[0084] Examples are provided below to facilitate a more complete understanding of the invention. The following examples illustrate the exemplary modes of making and practicing the invention. However, the scope of the invention is not limited to specific embodiments disclosed in these Examples, which are for purposes of illustration only, since alternative methods can be utilized to obtain similar results.

EXAMPLE 1 - Single Mutants

[0085] To enhance the yield of recombinantly expressed prefusion Env, we designed a series of single proline substitutions throughout the truncated gp41 subunit. Based on structural analysis of both the prefusion and postfusion conformations of gp41, nine positions were selected specifically to disfavor the formation of the elongated alpha helices which are characteristic of class I fusion proteins in the postfusion state (FIG. 1A). These substitutions were then made in the CH848 10. 17DT Env, a clade C Envelope that has been modified to engage the UCA (unmutated common ancestor) from the DH270 V3 glycan bnAb lineage (27). This Env was expressed using a chimeric gp41 subunit from BG505 and was stabilized with both SOSIP.664 and DS mutations (CH848 10.17DT DS-SOSIP)(17, 24, 28).

Purification of these constructs after transient transfection revealed that several of the individual proline substitutions had a substantial effect on Env yield (FIG. IB). Envs with proline substitutions at positions 536, 545, 568 or 569 (HXB2 numbering) in particular showed a marked increase in the yield of prefusion trimer relative to the unmutated construct, therefore these four mutations were downselected for further analysis.

EXAMPLE 2 - Double Mutants

[0086] Individual proline mutations at positions 536, 545, 568 or 569 were combined to form double-mutants in an effort to determine whether these substitutions were capable of functioning synergistically (FIG. 1C). Modest improvements in yield over the single substitutions could be observed after combining substitutions 536+545 or 568+569, but other combinations resulted in either decreased yield (536+568) or the appearance of a prominent contaminating low-molecular-weight species which was readily detectable by size-exclusion chromatography (536+569, 545+568, 545+569). Similarly, combining proline substitutions at all four positions did not result in enhanced yield as compared to the double mutant at positions 568 and 569 (FIG. 6), which showed a -8-fold increase in prefusion trimer relative to CH848 10.17DT DS-SOSIP. Therefore, the double mutant at positions 568 and 569 was selected for subsequent characterization and these mutations were termed “2P”. Generally the residues upstream of this helical capping position in gp41 cannot be clearly resolved during structural characterization, presumably due to a high degree of conformational flexibility. However, the recently reported structure of Env in the occluded-open conformation shows these amino acids forming a 5-residue helical extension of the central helix compared to what can normally be observed in the gp41 prefusion conformation (FIG. 7B suggesting that the 2P mutations may be stabilizing the prefusion conformation of gp41 by disfavoring the sampling of this early intermediate (30).

EXAMPLE 3 - Reverting I559P

[0087] We next generated an “SOS 2P” Env by reverting the I559P mutation to investigate whether the new 2P mutations were capable of increasing the yield of prefusion Env in the absence of the stabilizing effect of the IP mutation. The SOS 2P construct yielded only slightly more prefusion trimer than the SOSIP Env, while the dramatic boost in yield that was observed previously could only be recapitulated when both the IP and 2P mutations were present (FIG. 8A-B).

EXAMPLE 4 - Antigenic Landscape

[0088] To determine whether the 2P mutations altered the antigenic landscape of the Env ectodomain, we performed an ELISA to compare the binding profiles of CH848 10.17DT DS-SOSIP and CH848 10.17DT DS-SOSIP-2P against a panel of well-characterized monoclonal antibodies (mAbs), described earlier in this application. Both neutralizing and non-neutralizing mAbs spanning multiple epitopes were included to comprehensively evaluate what effect the 2P mutations might have on Env folding (FIG. 2A, neutralizing antibodies; FIG. 2B, non-neutralizing antibodies). Overall, the binding characteristics exhibited by the two Envs were extremely similar, although the 2P-stabilized construct appeared to bind to some neutralizing mAbs directed against the V3 glycan epitope (DH270 UCA3) and the CD4-binding site (N6) with slightly higher affinity (FIG. 2A). [0089] To further validate that the newly introduced 2P mutations did not disrupt the overall folding of the gp41 subunit, we determined the cryo-EM structure of the CH848 10. 17DT DS-SOSIP-2P Env to a resolution of 3.73 A (FIG. 3A-B, FIG. 9, FIG. 10A-E, Table 2).

[0090] Our model spanned residues 32-664 and we were able to build A-linked glycans at 15 of the 25 putative sequons. There were several small, flexible loops in the gpl20 subunit (59-66, 458-459, 400-411) that could not be confidently modeled, but overall the CH848 10. 17DT DS-SOSIP-2P Env was nearly indistinguishable from the structures of previously reported CH848 10. 17DT Envs which lack 2P stabilization (27). Like most other previously reported structures of gp41s in the prefusion conformation, the flexibility at the N-terminus of the central helix precluded us from observing both of our newly introduced proline substitutions, and only Pro569 could be confidently modeled into our reconstruction. However, the overall RMSD between our 2P-stabilized Env and a previously determined structure of CH848 10.17DT DS-SOSIP was only 0.757 A over 1,253 Ca atoms (FIG. 3B), confirming that the 2P mutations are capable of enhancing the yield of recombinantly expressed Env ectodomain without altering the conformation of the SOSIP trimer.

EXAMPLE 5 - Investigating Yields

[0091] While not wanting to be held to a mechanism, the increase in the yield of prefusion trimer could be due to either an increase in Env expression or due to enhanced durability of the prefusion conformation upon proline-stabilization. To delineate between these two possibilities, we began by investigating what impact proline-stabilization would have on cellsurface expression of full-length Env. FreeStyle 293-F cells were transiently transfected with CH848 10. 17DT gpl60s containing either no proline mutations, 2P mutations, or 2P mutations in combination with the I559P mutation. After 48 hours, the relative expression level of each transmembrane construct was estimated by flow cytometry after staining with a panel of monoclonal antibodies. Despite the dramatic differences in yield that could be observed when evaluating the recombinant expression of soluble ectodomains, no significant differences in surface expression levels of gpl60 could be detected among the three constructs that were tested (FIG. 4A, FIG. 11A-B). The only clear difference between the three gpl60s was a decrease in PGT151 binding in the 2P and the 2P+I559P constructs relative to the unstabilized gpl60. However, this trend was not conserved throughout the rest of the antibody panel, nor was there a detectable difference between the 2P-stabilized and non-2P-stabilized Envs when testing PGT151 binding to soluble ectodomains (FIG. 2A). [0092] Based on these similarities in cell-surface expression levels, we next evaluated the durability of recombinantly expressed, proline-stabilized Env ectodomains. Aliquots of CH848 10.17DT DS-SOSIP and CH848 10.17DT DS-SOSIP-2P were subjected to ten rounds of rapid freezing and thawing. Thermostability’ measurements of both Envs were collected by differential scanning fluorimetiy (DSF) during each round (FIG. 4B). The melting temperature (Tm) of CH848 10.17DT DS-SOSIP began at 73.6 °C, a value that was maintained until after four freeze/thaw cycles, when it abruptly dropped to 66.4 °C, where it remained throughout the rest of the experiment. The Tm of CH848 10. 17DT DS-SOSIP-2P was initially measured to be 71.0 °C, a value that was maintained until after eight freeze/thaw cycles, when the Tm dropped to 67.5 °C. To further investigate this phenomenon, we performed a forced degradation assay in which aliquots of CH848 10.17DT DS-SOSIP and CH848 10. 17DT DS-SOSIP-2P were incubated at 42 °C for either 48 or 96 hours. The integrity of the incubated trimers was then evaluated by size-exclusion chromatography (FIG. 4C). After 48 hours at 42 °C, we observed the appearance of a high-molecular weight aggregate peak in the non-2P-stabilized sample. The emergence of this peak corresponded with a decrease in the amount of properly folded trimer (FIG. 13), centered around 14.8 mL of elution volume. A similar, albeit less prominent, peak was not observed in the 2P- stabilized sample until after 96 hours of incubation at 42 °C, and the decrease of the peak at 14.8 mL was less pronounced, as measured by quantifying the area under the curve (Table 3).

[0093] These data, in conjunction with our evaluation of cell-surface expression levels, indicate that proline-stabilization of the gp41 subunit has no impact on the level of protein expression, and that observed increases in the yield of recombinant ectodomain after prolinestabilization are instead due to the increased durability of the prefusion conformation of Env.

EXAMPLE 5 - Conservation Among HIV Envs

[0094] Leu568 and Thr569 are 99.8% and 98.9% conserved among the 6,599 Env sequences curated in the LANL HIV database (FIG. 12), possibly reflecting their functional importance during the transition from prefusion-to-postfusion and their insulation from antibody- mediated selective pressure. This high degree of sequence conservation at positions 568 and 569 prompted us to evaluate whether our 2P stabilization strategy might also be effective outside of the context of the CH848 10.17DT Env. A panel of evolutionarily diverse Envs (Table 4) was selected and the yield of recombinantly expressed prefusion trimer after transient transfection of a 2P-stabilized construct was compared to that of a non-2P-stabilized construct (FIG. 5).

[0095] Overall, 2P-stabilization improved the yield of Envs from HIV-1 clades B and C, subgroup 02_AG, and SIV. The degree to which 2P-stabilization enhanced Env yield was variable for the constructs that were evaluated, ranging from ~2-fold increases in JRFL and B41 Envs to dramatic increases in the yields of CAM13, T250-4 and CH505 Envs, for which a fold-increase could not be reliably calculated due to an inability to purify the non-2P- stabilized construct. These findings suggest that 2P-stabilization is a broadly applicable approach to stabilizing the prefusion conformation of Env, albeit to varying degrees depending on the Env variant in question. Furthermore, the 2P mutations are compatible for addition to a variety of alternative Env stabilization strategies, including the SOSIP, DS and F14 mutations (Table 4).

EXAMPLE 6 - Materials and Methods

[0096] Protein production and purification

[0097] Plasmids encoding for Env ectodomains (CH848 10.17DT DS-SOSIP, CH848 10.17DT DS-SOS-2P, CH848 10.17DT DS-SOSIP -2P, CAM13 Q171K DS-SOSIP, CAM13 Q171K DS-SOSIP-2P. B41 SOSIP, B41 SOSIP-2P, JRFL SOSIPv6, JRFL SOSIPv6-2P, T250-4 DS-SOSIP, T250-4 DS-SOSIP-2P, CH505w24 F14 DS-SOSIP and CH505w24 F14 DS-SOSIP-2P) were mixed with a plasmid encoding furin at a ratio of 4: 1. All Env constructs contained a C-terminal HRV3C cleavage site, a TwinStrepTag and an 8xHisTag (SEQ ID NO: 18). These plasmid mixtures were transfected into FreeStyle 293-F cells (Thermo Fisher) using polyethylenimine. Transfected supernatants were harvested and filtered five days post-transfection. Because CH848 Envs do not bind to PGT145, all CH848 constructs were purified by StrepTactin resin (IBA). All other Envs were purified by PGT145 affinity chromatography, as described previously (19). After affinity' chromatography, Envs were further purified by size-exclusion chromatography using a Superose 6 Increase 10/300 GL column (Cytiva) in 2 mM Tris pH 8.0. 200 mM NaCl, 0.02% NaN3.

[0098] Plasmids encoding the heavy and light chains of mAbs used for ELISA or flow cytometric analysis (CH65, N6, DH270 UCA, DH270.6, PGT151, PGT128, VRC26.25, 2F5, 17b, 19b, RM19R, DH1029, 2G12, F93F, PGT125, F105, A32 and CH58) were combined at a ratio of 1 : 1 and used to transiently transfect Expi293 cells (Thermo Fisher) with ExpiFectamine (Thermo Fisher). Transfected supernatants were harvested and filtered five days post-transfection and antibodies were purified using Protein A resin (Thermo Fisher). Eluted antibodies were then buffer exchanged into PBS.

[0099] ELISA

[00100] Env protein containing a C-terminal StrepTag was bound in wells of 384-well plates, which were previously coated with streptavidin (Thermo Fisher Scientific) at 2 pg/ml and blocked with PBS containing 4% (w/v) whey protein, 15% normal goat serum, 0.5% Tween-20, and 0.05% sodium azide. Proteins were incubated at room temperature for 1 hour, washed with PBS and 0.1% Tween-20, then mAbs were added in serial dilutions beginning at 100 pg/ml. Antibodies were incubated at room temperature for 1 hour, washed, and binding detected with goat anti-human HRP (Jackson ImmunoResearch) and TMB substrate (Sera Care Life Sciences).

[00101] Cryo-EM sample preparation and data collection

[00102] Purified CH848 10. 17DT DS-SOSIP-2P was diluted to a final concentration of 1.8 mg/mL in 10 mM Tris pH 8.0. To prevent interaction of the trimer with the air-water interface during vitrification, the sample was incubated in 0.085 mM n-dodecyl [>-D- maltoside (DDM). A 3.5 pL drop of protein was deposited on a Quantifoil-1.2/1.3 grid (Electron Microscopy Sciences) that had been glow discharged for 10 seconds using an easiGlow Glow Discharge Cleaning System (PELCO). After a 30 second incubation in >95% humidity, excess protein was blotted away for 2.5 seconds before being plunge frozen into liquid ethane using a Leica EM GP2 plunge freezer (Leica Microsystems). Frozen grids were imaged using a Titan Krios (Thermo Fisher) equipped with a K3 detector (Gatan). Data were collected using the Gatan Latitude software.

[00103] Crvo-EM data analysis and refinement

[00104] Movies were imported into cry oSPARC v3.3. 1 (38) and aligned using patchbased motion correction. Patch-based CTF estimation was then performed before 5,780,212 particles were selected using a non-templated blob picking strategy. Junk particles were then removed by 2D classification, leaving a stack of 683,406 particles that were subjected to iterative rounds of ab initio volume calculation and heterogeneous 3D classification, leaving a final stack of 111,026 particles. These particles yielded a 3.95 A reconstruction after performing asymmetrical (Cl) non-uniform refinement, with the resulting map exhibiting C3 symmetry. Non-uniform refinement (39) was then performed again using the same particle stack and applying C3 symmetry which improved the resolution of the resulting reconstruction to 3.73 A. This map was then subjected to post-processing using DeepEMhancer (40). A full description of the cryo-EM data processing workflow can be found in FIG. 9. Chain A, D, E, H, I and 1 from PDB ID: 6UM7 were used as a starting model that was docked into the sharpened map and re-modeled through iterative rounds of building and refinement in Coot (41), PHENIX (42) and ISOLDE (43).

[00105] gp!60 cell-surface expression and characterization

[00106] 293-F cells (ThermoFisher, cat #R79007) were diluted to 1.25 x 10 6 cells/mL and seeded to 12-well plates 2-3 hours before transfection. Transient transfection of DNA plasmids was performed with jetPRIME transfection reagent (Polyplus, cat #101000046) following the manufacturer’s instruction. Transfected cells were cultured in an incubator at 37 °C with 8% CO2 and shaking at 125 rpm for 24 hours before flow cytometry staining.

[00107] 24 hours after transfection, 293-F cells were harvested, counted and then were rinsed with 1% BSA/PBS and pelleted at 500 g for 5 minutes. Next, cells were resuspended to a density of 1 x 10 6 cells/mL in 1%BSA/PBS and 50,000 cells were aliquoted to each well of a U-bottom 96-well plates. An equal volume of recombinant anti -HIV- 1 Env antibodies at 4 pg/mL were added to cells to reach a working concentration of 2 pg/mL. Antibodies were incubated with cells at 4 °C for 30 minutes. Cells were then washed once with 150 pL 1% BSA/PBS and then incubated with 50 pL Goat anti-Human IgG Fc secondary antibody PE (ThermoFisher, cat #12-4998-82) at a final concentration of 2.5 pg/mL in 1% BSA/PBS.

After a 30 minute incubation at 4 °C protected from light, cells were washed once with lx PBS and incubated with 100 pL LIVE/DEAD Aqua dead cell stain (L34966, 1:1000 in PBS) for 20 minutes at room temperature, protected from light. Next, cells were washed once with 100 pL 1% BSA/BSA. then resuspended in 50 pL 1% BSA 2mM EDTA 1% PFA in PBS. Flow cytometric data were acquired on an iQue 3 high-throughput flow cytometry system (Sartorius). Data were analyzed using FlowJo vlO (FlowJo).

[00108] Freeze/thaw thermostability analysis

[00109] Purified CH848 10.17DT DS-SOSIP and CH848 10.17DT DS-SOSIP-2P were diluted to 0.2 mg/mL in 2 mM Tris pH 8.0, 200 mM NaCl, 0.02% NaN3. Samples were rapidly frozen in liquid nitrogen and thawed by incubation at 30 DC for 5 minutes.

Thermostability measurements were collected using a Tycho NT.6 by increasing the temperature from 35 °C to 95 °C at a rate of 30.0 °C/minute. Tm was determined as the inflection temperature using the Tycho Nanotemper data processing software.

[00110] Forced degradation assay [00111] 250 pig aliquots of purified CH848 10.17DT DS-SOSIP or CH848 10.17DT

DS-SOSIP-2P were incubated at 42 °C for 0 hours, 48 hours or 96 hours. Aliquots were then run over a Superose 6 Increase 10/300 GL column (Cytiva Life Sciences) to evaluate Env integrity.

[00112] Negative stain electron microscopy

[00113] CH848 10. 17DT DS-SOSIP-2P was diluted to 0.2 mg/mL with buffer containing 20 mM HEPES pH 7.4, 150 mM NaCl, 8 mM glutaraldehyde and 5 g/dL glycerol. After 5 minutes of incubation, excess glutaraldehyde was quenched by the addition 1 M TRIS pH 8.0 for a final concentration of 80 mM TRIS and incubated for an additional 5 minutes. Quenched sample was applied to a glow-discharged, carbon-coated EM grid for 8-10 seconds, blotted, and stained with 2 g/dL uranyl formate for 1 minute before being blotted and allowed to air-dry. The stained grid was examined on a Philips EM420 electron microscope operating at 120 kV and a nominal magnification of 49,000x. 12 images were collected on a 76 Mpix CCD camera at 2.4 A/pix. Images were analyzed and 2D class averaging was performed using standard protocols within Relion 3.0 (Zivanov, Jasenko, et al. "New tools for automated high-resolution cryo-EM structure determination in RELION-3." elife 7 (2018): e42166).

[00114] References

[00115] 1. Centers for Disease C. 1981. Kaposi's sarcoma and Pneumocystis pneumonia among homosexual men— New York City and California. MMWR Morb Mortal Wkly Rep 30:305-8.

[00116] 2. Eisinger RW, Fauci AS. 2018. Ending the HIV/AIDS Pandemic(l).

Emerg Infect Dis 24:413-416.

[00117] 3. Porter K, Johnson AM, Phillips AN, Darbyshire JH. 1999. The practical significance of potential biases in estimates of the AIDS incubation period distribution in the UK register of HIV seroconverters. AIDS 13: 1943-51.

[00118] 4. Grant RM, Lama JR, Anderson PL, McMahan V, Liu AY, Vargas L,

Goicochea P, Casapia M, Guanira-Carranza JV, Ramirez-Cardich ME, Montoya-Herrera O, Fernandez T, Veloso VG, Buchbinder SP, Chariyalertsak S, Schechter M, Bekker LG, Mayer KH, Kallas EG, Amico KR, Mulligan K, Bushman LR, Hance RJ, Ganoza C, Defechereux P, Postle B, Wang F, McConnell JJ, Zheng JH, Lee J, Rooney JF, Jaffe HS, Martinez Al, Bums DN, Glidden DV, iPrEx Study T. 2010. Preexposure chemoprophylaxis for HIV prevention in men who have sex with men. N Engl J Med 363:2587-99.

[00119] 5. Palella FJ, Jr., Delaney KM, Moorman AC, Loveless MO, Fuhrer J,

Satten GA, Aschman DJ, Holmberg SD. 1998. Declining morbidity and mortality among patients with advanced human immunodeficiency virus infection. HIV Outpatient Study Investigators. N Engl J Med 338:853-60.

[00120] 6. Fauci AS. 2017. An HIV Vaccine Is Essential for Ending the

HIV/AIDS Pandemic. JAMA 318:1535-1536.

[00121] 7. Allan JS, Coligan JE, Barin F, McLane MF, Sodroski JG, Rosen CA,

Haseltine WA, Lee TH, Essex M. 1985. Major glycoprotein antigens that induce antibodies in AIDS patients are encoded by HTLV-III. Science 228: 1091-4.

[00122] 8. Willey RL, Bonifacino JS, Potts BJ, Martin MA, Klausner RD. 1988.

Biosynthesis, cleavage, and degradation of the human immunodeficiency virus 1 envelope glycoprotein gpl60. Proc Natl Acad Sci U S A 85:9580-4.

[00123] 9. Dalgleish AG, Beverley PC, Clapham PR, Crawford DH, Greaves MF,

Weiss RA. 1984. The CD4 (T4) antigen is an essential component of the receptor for the AIDS retrovirus. Nature 312:763-7.

[00124] 10. Klatzmann D, Champagne E, Chamaret S, Gruest J, Guetard D,

Hercend T, Gluckman JC, Montagnier L. 1984. T-lymphocyte T4 molecule behaves as the receptor for human retrovirus LAV. Nature 312:767-8.

[00125] 11 . Alkhatib G, Combadiere C, Broder CC, Feng Y, Kennedy PE, Murphy

PM, Berger EA. 1996. CC CKR5: a RANTES, MIP-lalpha, MIP-1 beta receptor as a fusion cofactor for macrophage-tropic HIV-1. Science 272: 1955-8.

[00126] 12. Kowalski M, Potz J, Basiripour L, Dorfman T. Goh WC, Terwilliger

E, Dayton A, Rosen C, Haseltine W, Sodroski J. 1987. Functional regions of the envelope glycoprotein of human immunodeficiency virus type 1. Science 237: 1351-5.

[00127] 13. Wei X, Decker JM, Wang S, Hui H, Kappes JC, Wu X, Salazar-

Gonzalez JF, Salazar MG, Kilby JM, Saag MS, Komarova NL, Nowak MA, Hahn BH, Kwong PD, Shaw GM. 2003. Antibody neutralization and escape by HIV-1. Nature 422:307- 12.

[00128] 14. Roark RS, Li H, Williams WB, Chug H, Mason RD, Gorman J, Wang

S, Lee FH, Rando J, Bonsignori M, Hwang KK, Saunders KO, Wiehe K, Moody MA, Hraber PT. Wagh K. Giorgi EE. Russell RM, Bibollet-Ruche F, Liu W, Connell J, Smith AG, DeVoto J, Murphy Al, Smith J, Ding W, Zhao C, Chohan N, Okumura M, Rosario C, Ding Y, Lindemuth E, Bauer AM, Bar KJ, Ambrozak D, Chao CW, Chuang GY, Geng H, Lin BC, Louder MK, Nguyen R, Zhang B, Lewis MG. Raymond DD, Doria-Rose NA, Schramm CA, Douek DC, Roederer M, Kepler TB, Kelsoe G, et al. 2021. Recapitulation of HIV-1 Env- antibody coevolution in macaques leading to neutralization breadth. Science 371. [00129] 15. Patel PH, Preston BD. 1994. Marked infidelity of human immunodeficiency virus type 1 reverse transcriptase at RNA and DNA template ends. Proc Natl Acad Sci U S A 91 :549-53.

[00130] 16. Weissenhom W, Wharton SA, Calder LJ, Earl PL, Moss B, Aliprandis

E, Skehel JJ, Wiley DC. 1996. The ectodomain of HIV- 1 env subunit gp41 forms a soluble, alpha-helical, rod-like oligomer in the absence of gpl20 and the N-terminal fusion peptide. EMBO J 15: 1507-14.

[00131] 17. Sanders RW, Vesanen M, Schuelke N, Master A, Schiffner L,

Kalyanaraman R, Paluch M, Berkhout B, Maddon PJ, Olson WC, Lu M, Moore JP. 2002. Stabilization of the soluble, cleaved, trimeric form of the envelope glycoprotein complex of human immunodeficiency virus type 1. J Virol 76:8875-89.

[00132] 18. Binley JM, Sanders RW. Clas B. Schuelke N, Master A. Guo Y.

Kajumo F, Anselma DJ, Maddon PJ, Olson WC, Moore JP. 2000. A recombinant human immunodeficiency virus type 1 envelope glycoprotein complex stabilized by an intermolecular disulfide bond between the gpl20 and gp41 subunits is an antigenic mimic of the trimeric virion-associated structure. J Virol 74:627-43.

[00133] 19. de Taeye SW, Ozorowski G, Torrents de la Pena A, Guttman M, Julien

JP, van den Kerkhof TL, Burger JA, Pritchard LK, Pugach P, Yasmeen A, Crampton J, Hu J, Bontjer I, Torres JL, Arendt H, DeStefano J, Koff WC, Schuitemaker H, Eggink D, Berkhout

B, Dean H. LaBranche C, Crotty S, Crispin M, Montefiori DC, Klasse PJ, Lee KK, Moore JP, Wilson IA, Ward AB, Sanders RW. 2015. Immunogenicity of Stabilized HIV-1 Envelope Trimers with Reduced Exposure of Non-neutralizing Epitopes. Cell 163: 1702-15.

[00134] 20. Del Moral-Sanchez I, Russell RA, Schermer EE, Cottrell CA, Allen

JD, Torrents de la Pena A, LaBranche CC, Kumar S, Crispin M, Ward AB. Montefiori DC, Sattentau QJ. Sliepen K, Sanders RW. 2022. High thermostability' improves neutralizing antibody responses induced by native-like HIV-1 envelope trimers. NPJ Vaccines 7:27.

[00135] 21. Torrents de la Pena A, Julien JP, de Taeye SW, Garces F, Guttman M,

Ozorowski G, Pritchard LK, Behrens AJ, Go EP, Burger JA, Schermer EE, Sliepen K, Ketas TJ, Pugach P, Yasmeen A. Cottrell CA, Torres JL, Vavourakis CD. van Gils MJ. LaBranche

C, Montefiori DC, Desaire H, Crispin M, Klasse PJ, Lee KK, Moore JP, Ward AB, Wilson IA, Sanders RW. 2017. Improving the Immunogenicity of Native-like HIV-1 Envelope Trimers by Hyperstabilization. Cell Rep 20: 1805-1817.

[00136] 22. Henderson R, Lu M, Zhou Y, Mu Z, Parks R, Han Q, Hsu AL, Carter

E, Blanchard SC, Edwards RJ, Wiehe K, Saunders KO, Borgnia MJ, Bartesaghi A, Mothes W, Haynes BF, Acharya P, Munir Alam S. 2020. Disruption of the HIV-1 Envelope allosteric network blocks CD4-induced rearrangements. Nat Commun 11:520.

[00137] 23. Kong L, He L, de Vai N, Vora N, Moms CD, Azadma P, Sok D, Zhou

B, Burton DR, Ward AB, Wilson IA, Zhu J. 2016. Uncleaved prefusion-optimized gpl40 trimers derived from analysis of HIV-1 envelope metastability. Nat Commun 7: 12040.

[00138] 24. Kwon YD, Pancera M, Acharya P, Georgiev IS, Crooks ET, Gorman J,

Joyce MG. Guttman M, Ma X, Narpala S. Soto C. Terry DS, Yang Y, Zhou T, Ahlsen G, Bailer RT, Chambers M, Chuang GY, Doria-Rose NA, Druz A, Hallen MA, Hamed A, Kirys T, Louder MK, O'Dell S, Ofek G, Osawa K, Prabhakaran M, Sastry M, Stewart-Jones GB, Stuckey J, Thomas PV, Tittley T, Williams C, Zhang B, Zhao H, Zhou Z, Donald BR, Lee LK, Zolla-Pazner S, Baxa U, Schon A, Freire E, Shapiro L, Lee KK, Arthos J, Munro JB, Blanchard SC, Mothes W, Binley JM, et al. 2015. Crystal structure, conformational fixation and entry -related interactions of mature ligand-free HIV-1 Env. Nat Struct Mol Biol 22:522- 31.

[00139] 25. Rutten L, Lai YT, Blokland S, Truan D, Bisschop IJM, Strokappe NM,

Koomneef A, van Manen D. Chuang GY. Famey SK. Schuitemaker H. Kwong PD, Langedijk JPM. 2018. A Universal Approach to Optimize the Folding and Stability of Prefusion-Closed HIV-1 Envelope Trimers. Cell Rep 23:584-595.

[00140] 26. Leaman DP, Zwick MB. 2013. Increased functional stability and homogeneity of viral envelope spikes through directed evolution. PLoS Pathog 9:el003184. [00141] 27. Saunders KO, Wiehe K, Tian M, Acharya P, Bradley T, Alam SM, Go

EP, Scearce R, Sutherland L, Henderson R, Hsu AL, Borgnia MJ, Chen H, Lu X, Wu NR, Watts B, Jiang C, Easterhoff D, Cheng HL, McGovern K, Waddicor P, Chapdelaine- Williams A, Eaton A, Zhang J, Rountree W, Verkoczy L, Tomai M, Lewis MG, Desaire HR, Edwards RJ, Cain DW, Bonsignori M, Montefiori D, Alt FW, Haynes BF. 2019. Targeted selection of HIV-specific antibody mutations by engineering B cell maturation. Science 366. [00142] 28. Joyce MG, Georgiev IS, Yang Y, Druz A, Geng H, Chuang GY, Kwon

YD, Pancera M, Rawi R, Sastry M, Stewart-Jones GBE, Zheng A, Zhou T, Choe M, Van Galen JG, Chen RE, Lees CR, Narpala S, Chambers M. Tsybovsky Y, Baxa U. McDermott AB, Mascola JR, Kwong PD. 2017. Soluble Prefusion Closed DS-SOSIP.664-Env Trimers of Diverse HIV-1 Strains. Cell Rep 21:2992-3002.

[00143] 29. Pallesen J, Wang N, Corbet KS, Wrapp D, Kirchdoerfer RN, Turner

HL, Cotrell CA, Becker MM, Wang L, Shi W, Kong WP, Andres EL, Ketenbach AN, Denison MR, Chappell JD, Graham BS, Ward AB, McLellan JS. 2017. Immunogenicity and structures of a rationally designed prefusion MERS-CoV spike antigen. Proc Natl Acad Sci U S A 114:E7348-E7357.

[00144] 30. Yang Z, Dam KA, Bridges MD, Hoffmann MAG, DeLaitsch AT,

Gristick HB, Escolano A, Gautam R, Martin MA, Nussenzweig MC, Hubbell WL, Bjorkman PJ. 2022. Neutralizing antibodies induced in immunized macaques recognize the CD4- binding site on an occluded-open HIV-1 envelope trimer. Nat Commun 13:732.

[00145] 31. Hsieh CL, Goldsmith JA, Schaub JM, Di Venere AM, Kuo HC,

Javanmardi K, Le KC, Wrapp D, Lee AG, Liu Y, Chou CW, Byrne PO, Hjorth CK, Johnson NV, Ludes-Meyers J, Nguyen AW, Park J, Wang N, Amengor D, Lavinder JJ, Ippolito GC, Maynard JA, Finkelstein IJ, McLellan JS. 2020. Structure-based design of prefusion- stabilized SARS-CoV-2 spikes. Science 369: 1501-1505.

[00146] 32. Barouch DH, O'Brien KL, Simmons NL, King SL, Abbink P, Maxfield

LF, Sun YH, La Porte A, Riggs AM, Lynch DM, Clark SL, Backus K, Perry 7 JR, Seaman MS, Carville A, Mansfield KG, Szinger JJ, Fischer W, Muldoon M, Korber B. 2010. Mosaic HIV- 1 vaccines expand the breadth and depth of cellular immune responses in rhesus monkeys. Nat Med 16:319-23.

[00147] 33. Haynes BF, Kelsoe G, Harrison SC, Kepler TB. 2012. B-cell-lineage immunogen design in vaccine development with HIV-1 as a case study. Nat Biotechnol 30:423-33.

[00148] 34. Rawi R, Ruten L, Lai YT, Olia AS, Blokland S, Juraszek J, Shen CH,

Tsybovsky Y, Verardi R, Yang Y, Zhang B, Zhou T, Chuang GY, Kwong PD, Langedijk JPM. 2020. Automated Design by Structure-Based Stabilization and Consensus Repair to Achieve Prefusion-Closed Envelope Trimers in a Wide Variety of HIV Strains. Cell Rep 33: 108432.

[00149] 35. Steichen JM, Kulp DW, Tokatlian T, Escolano A, Dosenovic P,

Stanfield RL, McCoy LE, Ozorowski G, Hu X, Kalyuzhniy O, Briney B, Schiffher T, Garces F, Freund NT, Gitlin AD, Menis S, Georgeson E, Kubitz M, Adachi Y, Jones M, Mutafyan AA, Yun DS, Mayer CT, Ward AB. Burton DR, Wilson IA, Irvine DJ, Nussenzweig MC, Schief WR. 2016. HIV Vaccine Design to Target Germline Precursors of Gly can-Dependent Broadly Neutralizing Antibodies. Immunity 45:483-496.

[00150] 36. Feng Y, Tran K, Bale S, Kumar S, Guenaga J, Wilson R, de Vai N,

Arendt H, DeStefano J, Ward AB, Wyatt RT. 2016. Thermostability of Well-Ordered HIV Spikes Correlates with the Elicitation of Autologous Tier 2 Neutralizing Antibodies. PLoS Pathog 12:el005767.

[00151] 37. Guttman M, Cupo A, Julien JP. Sanders RW, Wilson IA, Moore JP,

Lee KK. 2015. Antibody potency relates to the ability to recognize the closed, pre-fusion form of HIV Env. Nat Commun 6:6144.

[00152] 38. Punjani A, Rubinstein JL, Fleet DJ, Brubaker MA. 2017. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat Methods 14:290- 296.

[00153] 39. Punjani A, Zhang H, Fleet DJ. 2020. Non-uniform refinement: adaptive regularization improves single-particle cryo-EM reconstruction. Nat Methods 17: 1214-1221.

[00154] 40. Sanchez-Garcia R, Gomez-Blanco J, Cuervo A, Carazo JM, Sorzano

COS, Vargas J. 2021. DeepEMhancer: a deep learning solution for cryo-EM volume postprocessing. Commun Biol 4:874.

[00155] 41. Emsley P. Cowtan K. 2004. Coot: model-building tools for molecular graphics. Acta Crystallogr D Biol Crystallogr 60:2126-32.

[00156] 42. Adams PD, Grosse-Kunstleve RW, Hung LW, loerger TR, McCoy AJ,

Moriarty NW, Read RJ, Sacchettini JC, Sauter NK, Terwilliger TC. 2002. PHENIX: building new software for automated crystallographic structure determination. Acta Crystallogr D Biol Crystallogr 58: 1948-54.

[00157] 43. Croll TI. 2018. ISOLDE: a physically realistic environment for model building into low-resolution electron-density maps. Acta Crystallogr D Struct Biol 74:519- 530. EQUIVALENTS

Those skilled in the art will recognize, or be able to ascertain, using no more than routine experimentation, numerous equivalents to the specific substances and procedures described herein. Such equivalents are considered to be within the scope of this invention.