Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
FUSION PROTEINS
Document Type and Number:
WIPO Patent Application WO/2016/178905
Kind Code:
A1
Abstract:
The present invention relates to fusion proteins comprising an insulin receptor agonist fused to a human IgG Fc region through the use of a peptide linker, and the use of such fusion proteins in the treatment of diabetes. The fusion protein of the present invention has an extended time action profile and is useful for providing basal glucose control for an extended period of time.

Inventors:
BALDWIN DAVID BRUCE (US)
BEALS JOHN MICHAEL (US)
DAY JONATHAN WESLEY (US)
DICKINSON CRAIG DUANE (US)
KORYTKO ANDREW IHOR (US)
LAZAR GREGORY ALAN (US)
Application Number:
PCT/US2016/029807
Publication Date:
November 10, 2016
Filing Date:
April 28, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
LILLY CO ELI (US)
International Classes:
C07K14/62; A61K38/28; C12N15/62
Domestic Patent References:
WO2005000892A22005-01-06
Foreign References:
CN103509118A2014-01-15
KR101324828B12013-11-01
CN103509118A2014-01-15
KR101324828B12013-11-01
US5122464A1992-06-16
EP0619322A21994-10-12
US5120712A1992-06-09
US5424286A1995-06-13
US6268343B12001-07-31
US20140044717A12014-02-13
US7452966B22008-11-18
Other References:
IGLESIAS P ET AL., DIABETES OBES. METAB., vol. 10, 2008, pages 811 - 823
VLASAK J; IONESCU R: "Fragmentation of Monoclonal Antibodies", MABS, vol. 3, no. 3, May 2011 (2011-05-01), pages 253 - 263
HAMILTON SR ET AL.: "Production of complex human glycoproteins in yeast", SCIENCE, vol. 301, no. 5637, August 2003 (2003-08-01), pages 1244 - 1246
METHODS IN ENZYMOLOGY, vol. 182, 1990, pages 83 - 89
SCOPES: "Protein Purification: Principles and Practice", 1994, SPRINGER
LEWIS SM ET AL., N . BIOTECHNOL, vol. 32, no. 2, 2014, pages 191 - 198
CARTER, J. IMMUNOL. METHODS, vol. 248, no. 1-2, 2001, pages 7 - 15
RIDGWAY, J. B. ET AL., PROTEIN E G., vol. 9, no. 7, 1996, pages 617 - 612
MAA YF; HSU CC, INT J PHARM, vol. 140, 1996, pages 155 - 168
FRANSSON J ET AL., PHARM. RES., vol. 14, 1997, pages 606 - 612
LAM XM ET AL., PHARM. RES., vol. 14, 1997, pages 725 - 729
REMMELE RL JR ET AL., PHARM RES, vol. 15, 1998, pages 200 - 208
THIRUMANGALATHU R ET AL., J PHARM SCI, vol. 95, 2006, pages 1480 - 1497
Attorney, Agent or Firm:
LORD, Matthew T. et al. (P.O. Box 6288Indianapolis, Indiana, US)
Download PDF:
Claims:
WE CLAIM:

1. A fusion protein comprising:

a) an insulin receptor agonist having the general formula Z1-Z2-Z3, wherein:

i) Z1 is an insulin B-chain analog, comprising the amino acid

sequence:

X1 X2X3QHLCGSHLVEALX4LVCGERGFX5YX6X7X8X9

wherein X. is F, Q or A; X2 is V or G; X3 is N, K, D, G, Q, A or E; X4 is E, Y, Q, or H; X5 is H or F; X6 is G, T, S, H, V or is absent; X7 is G, E, P, K, D, S, H or is absent: X8 is G, E, K, P, Q, D, H or is absent; X9 is G, T, S, E, K, A or is absent, provided that the insulin B-chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X4, X5, X6, X7 , X8, or X9 (SEQ ID NO: l);

ii) Z2 is a first peptide linker comprising 5 to 10 amino acids, wherein at least 5 of said amino acids are G residues; and

iii) Z3 is an insulin A-chain analog comprising the amino acid

sequence:

GIVEQCCT SX1 CSLX 2QLENYCX3X4

wherein X. is T or I; X2 is D, Y, Q or E; X3 is G, N, S or A; and X4 is any naturally occurring amino acid, or is absent, provided that if X3 is N, then X4 must be an amino acid other than G or N (SEQ ID NO: 2);

b) a second peptide linker; and

c) a human IgG Fc region;

wherem the C-terminal residue of the insulin receptor agonist is directly fused to the N-terminal residue of the second peptide linker, and the C-terminal residue of the second peptide linker is directly fused to the N-terminal residue of the human IgG Fc region.

2. The fusion protein of claim 1 , wherein:

the insulin B-chain analog includes at least one modification from the amino acid sequence of the human insulin B-chain at X4 or X5 of SEQ ID NO: 1 ; and and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-cihna at X1 or X2 of SEQ ID NO:2.

3. The fusion protein of either of claims 1 or 2, wherein:

the insulin B-chain analog comprises the sequence of SEQ ID NO: 1 , wherein: X1 is F; X2 is V; X3 is N or D; X4 is E; X5 is H; and

the insulin A-chain analog comprises the sequence of SEQ ID NO:2, wherein: X1 is I or T; X2 is D; X3 is G; and X4 is absent.

4. The fusion protein of any of claims 1-3, wherein the insulin B-chain analog comprises the sequence of SEQ ID NO: 1, wherein X6,-X9 are each G.

5. The fusion protein of any of claims 1-4, wherein the first peptide linker comprises the following amino acid sequence:

X1 GX2GGGG

wherein X1 is G or is absent; and X2 is G, S or is absent (SEQ ID NQ:3).

6. The fusion protem of claim 5, wherein X1 and X2 of SEQ ID NO:3 are G and S, respectively.

7. The fusion protem of any of claims 1 -6, wherein the msulin receptor agonist has the following amino acid sequence:

FVNQHLCGSHLVEALELVCGERGFHYGGGGGGSGGGGGIVEQCCTSTCSLDQLENYC

G (SEQ ID NO:5).

8. The fusion protein of any of claims 1 -7, wherein the second peptide linker is a peptide having between 10 and 25 amino acids, wherein at least 50% of said amino acids are G residues.

9. The fusion protein of any of claims 1 -8, wherein the second peptide linker comprises a peptide having the sequence [GGGGX]n

wherein X is Q, E or S; and wherein n is 2-5.

10. The fusion protem of any of claims 1-9, wherein the second peptide linker comprises the following amino acid sequence:

GGGGX1GGGGX 2GGGGX3GGGGX4 X5X6

X1 is Q or E

X2 is Q or E X3 is Q or E

X4 is G, E, Q or is absent

X5 is G or absent; and

X6 is G or is absent

(SEQ ID NO:6).

11. The fusion protein of any of claims 1-10, wherein the second peptide linker has the following amino acid sequence:

GGGGQGGGGQGGGGQGGGGG (SEQ ID NO:7).

12. The fusion protein of any of claims 1-11, wherein the human IgG Fc region is an Fc region from an IgGl , IgG2 or IgG4.

13. The fusion protein of any of claims 1-12, wherein the human IgG Fc region comprises an amino acid sequence selected from the group consisting of SEQ ID NO:8, SEQ ID NO:9 and SEQ ID NO: 10.

14. A fusion protein having the amino acid sequence of SEQ ID NO: 12.

15. A homodimer of two fusion proteins of any of claims 1-14,

16. A pharmaceutical composition comprising either a fusion protein of any of claims 1- 14 or a homodimer of claim 15, and at least one excipient.

17. The pharmaceutical composition of claim 16, wherem the composition further

comprises one or more buffering agents, one or more surfactants, and one or more tsotonicity agents.

18. The pharmaceutical composition of claim 16, further comprising citrate, citric acid, polysorbate 80, and mannitol.

19. The pharmaceutical composition of any of claims 16-18 wherein the pH ranges from about 5.5 to about 8.0.

20. The pharmaceutical composition of any of claims 16-19 wherem the pH ranges from about 6.0 to about 6.75.

21. The pharmaceutical composition of any of claims 16-20, further comprising an

additional active ingredient.

22. The pharmaceutical composition of claim 21, wherein the additional active ingredient is an incretin-based therapy.

23. The pharmaceutical composition of claim 23, wherein the incretin-based therapy is a GLP-1 R agonist.

24. The pharmaceutical composition of claim 23 wherein the GLP-1R agonist is

dulaglutide.

25. A pharmaceutical composition comprising a homodimer of claim 15 and dulaglutide.

26. A method of treating a patient with diabetes mellitus comprising administering to a patient in need thereof a therapeutically effective amount of the fusion protein of any of claims 1-14.

27. The method of claim 26, wherein the fusion protein of any of claims 1 -14 is

administered in combination with an additional active ingredient.

28. The method of claim 27, wherein the additional active ingredient is an incretin-based therapy.

29. The method of claim 28, wherein the incretin-based therapy is a GLP-1 R agonist.

30. The method of claim 29, wherem the GLP-1R agonist is dulaglutide.

31. A method of treating a patient with diabetes mellitus comprising administering to a patient in need thereof a homodimer of claim 15 in combination with dulaglutide.

32. The fusion protein of any of claims 1 -14 for use in therapy.

33. The fusion protein of any of claims 1-14 for use in the treatment of diabetes mellitus.

34. The fusion protein of any of claims 1-14 in for use in simultaneous, separate or

sequential combination with an additional active ingredient.

35. The fusion protein of any of claims 1-14 in for use in simultaneous, separate or

sequential combination with dulaglutide.

36. The homodimer of claim 15 for use in simultaneous, separate or sequential

combination with dulaglutide.

37. A polynucleotide encoding the fusion protein of any of claims 1-14.

Description:
FUSION PROTEINS

The present invention relates to fusion proteins for use in the treatment of diabetes. More particularly, the invention relates to fusion proteins comprising an insulin receptor agonist fused to a human IgG Fc region with a peptide linker, and the use of such proteins in the treatment of diabetes. The fusion proteins of the present invention have an extended time action profile and are useful for providing protracted basal glucose control and suppression of hepatic glucose output.

Diabetes mellitus is a chronic disorder characterized by hyperglycemia resulting from defects in insulin secretion, msulin action, or both. Type 1 diabetes mellitus is characterized by little or no insulin secretory capacity, and patients with type 1 diabetes mellitus require insulin for survival. Type 2 diabetes mellitus is characterized by elevated blood glucose levels resulting from impaired insulin secretion, insulin resistance, excessive hepatic glucose output, and/or contributions from all of the above. In at least one-third of patients with Type 2 diabetes mellitus, the disease progresses to an absolute requirement for insulin therapy.

In order to achieve normal glycemia, insulin replacement therapy is desired to parallel, as closely as possible, the pattern of endogenous insulin secretion in normal individuals. The daily physiological demand for msulin fluctuates and can be separated into two phases: (a) the mealtime phase requiring a pulse (bolus) of insulin to dispose of the meal-related blood glucose surge, and (b) the inter-meal phase requiring a sustained (basal) amount of insulin to regulate hepatic glucose output for maintaining optimal fasting blood glucose.

Because Type 1 diabetes patients produce little or no insulin, effective insulin therapy for Type 1 diabetics generally involves the use of two types of exogenously administered insulin: a rapid-acting, mealtime insulin provided by bolus injections, and a long-acting, basal insulin, administered once or twice daily to control blood glucose levels between meals. Treatment of patients with Type 2 diabetes typically begins with prescribed weight loss, exercise, and a diabetic diet, but when these measures fail to control elevated blood sugars, then oral medications and incretin-based therapy, such as administration of glucagon-like peptide-] (GLP-1) receptor agonists and/or dipeptidyl peptidase 4 (DPP-4) inhibitors that enable increased incretin levels, may be necessary. When these medications are still insufficient, treatment with insulin is considered. Type 2 diabetes patients whose disease has progressed to the point that insulin therapy is required are generally started on a single daily injection of a long-acting, basal insulin, although mealtime injections of rapid-acting insulins may be included, as necessary, in some cases.

Several types of basal insulins are currently available. Insulin glargine, sold under the tradename LANT US®, comprises a modified insulin structure in which the asparagine at position 21 in the insulin A-chain is replaced with glycine, and two arginines are added to the C-terminus of the B-chain. insulin detemir, sold under the tradename LEVEMIR®, comprises a modified insulin structure in which the threonine at position 30 of the B-chain has been deleted and the lysine at position 29 of the B-chain has been derivatized through the covalent linkage of a 14-carbon, myristovl fatty acid to the ε-amine group of lysine at B29. Insulin degludec, available in Europe and Japan under the tradename TRESIBA®, comprises a modified insulin structure in which the threonine at position 30 of the B-chain has been deleted, and the ε-amino group of the lysine at position 29 of the B-chain is covaiently derivatized with hexadecandioic acid via a γ-L-glutamic acid linker. Ail of these insulins are indicated for once-daily

administration.

Treatment regimens involving daily injections of existing insulin therapies can be complicated and painful to administer and can result in undesired side effects, such as hypoglycemia and weight gain. Therefore, many diabetic patients are unwilling or unable to comply, or are incapable of complying, with the insulin therapy necessary to maintain close control of blood glucose levels. Poor glycemic control increases a patient's risk for developing serious diabetes-related complications, including heart disease, stroke, nerve damage, lower limb amputation, vision loss, and kidney disease. Research is being conducted to identify insulin products with longer duration of action; thus, requiring fewer injections than currently available insulin products to improve acceptance and compliance. CN1035091 18 describes proteins with a human insulin B-chain and human insulin A-chain joined by a 4 to 50 amino acid C-peptide connection sequence, and with the insulin A-chain attached directly, without an additional linker, to an immunoglobulin Fc fragment, and states that testing in mice shows that such proteins have an in vivo half-life as long as about 3 days. KR101324828 describes proinsulin analogs linked to immunoglobulin Fc regions through the use of non-peptidyl linkers, and states that such proteins provide increased serum half-life over existing therapies. The publication states that the non-peptidyl linkers represent an improvement over peptide linkers, asserting that fusion proteins using peptide linkers cannot increase the half-life of an active medication in the blood because peptide linkers are easily severed by enzymes in the body.

Despite the disclosures above, and/or in any other publications, the present inventors overcame multiple obstacles to discover fusion proteins comprising insulin receptor agonists, peptide linkers, and human IgG Fc regions that meet the ongoing need for a glucose-lowering product with increased duration of action, sufficient for less frequent dosing than currently available insulin products, including dosing as infrequently as once-weekly. For example, in order to achieve the desired prolonged time action profile, it was necessary to engineer fusion proteins with attenuated potency to avoid rapid receptor mediated clearance, a major route of insulin clearance, but that still have enough potency to provide sufficient glucose lowering. Further, in order to minimize renal clearance, the other major route of insulin clearance, and to regulate peripheral exposure through hydrodynamic size-limited paracellular diffusion, fusion proteins had to be engineered which were sufficiently large, and which would not have the insulin receptor agonist proteolytically cleaved from the human IgG Fc region after being administered, as such cleavage would result in faster than desired renal clearance of the insulin receptor agonist even if the human IgG Fc region remained in circulation. In addition, IgG Fc domains have evolved to self-associate to form stable dimers, and when such a dimer is formed from two fusion proteins, each comprising an insulin moiety fused with an IgG Fc region, the two insulin moieties are brought into close proximity to one another, enabling self-association, or dimerization, of the insulin moieties, mediated through the insulin B-chain self-association regions. Such insulin dimers are inactive, so fusion proteins with reduced self-association of the insulin receptor agonist moieties had to be engineered. Multiple additional challenges were overcome to create a fusion protein suitable for commercial manufacture and formulation as a therapeutic. The present invention provides fusion proteins which have prolonged duration of action compared to existing insulin treatments, allowing for less frequent injections than existing insulin products, including up to once weekly, thus reducing the complexity of and pain associated with existing treatment regimens involving more frequent injections. The fusion proteins of the present invention have a flat pharmacokinetic profile and restricted peripheral exposure, resulting in low day-to-day variability, and minimal incidence of hypoglycemia, including when provided in combination with an additional diabetes medication, such as an incretin-based therapy. The fusion proteins of the present invention may also provide prolonged duration of action without causing weight gain.

The present invention provides a fusion protein comprising:

a) an insulin receptor agonist having the general formula Z1-Z 2 -Z 3 , wherein t) Z 1 is an insulin B-chain analog, comprising the amino acid

sequence:

X 1 X 2 X 3 QHLCGSHLVEALX 4 Lv CGERGFX 5 YX 5 X 7 X 8 X 9

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent, provided that the insulin B-chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 , X 5 , X 6 , X 7 , X 8 , or X 9 (SEQ ID NO: 1);

ti) Z- 2 is a first peptide linker comprising 5 to 10 amino acids, wherein at least 5 of said amino acids are G residues; and

iii) Z 3 is an insulin A-chain analog comprising the amino acid

sequence:

G IVEQCC TS X 1 CS LX 2 Q LEN Y C X 3 X 4

X: is T or I; X 2 is D, Y, Q or E; X 3 is G, N, S or A; and X 4 is any naturally occurring amino acid, or is absent, provided that if X 3 is N, then X 4 must be an amino acid other than G or N (SEQ ID NG:2); b) a second peptide linker; and

c) a human IgG Fc region;

wherein the C-terminal residue of the insulin receptor agonist is directly fused to the N- terminal residue of the second peptide linker, and the C-terminal residue of the second peptide linker is directly fused to the N-terminal residue of the IgG Fc region.

The present invention also provides a fusion protein consisting of:

a) an insulin receptor agonist having the general formula Z 1 - Z 2 - Z 3 , wherein t) Z 1 is an insulin B-chain analog, having the amino acid sequence:

X 1 X 2 X 3 QHLCGSHLVEALX 4 LVCGERGFX 5 YX 6 X 7 X 8 X 9

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; Xf, is G, T, S, FT, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent, provided that the insulin B-chain analog mcludes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 , X 5 , X 6 , X 7 , X 8 , or XQ (SEQ ID NO: 1);

ii) Z 2 is a first peptide linker comprising 5 to 10 amino acids, wherein at least 5 of said amino acids are G residues; and

iii) Z 3 is an insulin A -chain analog having the amino acid sequence: G IVEQCCTSX 1 CSLX 2 QLENYCX 3 X 4

X·, is T or i; X 2 is D, Y, Q or E; X 3 is G, N, S or A; and X 4 is any naturally occurring amino acid, or is absent, provided that if X 3 is N, then X 4 must be an amino acid other than G or N (SEQ ID NO:2);

b) a second peptide linker: and

c) a human IgG Fc region;

wherein the C-terminal residue of the insulin receptor agonist is directly fused to the N- terminal residue of the second peptide linker, and the C-terminal residue of the second peptide linker is directly fused to the N-terminal residue of the IgG Fc region.

The present invention also provides a pharmaceutical composition comprising a fusion protein of the present invention and at least one pharmaceutically acceptable excipient. The present invention also provides a method of treating a patient with diabetes mellitus, obesity, dyslipidemia or metabolic syndrome, comprising administering to a patient in need thereof a therapeutically effective amount of a fusion protein of the present invention. The present invention also provides a method of treating or preventing a diabetes-related condition selected from the group consisting of heart disease, stroke, nephropathy, retinopathy, and kidney disease, comprising administering to a patient in need thereof a therapeutically effective amount of a fusion protein of the present invention.

The invention also provides a fusion protein of the present invention for use in therapy.

The present invention also provides the use of a fusion protein of the present invention in the manufacture of a medicament.

The present invention also provides polynucleotides encoding a fusion protein of the present invention.

The present invention also provides a process for producing a fusion protein of the present invention, said process comprising the steps of:

1. culturing a mammalian host cell comprising a polynucleotide encoding a fusion protein of the present invention under conditions such that said fusion protein is expressed; and

2. recovering from said host cell a fusion protein;

The present invention also provides a fusion protein produced by the process described above.

Figure 1. Figure 1 provides pharmacodynamic data for exemplary fusion proteins of the present invention in a streptozotocin (STZ)-treated rat diabetes model.

Figure 2. Figure 2 provides a schematic diagram of configurations of proteins described herein. It should be noted that the particular shapes (e.g., ovals, half-circles, etc.) used in the diagrams in Figure 2 are not intended to describe or characterize, and should not be used to construe, the meaning or structure of the individual components of the fusion proteins of the present invention. In certain embodiments, the insulin B -chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 or X 5 of SEQ ID NO: 1 , and at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1.

In certain embodiments, the insulin B-chain analog includes at least two modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1.

In certain embodiments, the insulin B-chain analog has the amino acid sequence of SEQ ID NO: 1 wherein X 6 ,- X 9 are each G.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 and X 5 of SEQ ID NO: 1,

In certain embodiments, the insulin B-chain analog has the sequence of SEQ ID NO: 1, wherein: X, is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H.

In certain embodiments, the insulin B-chain analog includes modification from the amino acid sequence of the B-chain of a molecule of human insulin at each of positions X 4 , X 5 , X 6 , X 7 , X 8 , and X 9 of SEQ ID NO : 1.

In certain embodiments, the insulin B-chain analog comprises the sequence of SEQ ID NO: !, wherein: X 1 is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and X 6 -X 9 are each G.

In certain embodiments, the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X- or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID

NO:2. In certain embodiments, the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein X 3 is N and wherein X 4 is an amino acid other than G, N, S, V L or P.

In certain embodiments, the insulin B -chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 or X 5 of SEQ ID NO: 1 , and at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X·, or X 2 of SEQ ID NQ:2.

In certain embodiments, the insulin B-chain analog includes at least two modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at Xi or X 2 of SEQ ID NO:2,

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecu le of human insulin at X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1 ; and the msulin A-chain analog includes at least one modification from the amino acid sequence of the human msulin A-chain at X: or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog has the amino acid sequence of SEQ ID NO: 1 wherein X 6 - X 9 are each G; and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X 1 or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 and X 5 of SEQ ID NO: 1 ; and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X: or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog comprises the sequence of SEQ ID NO: 1, wherein X: 1 is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and the insulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X 1 or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B -chain analog includes modification from the amino acid sequence of the B-chain of a molecule of human insulin at each of positions X 4 , X 5 , X 6 , X-7, X 8 , and X 9 of SEQ ID NO: 1 ; and the insulin Λ-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X 1 or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog has the sequence of SEQ ID NO: 1 , wherein: X, is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and X 6 -X 9 are each G; and the msulin A-chain analog includes at least one modification from the amino acid sequence of the human insulin A-chain at X: or X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 or X 5 of SEQ ID NO: 1, and at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID NO:2.

In certain embodiments, the tnsulm B-chain analog includes at least two modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecu le of human insulin at X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog has the amino acid sequence of SEQ ID NO: 1 wherein X 6 -X 9 are each G; and the insulin A-chain analog includes modifications from the amino acid sequence of the human tnsulm A-chain at both X 1 and

X 2 of SEQ ID NO: In certain embodiments, the insulin B-chatn analog includes modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 and X 5 of

SEQ ID NO: l; and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog comprises the sequence of

SEQ ID NO: 1, wherein: X 1 is F: X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and the insulin

A-chain analog includes modifications from the amino acid sequence of the human

insulin A-chain at both X: and X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog includes modification from the amino acid sequence of the B-chain of a molecule of human insulin at each of positions X 4 , X 5 , X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both and X 1 X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog has the sequence of SEQ ID

NO: 1 , wherein: X 1 is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and X 6 -X 9 are each G;

and the insulin A-chain analog includes modifications from the amino acid sequence of the human insulin A-chain at both X 1 and X 2 of SEQ ID NO:2.

In certain embodiments, the insulin B-chain analog includes at least one

modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 or X 5 of SEQ ID NO: 1 , and at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at X 6 , X 7 , X 8 , or X 9 of S EQ ID NO: 1 ; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog includes at least two

modifications from the amino acid sequence of the B-chain of a molecule of human

insulin at X 6 , X 7 , X 8 , or X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog has the

sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chatn of a molecule of human insulin at X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog has the amino acid sequence of SEQ ID NO: 1 wherein X 6 -X 9 are each G, wherein: X. is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog includes modifications from the amino acid sequence of the B-chain of a molecule of human insulin at X 4 and X 5 of SEQ ID NO: 1 ; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog comprises the sequence of

SEQ ID NO: 1, wherein: X 1 is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is H; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog includes modification from the amino acid sequence of the B-chain of a molecule of human insulin at each of positions X 4 , X 5 , X 6 , X 7 , X 8 , and X 9 of SEQ ID NO: 1 ; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T: X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the insulin B-chain analog has the sequence of SEQ ID NO: 1 , wherein: X: is F; X 2 is V; X 3 is N or D; X 4 is E; X 5 is II; and X 6 -X 9 are each G; and the insulin A-chain analog has the sequence of SEQ ID NO:2, wherein: X 1 is I or T; X 2 is D; X 3 is G; and X 4 is absent.

In certain embodiments, the first peptide linker (Z 2 ) comprises the amino acid sequence: X 1 GX 2 GGGG, wherein X 1 is G or is absent; and X 2 is G, S or is absent (SEQ ID NO:3).

In certain embodiments, the insulin receptor agonist, i.e., Z 1 Z 2 -Z 3 in the fusion protein described above, comprises the amino acid sequence:

X 1 X 2 X 3 QHLCGSHLVEALX 4 LVCGERGFX 5 YX 6 X 7 X 8 X 9 X 10 GX 1 1 GGGGGIVEQCCTSX 12 CSL X 13 QLENYCX 1 4 X 1 5

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent; X 10 is G or is absent; X 11 is G, S or is absent; X 12 is T or I; X 13 is D, Y, Q or E; X 14 is G, N, S or A; and X 15 is any naturally occurring amino acid, or is absent, provided that at least one of X 4 , X 5 , X 6 , X 7 , X 8 , or X 9 must be a different amino acid than that found, respectively, at position B 16 , B 25 , B 27 , B 28 , B 29 or B 30 of the B-chain of a molecule of human insu lin, and further provided that if X 14 is N, then X 15 must be an amino acid other than G or N (SEQ ID NO:4).

In certain preferred embodiments, the insulin receptor agonist, i.e., Z 1 -Z 2 -Z 3 in the fusion protein described above has the following amino acid sequence:

FVNQHLCGSHLVEALELVCGERGFHYGGGGGGSGGGGGIVEQCCT STCSLDQLENYCG

(SEQ ID NO:5).

In certain embodiments, the second peptide linker is a peptide having between 10 and 25 amino acids, wherein at least 50% of said amino acids are G residues. In certain embodiments, the second peptide linker is a peptide comprising the amino acid sequence [GGGGX] n , wherein X is Q, E or S; and wherein n is 2-5. In certain embodiments, the second peptide linker comprises the amino acid sequence:

GGGGX 1 GGGGX 2 GGGGX 3 GGGGX 4 X 5 X 6

wherein X 1 is Q or E; X 2 is Q or E; X 3 is Q or E; X 4 is G, E, Q or is absent; X 5 is G or absent; and X 6 is G or is absent (SEQ ID NO: 6).

In certain preferred embodiments, the second peptide linker has the amino acid sequence:

GGGGQGGGGQGGGGQGGGGG ( SEQ I D NO : 7 ) .

In certain embodiments, the human IgG Fc region comprises fragments from one heavy chain of an IgG antibody. A schematic diagram of a fusion protein comprising such an IgG region is provided in diagram (A) in Figure 2. In other embodiments, the human IgG Fc region comprises fragments from two heavy chains of an IgG antibody. A schematic diagram of a fusion protein comprising such an IgG region is provided in diagram (B) in Figure 2.

In certain embodiments, the human IgG Fc region is an Fc region from an IgGl , IgG2 or IgG4 antibody. In certain embodiments, the human IgG Fc region is an Fc region from an IgGl antibody comprising the following amino acid sequence:

CPPCPAPELLGGPS VX, LX 2 PPKPKDTLMISRTPEVTCX 3 VX 4 DVSHEDPEVKFNWY

VDGVEVHNAKTKPREEQYXjSTYRVVSVLTVLHQDWLNGKEYKCKVSNKALPA PIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDIAVEWESNGQ PENNYKTTPPVLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQKS LSLSPGX 6

wherein X 1 is F, Q or E; X 2 is F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, D or Q; and X 6 is K or is absent. (SEQ ID NO: 8)

In certain embodiments, the IgG Fc region comprises the amino acid sequence of SEQ ID NO: 8 and further comprises some or all of the amino acids that would be found in a wild-type IgGl Fc sequence to the N-terminal side of the C residue at position 1 in SEQ ID NO:8.

Preferably, the human IgG Fc region is from either an IgG2 or IgG4 antibody. In certain embodiments, the human IgG Fc region is an Fc region from an IgG4 antibody comprising the following amino acid sequence:

PCPPCPAPEAAGGPSVX 1 LX 2 PPKPKDTLMISRTPEVTCX 3 VX 4 DVSQEDPEVQFNW

Y^DGVEVIMAKTKPREEQFXsSTYRWSVLTYLHQDWLNGKEYKCKVSNKGLP SSIEKTISKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNG

QPENNYKTTPPVLDSDGSFFLYSX 6 LTVDKSRWQEGNVFSCSVMHEALHNHYTQ KSLSLSLGX 7

wherein X 1 is F, Q or E; X 2 F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, D or Q; X 6 is R, or K; X 7 is K or is absent. (SEQ ID NO: 9)

In certain embodiments, the IgG Fc region comprises the amino acid sequence of SEQ ID NO: 9, and further comprises some or all of the amino acids that would be found in a wild type IgG4 Fc sequence to the N-terminal side of the C residue at position 1 in SEQ ID NO: 9. In certain embodiments, the human IgG Fc region comprises the amino acid sequence of SEQ ID NO:9, wherein X 1 is F; X 2 is F; X 3 is V; X 4 is V; X 5 is N; X 6 is R; and X 7 is absent. In certain embodiments, the human IgG Fc region is an Fc region from an XgG2 antibody having the following amino acid sequence:

wherein X 1 is F, Q or E; X 2 is F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, D or Q; and X 6 is K or absent, (SEQ ID NO: 10)

In certain embodiments, the IgG Fc region comprises the amino acid sequence of

SEQ ID NO: 10, and further comprises some or all of the amino acids that would be found in a wild type IgG2 Fc sequence to the N -terminal side of the E residue at position 1 in SEQ ID NO: 10. In certain embodiments, the human IgG Fc region comprises the amino acid sequence of SEQ ID NO: 10, wherein X 1 is F; X 2 is F; X 3 is V; X 4 is V; X 5 is N; and X 6 is absent.

In certain embodiments, the fusion protein comprises the amino acid sequence:

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent, provided that at least one of X 4 , X 5 , X 6 , X 7 , X¾ or X 9 is an amino acid other than that which is present, respectively, at position Β 16 , B 25 , B 27 , B 28 , B 29 or B 30 of a human insulin B-chain: X 10 is G or is absent; X 11 is G, S or is absent; X 12 is T or I; X 13 is D, Y, Q or E; X 14 is G, N, S or A; X 15 is any naturally occurring amino acid, or is absent, provided that if X 14 is N, then X 15 must be an amino acid other than G or N; X 16 is Q or E; X 17 is Q or E; X 18 is Q or E; X 19 is G, E, Q or is absent; X 20 is G or absent; X 21 is G or is absent; X 22 is E or P; X 23 is E or P; X 24 is A or V; X 25 is G or is absent; X 26 is Q or H; X 27 is Y or F; X 28 is L or V; X 29 is S or A; X 30 is S or P; X 31 is A or T; X 32 is Q or R; X 33 is V or M; X 34 is R or K; X 35 is E or Q; and X 36 is L or P (SEQ ID NO: 11).

In certain embodiments, the present invention provides a fusion protein selected from the group consisting of SEQ ID NO: 12, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, and SEQ ID NO:24.

In a referred embodiment, the fusion rotein has the amino acid se uence:

In certain embodiments, the fusion proteins of the present invention are present in the form of a dimer. A schematic diagram of such a dimer is provided in diagram (C) in Figure 2. In certain embodiments, the dimer is a homodimer, wherein the amino acid sequences of the two fusion proteins that make the dimer are the same. In certain embodiments, the dimer is a heterodimer, wherein the amino acid sequences of the two fusion proteins that make up the dimer are different.

In certain embodiments, the pharmaceutical composition of the present invention comprises a fusion protein of the present invention, a buffering agent, a surfactant, and an isotonicitv agent. In certain embodiments, the buffering agent is citric acid and/or citrate, the surfactant is polysorbate 80, and the isotonicity agent is mannitol. In certain embodiments, the pH of the composition ranges from about 5.5 to about 8.0. In certain embodiments, the pH ranges from about 6,0 to about 7.4. In certain embodiments, the pH ranges from about 6.0 to 6.75.

in certain embodiments, the pharmaceutical composition further comprises an additional active ingredient. In certain embodiments, the additional active ingredient is an incretin-based therapy. In certain preferred embodiments, the incretin-based therapy is a GLP-1R agonist. Preferably, the GLP-1R agonist is dulaglutide.

In certain embodiments, the method of the present invention comprises administration of a therapeutically effective amount of a fusion protein once daily. In preferred embodiments, a therapeutically effective amount of the fusion protein is administered once weekly. In certain embodiments, a therapeutically effective amount of the fusion protein is administered once monthly. In certain embodiments, the present invention provides a method of treating a patient with diabetes mellitus while reducing the risk of hypoglycemia and/or weight gain, comprising administering to the patient a therapeutically effective amount of a fusion protein of the present invention.

The present invention also provides a method of treating a patient with diabetes mellitus, obesity, dyslipidemia, and/or metabolic syndrome, comprising administering a therapeutically effective amount of a fusion protein of the present invention in combination with an additional active ingredient. The fusion protein and additional active ingredient in such embodiments may be administered simultaneously, sequentially or in a single combined formulation. In certain embodiments, the additional active ingredient is an incretin-based therapy. In certain preferred embodiments, the incretin- based therapy is a GLP-1R agonist. Preferably, the GLP- 1 R agonist is dulaglutide. In certain embodiments, the combination is admmistered once daily. In certain preferred embodiments, the combination is administered once weekly. In certain embodiments, the combination is administered once monthly.

In certain embodiments, the present invention provides a fusion protein of the present invention for use in treatment of diabetes mellitus, obesity, dyslipidemia or metabolic syndrome. In certain embodiments, the present invention provides a fusion protein of the present invention for use in treating or preventing a diabetes-related condition selected from the group consisting of heart disease, stroke, nephropathy, retinopathy, and kidney disease. In certain embodiments, the present invention provides a fusion protein of the present invention for use in treating a patient with diabetes mellitus while reducing the risk of hypoglycemia and/or weight gain, comprising administering to the patient a fusion protein of the present invention. In certain embodiments, the fusion protein of the present invention is provided for use in simultaneous, separate or sequential combination with another active ingredient. In certain embodiments, the additional active ingredient is an incretin-based therapy. In certain preferred embodiments, the incretin- based therapy is a GLP-1R agonist.

In certain embodiments, the present invention provides the use of a fusion protein of the present invention in the manufacture of a medicament for the treatment of diabetes mellitus, obesity, dyslipidemia or metabolic syndrome. In certain embodiments, the present invention provides the use of a fusion protein of the present invention in the manufacture of a medicament for the treatment or prevention of a diabetes-related condition selected from the group consisting of heart disease, stroke, nephropathy, retinopathy, and kidney disease. In certain embodiments, the present invention provides the use of a fusion protein of the present invention in the manufacture of a medicament for treating dia betes mellitus while reducing the risk of hypoglycemia and/or weight gain, comprising administering to the patient a fusion protein of the present invention. In certain embodiments, the present invention provides the use of a fusion protein of the present invention in the manufacture of a medicament for the treatment of diabetes mellitus, obesity, dyslipidemia or metabolic syndrome, wherein the medicament is to be administered simultaneously, separately or sequentilaly in combination with another active ingredient.

When used herein, the term "insulin receptor agonist" refers to a protein that binds to and activates the insulin receptor, resulting in a lowering of blood glucose levels and/or suppression of hepatic glucose output, characteristics which can be tested and measured using known techniques, such as those shown in the studies described below.

The insulin receptor agonist portion of the fusion proteins of the present in vention includes an analog of an insulin B-chain and an analog of an insulin A-chain. When used herein, the terms "insulin A-chain" and "insulin B-chain" refer to the A and B chains of the human insulin molecule (CAS No. 1 1061-68-0), whose native wild type sequences are well-known. The human msulin A-chain consists of 21 amino acids, referred to in the art as A 1 -A 21 , having the following sequence:

GI VEQCCTS I CSLYQLENYCN (SEQ ID NO; ! 3).

The human insulin B-chain consists of 30 amino acids, referred to in the art as Bi-

B 30 , having the following sequence:

FWQHLCGS HLVEALYLVCGERGFFYTPKT (SEQ ID NO: 14).

In a molecule of human insulin, the A- and B-chains are joined by two disulfide bonds, CysA7-CysB7 and CysA20-CysB19. The A-chain has an intra-chain disulfide bond at CysA6-CysAl 1.

To achieve the desired extended time action profile, the insulin receptor agonist of the fusion protein of the present invention must remain in circulation, and be capable of interacting with the msulin receptor, over an extended period of time. In order for the fusion proteins of the present invention to remain in circulation for the desired period of time, elimination of the fusion proteins must be attenuated. The two primary routes of insulin elimination are renal clearance and insulin receptor-mediated clearance. See. Igiesias P, et al. Diabetes Obes. Metab. 2008; 10:811-823. To minimize renal clearance, a molecule with hydrodynamic size of at least about the size of human serum albumin is needed, and such a hydrodynamic size is provided in the fusions proteins of the present invention by the human IgG Fc region. As for receptor-mediated clearance, the fusion protein cannot be so potent at the insulin receptor that it results in more rapid receptor- mediated clearance than desired, but the fusion protein must be potent enough, however, to provide sufficient glucose control at doses that are commercially feasible. Thus, the potency of the fusion protein must be carefully balanced, and the structure of the fusion protein of the present invention allows it to achieve such a balance.

Insulin molecules have a tendency to self-associate into dimers and hexamers. Numerous roles have been proposed for the evolutionarily-conserved, self-association tendencies of msulin, including: (1) chemical and thermal stabilization of the molecule during intracellular vacuole storage; (2) protection of the monomeric insulin from fibrillation in vivo; (3) substitution for chaperone-assisted stabilization and folding during intracellular expression; and/or (4) essential for secretory trafficking. The active form of insulin, however, is the monomer.

The tendency of insulin molecules to self-associate, and the inactivity of such self- associated molecules, is relevant to the present invention, because human IgG Fc regions also tend to self-associate to form dimers, typically associated covalently through disulfide bonds in the hinge region, and such dimers are formed from the human IgG Fc region s in the fusion proteins of the present invention. As a result of the dimerization of the human IgG Fc regions, the two insulin receptor agonist "arms" are in close proximity to one another, and thereby exist at relatively high local concentration. In the case of human insulin, such close proximity would tend to favor self-association, or dimerization, of the insulin moieties, affecting the activity of the molecules. A schematic diagram of an Fc fusion protein dirtier with insulin moiety arms that have become self-associated, or dimerized, is provided in diagram (D) of Figure 2. The tendency of insulin molecules to self-associate could also lead to self-association, or dimerization, of insulin moieties from more than one Fc fusion protein dimer; a schematic diagram of a dimer of two Fc fusion protein dimers with self-associated, or dimerized, insulin moieties is provided in diagram (E) of Figure 2, Further, the insulin moieties in more than two Fc fusion protein dimers could also self-associate in such a fashion to form, for example, a trimer comprised of three dimers, or even higher order aggregates comprised of more than three dimers.

The insulin receptor agonist portion of the fusion protein of the present invention, however, has a reduced tendency to self-associate or dimerize, and thus fusion protein dimers comprised of fusion proteins of the present invention tend to favor the structure depicted in diagram (C) of Figure 2, as opposed to the structures depicted in diagrams (D) and (E) of Figure 2. Thus, while the present invention provides a dimer of two fusion proteins, the insulin receptor agonist "arm" of each fusion protein in the dimer maintains a predominately monomeric state, as depicted for example in diagram (C) of Figure 1 , and is thus more capable of interacting with the insulin receptor.

In the fusion proteins of the present invention, the analog of the insulin B-chain in the insulin receptor agonist includes one or more modifications to the amino acid sequence of the human insulin B-chain. In particular, in order to reduce the propensity of the insulin receptor agonist portions to self-associate, or dimerize, the insulin B-chain analog includes one or more modifications from the B-chain of a molecule of human insulin at positions B 1 6 , B 25 or B 27-30 , which are represented in SEQ ID NO: 1 as positions X 4 , X 5 and X 6-9 , respectively. For example: X 4 (which corresponds with B 16 in the B- chain of a human insulin molecule) may be modified to E, Q or H; X 5 (which corresponds with B 25 in the B-chain of a human insulin molecule) may be modified to H; X 6 (which corresponds with B 27 in the B-chain of a human insulin molecule) may be deleted or modified to G, S, H or V; X 7 (which corresponds with B 28 in the B-chain of a human insulin molecule) may be deleted or modified to G, E, K, D, S or H; X 8 (which corresponds with B 29 in the B-chain of a human insulin molecule) may be deleted or modified to G, E, P, Q, D or H; and X 9 (which corresponds with B 30 in the B-chain of a human insulin molecule) may be deleted or modified to G, S, E or K. In addition to reducing the propensity of the insulin receptor agonist portions to self-associate, modifications to positions B 16 and B 25 on the insulin B-chain analog - X 4 and X 5 of SEQ ID NO: 1 , respectively - may also be made to adjust potency, improve expression, improve chemical and/or physical stability, improve the ease with which the fusion proteins can be formulated with other commonly used excipients and/or to eliminate deamidation. The insulin B-chain analog may also include additional modifications for these reasons. Referring to the variables in SEQ ID NO: 1 , such additional modifications include the following: modification of X 1 (which corresponds with B 1 in the B-chain of a human insulin molecule) to Q or A; modification of X 2 (which corresponds with B 2 in the B-chain of a human insulin molecule) to G; and/or modification of X 3 (which corresponds with B 3 in the B-chain of a human insulin molecule) to K, D or G.

in certain preferred embodiments, the insulin B-chain analog includes more than one modification to the amino acid sequence of the human insulin B-chain at positions X 4 , X 5 and X 6 -y of SEQ ID NO : 1 . In a preferred embodiment, X 4 is E and X 5 is H. In certain embodiments, the amino acid sequence of X 6 -X 9 of SEQ ID NO: 1 is selected from the group consisting of: GGES, GGGS, GGDS, GGEG, GGGG, SSES, SSGS, GGEE, GGGE, GGEK, GGGK, TPGS, TGGS, HGES, GHES, GGHS, GGEH, HGGS, GHGS, GGGH, GGDD, VGES, ΤΈΕΤ, TKPT, GGGG, TGGG, TPGG, EPKT, TDKT, TPGS, EGGS, EGES, EEES, EPES, EPEP and GGDD. in preferred embodiments, the sequence of these four amino acids is GGG S, GGGG or TΈΕΤ. In particularly preferred embodiments, X 4 is E, X 5 is H and X 6-9 is GGGG.

It should be noted that, while X 6 -X 9 of SEQ ID NO: 1 are described above as comprising the C-terminal end of the insulin B-chain analog (Z 1 ), these amino acids are not critical to the activity of the fusion proteins at the insulin receptor, and thus may alternatively be considered an extension of the first peptide linker (Z 2 ). For example, in the context of SEQ ID NO:4, X 6 -X 9 may be considered either part of the insulin B-chain analog or the first peptide linker in that insulin receptor agonist.

In the fusion proteins of the present invention, the analog of the insulin A-chain in the insulin receptor agonist portion may include one or more modifications to the amino acid sequence of the human insulin A-chain intended to improve chemical and physical stability, adjust potency, and/or enhance expression. Referring to the variables in SEQ ID NO:2, these modifications include the following: modification of X 1 (which corresponds with Aioin the A-chain of a human insulin molecule) to T; modification of X 2 (which corresponds with A 14 in the A-chain of a human insulin molecule) to D, Q or E; and/or modification of X 3 (which corresponds with A 21 in the A-chain of a human insulin molecule) to G, S or A. In a preferred embodiment, X 1 is T, X 2 is D, and X 3 is G.

Further, in order to avoid deamidation as well as chemical and/or proteolytic cleavage, if the amino acid at position 21 in the analog of the insulin A-chain in the insulin receptor agonist, i.e., X 3 in SEQ ID NO: 2, is an N - the amino acid that is present at the corresponding position in a molecule of human insulin - it must not be immediately followed at the C-terminal side by certain amino acids, such as a G or an N, or, in certain embodiments, a P, S, V or L. See, e.g., Vlasak J, lonescu R., MAbs. 2011 May- Jun;3(3):253-63. Fragmentation of Monoclonal Antibodies. It should be noted that while this requirement is recited in the first fusion protein described above in the context of the options for positions X 4 and X 5 in SEQ ID NO:2, which pertains to the analog of the insulin A-chain, it is not critical for the non-glycine residue following an asparagine residue at position 21 in the analog of the insulin A-chain to be considered part of the insulin receptor agonist, as opposed to the second peptide linker. For example, in the context of SEQ ID NO: 1 1 , the residue corresponding with position 21 in the A-ehatn analog portion is represented by X 14 , and the following amino acid, X 1 5 , could either be considered part of the insulin receptor agonist or the second peptide linker.

As described above, in the fusion proteins of the present invention, the C-tenninal residue of the insulin B-chain analog is directly fused to the N-terminal residue of a first peptide linker and the C-terminal residue of the first peptide linker is directly fused to the N-terminal residue of the insulin B-chain analog. The first peptide linker must provide sufficient flexibility for the analogs of the insulin A-chain and B-chain to achieve the structure necessary to bind to the insulin receptor, but must not be so long that it unduly interferes with that binding. The length and composition of the first peptide linker may be adjusted in order to adjust the potency and/or expression of the fusion proteins, in some embodiments, the first peptide linker is 5 to 10 amino acids in length, at least 5 of which are G residues. In certain embodiments, the amino acid sequence of the first peptide linker is selected from the group consisting of: GGGGGG, GGGGG, EGGGGG, GEGGGG, GGEGGG, GGGEGG, GGGGEG, GGGGGE, KGGGGG, GKGGGG, GGKGGG, GGGKGG, GGGGKG, GGGGGK, HGGGGG, GHGGGG, GGHGGG, GGGHGG, GGGGHG, GGGGGH, GGGGGA, GGGGGR, SGGGGG, GSGGGG, GGSGGG, GGSGGGK, GGSGGGG and GGSGGG. In certain preferred embodiments, the sequence of the first peptide linker comprises SEQ ID NO:3. Most preferably, the sequence of the first peptide linker is GGSGGGG (SEQ ID NO: 15), The insulin receptor agonist portion of the fusion proteins of the present inven tion also includes the disulfide bonds found in a molecule of human insulin as described above, namely, two disulfide bonds joining the analogs of the insulin A-chain and B-chain at CysA7-CysB7 and CysA20-CysB19, and an intra-chain disulfide bond in the analog of the insulin A-chain at CysA6-CysA11.

As described above, the C-terminal residue of the insulin receptor agonist portion of the fusion proteins of the present invention is fused to the N -terminal residue of a second peptide linker, and the C-terminal residue of the second peptide linker is fused directly to the N-terminal residue of the Fc portion. It is preferred that the second peptide linker be glycine rich, to provide sufficient conformational flexibility. Preferably, the second peptide linker is less than 30 amino acids in length. In certain preferred embodiments, the second peptide linker is between 10 and 25 amino acids in length, with at least 50% of the amino acids being glycine residues. A preferred second peptide linker includes the sequence (GGGGX 15 ) n wherein X 15 is Q, E or S and n = 2-5. A more preferred second peptide linker has the amino acid sequence of SEQ ID NO:6. A most preferred second peptide linker has the amino acid sequence

GGGGQGGGGQGGGGQGGGGG (SEQ ID NO:7).

As used herein, the term "human IgG Fc region" has the meaning commonly given to the term in the field of immunology. Specifically, this term refers to a human IgG antibody fragment which is obtained by removing the two antigen binding regions (the Fab fragments) from the antibody. In particular, the Fc region includes the CH2 and CH3 constant region domains of the antibody, and may also include some or all of the hinge region.

As described above, in certain embodiments of the fusion protein of the present invention, the human IgG Fc region comprises fragments of the constant region from one heavy chain of an IgG antibody, a schematic depiction of which is provided in diagram (A) of Figure 2, and in other embodiments, the human IgG Fc region comprises fragments of the constant regions from two heavy chains of an IgG antibody, a schematic depiction of which is provided in diagram (B) of Figure 2. In this embodiment, the constant regions of the two heavy chains are associated with one another through non-covalent interactions and disulfide bonds.

There are four IgG subclasses (Gl, G2, G3, and G4) each of which has different structures and biological functions known as effector functions. These effector functions are generally mediated through interaction with the Fc receptor (FcyR) or by binding complement factor Clq. Binding to FcyR can lead to antibody-dependent cell-mediated cytolysis, whereas binding to complement factors can lead to complement-mediated cell lysis. The structures and properties of the Fc regions of the IgG subclasses are known in the art. The fusion proteins of the present invention may contain Fc regions from any of the IgG subclasses, although G2 and G4 have lower receptor binding and effector function activities than G1 and G3 antibodies, and are thus preferred. When used herein, the term human IgG Fc region also includes versions of such antibody fragments which have been modified, elongated and/or truncated, for example, to alter properties or characteristics such as the complement and/or Fc receptor binding functions, effector functions, disulfide bond formation, glycosylation, antibody -dependent cell-mediated cytotoxicity (ADCC), manufacturability, and/or stability. For example, the human IgG Fc regions of the fusion proteins of the present invention may be modified to reduce or remove the N-linked glycosylation site, which will reduce Clq binding affinity and cytotoxicity, and which may aid in immunogenicity, affect conformational stability and in vivo clearance rate, and/or modify the effector functions. The human IgG Fc regions of the fusion proteins of the present invention may also have some or all of the hinge region removed in order to simplify disulfide mediated Fc dimerization. Other examples of alterations include phosphorylation, sulfation, acylation, glycosylation, methylation, acetyiation, amidation, and/or modifications to enable production of heterodimer molecules. Techniques for modifying the structures and properties of the human IgG Fc regions of the IgG subclasses are known in the art.

Regardless of the final structure of the fusion protein, the human IgG Fc regi on must serve to prolong the in vivo plasma half-life of the insulin receptor agonist. In preparing heterologous Fc fusion proteins wherein the Fc portion is being utilized for its ability to extend half-life, it is important to minimize any effector function. Furthermore, the fused insulin receptor agonist must remain able to bind to and activate the insulin receptor to result in a lowering of blood glucose levels and/or suppression of hepatic glucose output, characteristics which can be tested and measured using known techniques, such as those shown in the studies described below. A long half-life in the fusion proteins of the present invention can be demonstrated using, for example, the methods described below.

One preferred human IgG Fc region is an IgG4 Fc region modified to further reduce effector function, promote homodimer formation, and having a portion of the hinge deleted, as in SEQ ID NO:9 wherein X. is F; X 2 is F; X 3 is V; X 4 is V; X 5 is N; X 6 is R; and X 7 is absent. Another preferred human IgG Fc region is an IgG2 Fc region having a portion of the hinge deleted, as in SEQ ID NO: 10 wherein X 1 is F; X 2 is F; X 3 is V; X 4 is V; X 5 is N; and X 6 is absent.

It should be noted that, although the amino acid sequences of the preferred human IgG Fc regions recited above have portions of the hinge regions removed to simplify disulfide mediated dimerization, those hinge regions may be present in certain

embodiments. For example, a wild-type IgG2 Fc region includes the six amino acid sequence ERKCCV at its N-terminal end, and although these amino acids are not recited in the IgG2 Fc region sequence set forth in SEQ ID NO: 10, it is contemplated that a human IgG Fc region which comprises the amino acid sequence set forth in SEQ ID

NO: 10 may further comprise some or all of the six amino acid sequence ERKCCV at its N-terminal end. Similarly, a human IgG Fc region which comprises the amino acid sequence set forth in SEQ ID NO: 9 may further comprise some or all of the six amino acids found at the N-terminal end of an IgG4 Fc region, namely ESKYGP. Likewise, a human IgG Fc region which comprises the amino acid sequence set forth in SEQ ID NO: 8 may further comprise some or all of the ten amino acids found at the N-terminal end of an IgGl Fc region, namely: EPKSCDKTHT. Moreover, the precise delineation between which amino acid constitutes the C-terminal end of the second peptide Itnker and which amino acid constitutes the N-terminal end of the human IgG Fc region is not critical to the structure or function of the fusion protein of the present invention. For example, in the context of SEQ ID NO: 11 , the residues corresponding with positions X< 9- X 22 could be described as either the C-terminal end of the second peptide Itnker or an N- terminal extension of the human IgG Fc region.

As described above, the present invention also relates to polynucleotides that encode any of the fusion proteins of the present invention. The polynucleotides encoding the above-described fusion proteins may be in the form of RNA or DNA, which includes cDNA and synthetic DNA, and which may be double-stranded or single-stranded. The coding sequences that encode the proteins of the present invention may vary as a result of the redundancy or degeneracy of the genetic code. The polynucleotides that encode for the fusion proteins of the present invention may include the following: only the coding sequence for the proteins, the coding sequence for the proteins and additional coding sequence, such as a leader or secretory sequence or a pro-protein sequence; the coding sequence for the proteins and non-coding sequence, such as mtrons or non-coding sequence 5' and/or 3' of the coding sequence for the proteins. Thus, the term "polynucleotide encoding a protein" encompasses a polynucleotide that may include not only coding sequence for the proteins but also a polynucleotide that includes additional coding and/or non-coding sequence.

' The polynucleotides of the present invention will be expressed in a host cell after the sequences have been operably linked to an expression control sequence. The expression vectors are typically replicable in the host organisms either as episomes or as an integral part of the host chromosomal DNA. Commonly, expression vectors will contain selection markers to permit detection of those cells transformed with the desired DNA sequences.

The fusion proteins of the present invention may readily be produced in mammalian cells such as CHO, NSO, HEK293, BHK, or COS cells; in bacterial cells such as E. coli, Bacillus subtilis, or Pseudomonas fluorescence; in insect cells, or in fungal or yeast cells, which are cultured using techniques known in the art. Insect, and yeast or other fungal cells, however, produce non-human N-glycans, so proteins with N- linked glycosylation produced in such cells may cause immunogenic reactions if administered to patients. Production in such cells thus requires elimination of N-linked glycosylation sites and/or genetic engineering of the cells to produce human N-glycans using techniques known in the art. See, e.g., Hamilton SR, et al, Production of complex human glycoproteins in yeast, 301 SCIENCE (5637): 1244-6 (August 2003). Production in mammalian cells is preferred, and the preferred mammalian host cell is the CHOK1 SV cell line containing a glutamine synthetase (GS) expression system (see US. Pat. No. 5,122,464).

The vectors containing the polynucleotide sequences of interest (e.g., the fusion proteins and expression control sequences) can be transferred into the host cell by well- known methods, which vary depending on the type of cellular host. For example, calcium chloride transformation is commonly utilized for prokaryotic cells, whereas calcium phosphate treatment or electroporation may be used for other cellular hosts.

Various methods of protein purification may be employed and such methods are known in the art and described, for example, in Deuischer, Methods in Enzymology 182: 83-89 (1990) and Scopes, Protein Purification: Principles and Practice, 3rd Edition, Springer, N.Y. (1994).

As described above, the fusion protein of the present invention in certain embodiments is produced as a dimer. In such a dimer, the human IgG Fc regions of the fusion proteins are associated with one another through non-cova!ent interactions and disulfide bonds, A schematic depiction of such a dimer is provided in diagram (C) of Figure 2. When the amino acid sequences of the two fusion protems that make up such a dimer - e.g., Fusion Protein A and Fusion Protein B in the dimer depicted in diagram (C) of Figure 2 - are the same, the dimer is referred to herein as a "homodimer." As noted above, expression of fusion proteins of the present invention in mammalian cells is preferred, and expression in such cells results in homodimers. The fusion proteins in such homodimers are associated through non-covalent interactions and intermolecular disulfide bonds in the Fc portion. For example, the protein produced by a gene which encodes the fusion protein of SEQ ID NO: 12 would be a homodimer covalently bonded through interchain disulfide bonds, namely C80 to C8Q and C83 to C83.

When the amino acid sequences of two fusion proteins that make up a dimer - e.g., Fusion Protein A and Fusion Protein B in diagram (C) of Figure 2 - are different, the dimer is referred to herein as a "heterodimer." Such a heterodimer may be prepared by techniques known in the art. See, e.g, Lewis SM, et al. NAT. BIQTECHNOL, 32(2):191-8 (2014); Carter, J. IMMUNOL. METHODS, 248(1-2):7-15 (2001): Ridgway, J. B. et al.

PROTEIN ENG. 9(7):617-2 (1996).

References herein to pharmaceutical compositions comprising a fusion protein include pharmaceutical compositions which contain a homodimer of that fusion protein, and/or which contain a heterodimer, wherein one member of the heterodimer is that fusion protein. Similarly, references herein to methods comprising administering a fusion protein, include methods comprising administering a homodimer of that fusion protein and/or administering a heterodimer, wherein one member of the heterodimer is that fusion protein. Likewise, references to a fusion protein for use in therapy and/or a fusion protein for use in the manufacture of a medicament include a homodimer of that fusion protein, and/or a heterodimer wherein one member of the heterodimer is that fusion protein, for use in therapy and/or in the manufacture of a medicament,

The term "treatment" or "treating," as used herein, refers to the management and care of a patient having diabetes or hyperglycemia, or other condition for which insulin administration is indicated for the purpose of combating or alleviating symptoms and complications of those conditions. ' Treating includes administering fusion proteins of the present invention to prevent or delay the onset of symptoms or complications, alleviating the symptoms or complications, or eliminating the disease, condition, or disorder. The patient to be treated is a mammal, and preferably, a human being.

The term "prevent" or "preventing," as used herein, refers to reducing the risk or incidence of, or eliminating or slowing the progression of, one or more conditions, symptoms, complications or disorders.

The fusion proteins of the present invention can be used to treat subjects with a wide variety of diseases and conditions. Included are subjects with hyperglycemia, insulin-dependent diabetes as well as subjects with non-insulin dependent diabetes, including treatment naive subjects as well as subjects being treated with oral medications, such as a sulfonylurea, metformin, thiazolidiiiedione such as pioglitazone, a-glucosidase inhibitor such as acarbose, and/or noninsulin injectables, including incretin-based therapies, such as DPP-4 inhibitors and G LP-1R agonists. The fusion proteins of the present invention may be used to regulate blood glucose in such patients, and may treat conditions or complications that result from insufficient blood glucose control such as retinopathy, neuropathy or kidney disease.

in certain embodiments, the fusion protein of the present invention is administered every day, every other day, twice weekly, thrice weekly, once weekly, twice monthly or once mon thly. In preferred embodiments, the duration of action is sufficiently extended to allow for once weekly dosing. Even for such long acting molecules, however, it will be recognized by those of skill in the art that effective glucose control may also be provided by gradually dose accumulating a drug with a long pharmacokinetic profile using a more frequent treatment regimen, such as once-daily. See, e.g., Heise T and Meneghini LF, 20 ENDOCR. PRACT. p75-83 (2014).

In certain embodiments, the fusion protein of the present invention is administered in combination with an additional active ingredient, such as insulin or an insulin analog, an incretin-based therapy, or an oral diabetes medication, such as a sulfonylurea, metformin, thiazolidinedione such as pioglitazone or an a-glucosidase inhibitor such as acarbose.

The term "incretin-based therapy" includes any treatment which comprises administration of, or promotes, enables, enhances and/or simulates the effects of, a group of metabolic hormones known as incretins, which group includes GLP-1 and gastric inhibitory peptide (GIP). Incretin-based therapies which are currently available include DPP-4 inhibitors and GLP-1 R agonists.

A "DPP-4 inhibitor" is a compound that blocks the DPP-4 enzyme, which is responsible for the degradation of incretins. Currently available DPP-4 inhibitors include sitagliptin (Januvia®), and linagliptin (Tradjenta®).

A "GLP-1 R agonist" is defined as a compound comprising the amino acid sequence of native human GLP-1 (SEQ ID NO:25), a GLP-1 analog, GLP- 1 derivative or GLP-1 fusion protein, which maintains activity at the GLP-1 receptor. GLP-1 R activity may be measured by methods known in the art, including using in vivo experiments and in vitro assays that measure GLP-1 receptor binding activity or receptor activation, e.g., assays employing pancreatic islet cells or insulinoma cells, as described in EP 619,322 and U.S. Patent No. 5,120,712, respectively. A GLP-1 analog is a molecule having a modification including one or more amino acid substitutions, deletions, inversions, or additions when compared with the amino acid sequence of native human GLP-1 (SEQ ID NO:25). A GLP-1 derivative is a molecule having the amino acid sequence of native human GLP-1 (SEQ ID NO:25) or of a GLP-1 analog, but additionally having at least one chemical modification of one or more of its amino acid side groups, α-carbon atoms, terminal amino group, or terminal carboxylic acid group. A GLP-1 fusion protein is a heterologous protein comprising GLP-1, a GLP-1 analog or a GLP-1 derivative portion and a second polypeptide. Currently available GLP-1 R agonists include exenatide (Byetta® and Bydureon®), liraglutide (Victoza®), albiglutide (Tanzeum®) and duiaglutide (Trulicity®), the structures of which are known in the art. See, e.g., US 5,424,286 (exenatide); US 6,268,343 (liraglutide); US 2014044717 (albiglutide); and US 7,452,966 (duiaglutide).

In embodiments wherein a fu sion protein of the present invention is provided in combination with an additional active ingredient, the fusion protein and additional active ingredient may be administered simultaneously, sequentially or in a single, combined formulation.

The fusion proteins of the present invention are effective in treating such diseases and conditions by administering to a patient in need thereof a therapeutically effective amount of a fusion protein of the present invention. As used herein, the phrase

"therapeutically effective amount" refers to that amount of a fusion protein of the present invention sufficient to regulate blood glucose in a patient without causing unacceptable side effects. A therapeutically effective amount of the fusion protein administered to a subject will depend on the type and severity of the disease and on the characteristics of the subject, such as general health, age, sex, body weight, and tolerance to drugs. The skilled artisan will be able to determine appropriate dosages depending on these and other factors. In certain embodiments, a therapeutically effective amount of a fusion protein of the present invention when administered once weekly ranges from about 0.01 nmol/kg to about 100 nmol/kg. More preferably, a therapeutically effective amount of a fusion protem of the present invention when administered once weekly ranges from about 1 nmol/kg to about 50 nmol/kg. More preferably, a therapeutically effective amount of a fusion protein of the present invention when administered once weekly ranges from about 16 nmol/kg to about 25 nmol/kg. In certain embodiments, a therapeutically effective amount of a fusion protein of the present invention when administered once weekly ranges from about 1 mg to about 200 mg. More preferably, a therapeutically effective amount of a fusion protein of the present invention when administered once weekly ranges from about 25 mg to about 175 mg. More preferably, a therapeutically effective amount of a fusion protein of the present invention when administered once weekly ranges from about 100 mg to about 160 mg.

Persons of skill in the art will understand that when the fusion protein of the present invention is administered in combination with another active ingredient, such as a GLP-1 R agonist, the dose may be adjusted so that the activity of the two treatments combined is sufficient to regulate blood glucose in a patient. Thus, the amount ef fusion protein that must be administered to regulate blood glucose levels in such combinations may be less than would be required if the fusion protein were administered as a monotherapy. For example, when the fusion protein of the present invention is provided in combination with a GLP- 1 R agonist, the amount of fusion protein to be provided in a once weekly dose may be reduced by up to 50%, as compared to the amount of the same fusion protein for use as a monotherapy, such as the doses described in the preceding paragraph.

Preferably, administration of a therapeutically effective amount of the fusion protein of the present invention in some embodiments will provide effective glucose control while reducing the risk of hypoglycemia and/or weight gain as compared to existing treatments. The incidence of hypoglycemia caused by a diabetes therapy which agonizes the insulin receptor may be minimized by avoiding a rapid spike in the concentration of the therapeutic agent following administration. The fusion proteins of the present invention have an extended time action profile without a rapid spike in concentration following administration.

Moreover, in embodiments wherein the fusion proteins of the present invention are provided in combination with another active ingredient, in particular, a GLP-IR agonist, the hepatopreierential activity of the fusion proteins of the present invention may also reduce the risk of hypoglycemia while controlling glucose levels. Because the fusion protein of the present invention can readily access the liver through the fenestrated sinusoid endothelium, the molecule can control hepatic glucose output, with little, if any, activity at the periphery, while the GLP-IR agonist promotes glucose-dependent secretion of endogenous insulin from the pancreas that is readily capable of perfusing into the periphery to control glucose uptake in muscle and fat. The fusion proteins of the present invention are administered parenterally, by- nasal administration or pulmonary inhalation. Parenteral administration is preferred, and can include, for example, systemic administration, such as by intramuscular, intravenous, subcutaneous, or intraperitoneal injection.

The fusion proteins can be administered to the subject in a pharmaceutical composition, which comprises a fusion protein of the present invention and at least one pharmaceutically acceptable excipient. Such pharmaceutical compositions are typically, though not necessarily, parenteral in nature and may be prepared by any of a variety of techniques using conventional excipients for parenteral products, which are well known in the art. See, e.g.. Remington: the Science and Practice of Pharmacy (D.B. Troy, Editor, 21 st Edition, Lippincott, Williams & Wilkins, 2006). As described above, the fusion protein of the present invention is a homodimer when expressed in mammalian cells. Thus, when used herein, the term "composition comprising a fusion protein" includes a composition, which contains a homodimer of a fusion protein. In certain embodiments, a pharmaceutical composition of the present invention includes a composition with the fusion protein of the present invention present in a concentration of at least 1 mg/mL-, at least 2 mg/mL, at least 5 mg/mL, at least 10 mg/mL, at least 20 mg/mL, at least 25 mg/mL, at least 30 mg/mL, at least 35 mg/mL, at least 50 mg/mL, at least 55 mg/mL, at least 50 mg/mL, at least 65 mg/mL, at least 75 mg/mL, at least 100 mg/mL or greater. In preferred embodiments, the fusion protein is present in a concentration of 10-100 mg/mL. In more preferred embodiments, the fusion protein is present in a concentration of 15-75 mg/mL, and in most preferred embodiments, the fusion protein is present in a

concentration of 20-65 mg/mL.

The term "excipient" means any substance added to the composition other than the fusion protein or any other additional active ingredient(s). Examples of such excipients that may be used in the compositions of the present invention include buffering agents, surfactants, isotonicity agents and preservatives.

In certain embodiments, the composition of the present invention includes one or more buffering agents to control the pH of the composition. A "buffering agent" is a substance which resists changes in pH by the action of its acid-base conjugate components. In certain embodiments, the composition of the present invention has a pH from about 5.5 to about 8.0, preferably, between about 6.0 and about 7.4, more preferably between about 6.0 and 6.75. Buffering agents suitable for controlling the pH of the compositions of the present invention in the desired range include, but are not limited to agents such as phosphate, acetate, citrate, or acids thereof, arginine, TRIS, and histidine buffers, as well as combinations thereof. "TRIS" refers to 2-amino-2-hydroxymethyl- 1,3,-propanediol, and to any pharmacologically acceptable salt thereof. The free base and the hydrochloride form (i.e., TRIS-HCl) are two common forms of TRIS. TRIS is also known in the art as trimethylol aminomethane, tromethamine, and tris(hydroxymethyl) aminomethane. Preferred buffering agents in the composition of the present invention are citrate, or citric acid, and phosphate.

In certain embodiments, the compositions of the present invention include one or more isotonicity agents to minimize pain upon injection due to cellular swelling or cellular rupture. An "isotonicity agent" is a compound that is physiologically tolerated and imparts a suitable tonicity to a formulation to prevent the net flow of water across cell membranes that are in contact with an administered pharmaceutical composition. Known isotonicity agents include glycerol, salts such as sodium chloride, and monosaccharides including, but not limited to, mannitol, dextrose and lactose. Preferred isotonicity agent(s) are mannitol and sodium chloride.

In certain embodiments, the compositions of the present invention include a surfactant. A "surfactant" is a substance that lo wers the surface tension of a liquid.

Examples of surfactants used in pharmaceutical compositions and which may be used in certain compositions of the present invention include polysorbate 20, polysorbate 80, polyethylene glycols (e.g., PEG 400, PEG 3000, TRITON X-100), polyethylene glycol alkyl ethers (e.g., BRIJ), polypropylene glycols, block copolymers (e.g., poloxamer, PLURONIC F68; poloxamer 407, PLURONIC F127; TETRONICS), sorbitan alkyl esters (e.g., SPAN), polyethoxylated castor oil (e.g., KOLLIPHOR, CREMOPHOR), and trehalose.

The pharmaceutical compositions of the present invention may also contain a preservative. The term "preservative" refers to a compound added to a pharmaceutical formulation to act as an anti-microbial agent in multi-use and/or titratable compositions. Among preservatives known in the art as being effective and acceptable in parenteral formulations are benzalkonium chloride, benzethonium, chiorohexidine, phenol, m- cresol, benzyl alcohol, methyl- or propylparaben, chlorobutanol, o-cresol, p-cresol, chlorocresol, phenylmercuric nitrate, thimerosal, benzoic acid, and various mixtures thereof. Phenolic preservative includes the compounds phenol, m-cresol, o-cresol, p- cresol, chlorocresol, methylparaben, and mixtures thereof If a preservative is necessary, the preservative used in compositions of the present invention is preferably a phenolic preservati ve, preferably either m-cresol or phenol.

Certain phenolic preservatives, such as phenol and m-cresol, are known to bind to insulin and insulin hexamers and thereby stabilize a conformational change that increases either physical or chemical stability, or both. In compositions comprising other proteins, however, such preservatives may contribute to the formation of protein aggregates, or high molecular weight polymers (HMWP). See, e.g., Maa YF and Hsu CC, Int J Pharm 140: 155-168 (1996); Fransson J, et al, Pharm. Res., 14: 606-612 (1997); Lam XM, et al, Pharm. Res., 14: 725-729 (1997); Remmele RL Jr, et al, Pharm Res 15: 200-208. (1998); Thiramangalathu R, et al., J Pharm Sci 95: 1480-1497 (2006), Such protein aggregates in therapeutic formulations are undesirable due to their tendency to induce an immune response.

In certain preferred embodiments, the amino acid sequence of the fusion protein of the present in vention is optimized to enhance the physical stability of the fusion protein in compositions which also contain a phenolic preservative. For example, the present inventors surprisingly discovered that the presence of an amino acid mutation at position 10 in the analog of the insulin A-chain (variable X= in the analog of the insulin A-chain recited in SEQ ID NO:2) reduces the accumulation of aggregates of the fusion protein in the presence of phenolic presen'atives, as shown in the stability studies below.

The invention is further illustrated by the following examples, which are not to be construed as limiting.

Expression and Purification of Fusion Proteins Fusion proteins of the present invention are produced in a mammalian cell expression system using the CHO glutamine synthetase (GS) knockout (GSKO) cell line. The GS gene knockout enables tightened selection stringency by eliminating endogenous GS background activity which can allow survival of low- or non-productive cells under selection conditions. Genes coding for fusion proteins are sub-cloned into the glutamine synthetase (GS) containing expression plasmid. The cDNA sequence encoding the fusion proteins is fused in frame with the coding sequence of a signal peptide which enhances secretion of the fusion protein into the cell culture medium. The expression is driven by the cytomegalovirus (CMV) promoter. CHO GSKO cells are stably transfected using electroporation and the appropriate amount of recombinant expression plasmid.

Transfected cells undergo bulk selection in glutamine-free media. Transfected pools are plated at low density to allow for close-to-clonal outgrowth of stable expressing cells. The masterwells are screened for fusion protein expression and scaled-up in serum- free suspension cultures to be used for production.

Fusion proteins secreted into the media may be purified by Protein A affinity chromatography followed by size exclusion chromatography following standard chromatographic techniques. Briefly, fusion proteins from clarified media are captured by Mab Select Protein A (GE) that has been equilibrated with phosphate buffered saline pH 7.4. Following a wash step with phosphate buffered saline pH 7.4, bound fusion proteins are eluted with 10 mM citric acid pH 3.0. Fractions containing fusion protein are pooled and neutralized by adding 1/10 volume of 1M Tris pFI 8.0. Soluble aggregates and mul timers may be effectively removed by common techniques, including size exclusion, hydrophobic interaction or ion exchange chromatography. Fractions containing monomeric fusion protein (covalently linked homodimer), as determined by size exclusion chromatography, are pooled, sterile filtered, and stored.

Amino acid sequences of exemplary' fusion proteins of the present invention are shown below:

Example 1 :

10 2Q 30 40 50 60

FVNQHLCGSHLVEALELVCGERGFHYGGGGGGSGGGGGIVEQCCTSTCSLDQLENYC GGG

Example 10

Test-lots of Examples 1-3, 5 and 9 are prepared in phosphate buffered saline (PBS, pH 7.4) or 10 mM citrate buffer (pH 6.5) and stored at 4 °C. Biosynthetic human insulin (Eli Lilly and Company) is prepared in 0.01 N HC1 and stored as frozen aliquots, or prepared in a formulated mixture containing m-cresol, zinc, sodium chloride, and TRIS buffer (pH 7.3) at 100 units/mL and stored at 4 °C.

Affinities of fusion proteins are determined in receptor binding assays performed on PI membranes prepared from stably-transfected 293EBNA cells (293HEK human embryonic kidney cells expressing EBNA-1 ) over-expressing either human insulin receptor isoform A (MR- A) or human insulin receptor isoform B (hIR-B) containing a C9 epitope tag at the C-terminus. Binding affinities are determined from a competitive radioligand binding assay performed at steady-state using human recombinant (3-[ 125 I]- iodotyrosyl-A 14 )-insulin. Values for test samples are calculated as percent relative to the activity of unlabeled human insulin. IC 5 0 values are determined from 4-parameter logistic non-linear regression analysis (XLFit version 4.0, IDBS). If necessary, curve top or bottom parameters are set to 100 or 0, respectively.

The affinity constant (K i ) is calculated from the IC 5 0 value based upon the equation K i = IC 50 / (1 + D /K4) where D equals the concentration of radio-ligand used in the experiment and K4 equals the equilibrium binding affinity constant of the radio- ligand for its cognate receptor determined from saturation binding analysis (hIR-A === 0.205 TiM; hIR-B = 0.251 nM). Reported values for K i are shown as geometric mean ± the standard error of the mean (SEM), with the number of replicate determinations indicated by "n" (Table 1).

The exemplified fusion proteins have affinity at both hIR-A and hlR-B.

Compared to human insulin, the exemplified fusion proteins sho w reduced binding affinity to hIR-A and hIR-B (Table 1).

Ki values are geometric means. SEM is standard error of the mean, n is the number of observations.

In Vivo Studies

Effects of Fusion Proteins are investigated in STZ-treated rat diabetes model. Male Sprague-Dawley rats, 400-425 gram body weight, are anesthetized with isoflurane and given a single injection of Zanosar ® (STZ item # 89256, Teva Parenteral Medicines, 40 mg/kg IV). The rats are used in studies 3 days after injection of Zanosar*'; only animals with non-fasted blood glucose between 400-550 mg/dL are used in these studies.

The rats are distributed into groups to provide comparable variance in blood glucose and body weight and then randomized. Blood glucose is measured using Accucheck Aviva glucometer (Roche). STZ-treated rats are given a single subcutaneous (SC) injection of 30 nmol/kg dose.

Blood samples for glucose measurements are collected by tail bleed. Animals have free access to food and water throughout the experiment. Blood glucose data are provided in Figure 1. Data shown in Figure 1 are mean ± SEM (n==6 for Examples 1 and

8; n=3 for remaining examples). Blood glucose data for time points during the initial feeding period (between 0 and 24 hours) are collected, but are not included in Figure 1 for ease of visual representation. As shown in Figure 1 , Examples 1 -10 each provide glucose lowering for a prolonged period of time.

Pharmacokinetic properties of Examples 1, 3-6 and 9-10 are also characterized following subcutaneous (SC) dosing in STZ- treated rats as described above. Data are generated using an insulin receptor ELISA assay that requires the presence of insulin that is capable of binding the insulin receptor. The insulin receptor ELISA utilizes the human insulin receptor (R&D Systems#1544-IR/CF aa28-944) as capture. The human insulin receptor is attached to an Immunlon 4 HBX plate by mouse anti-6x HisTag antibody (Novagen 70796). Fusion protein standard curve and samples are diluted in 100% rat K3 EDTA plasma and detected by mouse anti-human IgG Fc horseradish peroxidase (SouthernBiotech 9040-05). Concentrations of Examples 3-6 and 9-10 at 336 hour time point are all between 22.1 ± 9.8 mg/mL and 1498 ± 690 mg/mL. Table 3 shows concentrations of Example 1 over time, and Table 4 shows pharmacokinetic parameters following non-compartmental analysis of the data for Example 1. The data support an extended duration of bioavailability for fusion proteins of the present invention.

Table 3

Data represent mean and standard deviation of n=3.

Table 4

Data represent mean and standard deviation of n==3. Abbreviations: AUCo -∞ - area under the curve from 0 to infinity, C max - maximal concentration, T max - time at maximum concentration, CL - clearance, F - bioavailability, tl/2 - half-life.

Euglycemic Clampp Study in Normal Rats

A euglycemic clamp study in male Sprague-Dawley rats is performed to understand overall in vivo activity of Example 1 on glucose utilization and to determine activity of Example 1 in liver versus peripheral tissues. A bolus/continuous infusion of 3- 3H-glucose is initiated in chronically catheterized, overnight- fasted rats to determine endogenous glucose production (EGP) under basal conditions. A bolus/continuous intravenous infusion of test article -7nmol/kg [bolus] and lnmol/kg/hr [continuous rate] - or insulin lispro comparator - [no bolus] and 0.75 mU/lcg/min [continuous rate] - is then administered and a variable intravenous infusion of 20% glucose is initiated and periodically adjusted to maintain blood glucose concentration at 100-110 mg/dL.

Bolus/infusion rates are selected to achieve comparable glucose infusion rates (GIR) and comparable suppression of EGP under euglycemic clamp conditions in each group.

Somatostatin is administered to inhibit endogenous insulin secretion. Arterial blood samples are obtained during the experiment to monitor hematocrit, plasma insulin, and free fatty acids, C-peptide, and basal and clamp EGP. At the end of the experiment, 2-[l- 1 4 C]-deoxy glucose is administered to measure tissue glucose uptake under steady state glucose concentrations. Under these matched conditions, peripheral activity of the test article and comparators is analyzed as glucose uptake in the soleus muscle and suppression of plasma free fatty acids (FFA). All data are analyzed using a one-way Analysis of Variance with Dunnett's post-hoc test using the insulin lispro group as the control comparator.

Bolus and infusion doses of Example 1 result in steady plasma concentration of

274 ± 57 nM over the course of the clamp experiment. As shown in Table 5, both groups of animals achieve comparable GIR during the clamp phase of the experiment. Average blood glucose in the Example 1 group is slightly higher than the insulin lispro group. Both groups have similar basal and clamped EGP rates (Table 5). Furthermore, the percent change from basal EGP in the Example 1 group is comparable to the insulin lispro control group (Table 5). To assess peripheral activity under equivalently clamped conditions, suppression of plasma FFA and muscle glucose uptake are measured.

Whereas insulin lispro resulted in a reduction in plasma FFA levels over the course of the clamp experiment. Example 1 did not. (Table 5). In addition, infusion of Example 1 results in a 33% decrease in glucose uptake in the soleus muscle compared to insulin lispro. (Tabie S). Collectively, these data indicate Example 1 displays reduced peripheral activity compared to insulin lispro.

Table 5:

Values are displayed as mean ± SEM of 13 animals for insulin lispro and 17 animals for Example 1. Rg = glucose metabolic index. Statistical analysis was completed by one-way ANOVA followed by Dunnett's post-hoc test. * = Significantly different from insulin lispro (p<0.05).

In vivo Efficacy in Cynomolgus Monkeys

The pharmacokinetic (PK) parameters of Example 1 are evaluated following a single intravenous dose of 1.5 nmol/kg and a single subcutaneous dose of 3 nmol/kg in cynomolgus monkeys. Plasma samples for the PK analysis are collected from three animals per group/route, over the course of 3 weeks. Two assays are utilized for the PK analysis; insulin receptor ELISA and total IgG Fc ELISA. The insulin receptor ELISA utilizes the human insulin receptor (R&D Systems 1544-IR/CF) as capture. The human insulin receptor is attached to an Immunlon 4 HBX plate by mouse anti- HisTag antibody (Novagen 70796). Example 1 standard curve and samples are diluted in 100% cynomolgus monkey plasma (anticoagulant was K3 EDTA) and detected by mouse anti- human IgG Fc horseradish peroxidase (SouthernBiotech 9040-05). The total IgG 1:1. I SA utilizes the anti-human lgG 2 (Abeam ab1933) as the capture reagent. Example 1 is diluted in 100% cynomolgus monkey plasma and the detection antibody is the same as in the insulin receptor ELISA. Results from both assays are shown in Table 6, and the associated PK parameters are shown in Table 7. Example 1 shows complete

bioavailability in monkeys, and the active insulin assay and the total Fc assay give similar results.

Data represent mean and standard deviation from n=3. Abbreviations: IV = intravenous; SD = standard deviation; NC = not calculated

Table 7. PK Parameters derived from Non-compartmental Analysis of Data in Table 6.

Data represent mean and standard deviation from n=3. Abbreviations: AUCo-inf- under the curve from 0 to infinity, C max - maximal concentration (for IV administration C max is extrapolated concentration at time 0), T max - time at maximum concentration, CL - clearance , F - bioavailability, t l/2 - half-life, V ss - steady-state volume of distribution, NA - not applicable.

Non-preserved 65 mg/mL Formulation of Example 1

Example 1 is formulated at 65 mg/mL in 10 mM citrate, 46.4 mg/mL mannitol, 0.02% polysorbate 80, pH 6.5 and stored at 30 °C. Samples are analyzed for percent high molecular weight by size exclusion chromatography (SEC) at 0, 2, and 4 weeks by injecting 1 μΤ of the 65 mg/mL sample. Analytical SEC is performed on an Agilent 1 100 system equipped with a TSKgel SuperSW3000 (Tosoh Bioscience) column and 50 mM sodium phosphate, 300 mM NaCl, pH 7.0 mobile phase flowing at 0.4 mL/min for 15 minutes. Peaks are detected at an absorbance of 280 nm and chromatograms are analyzed using ChemStation software. Percent high molecular weight at time zero is 1.13%, at time two weeks is 1.68%, and at time four weeks is 1.74%. These results support stability of Example 1 at 65 mg/mL concentration with minimal growth of soluble aggregate after 4 weeks at 30 °C.

Preserved and Non-preserved 1 mg/mL Formulations of Examples 1 and 3 Examples 1 and 3 are formulated at 1 mg/mL in 10 mM citrate, pH 6.5 in the presence or absence of 30 mM m-Cresol and stored at 30 °C. Samples are analyzed for percent high molecular weight by size exclusion chromatography (SEC) at time zero and 2 weeks by injecting 10 μL of the 1 mg/mL sample. Analytical SEC is performed on an Agilent 1100 system equipped with a TSKgel G3000SWxl (Tosoh Bioscience) column and PBS + 350 mM NaCl, pH 7,4 mobile phase flowing at 0,5 mL/min for 35 minutes or 45 minutes for samples in the absence or presence of m-cresol, respectively. Peaks are detected at an absorbance of 214 nm and chromatograms are analyzed using ChemStation software. For Example 3, without and with m-cresol, percent high molecular weight at time zero is 0.2% and 3.06%, respectively. Percent high molecular weight at 2 weeks without and with m-cresol was 0.2% and 1 .73%, respectively. These results show an immediate increase in soluble aggregate after addition of m-cresol for Example 3 at time zero. For Example 1, in the absence and presence of m-cresol, percent high molecular weight at time zero was 0,16% and 0.15%, respectively. Percent high molecular weight at 2 weeks in the absence and presence of m-cresol was 0.18% and 0.31%, respectively. These results demonstrate stability, in the presence of preservative, of Example 1, which includes modification of the amino acid at position 10 in the analog of the insulin A-chain (variable X 1 in SEQ ID NO:2, Z 3 in the first fusion protein described above) to a T residue, relative to Example 3, which mcludes the native I residue at that position, but which otherwise comprises the same insulin receptor agonist amino acid sequence as Example 1 .

X 1 X 2 X 3 QHLCGSHLVEALX 4 LV CGERGFX 5 YX 5 X 7 X 8 X 9

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent, provided that the insulin B-chain analog includes at least one modification from the amino acid sequence of the B-chain of a molecule of human insulin at ¾, X 5 , X 6 , X 7 , X 8 , or X 9

G IVEQCCTSX 1 CSLX 2 QLENYCX 3 X 4

X 1 is T or i; X 2 is D, Y, Q or E; X 3 is G, N, S or A; and X 4 is any naturally occurring amino acid, or is absent, provided that if X 3 is N, then X 4 must be an amino acid other than G or N

X 1 GX 2 GGGG, wherein X 1 is G or is absent; and X 2 is G, S or is absent

X 1 X 2 X 3 QHLCGSHLVEALX 4 LVCGERGFX 5 YX 6 X 7 X 8 X 9 X 1 0 G X 11 GGGGGIVEQCCTSX 12 CSL

X 13 QLENYCX 1 4 X 1 5

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, D, G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent; X t0 is G or is absent; X 11 is G, S or is absent; X 1 2 is T or I; X 13 is D, Y, Q or E; X 14 is G, N, S or A; and X 15 is any naturally occurring amino acid, or is absent, provided that at least one of X 4 , X 5 , X 6 , X 7 , X 8 , or X 9 must be a different amino acid than that found, respectively, at position B 16 , B 25 , B 27 , B 2 s, 6 29 or B 31 ) of the B-chain of a molecule of human insu lin, and further provided that if X 14 is N, then X 15 must be an amino acid other than G or N SEQ IP NO;5 - Insulin receptor agonist

FVNQHLCGSHLVEALELVCGERGFHY GGGGGGSGGGGGIVEQCCT STCSLDQLENYCG

GGGGX 1 GGGGX 2 GGGGX 3 GGGGX 4 X 5 X 6

wherein X 1 is Q or E; X 2 is Q or E; X 3 is Q or E; X 4 is G, E, Q or is absent; X 5 is G or absent; and X 6 is G or is absent

SEQ ID NO:7 - Seeond peptide linker

GGGGQGGGGQGGGGQGGGGG

SEP ID NQsS - Human IgG Fc region

CPPCPAPELLGGPS VX 1 LX 2 PPKPKDTLMISRTPEVTCX 3 VX 4 DVSHEDPEVKFNWY VDGVEVHNAKTKPREEQYX 5 STYRVVSVLTVLHQDWLNGKEYKCKVSNKALPA PIEKTISKAKGQPREPQVYTLPPSRDELTKNQVSLTCLVKGFYPSDTAVEWESNGQ

PENNYKTTPPVLDSDGSFFLYSKI;TVDKSRWQQGWFSCSVMHEALHNHYTQKS

LSLSPGX 6

wherein X 1 is F, Q or E; X 2 is F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, D or Q; and X- 6 is K or is absent

SEQ ID NO:9 - Human IgG Fc region

PCPPCPAPEAAGGPSYX 1 LX 2 PPKPKDTLMISRTPEVTCX 3 VX 4 DVSQEDPEVQFNW YVDGVEVHNAKTKPREEQFX 5 STYRVVSVLTVLHQDWLNGKEYKCKVSNKGLP SSIE^SKAKGQPREPQVYTLPPSQEEMTKNQVSLTCLVKGFYPSDIAVEWESNG QPENNYKTTPPVLDSDGSFFLYSX 6 LTVDKSRWQEGNVFSCSVMHEALHNHYTQ KSLSLSLGX 7

wherein X 1 is F, Q or E; X 2 F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, D or Q; X 6 is R, or K; X 7 is K or is absent SEQ ID NO: 10 - Human IgG Fc region ECPPCPAPPVAGPSVX:LX 2 PPKPKD^MISRTPEWCX 3 VX 4 DVST-ffiDPFiVQFNWY VDGVEVHNAKllCPREFX 3 FX 5 STFRVVSVLTVVHQDWi.NGKEYKCKVSNKGLPA PIEKTSSKTKGQPREPQ V YTLPPSREEMTKN Q VSLTCL VKGF YPSDIAVE WESNGQ PENNYKTTPPMLDSDGSFFLYSKLTVDKSRWQQGNVFSCSVMHEALHNHYTQK SLSLSPGX 6

wherein X 1 is F, Q or E; X 2 is F, Q or E; X 3 is V or T; X 4 is V or T; X 5 is N, t) or Q; and X 6 is K or absent

SEQ ID NO: 11 - Fusion protein

X 1 X 2 X 3 QHLCGSHLVEALX 4 LV CGERGFX 5 YX 5 X 7 X 8 X 9 X 10 GX 1 1 GGGGGIVEQCCTSX 1 2 CSL

X 1 3 QLENYCX 1 4X 1 5 GGGGX 1 6 GGGGX 1 7GGGGX 1 8 GGGGX 1 9X 20 X 21 X 22 CPPCPAPX 23 X 24 AGX 25 PSVFLFPPKPKDTLMI SRT PEVTCVWDVSX 26 EDPEVQFNWYVDGVEVHNAKTKPREE QFNS TX 27 RVVSVLTVX 28 HQDWLNGKEYKCKVSNKGLPX 29 X 30 IEKT I SKX 31 KGQPREPQ VYTLPPSX 32 EEMTKNQVS LTCLVKGFYPSDIAVEWE SNGQPENNYKTTPPX 33 LDSDGSF FLYSX 34 LTVDKSRWQX 35 GNVFSCSVMHEALHNHYTQKS LSLSX 36 G

wherein X 1 is F, Q or A; X 2 is V or G; X 3 is N, K, L), G, Q, A or E; X 4 is E, Y, Q, or H; X 5 is H or F; X 6 is G, T, S, H, V or is absent; X 7 is G, E, P, K, D, S, H or is absent; X 8 is G, E, K, P, Q, D, H or is absent; X 9 is G, T, S, E, K, A or is absent, provided that at least one of X 4 , X 5 , X- 6 , X 7 , X ¾ or X 9 is an amino acid other than that which is present, respectively, at position B^, B 25 , B 2 7, B 28 , B 2 9 or B 3 0 of a human insulin B-chain; X 10 is G or is absent; Xn is G, S or is absent; X 12 is T or I; X 13 is D, Y, Q or E; X 14 is G, N, S or A; X1 5 is any naturally occurring amino acid, or is absent, provided that if XM is N, then X 15 must be an amino acid other than G or N; X 1 6 is Q or E; X 17 is Q or E; X 18 is Q or E; X 1y is G, E, Q or is absent; X 20 is G or absent; X 21 is G or is absent; X 22 is E or P; X 23 is E or P; X 24 is A or V; Χ 25 is G or is absent; X 26 is Q or H; X 27 is Y or F; X 28 is L or V; X 29 is S or A; X 30 is S or P; X 31 is A or T; X 32 is Q or R; X 33 is V or M; X 34 is R or K; X 35 is E or

Q; and X 36 is L or P

SEQ ID NO: 12 - Fusion protem

10 20 30 4 0 50 60

FVNQHLCGSHLVEALELVCGERGFHYGGGGGGSGGGGGIVEQCCTSTCSLDQLENYC GGG

SEQ ID NO: 14 - Human insulin B-chain

SEQ ID NO: 16 - Fusion protein

SEP ID NO: 19 - Fusion protein

SEQ ID NO:20 - Fusion protein LG

SEP ID ΝΡ:2ΐ - Fusion protein

SEQ ID NO:23 - Fusion protein

SEP IP NP:24 - Fusion protein

SEP ID NO:25 - GLP-1