INFLUENZA VIRUS VACCINES AND USES THEREOF

Title:

INFLUENZA VIRUS VACCINES AND USES THEREOF

Document Type and Number:

WIPO Patent Application WO/2016/005480

Kind Code:

Abstract:

Provided herein are influenza hemagglutinin stem domain polypeptides, methods for providing hemagglutinin stem domain polypeptides, compositions comprising the same, vaccines comprising the same and methods of their use, in particular in the 5 detection, prevention and/or treatment of influenza

More Like This:

WO/2024/018364	ENVELOPED VIRUS LIKE PARTICLES COMPRISING SARS-COV-2 S PROTEIN
JP6141228	The particles like a chimera influenza virus containing hemagglutinin
JP2006513273	Compositions and Methods for Preventing or Treating Viral Infections

Inventors:

IMPAGLIAZZO ANTONIETTA (NL)
MEIJBERG JAN WILEM (NL)
RADOSEVIC KATARINA (NL)
WAGNER MICHELLE (US)
DING ZHAOQING (US)

Application Number:

PCT/EP2015/065661

Publication Date:

January 14, 2016

Filing Date:

July 09, 2015

Export Citation:

Click for automatic bibliography generation Help

Assignee:

CRUCELL HOLLAND BV (NL)

International Classes:

C07K14/11; A61K39/145

Domestic Patent References:

WO2013079473A1	2013-06-06
WO2014191435A1	2014-12-04

Other References:

V. V. A. MALLAJOSYULA ET AL: "Influenza hemagglutinin stem-fragment immunogen elicits broadly neutralizing antibodies and confers heterologous protection", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, vol. 111, no. 25, 24 June 2014 (2014-06-24), pages E2514 - E2523, XP055158737, ISSN: 0027-8424, DOI: 10.1073/pnas.1402766111
GAYATHRI BOMMAKANTI ET AL: "Design of an HA2-based Escherichia coli expressed influenza immunogen that protects mice from pathogenic challenge", NATIONAL ACADEMY OF SCIENCES. PROCEEDINGS, NATIONAL ACADEMY OF SCIENCES, UNITED STATES, vol. 107, no. 31, 3 August 2010 (2010-08-03), pages 13701 - 13706, XP002675041, ISSN: 1091-6490, [retrieved on 20100706], DOI: 10.1073/PNAS.1007465107
Y. LU ET AL: "Production and stabilization of the trimeric influenza hemagglutinin stem domain for potentially broadly protective influenza vaccines", PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES, 16 December 2013 (2013-12-16), XP055095011, ISSN: 0027-8424, DOI: 10.1073/pnas.1308701110
EKIERT DAMIAN C ET AL: "Antibody Recognition of a Highly Conserved Influenza Virus Epitope", SCIENCE, AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE, US, vol. 324, no. 5924, 1 April 2009 (2009-04-01), pages 246 - 251, XP009144786, ISSN: 0036-8075

Attorney, Agent or Firm:

MANTEN, Annemieke et al. (CN Leiden, NL)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

1. An influenza hemagglutinin stem domain polypeptide comprising:

(a) an influenza hemagglutinin HAl domain that comprises an HAl N- terminal stem segment, covalently linked by a linking sequence of 0-50 amino acid residues to an HAl C- terminal stem segment, said HAl C-terminal segment being linked to

(b) an influenza hemagglutinin HA2 domain, wherein the HAl and HA2 domain are derived from an influenza A virus subtype comprising HA of the H 1 subtype;

(d) wherein said HAl N-terminal segment comprises the amino acids 1-x of HAl, preferably the amino acids p-x of HAl, and wherein the HAl C- terminal stem segment comprises the amino acids y- C-terminal amino acid of HAl, wherein x= the amino acid on position 52 of SEQ ID NO: 1 (or an equivalent position in hemagglutinin of another influenza virus strain), p= the amino acid on position 18 of

SEQ ID NO: 1 (or an equivalent position in hemagglutinin of another influenza virus) and y=the amino acid on position 320 of SEQ ID NO: 1 (or an equivalent position in another hemagglutinin);

(e) wherein the region comprising the amino acid residues 402-418 comprises the amino acid sequence XiNTQX₂TAX₃GKEX₄N(H/K)X₅E(K/R) (SEQ ID NO: 8), wherein:

Xi, is an amino acid selected from the group consisting of M, E, K, V, R and T, X₂ is an amino acid selected from the group consisting of F, I, N, T, H, L and Y, preferably I, L or Y,

X₃ is an amino acid selected from the group consisting of V, A, G, I, R, F and S, preferably A, I or F,

X4 , is an amino acid selected from the group consisting of F, I, N, S, T, Y, E, K, M, and V, preferably I, Y, M or V,

X_5i is an amino acid selected from the group consisting of L, H, I, N, R, preferably I; (f) wherein the amino acid residue on position 337 (HAl domain) is selected from the group consisting of: I, E, K, V, A, and T,

the amino acid residue on position 340 (HAl domain) is selected from the group consisting of: I, K, R, T, F, N, S and Y,

the amino acid residue on position 352 (HA2 domain) is selected from the group consisting of: D, V, Y, A, I, N, S, and T, and

the amino acid residue on position 353 (HA2 domain) is selected from the group consisting of: K, R, T, E, G, and V; and

(g) wherein the polypeptide further comprises a disulfide bridge between the amino acid on position 324 and the amino acid on position 436; and

(h) wherein the amino acid sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) has been introduced at positions 419-433 or wherein sequence RMKQIEDKIEEIESKQK (SEQ ID NO: 21) has been introduced at position 417-433.

2. Polypeptide according to claim 1, wherein the HA2 domain has been truncated.

3. Polypeptide according to claim 1 or 2, wherein the C-terminal part of the HA2 domain from position 519 to the C-terminal amino acid has been deleted.

4. Polypeptide according to claim 1 or 2, wherein the C-terminal part of the HA2 domain from position 530 to the C-terminal amino acid has been deleted.

5. Polypeptide according to claim 3 or 4, wherein the C-terminal part of the HA2 domain has been replaced by the amino acid sequence

GYIPEAPRDGQAYVRKDGEWVLLSTFL (SEQ ID NO: 3), optionally connected through a linker.

6. Polypeptide according to any one of the preceding claims, wherein the C- terminal amino acid residue of the HAl C-terminal stem segment is any amino acid other than arginine (R) or lysine (K), preferably glutamine (Q).

7. Polypeptide according to any one of the preceding claims, wherein the polypeptide comprises one or more further mutations in the HAl domain and/or the HA2 domain.

8. Polypeptide according to any one of the preceding claims, wherein the

polypeptide selectively binds to the antibodies CR6261 and/or CR91 14.

9. Polypeptide according to any one of the preceding claims, wherein the

polypeptide does not bind to the antibodies CR8020 and/or CR8057.

10. Nucleic acid molecule encoding the polypeptide of any one of claims 1 to 9.

1 1. Vector comprising the nucleic acid molecule of claim 10.

12. Composition comprising the polypeptide of any one of claims 1 to 9 and/or a nucleic acid molecule according to claim 10.

13. Polypeptide according to any one of the claims 1-9, a nucleic acid molecule

according to claim 10, and/or a vector according to claim 11 , for use as a medicament.

14. Polypeptide according to any one of the claims 1-9, a nucleic acid molecule

according to claim 10, and/or a vector according to claim 11 , for use in inducing an immune response against influenza virus.

15. Polypeptide according to any one of the claims 1-9, a nucleic acid molecule

according to claim 10, and/or a vector according to claim 1 1 , for use as a vaccine.

Description:

INFLUENZA VIRUS VACCINES AND USES THEREOF

INTRODUCTION

The invention relates to the field of medicine. Provided herein are influenza hemagglutinin stem domain polypeptides, methods for providing hemagglutinin stem domain polypeptides, compositions comprising the same, vaccines comprising the same and methods of their use, in particular in the detection, prevention and/or treatment of influenza.

BACKGROUND

Influenza viruses are major human pathogens, causing a respiratory disease

(commonly referred to as "influenza" or "the flu") that ranges in severity from subclinical infection to primary viral pneumonia which can result in death. The clinical effects of infection vary with the virulence of the influenza strain and the exposure, history, age, and immune status of the host. Every year it is estimated that approximately 1 billion people worldwide undergo infection with influenza virus, leading to severe illness in 3-5 million cases and an estimated 300,000 to 500,000 of influenza related deaths. The bulk of these infections can be attributed to influenza A viruses carrying HI or H3 hemagglutinin subtypes, with a smaller contribution from Influenza B viruses, and therefore representatives of all three are included in the seasonal vaccine. The current immunization practice relies on early identification of circulating influenza viruses to allow for timely production of an effective seasonal influenza vaccine. Apart from the inherent difficulties in predicting the strains that will be dominant during the next season, antiviral resistance and immune escape also play a role in failure of current vaccines to prevent morbidity and mortality. In addition to this the possibility of a pandemic caused by a highly virulent viral strain originating from animal reservoirs and reassorted to increase human to human spread, poses a significant and realistic threat to global health.

Influenza A viruses are widely distributed in nature and can infect a variety of birds and mammals. Influenza viruses are enveloped RNA viruses that belong to the family of Orthomyxoviridae. Their genomes consist of eight single-stranded RNA segments that code for 11 different proteins, one nucleoprotein (NP), three polymerase proteins (PA, PBl, and PB2), two matrix proteins (Ml and M2), three non-structural proteins (NSl, NS2, and PBl- F2), and two external glycoproteins: hemagglutinin (HA) and neuraminidase (NA). The viruses are classified on the basis of differences in antigenic structure of the HA and NA proteins, with their different combinations representing unique virus subtypes that are further classified into specific influenza virus strains. Although all known subtypes can be found in birds, currently circulating human influenza A subtypes are H1N1 and H3N2. Phylogenetic analysis has demonstrated a subdivision of hemagglutinins into two main groups: inter alia the HI, H2, H5 and H9 subtypes in phylogenetic group 1 and inter alia the H3, H4 and H7 subtypes in phylogenetic group 2.

The influenza type B virus strains are strictly human. The antigenic variation in HA within the influenza type B virus strains is smaller than those observed within the type A strains. Two genetically and antigenically distinct lineages of influenza B virus are circulating in humans, as represented by the B/Y amagata/16/88 (also referred to as B/Yamagata) and B/Victoria/2/87 (B/Victoria) lineages (Ferguson et al., 2003). Although the spectrum of disease caused by influenza B viruses is generally milder than that caused by influenza A viruses, severe illness requiring hospitalization is still frequently observed with influenza B infection.

It is known that antibodies that neutralize the influenza virus are primarily directed against hemagglutinin (HA). Hemagglutinin or HA is a trimeric glycoprotein that is anchored to the viral coat and has a dual function: it is responsible for binding to the cell surface receptor sialic acid and, after uptake, it mediates the fusion of the viral and endosomal membrane leading to release of the viral RNA in the cytosol of the cell. HA comprises a large head domain and a smaller stem domain. Attachment to the viral membrane is mediated by a C-terminal anchoring sequence connected to the stem domain. The protein is post-translationally cleaved in a designated loop to yield two polypeptides, HA1 and HA2 (the full sequence is referred to as HA0). The membrane distal head region is mainly derived from HA1 and the membrane proximal stem region primarily from HA2 (FIG. 1).

The reason that the seasonal influenza vaccine must be updated every year is the large variability of the virus. In the hemagglutinin molecule this variation is particularly manifested in the head domain where antigenic drift and shift have resulted in a large number of different variants. Since this is also the area that is immunodominant, most neutralizing antibodies are directed against this domain and act by interfering with receptor binding. The combination of immunodominance and large variation of the head domain also explains why infection with a particular strain does not lead to immunity to other strains: the antibodies elicited by the first infection only recognize a limited number of strains closely related to the virus of the primary infection.

Recently, influenza hemagglutinin stem domain polypeptides, lacking all or substantially all of the influenza hemagglutinin globular head domain, have been described and used to generate an immune response to one or more conserved epitopes of the stem domain polypeptide. It is believed that epitopes of the stem domain polypeptide are less immunogenic than the highly immunogenic regions of a globular head domain, thus the absence of a globular head domain in the stem domain polypeptide might allow an immune response against one or more epitopes of the stem domain polypeptide to develop (Steel et al., 2010). Steel et al. thus have created a new molecule by deleting amino acid residue 53 to 276 of HA1 of the A Puerto Rico/8/1934 (H1N1) and A/Hong Kong/1968 (H3N2) strains from the HA primary sequence, and replacing this by a short flexible linking sequence GGGG. Vaccination of mice with the H3 HK68 construct did not elicit antisera that were cross-reactive with group 1 HAs. In addition, as shown in PCT/EP2012/073706, the stem domain polypeptides were highly unstable and did not adopt the correct conformation as proven by the lack of binding of antibodies that were shown to bind to conserved epitopes in the stem region.

In addition, Bommakanti et al. (2010) described an HA2 based polypeptide comprising amino acid residues 1-172 of HA2, a 7-amino acid linker (GSAGSAG), amino acid residues 7-46 of HA1, a 6-amino acid linker GSAGSA, followed by residues 290-321 of HA1, with the mutations V297T, I300E, Y302T and C305T in HA1. The design was based on the sequence of H3 HA (A/Hong Kong/1968). The polypeptide did only provide cross-protection against another influenza virus strain within the H3 subtype (A/Phil/2/82 but not against an HI subtype (A/PR/8/34). In a more recent paper by Bommakanti et al (2012) a stem domain sequence based on HA from H1 1 A/Puerto Rico/8/1934 (H1HA0HA6) is described. In this polypeptide the equivalent of residues 55 to 302 have been deleted and mutations 131 IT, V314T, I316N, C319S, F406D, F409T, and L416D have been made. Both the H3 and HA based polypeptides were expressed in E. coli and therefore lack the glycans that are aprt of the naturally occurring HA proteins. When expressed in E.coli the polypeptide is recovered mainly as high molecular weight aggregates and a minor monomeric fraction. The polypeptide binds CR6261 with two apparent dissociation constants of 9 and 0.2 μΜ. The authors show that mice can survive a challenge with 1 LD90 of the homologous HlNl A Puerto Rico/8/1934 virus after immunization (twice, 4 week interval) with 20 μg of protein adjuvanted with 100 μg of CpG7909. The authors also describe circularly permutated polypeptides comparable to those described above for A/Hong Kong/1/1968 derived polypeptides. These

polypeptides are derived from HA's from HlNl A/Puerto Rico/8/1934, HlNl A/North Carolina/20/99 or HlNl A/California/07/2009 and can provide partial protection in a mild challenge (1LD90) model in mice of HlNl A/Puerto Rico/8/1934 (i.e. within the same subtype). Sera from guinea pigs immunized with these polypeptides did not exhibit detectable levels of neutralization when tested in a neutralization assay.

More recently Lu et al (2013) also described soluble stem domain polypeptides derived from the HA of HlNl A/California/05/2009. In the final design the equivalent of residues 54-303 (numbering according to SEQ ID NO: 1) have been deleted (the leader sequence, residues 1-17 is also not present) and two mutations have been introduced in the B-loop of the protein, i.e. F407D, and L413D. Furthermore the polypeptide contained a C-terminal trimerization domain (foldon). In addition, two intermonomer disulfide bridges were introduced, one in the area of the trimeric foldon domain, and one at position 430 and 431. The polypeptide is produced in an E.coli based cell free system, (and thus lacks the glycans that arepart of the naturally occurring HA proteins) and is recovered in a denatured form, which needs to be refolded prior to use .No immunological or protection from influenza challenge data were shown.

In a recent paper Mallajosyula et al (2014) also report a stem domain polypeptide.

In this design, based on the HA from HlNl A/Puerto Rico/8/1934, not only a large part of the HA1 sequence is deleted (residue 42 to 289, numbering according to SEQ ID NO: 1), but also large part of the N- and C-terminal sequences of HA2 (residues 344 to 383 and 457 to 565, respectively). The polypeptide contains a foldon trimerization domain at the C-terminus and is also produced in E. coli, so lacks the naturally occurring glycans on viral HA. The polypeptide binds the broadly neutralizing antibodies CR6261, F10 and FI6v3. The polypeptide was also tested in an influenza challenge model (1LD90 of H1 1 A Puerto Rico/8/1934) and could protect mice from death. Equivalent polypeptides derived from HA of H1N1 A/New Caledonia/20/1999 and H1N1 A/California/04/2009 could also partially protect. A polypeptide derived from H5 1 A/Viet Nam/1203/2004 only gave limited protection in this challenge model. Moreover, the challenge model used is mild with a relatively low dose administerd (1-2 LD90).

There thus still exists a need for a safe and effective universal vaccine that stimulates the production of a robust, broadly neutralizing antibody response and that offers protection against a broad set of current and future influenza virus strains (both seasonal and pandemic), in particular providing protection against one or more influenza A virus subtypes within phylogenetic group 1 and/or group 2, for effective prevention and therapy of influenza.

SUMMARY

Provided herein are influenza hemagglutinin stem domain polypeptides, methods for providing stem domain polypeptides, compositions comprising the same, vaccines comprising the same and methods of their use.

In a first aspect, the present invention provides novel immunogenic polypeptides comprising an influenza hemagglutinin stem domain and lacking the globular head, referred to as influenza hemagglutinin (HA) stem domain polypeptides. The polypeptides are capable of inducing an immune response when administered to a subject, in particular a human subject. The polypeptides of the invention present conserved epitopes of the membrane proximal stem domain HA molecule to the immune system in the absence of dominant epitopes that are present in the membrane distal head domain. To this end, part of the primary sequence of the HAO protein making up the head domain is removed and the remaining amino acid sequence is reconnected, either directly or, in some

embodiments, by introducing a short flexible linking sequence ('linker') to restore the continuity of the amino acid chain. The resulting sequence is further modified by introducing specific mutations that stabilize the native 3-dimensional structure of the remaining part of the HAO molecule. The immunogenic polypeptides do not comprise the full-length HA1 domain of an influenza virus. The present invention provides novel influenza hemagglutinin stem domain polypeptide comprising: (a) an influenza hemagglutinin HA1 domain that comprises an HA1 N- terminal stem segment, covalently linked by a linking sequence of 0-50 amino acid residues to an HA1 C- terminal stem segment, said HA1 C-terminal segment being linked to (b) an influenza hemagglutinin HA2 domain, wherein the HA1 and HA2 domain are derived from an influenza A virus subtype comprising HA of the HI subtype; and

(d) wherein said HA1 N-terminal segment comprises the amino acids 1-x of HA1, preferably the amino acids p-x of HA1, and wherein the HA1 C- terminal stem segment comprises the amino acids y-C-terminal amino acid of HA1, wherein x= the amino acid on position 52 of SEQ ID NO: 1 (or an equivalent position in another hemagglutinin), p= the amino acid on position 18 of SEQ ID NO: 1 (or an equivalent position in another hemagglutinin) and y= the amino acid on position 321 of SEQ ID NO: 1 (or an equivalent position in another hemagglutinin);

(e) wherein the region comprising the amino acid residues 402-418 comprises the amino acid sequence XiNTQX ₂TAX ₃GKEX ₄N(H/K)X ₅E(K/R) (SEQ ID NO: 8), wherein:

Xi, is an amino acid selected from the group consisting of M, E, K, V, R and T,

X ₂ is an amino acid selected from the group consisting of F, I, N, T, H, L and Y, preferably I, L or Y,

X ₃ is an amino acid selected from the group consisting of V, A, G, I, R, F and S, preferably A, I or F,

X4 is an amino acid selected from the group consisting of F, I, N, S, T, Y, E, K, M, and V, preferably I, Y, M or V,

X ₅ is an amino acid selected from the group consisting of L, H, I, N, R, preferably I;

(f) wherein the amino acid residue on position 337 (HA1 domain) is selected from the group consisting of: I, E, K, V, A, and T,

the amino acid residue on position 340 (HA1 domain) is selected from the group consisting of: I, K, R, T, F, N, S and Y, the amino acid residue on position 352 (HA2 domain) is selected from the group consisting of: D, V, Y, A, I, N, S, and T, and

the amino acid residue on position 353 (HA2 domain) is selected from the group consisting of: K, R, T, E, G, and V; and

(g) wherein the polypeptide further comprises a disulfide bridge between the amino acid on position 324 and the amino acid on position 436; and

(h) wherein furthermore the amino acid sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) has been introduced at positions 419-433 or wherein sequence RMKQIEDKIEEIESKQK (SEQ ID NO: 21) has been introduced at position 417-433.

In certain embodiments, the polypeptides comprise the complete HA2 domain, i.e. the HA2 domain including the transmembrane domain and the intracellular sequence. In certain embodiments, the HA2 domain has been truncated. Thus, in certain embodiments, the polypeptides of the invention do not contain the intracellular sequences of HA and the transmembrane domain. In certain embodiments, the amino acid sequence from position (or the equivalent of) 514, 515, 516, 517, 518, 519, 520, 521 , 522, 523, 524, 525, 526, 527, 526, 528, 529, or 530 of the HA2 domain to the C-terminus of the HA2 domain has been removed.

According to the invention, the C-terminal amino acid of the HA1 C-terminal stem segment is linked to the N-terminal amino acid of the HA2 domain, thus forming a junction between the HA1 and HA2 domain. The polypeptides of the invention do not comprise a protease cleavage site at the junction between the HA1 and HA2 domain. In certain embodiments, the C- terminal amino acid residue of the HA1 C-terminal stem segment (amino acid 343 in SEQ ID NO: 1) is any amino acid other than arginine (R) or lysine (K), preferably glutamine (Q).

The polypeptides of the invention are substantially smaller than HA0, preferably lacking all or substantially all of the globular head of HA. Preferably, the immunogenic polypeptides are no more than 360, preferably no more than 350, 340, 330, 320, 310, 305, 300, 295, 290, 285, 280, 275, or 270 amino acids in length. In certain embodiments, the immunogenic polypeptides are from about 250 to about 350, preferably from about 260 to about 340, preferably from about 270 to about 330, preferably from about 270 to about 330 amino acids in length. The polypeptides of the invention comprise the conserved stem domain epitopes of the group 1 cross-neutralizing antibody CR6261 (as disclosed in WO2008/028946) and/or of the antibody CR91 14 (as described in WO2013/007770), an antibody capable of binding to and neutralizing both group 1 and group 2 influenza A viruses, as well as influenza B viruses. It is thus another aspect of the invention to provide HA stem domain polypeptides, wherein said polypeptides stably present the epitopes of the antibody CR6261 and/or CR91 14, as indicated by binding of said antibody or antibodies to said polypeptides. In an embodiment, the polypeptides do not bind to CR8020 and CR8057 (described in WO 2010/130636), which are monoclonal antibodies that binds to H3 influenza viruses only.

The influenza hemagglutinin stem domain polypeptides provided herein are suitable for use in immunogenic compositions (e.g. vaccines) capable of generating immune responses against one/or a plurality of influenza virus A and/or B strains, in particular against an influenxa virus of the HI subtype. In an embodiment, the influenza

hemagglutinin stem domain polypeptides are capable of generating immune responses against influenza A virus strains of phylogenetic group 1 and/or group 2, in particular against influenza virus strains of both phylogenetic group 1 and group 2. In an embodiment, the polypeptides are capable of generating an immune response against homologous influenza virus strains. In an embodiment, the polypeptides are capable of generating an immune response against heterologous influenza virus strains of the same and/or different subtypes. In a further embodiment, the polypeptides are capable of generating an immune response to influenza virus strains of both phylogenetic group 1 and group 2 and influenza B virus strains.

The polypeptides according to the invention may be used e.g. in stand alone therapy and/or prophylaxis and/or diagnosis of a disease or condition caused by an influenza virus, in particular a phylogenetic group 1 or 2 influenza A virus and/or an influenza B virus, or in combination with other prophylactic and/or therapeutic

treatments, such as (existing or future) vaccines, antiviral agents and/or monoclonal antibodies.

In a further aspect, the present invention provides nucleic acid molecules encoding the influenza HA stem domain polypeptides. In yet another aspect, the invention provides vectors comprising the nucleic acids encoding the immunogenic polypeptides. In a further aspect, the invention provides methods for inducing an immune response in a subject, the method comprising administering to the subject a polypeptide and/or nucleic acid molecule and/or vector according to the invention.

In another aspect, the invention provides compositions comprising a polypeptide and/or a nucleic acid molecule and/or a vector according to the invention. The compositions preferably are immunogenic compositions. The compositions provided herein can be in any form that allows for the compositions to be administered to a subject, e.g. mice, ferrets or humans. In a specific embodiment, the compositions are suitable for human administration. The polypeptides, nucleic acid molecules and compositions may be used in methods of preventing and/or treating an influenza virus disease and/or for diagnostic purposes. The compositions may further comprise a pharmaceutically acceptable carrier or excipient. In certain embodiments, the compositions described herein comprise, or are administered in combination with, an adjuvant.

In another aspect, the invention provides polypeptides, nucleic acids and/or vectors for use as a vaccine. The invention in particular relates to immunogenic polypeptides, nucleic acids, and/or vectors for use as a vaccine in the prevention and/or treatment of a disease or condition caused by an influenza virus A subtype of phylogenetic group 1 and/or 2 and/or influenza B virus, in particular a disease or condition caused by an influenza virus comprising HA of the HI subtype.

The various embodiments and uses of the polypeptides according to the invention will become clear from the following detailed description of the invention.

BRIEF DESCRIPTION OF THE FIGURES

FIG. 1 shows a model of the HA monomer in the pre-fusion state as present in the native trimer. HA 1 is shown in light grey, HA2 is shown in dark grey. Helix A (an important part of the epitope of CR6261) and helix CD (part of the trimer interface) are indicated, as is the loop connecting these secondary structure elements. The C-terminus of HA1 and the N-terminus of HA2 are also indicated. The fusion peptide is located at the N-terminus of HA2. FIG. 2. Sandwich Elisa results obtained for supernatants of cultures expressing SEQ ID NO: 65 to 71 and SEQ ID NO: 76 to 78, disclosed in PCT/EP2014/060997. Capture and detection antibodies are indicated above the graph. Mini-HA refers to a soluble version of SEQ ID NO: 2 where the equivalent of residue 519-565 has been replaced by

RSLVPRGSPGHHHHHH; FL-HA-FFH refers to a soluble version of SEQ ID NO: 1 containing a C-terminal Flag-thrombin-foldon-His sequence (SEQ ID NO: 4) from position 520; FL-HA-7xHis refers to a soluble version of SEQ ID NO: 1 containing the C-terminal sequence EGRHHHHHHH from position 530. FIG. 3. Sandwich Elisa results obtained for supernatants of cultures expressing polypeptides of the invention comprising GCN4 derived sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) at position 419-433 (t2 variants). Capture and detection antibodies are indicated above the graph. Mini-HA-t2 is derived from Mini-HA by introducing SEQ ID NO: 20 at position 419-433; FL-HA-FFH, FL-HA-7xHis as above.

FIG. 4. Sandwich Elisa results obtained for supernatants of cultures expressing polypeptides of the invention comprising GCN4 derived sequence

RMKQIEDKEEIESKQK (SEQ ID NO: 21) at position 417-433 (t3 variants). Capture and detection antibodies are indicated above the graph. Mini-HA-t3 is derived from Mini- HA by introducing SEQ ID NO: 21 at position 417-433; FL-HA-FFH, FL-HA-7xHis as above.

FIG. 5. Elution profiles of s55G7-t2, sl27Hl-t2 and s86B4-t2 from a Superdex 200 size exclusion column, the final step in the purification procedure. The numbered lines under the chromatogram indicate fractions collected during the elution process.

FIG. 6. SDS-PAGE and Western Blot analysis of fractions collected during the elution of the Superdex 200 size exclusion column. Numbers correspond to the fractions indicated in figure 5. For detection on Western Blot an antibody recognizing the C-terminal his-tag was used. FIG. 7. Size exclusion chromatography (Tosoh G2000 analytical column) of sl27Hl-t2 in the presence and absence of Fab fragments of broadly neutralizing antibodies CR91 14, CR6261, and CR8020. Molecular weights of individual proteins and/or complexes were determined by multi-angle light scattering during elution from the column and are listed in Table 8.

FIG. 8. Binding of polypeptide ofthe invention sl27Hl-t2 to monoclonal antibodies CR6261 and CR9114 using biolayer interferometry. Top panels show individual binding curves for immobilized monoclonal antibodies exposed to varying concentrations of sl27Hl-t2, bottom panels show the steady state analysis used to estimate ¾.

FIG. 9. Survival (A), body weight loss (B) and clinical score (C) for the negative (PBS, 3 immunizations at 3 weeks intervals) and positive control (15 mg/kg CR6261 , 1 day before challenge) groups. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A Puerto Rico/8/34 and monitored for 21 days. Error bars indicate 95% confidence interval (B) or interquartile range (C)

FIG 10. Survival (A), body weight loss (B) and clinical score (C) for the experimental groups immunized (3 immunizations at 3 weeks intervals) with 10 μg sl27Hl-t2, either in the presence or absence of 10 μg Matrix-M. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A/Puerto Rico/8/34 and monitored for 21 days. or reasons of comparison the negative control group (PBS) is also shown. Error bars indicate 95% confidence interval (B) or interquartile range (C) FIG 11. Elisa results for serum of the negative control and experimental groups using sl27Hl-t2 (A) or a soluble form of Full length HA (B) as the antigen. Bars represent median.

FIG 12. The antibodies induced after immunization with adjuvated polypeptide of the invention sl27Hl -t2 are capable of competing with CR9114 for binding to full length HA from H1N1 A/Brisbane/59/07 in a competition ELISA (A). For reasons of comparison competition levels by unlabeled CR9114 (i.e. self-competition) and the non-binding monoclonal antibodies CR8020 and CR-JB, both serially diluted from 5 g/ml starting concentration, are indicated in a separate graph. FIG. 13. Survival (A), relative body weight loss (B) and clinical score (C) for the negative (PBS, 3 immunizations at 3 weeks intervals) and positive control (15 mg/kg CR6261, 1 day before challenge) groups. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A Puerto Rico/8/34 and monitored for 21 days. Error bars indicate 95% confidence interval (B) or interquartile range (C).

FIG 14. Survival for groups immunized 1 time (A), 2 times (B) or 3 times (C) with 30 μg sl27Hl-t2-cll 81ong in the presence of 10 μg Matrix-M. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A/Puerto Rico/8/34 and monitored for 21 days. For reasons of comparison the negative control group (PBS) is also shown.

FIG 15. Relative body weight change for groups immunized 1 time (A), 2 times (B) or 3 times with 30 μg sl27Hl -t2-cll 81ong in the presence of 10 μg Matrix-M. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A/Puerto Rico/8/34 and monitored for 21 days. For reasons of comparison the negative control group (PBS) is also shown. Error bars indicate 95% confidence interval.

FIG 16. Clinical scores for groups immunized 1 time (A), 2 times (B) or 3 times with 30 μg sl27Hl-t2-cll81ong in the presence 10 μg Matrix-M. Mice were challenged four week after the last immunization with a lethal dose (25xLD50) of H1N1 A/Puerto Rico/8/34 and monitored for 21 days. For reasons of comparison the negative control group (PBS) is also shown. Error bars indicate interquartile range.

FIG 17. ELISA results for pre-challenge serum (4 weeks after the final immunization) of the negative control and experimental groups using sl27Hl-t2-cll81ong (A) or a soluble form of Full length HA (B) as the antigen. Bars represent median. FIG 18. The antibodies induced after immunization with Matrix-M adjuvated polypeptide of the invention sl27Hl-t2-cll81ong are capable of competing with CR91 14 for binding to full length HA from HlNl A/Brisbane/59/07 in a competition ELISA (A). For reasons of comparison competition levels by unlabeled CR9114 (i.e. self-competition) and the non-binding monoclonal antibodies CR8020, both serially diluted from 5 g/ml starting concentration, are indicated in a separate graph (B). Bars represent median.

FIG 19. (A) Survival for the negative (PBS, 3 immunizations at 3 weeks intervals) and positive control (15 mg/kg CR6261, 1 day before challenge) groups. Mice were challenged four week after the last immunization with a lethal dose (12.5xLD50) of H5 1 A/Hong Kong/156/97. (B) Survival (C) relative body weight change and (D) median clinical scores for the group immunized 3 times with 30 μg sl27Hl-t2 in the presence of 10 μg Matrix-M. Error bars indicate 95% confidence interval (C) or interquartile range (D) .Mice were challenged four week after the last immunization with a lethal dose (12.5xLD50) of H5N1 A/Hong Kong/156/97 and monitored for 21 days. For reasons of comparison the negative control group (PBS) is also shown in B, C, D.

FIG 20. Elisa results for sera from mice immunized 3 times with polypeptide of the invention sl27Hl -t2 as described in example 5 using full length HA's from a number of group 1 (HI, H5 and H9) and group II (H3 and H7) influenza strains as the antigen. Induced antibodies recognize all tested FL HA's from group 1.

FIG 21. (A) Survival for the negative (PBS, 3 immunizations at 3 weeks intervals) and positive control (15 mg/kg CR6261, 1 day before challenge) groups. Mice were challenged four week after the last immunization with a lethal dose (12.5xLD50) of HlNl A/Brisbane/59/2007. (B) Survival (C) relative body weight change and (D) median clinical scores for the group immunized 3 times with 30 μg sl27Hl-t2 in the presence of 10 μg Matrix-M. Error bars indicate 95% confidence interval (C) or interquartile range (D) .Mice were challenged four week after the last immunization with a lethal dose (12.5xLD50) of HlNl A/Brisbane/59/2007 and monitored for 21 days. For reasons of comparison the negative control group (PBS) is also shown in B, C, D. FIG 22. Pseudoparticle neutralizations assay using sera from mice immunized with polypeptide of the invention sl27Hl-t2 or PBS.

FIG 23. Antibody Dependent Cellular Cytotoxicity (ADCC) surrogate assay. Sera from mice immunized with polypeptide of the invention sl27Hl -t2 exhibit a 30-40 fold induction of FcyRIV signaling activity at the highest serum concentrations using target cells transfected with FL HA from H5N1 A/Hong Kong/156/97 (A) or H1N1

A/Brisbane/59/07 (B) as the source of antigen. FIG 24. Survival (A) and % body weight change (B) of mice after serum transfer and challenge with H5N1 A/Hong Kong/156/97 as described in Example 9.

FIG 25. Full length HA (H1N1 A/Brisbane/59/2007) ELISA titers of donor mice (D) at day 70, and recipient mice (R) prior to serum transfer (day -4) or challenge (day 0). Data were analyzed using a slope based weighted average approach. Open symbols denote measurements at LOD. Bars denote medians.

FIG 26. Survival (A) and % body weight change (B) of mice after immunization and challenge with H1N1 A/NL/602/09 as described in Example 10.

FIG 27. (A): Full length HA (H1N1 A/Brisbane/59/2007) ELISA titers of mice immunized as described in Example 10. Data were analyzed using a slope based weighted average approach. Open symbols denote measurements at LOD. Bars denote medians. (B): Serum IgG CR9114 competition binding obtained after immunization mice as described in Example 10. FL HA from H1N1 A/Brisbane/59/2007 was used as the antigen. Data shown are group medians, error bars denote interquartile range. Data for CR9114 and CR8020 starting from a 5 μg/ml solution and diluted in the same manner as the serum samples are indicated.

FIG 28. Primary screen of a total of 10472 clones (5544 and 4928 from set 1 and 2, respectively) Data are normalized to the average of the FL HA binding and expression included in the experiment. The top 20% clones in the CR9114 sandwich assay (panel A) also exhibiting expression >50 % of FL HA expression and binding signals to CR6261 >80% of the signals observed for FL HA (panel B) were considered hits; this procedure yielded 703 hits (596 and 107 from library 1 and 2, respectively).

FIG 29. CR91 14 sandwich Elisa results for polypeptides of the invention (A) SEQ ID NO: 158 to 162 all containing a C-terminal Flag-foldon-his sequence (B) SEQ ID NO: 163 to 166, all containing a C-terminal TCS-his sequence. FIG 30. SEC MALS results for SEQ ID NO: 158 in the presence and absence of Fab fragments of CR91 14 (indicated as CRF9114) or CR6261 (indicated as CRF6261). The molecular mass derived from the multi-angle light scattering analysis is given in example 12 and indicates formation complexes with 3 Fab fragments per trimer of the polypeptide of the invention.

FIG 31. Survival (A) and % body weight change (B) of mice after immunization and challenge with H1N1 A/Brisbane/59/07 as described in Example 13.

FIG 32. (A): Full length HA (H1N1 A Brisbane/59/2007) ELISA titers of mice immunized as described in Example 13. Data were analyzed using a slope based weighted average approach. Open symbols denote measurements at LOD. Bars denote medians. (B): Serum IgG CR91 14 competition binding obtained after immunization mice as described in Example 18. FL HA from H1N1 A Brisbane/59/2007 was used as the antigen. Data shown are group medians, error bars denote interquartile range. Levels for CR91 14 and CR8020 starting from a 5 μg/ml solution and diluted in the same manner as the serum samples are indicated.

FIG 33. Survival (A) and % body weight change (B) of mice after immunization and challenge with H5N1 A/Hon Kong/156/97 as described in Example 14. FIG 34. (A): Full length HA (H1N1 A/Brisbane/59/2007) ELISA titers of mice immunized as described in Example 14. Data were analyzed using a slope based weighted average approach. Open symbols denote measurements at LOD. Bars denote medians. (B): Serum IgG CR9114 competition binding obtained after immunization mice as described in example 18. FL HA from H1N1 A Brisbane/59/2007 was used as the antigen. Data shown are group medians, error bars denote interquartile range. Levels for CR91 14 and CR8020 starting from a 5 μg/ml solution and diluted in the same manner as the serum samples are indicated. FIG 35. Survival (A) and % body weight change (B) of mice after immunization and challenged with H1N1 A/Puerto Rico/8/1934 as described in Example 15.

FIG 36. (A): Full length HA (H1N1 A/Brisbane/59/2007) ELISA titers of mice immunized as described in Example 15. Data were analyzed using a slope based weighted average approach. Open symbols denote measurements at LOD. Bars denote medians. (B): Serum IgG CR9114 competition binding obtained after immunization mice as described in example 18. FL HA from H1N1 A/Brisbane/59/2007 was used as the antigen. Data shown are group medians, error bars denote interquartile range. Levels for CR91 14 and CR8020 starting from a 5 μg/ml solution and diluted in the same manner as the serum samples are indicated.

DEFINITIONS

Definitions of terms as used in the present invention are given below.

An amino acid according to the invention can be any of the twenty naturally occurring (or 'standard' amino acids) or variants thereof, such as e.g. D-proline (the D- enantiomer of proline), or any variants that are not naturally found in proteins, such as e.g. norleucine. The standard amino acids can be divided into several groups based on their properties. Important factors are charge, hydrophilicity or hydrophobicity, size and functional groups. These properties are important for protein structure and protein-protein interactions. Some amino acids have special properties such as cysteine, that can form covalent disulfide bonds (or disulfide bridges) to other cysteine residues, proline that forms a cycle to the polypeptide backbone, and glycine that is more flexible than other amino acids. Table 2 shows the abbreviations and properties of the standard amino acids.

The term "amino acid sequence identity" refers to the degree of identity or similarity between a pair of aligned amino acid sequences, usually expressed as a percentage. Percent identity is the percentage of amino acid residues in a candidate sequence that are identical (i.e., the amino acid residues at a given position in the alignment are the same residue) or similar (i.e., the amino acid substitution at a given position in the alignment is a conservative substitution, as discussed below), to the corresponding amino acid residue in the peptide after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence homology. Sequence homology, including percentages of sequence identity and similarity, are determined using sequence alignment techniques well- known in the art, such as by visual inspection and mathematical calculation, or more preferably, the comparison is done by comparing sequence information using a computer program. An exemplary, preferred computer program is the Genetics Computer Group (GCG; Madison, Wis.) Wisconsin package version 10.0 program, 'GAP' (Devereux et al. (1984)).

"Conservative substitution" refers to replacement of an amino acid of one class is with another amino acid of the same class. In particular embodiments, a conservative substitution does not alter the structure or function, or both, of a polypeptide. Classes of amino acids for the purposes of conservative substitution include hydrophobic (e.g. Met, Ala, Val, Leu), neutral hydrophilic (e.g. Cys, Ser, Thr), acidic (e.g. Asp, Glu), basic (e.g. Asn, Gin, His, Lys, Arg), conformation disrupters (e.g. Gly, Pro) and aromatic (e.g. Trp, Tyr, Phe).

As used herein, the terms "disease" and "disorder" are used interchangeably to refer to a condition in a subject. In some embodiments, the condition is a viral infection, in particular an influenza virus infection. In specific embodiments, a term "disease" refers to the pathological state resulting from the presence of the virus in a cell or a subject, or by the invasion of a cell or subject by the virus. In certain embodiments, the condition is a disease in a subject, the severity of which is decreased by inducing an immune response in the subject through the administration of an immunogenic composition. As used herein, the term "effective amount" in the context of administering a therapy to a subject refers to the amount of a therapy which has a prophylactic and/or therapeutic effect(s). In certain embodiments, an "effective amount" in the context of administration of a therapy to a subject refers to the amount of a therapy which is sufficient to achieve a reduction or amelioration of the severity of an influenza virus infection, disease or symptom associated therewith, such as, but not limited to a reduction in the duration of an influenza virus infection, disease or symptom associated therewith, the prevention of the progression of an influenza virus infection, disease or symptom associated therewith, the prevention of the development or onset or recurrence of an influenza virus infection, disease or symptom associated therewith, the prevention or reduction of the spread of an influenza virus from one subject to another subject , the reduction of hospitalization of a subject and/or hospitalization length, an increase of the survival of a subject with an influenza virus infection or disease associated therewith, elimination of an influenza virus infection or disease associated therewith, inhibition or reduction of influenza virus replication, reduction of influenza virus titer; and/or enhancement and/or improvement of the prophylactic or therapeutic effect(s) of another therapy. In certain embodiments, the effective amount does not result in complete protection from an influenza virus disease, but results in a lower titer or reduced number of influenza viruses compared to an untreated subject. Benefits of a reduction in the titer, number or total burden of influenza virus include, but are not limited to, less severe symptoms of the infection, fewer symptoms of the infection and a reduction in the length of the disease associated with the infection.

The term "host", as used herein, is intended to refer to an organism or a cell into which a vector such as a cloning vector or an expression vector has been introduced. The organism or cell can be prokaryotic or eukaryotic. Preferably, the host comprises isolated host cells, e.g. host cells in culture. The term "host cells" merely signifies that the cells are modified for the (over)-expression of the polypeptides of the invention. It should be understood that the term host is intended to refer not only to the particular subject organism or cell but to the progeny of such an organism or cell as well. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent organism or cell, but are still included within the scope of the term "host" as used herein.

The term "included" or "including" as used herein is deemed to be followed by the words "without limitation".

As used herein, the term "infection" means the invasion by, multiplication and/or presence of a virus in a cell or a subject. In one embodiment, an infection is an "active" infection, i.e., one in which the virus is replicating in a cell or a subject. Such an infection is characterized by the spread of the virus to other cells, tissues, and/or organs, from the cells, tissues, and/or organs initially infected by the virus. An infection may also be a latent infection, i.e., one in which the virus is not replicating. In certain embodiments, an infection refers to the pathological state resulting from the presence of the virus in a cell or a subject, or by the invasion of a cell or subject by the virus.

Influenza viruses are classified into influenza virus types: genus A, B and C. The term "influenza virus subtype" as used herein refers to influenza A virus variants that are characterized by combinations of the hemagglutinin (H) and neuramidase (N) viral surface proteins. According to the present invention influenza virus subtypes may be referred to by their H number, such as for example "influenza virus comprising HA of the H3 subtype", "influenza virus of the H3 subtype" or "H3 influenza", or by a combination of a H number and an N number, such as for example "influenza virus subtype H3N2" or "H3N2". The term "subtype" specifically includes all individual "strains", within each subtype, which usually result from mutations and show different pathogenic profiles, including natural isolates as well as man-made mutants or reassortants and the like. Such strains may also be referred to as various "isolates" of a viral subtype. Accordingly, as used herein, the terms "strains" and "isolates" may be used interchangeably. The current nomenclature for human influenza virus strains or isolates includes the type (genus) of virus, i.e. A, B or C, the geographical location of the first isolation, strain number and year of isolation, usually with the antigenic description of HA and A given in brackets, e.g. A/Moscow/10/00 (H3N2). Non-human strains also include the host of origin in the nomenclature. The influenza A virus subtypes can further be classified by reference to their phylogenetic group.

Phylogenetic analysis has demonstrated a subdivision of hemagglutinins into two main groups: inter alia the HI, H2, H5 and H9 subtypes in phylogenetic group 1 ("group 1" influenza viruses) and inter alia the H3, H4, H7 and H10 subtypes in phylogenetic group 2 ("group 2" influenza viruses).

As used herein, the term "influenza virus disease" refers to the pathological state resulting from the presence of an influenza virus, e.g. an influenza A or B virus in a cell or subject or the invasion of a cell or subject by an influenza virus. In specific

embodiments, the term refers to a respiratory illness caused by an influenza virus.

As used herein, the term "nucleic acid" is intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules (e.g., mRNA) and analogs of the DNA or RNA generated using nucleotide analogs. The nucleic acid can be single - stranded or double-stranded. The nucleic acid molecules may be modified chemically or biochemically or may contain non-natural or derivatized nucleotide bases, as will be readily appreciated by those of skill in the art. Such modifications include, for example, labels, methylation, substitution of one or more of the naturally occurring nucleotides with an analog, internucleotide modifications such as uncharged linkages (e.g., methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), charged linkages (e.g., phosphorothioates, phosphorodithioates, etc.), pendent moieties (e.g., polypeptides), intercalators (e.g., acridine, psoralen, etc.), chelators, alkylators, and modified linkages (e.g., alpha anomeric nucleic acids, etc.). A reference to a nucleic acid sequence encompasses its complement unless otherwise specified. Thus, a reference to a nucleic acid molecule having a particular sequence should be understood to encompass its complementary strand, with its complementary sequence. The complementary strand is also useful, e.g., for anti-sense therapy, hybridization probes and PCR primers.

As used herein, in certain embodiments the numbering of the amino acids in HA is based on the numbering of amino acids in HAO of a wild type influenza virus, e.g. the numbering of the amino acids of the HlNl influenza strain A Brisbane/59/2007 (SEQ ID NO: 1). As used in the present invention, the wording "the amino acid at position "x" in HA" thus means the amino acid corresponding to the amino acid at position x in HAO of the particular wild type influenza virus, e.g. A/Brisbane/59/2007 (SEQ ID NO: 1 ;

wherein the amino acids of the HA2 domain have been indicated in italics). It will be understood by the skilled person that equivalent amino acids in other influenza virus strains and/or subtypes can be determined by multiple sequence alignment. Note that, in the numbering system used throughout this application 1 refers to the N-terminal amino acid of an immature HAO protein (SEQ ID NO: 1). The mature sequence starts e.g. on position 18 of SEQ ID NO: 1. It will be understood by the skilled person that the leader sequence (or signal sequence) that directs transport of a protein during production (e.g. corresponding to amino acids 1-17 of SEQ ID NO: 1), generally is not present in the final polypeptide, that is e.g. used in a vaccine. In certain embodiments, the polypeptides according to the invention thus comprise an amino acid sequence without the leader sequence, i.e. the amino acid sequence is based on the amino acid sequence of HAO without the signal sequence.

"Polypeptide" refers to a polymer of amino acids linked by amide bonds as is known to those of skill in the art. As used herein, the term can refer to a single polypeptide chain linked by covalent amide bonds. The term can also refer to multiple polypeptide chains associated by non-covalent interactions such as ionic contacts, hydrogen bonds, Van der Waals contacts and hydrophobic contacts. Those of skill in the art will recognize that the term includes polypeptides that have been modified, for example by post-translational processing such as signal peptide cleavage, disulfide bond formation, glycosylation (e.g., N-linked and O-linked glycosylation), protease cleavage and lipid modification (e.g. S-palmitoylation).

"Stem domain polypeptide" refers to a polypeptide that comprises one or more polypeptide chains that make up a stem domain of a naturally-occurring (or wild-type) hemagglutinin (HA). Typically, a stem domain polypeptide is a single polypeptide chain (i.e. corresponding to the stem domain of a hemagglutinin HAO polypeptide) or two polypeptide chains (i.e. corresponding to the stem domain of a hemagglutinin HA1 polypeptide in association with a hemagglutinin HA2 polypeptide). According to the invention, a stem domain polypeptide comprises one or more mutations as compared to the wild-type HA molecule, in particular one or more amino acid residues of the wild- type HA may have been substituted by other amino acids, not naturally occurring on the corresponding position in a particular wild-type HA. Stem domain polypeptides according to the invention can furthermore comprise one or more linking sequences, as described below. The term "vector" denotes a nucleic acid molecule into which a second nucleic acid molecule can be inserted for introduction into a host where it will be replicated, and in some cases expressed. In other words, a vector is capable of transporting a nucleic acid molecule to which it has been linked. Cloning as well as expression vectors are

contemplated by the term "vector", as used herein. Vectors include, but are not limited to, plasmids, cosmids, bacterial artificial chromosomes (BAC) and yeast artificial

chromosomes (YAC) and vectors derived from bacteriophages or plant or animal

(including human) viruses. Vectors comprise an origin of replication recognized by the proposed host and in case of expression vectors, promoter and other regulatory regions recognized by the host. Certain vectors are capable of autonomous replication in a host into which they are introduced (e.g., vectors having a bacterial origin of replication can replicate in bacteria). Other vectors can be integrated into the genome of a host upon introduction into the host, and thereby are replicated along with the host genome.

As used herein, the term "wild-type" in the context of a virus refers to influenza viruses that are prevalent, circulating naturally and producing typical outbreaks of disease.

DETAILED DESCRIPTION

Influenza viruses have a significant impact on global public health, causing millions of cases of severe illness each year, thousands of deaths, and considerable economic losses. Current trivalent influenza vaccines elicit a potent neutralizing antibody response to the vaccine strains and closely related isolates, but rarely extend to more diverged strains within a subtype or to other subtypes. In addition, selection of the appropriate vaccine strains presents many challenges and frequently results in sub-optimal protection.

Furthermore, predicting the subtype of the next pandemic virus, including when and where it will arise, is currently impossible.

Hemagglutinin (HA) is the major envelope glycoprotein from influenza A viruses which is the major target of neutralizing antibodies. Hemagglutinin has two main functions during the entry process. First, hemagglutinin mediates attachment of the virus to the surface of target cells through interactions with sialic acid receptors. Second, after endocytosis of the virus, hemagglutinin subsequently triggers the fusion of the viral and endosomal membranes to release its genome into the cytoplasm of the target cell. HA comprises a large ectodomain of -500 amino acids that is cleaved by host- derived enzymes to generate 2 polypeptides that remain linked by a disulfide bond. The majority of the N- terminal fragment (HAl , 320-330 amino acids) forms a membrane-distal globular domain that contains the receptor-binding site and most determinants recognized by virus- neutralizing antibodies. The smaller C-terminal portion (HA2, -180 amino acids) forms a stem-like structure that anchors the globular domain to the cellular or viral membrane. The degree of sequence homology between subtypes is smaller in the HAl polypeptides (34% - 59% homology between subtypes) than in the HA2 polypeptide (51%- 80% homology). The most conserved region is the sequence around the cleavage site, particularly the HA2 N- terminal 23 amino acids, which is conserved among all influenza A virus subtypes (Lorieau et al., 2010). Part of this region is exposed as a surface loop in the HA precursor molecule (HA0), but becomes inaccessible when HA0 is cleaved into HAl and HA2.

Most neutralizing antibodies bind to the loops that surround the receptor binding site and interfere with receptor binding and attachment. Since these loops are highly variable, most antibodies targeting these regions are strain-specific, explaining why current vaccines elicit such limited, strain-specific immunity. Recently, however, fully human monoclonal antibodies against influenza virus hemagglutinin with broad cross-neutralizing potency were generated. Functional and structural analysis have revealed that these antibodies interfere with the membrane fusion process and are directed against highly conserved epitopes in the stem domain of the influenza HA protein (Throsby et al., 2008; Ekiert et al. 2009, WO 2008/028946, WO2010/130636, WO 2013/007770).

Stem domain polypeptides stably presenting the epitopes of these antibodies are described in the co-pending patent application PCT/EP2012/073706. At least some of the stem domain polypeptides described herein stably present the epitope of CR6261 and/or

CR91 14 and are immunogenic in mice. Additional immunogenic stem domain polypeptides stably presenting the epitope of CR6261 and/or CR9114 have been described in co-pending patent application PCT/EP2014/060997.

According to the present invention new HA stem domain polypeptides have been designed presenting these epitopes. These polypeptides can be used to create a universal epitope-based vaccine inducing protection against a broad range of influenza strains. Like in the previously described stem domain polypeptides, the highly variable and immunodominant part, i.e. the head domain, is first removed from the full length HA molecule to create a stem domain polypeptide, also called mini-HA, in order to redirect the immune response towards the stem domain where the epitopes for the broadly neutralizing antibodies are located. The broadly neutralizing antibodies mentioned above were used to probe the correct folding of the newly created molecules, and to confirm the presence of the neutralizing epitopes.

The new stem domain polypeptides of the invention show increased binding of the antibodies, in particular CR6261 and/or CR9114, and/or an increased propensity to multimerize and increased stability, as compared to binding of those antibodies to the stem polypeptides described earlier (PCT/EP2012/073706 and PCT/EP2014/060997).

The stem domain polypeptides of this invention are capable of presenting the conserved epitopes of the membrane proximal stem domain HA molecule to the immune system in the absence of dominant epitopes that are present in the membrane distal head domain. To this end, part of the primary sequence of the HAO protein making up the head domain is removed and reconnected, either directly or, in some embodiments, by introducing a short flexible linking sequence ('linker') to restore the continuity of the polypeptide chain. The resulting polypeptide sequence is further modified by introducing specific mutations that stabilize the native 3-dimensional structure of the remaining part of the HAO molecule.

The present invention in particular provides influenza hemagglutinin stem domain polypeptides comprising:

(a) an influenza hemagglutinin HA1 domain that comprises an HA1 N- terminal stem segment, covalently linked by a linking sequence of 0-50 amino acid residues to an HA1 C- terminal stem segment, said HA1 C-terminal segment being linked to

(b) an influenza hemagglutinin HA2 domain, wherein the HA1 and HA2 domain are derived from an influenza A virus subtype comprising HA of the HI subtype;

(d) wherein said HA1 N-terminal segment comprises the amino acids 1-x of HA1, preferably the amino acids p-x of HAl, and wherein the HAl C- terminal stem segment comprises the amino acids y- C-terminal amino acid of HAl, wherein x= the amino acid on position 52 of SEQ ID NO: 1 (or an equivalent position in hemagglutinin of another influenza virus strain), p= the amino acid on position 18 of SEQ ID NO: 1 (or an equivalent position in hemagglutinin of another influenza virus) and y=the amino acid on position 321 of SEQ ID NO: 1 (or an equivalent position in another hemagglutinin);

(e) wherein the region comprising the amino acid residues 402-418 comprises the amino acid sequence XiNTQX ₂TAX ₃GKEX ₄N(H/K)X ₅E(K/R) (SEQ ID NO: 8), wherein:

Xi, is an amino acid selected from the group consisting of M, E, K, V, R and T, X ₂ is an amino acid selected from the group consisting of F, I, N, T, H, L and Y, preferably I, L or Y,

X ₃ is an amino acid selected from the group consisting of V, A, G, I, R, F and S, preferably A, I or F ,

X4 , is an amino acid selected from the group consisting of F, I, N, S, T, Y, E, K, M, and V, preferably I, Y, M or V,

X _5i is an amino acid selected from the group consisting of L, H, I, N, R, preferably

(f) wherein the amino acid residue on position 337 (HAl domain) is selected from the group consisting of: I, E, K, V, A, and T,

the amino acid residue on position 340 (HAl domain) is selected from the group consisting of: I, K, R, T, F, N, S and Y,

the amino acid residue on position 352 (HA2 domain) is selected from the group consisting of: D, V, Y, A, I, N, S, and T, and

the amino acid residue on position 353 (HA2 domain) is selected from the group consisting of: K, R, T, E, G, and V; and

(g) wherein the polypeptide further comprises a disulfide bridge between the amino acid on position 324 and the amino acid on position 436; and (h) wherein the amino acid sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) has been introduced at positions 419-433 or wherein sequence RMKQIEDKIEEIESKQK (SEQ ID NO: 21) has been introduced at position 417-433.

The present invention thus provides stable hemagglutinin stem polypeptides that mimic the three-dimensional conformation of the stem of the natural hemagglutinin molecule.

The polypeptides of the invention do not comprise the full length HAl domain.

The polypeptides thus are substantially smaller than HA0, preferably lacking all or substantially all of the globular head of HA. Preferably, the immunogenic polypeptides are no more than 360, preferably no more than 350, 340, 330, 320, 310, 305, 300, 295, 290, 285, 280, 275, or 270 amino acids in length. In certain embodiments, the immunogenic polypeptides are from about 250 to about 350, preferably from about 260 to about 340, preferably from about 270 to about 330, preferably from about 270 to about 330 amino acids in length.

According to the invention, the "HAl N-terminal segment" refers to a polypeptide segment that corresponds to the amino -terminal portion of the HAl domain of an influenza hemagglutinin (HA) molecule. The HAl N-terminal polypeptide segment comprises the amino acids from position 1 to position x of the HAl domain, wherein the amino acid on position x is an amino acid residue within HAl . The term "HAl C-terminal segment" refers to a polypeptide segment that corresponds to the carboxy-terminal portion of an influenza hemagglutinin HAl domain. The HAl C-terminal polypeptide segment comprises the amino acids from position y to and including the C-terminal amino acid of the HAl domain, wherein the amino acid on position y is an amino acid residue within HAl .

According to the invention y is greater than x, thus a segment of the HAl domain between the HAl N-terminal segment and the HAl C-terminal segment, i.e. between the amino acid on position x and the amino acid on position y of HAl, has been deleted, and in some embodiments, replaced by a linking sequence. Thus, in certain embodiments, the deletion in the HAl segment comprises the amino acid sequence from the amino acid at position x+1 up to and including the amino acid at position y-1.

In certain embodiments, the polypeptides do not comprise the signal sequence. Thus in certain embodiments, the HAl N-terminal segment comprises the amino acid p-x of HAl , wherein p is the first amino acid of the mature HA molecule (e.g. p=l 8 in case of SEQ ID NO: 1). The skilled person will be able to determine the equivalent amino acid in other hemagglutins and to prepare the polypeptides described herein without the signal peptides (e.g. amino acids 1-17 of SEQ ID NO: 1 or an equivalent position in other HI influenza virus strains (see e.g. Table 2), to position x of the HAl domain.

According to the present invention, the HAl N-terminal segment comprises the amino acids 1 -x, preferably p-x of the HAl domain, wherein x=52 and p=18 in SEQ ID NO: 1 or an equivalent amino acid position in other HA sequences of the HI subtype.

According to the invention, the HAl C-terminal polypeptide segment comprises the amino acids from position y to and including the C-terminal amino acid of the HI HAl domain, wherein y is 321 or an equivalent amino acid position in other HA sequences of the HI subtype.

According to the invention, the HAl N- terminal stem segment thus comprises the amino acid residues 1-52 of HAl , preferably the amino acid residues 18-52 of HAl, and the HAl C- terminal stem segment comprises the amino acid residues 321-343 of HAl . In certain embodiments, the HAl N- terminal stem segment consists of the amino acid residues 1 -52 of HAl, preferably the amino acid residues 18-52 of HAl, and the HAl C- terminal stem segment consists of the amino acid residues 321-343 of HAl .

According to the invention, the polypeptides do not comprise a protease cleavage site at the junction between the HAl and the HA2 domain. Thus, the hemagglutinin stem domain polypeptides are resistant to protease cleavage at the junction between HAl and HA2. It is known to those of skill in the art that the Arg (R) - Gly (G) sequence spanning HAl and HA2 (i.e. amino acid positions 343 and 344 in SEQ ID NO: 1) is a recognition site for trypsin and trypsin-like proteases and is typically cleaved for hemagglutinin activation. Since the HA stem domain polypeptides described herein should not be activated, the influenza hemagglutinin stem domain polypeptides of the invention are resistant to protease cleavage. According to the invention, the protease cleavage site thus has been removed in order to prevent cleavage of the polypeptide at the cleavage site between the HAl and HA2 domain. In certain embodiments, the protease cleavage site has been removed by mutation of the C-terminal amino acid of the C-terminal HAl segment and/or mutation of the N-terminal amino acid of the HA2 domain to obtain a sequence that is resistant to protease clevage. In certain embodiments, removal of the cleavage site between HA1 and HA2 in certain embodiments can be achieved by mutation of R (in a small number of cases K) to Q at the PI position (see e.g. Sun et al, 2010 for an explanation of the nomenclature of the cleavage site (position 343 in SEQ ID NO: 1). Thus, in certain embodiments, the C- terminal amino acid residue of the HA1 C-terminal stem segment is any amino acid other than arginine (R) or lysine (K). In certain embodiments, the HA1 C- terminal amino acid is glutamine (Q), serine (S), threonine (T), asparagine (N), aspartic acid (D) or glutamic acid (E). In certain embodiments, the C-terminal amino acid residue of the HA1 C-terminal stem segment is glutamine (Q).

According to the invention, the polypeptides are derived from or based on HI HA, i.e. HA comprising an amino acid sequence from an influenza virus of the HI subtype. In a particular embodiment, the polypeptides comprise hemagglutinin stem domains from or based on HA of an influenza A virus comprising HA of the HI subtype, such as from the influenza virus A Brisbane/59/2007 (HlNl) (SEQ ID NO: 1), as described below. It will be understood by the skilled person that also other influenza A viruses comprising HA of the HI subtype may be used according to the invention. In certain embodiments, the polypeptides comprise hemagglutinin stem domains derived from or based on HA of an influenza A HI virus selected from Table 2. With "derived from" or "based on" it is meant that the N-terminal segments, and/or C-terminal segments of the HA1 domain and/or the HA2 domains have at least 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% amino acid sequence identity with the corresponding N-terminal and/or C-terminal segments of HA1 and/or the HA2 domains of a naturally occurring influenza hemagglutinin of a HI subtype known to those of skill in the art or later discovered.

According to the invention, the HA2 domain comprises one or more mutations in the HA2 amino acid sequence connecting the C-terminal residue of helix A to the N- terminal residue of helix CD (FIG. 1). The HI HA2 amino acid sequence connecting the C-terminal residue of helix A and the N-terminal residue of helix CD comprises the amino acid sequence comprising residues 402-418 of influenza HA (numbering

according to SEQ ID NO: 1), comprising the amino acid sequence

MNTQFTAVGKEFN(H/K)LE(K/R) (SEQ ID NO: 7).

In certain embodiments, the amino acid sequence connecting the C-terminal residue of helix A to the N-terminal residue of helix CD, i.e. the region comprising the amino acid residues 402-418 of influenza HA of serotype HI (numbering according to SEQ ID NO: 1) comprises the amino acid sequence

XiNTQX ₂TAX ₃GKEX ₄N(H/K)X ₅E(K/R) (SEQ ID NO: 8).

According to the invention, one or more of the amino acids on position 402, 406, 409, 413 and 416 (numbering refers to SEQ ID NO: 1), i.e one or more of the amino acids Xi, X ₂, X ₃, X4 and Xshave been mutated, i.e. comprise an amino acid that is not occurring at those positions in a wild-type influenza virus on which the stem polypeptide is based.

In certain embodiments, the mutated amino acid on position 402, i.e. Xi, is an amino acid selected from the group consisting of M, E, K, V, R and T.

In certain embodiments, the mutated amino acid on position 406, i.e. X ₂, is an amino acid selected from the group consisting of F, I, N, T, H, L and Y, preferably I, L or Y.

In certain embodiments, the mutated amino acid on position 409, i.e. X ₃, is an amino acid selected from the group consisting of V, A, G, I, R, F and S, preferably A, I or F.

In certain embodiments, the mutated amino acid on position 413, i.e. X4 , is an amino acid selected from the group consisting of F, I, N, S, T, Y, E, K, M, and V, preferably I, Y, M or V.

In certain embodiments, the mutated amino acid on position 416, i.e. X5, is an amino acid selected from the group consisting of L, H, I, N, R, preferably I.

Combinations of these mutations are also possible.

In certain embodiments, Xi is M, X ₂ is Y, X ₃ is I, X4 is Y and X5 is S.

According to the invention, the stem polypeptides comprise one or more additional mutations, i.e. amino acid substitutions, in the HA1 domain and/or the HA2 domain, as compared to the amino acid sequence of corresponding wild-type influenza virus HA1 and/or HA2 domains, i.e. the influenza virus on which the stem polypeptides are based.

In certain embodiments, one or more amino acid residues close to the HA0 cleavage site (residue 343 in SEQ ID NO: 1) have been mutated. In certain embodiments, one or more of the amino acid residues on position 337, 340, 352, or 353 of SEQ ID NO: 1, or equivalent positions in other influenza viruses, have been mutated, i.e. are substituted by an amino acid that is not occurring at the corresponding position in the amino acid sequence of the HA of the wild-type influenza virus on which the stem polypeptide is based. Table 6 shows the the naturally occurring amino acid variation.

In certain embodiments, the polypeptides of the invention comprise at least one mutation on position 352 of SEQ ID NO: 1 , or on an equivalent position of other influenza viruses.

In certain embodiments, the polypeptides of the invention comprise at least one mutation on position 353 of SEQ ID NO: 1 , or on an equivalent position of other influenza viruses .

In certain embodiments, the polypeptides of the invention comprise at least one mutation on position 337 of SEQ ID NO: 1 , or on an equivalent position of other influenza viruses.

In certain embodiments, the polypeptides of the invention comprise at least one mutation on position 340 of SEQ ID NO: 1 , or on an equivalent position of other influenza viruses.

In certain embodiments, the mutated amino acid residue on position 337 (HAl domain) is selected from the group consisting of: I, E, K, V, A, and T.

In certain embodiments, the mutated amino acid residue on position 340 (HAl domain) is selected from the group consisting of: I, K, R, T, F, N, S and Y.

In certain embodiments, the mutated amino acid residue on position 352 (HA2 domain) is selected from the group consisting of: D, V, Y, A, I, N, S, and T.

In certain embodiments, the mutated amino acid residue on position 353 (HA2 domain) is selected from the group consisting of: K, R, T, E, G, and V.

In certain embodiments the mutated amino acid introduces a consensus N- glycoslation e.g. N-X-T/S (where X is any naturally ccuring amino acid except P) in the sequence as is for example the case for I340N in SEQ ID NO: 6.

In certain embodiments, the mutated amino acid is an amino acid that does not naturally occur in sequences of the same subtype.

In certain embodiments, the the mutated amino acid residue on position 337 (HAl domain) is K. In certain embodiments, the mutated amino acid residue on position 340 (HAl domain) is K.

In certain embodiments, the mutated amino acid residue on position 352 (HA2 domain) is F.

In certain embodiments, the mutated amino acid residue on position 353 (HA2 domain) is T.

It is again noted that throughout this application the numbering of the amino acids is based on the numbering of amino acids in HI HA0, in particular the numbering of the amino acids of the H1N1 influenza strain A/Brisbane/59/2007 (SEQ ID NO: 1). The skilled person will be able to determine the equivalent (or corresponding) amino acids in HA of other influenza viruses and thus will be able to determine equivalent mutations, see e.g. Table 2 for the sequence alignment of different HI influenza viruses.

According to the invention, the polypeptides further comprise a disulfide bridge between the amino acid on position 324 and the amino acid on position 436. Thus, according to the invention at least one disulfide bridge has been introduced in the stem domain polypeptides, preferably between amino acids of (or the equivalent of) position 324 and 436 in HI A Brisbane/59/2007 (SEQ ID NO: 1). In certain embodiments, the polypeptides thus further comprise the mutation R324C in the HAl domain and T436C in the HA2 domain. Equivalent positions can be easily determined by those skilled in the art by aligning the sequences using a suitable algorithm such as Clustal, Muscle etc.

Engineered disulfide bridges are created by mutating at least one (if the other is already a cysteine), but usually two residues that are spatially close into cysteine, that will spontaneously or by active oxidation form a covalent bond between the sulfur atoms of these residues.

In certain embodiments, the polypeptides further comprise one or more additional mutations in the HAl and/or HA2 domain, as compared to the amino acid sequence of the HA of which the HA 1 and HA2 domains are derived.Thus, the stability of the stem polypeptides is further increased.

Applicants have previously identified broadly neutralizing antibodies isolated from primary human B-cells from vaccinated individuals some of which were specific for group 1 (e.g. CR6261, as described in WO 2008/028946) and some of which were specific for group 2 influenza viruses (e.g. CR8020 as described in WO 2010/130636). Detailed analysis of the epitopes of these monoclonal antibodies has revealed the reason for the lack of cross-reactivity of these specific antibodies. In both cases the presence of glycans in group 1 or group 2 HA molecules on different positions at least partly explained the fact that the antibodies are group-specific. With the identification of CR91 14-like antibodies that cross-react with many group 1 and 2 HA molecules, as described below, it has become clear that it is possible for the human immune system to elicit very broad neutralizing antibodies against influenza viruses. However, given the need for a yearly vaccination scheme these antibodies are apparently not, or only to a very low extent elicited following infection or vaccination with (seasonal) influenza viruses of subtypes HI and/or H3.

According to the present invention polypeptides are provided that mimic the specific epitopes of CR6261 and/or CR9114, and that can be used as immunogenic polypeptides, e.g. to elicit cross-neutralizing antibodies when administered in vivo, either alone, or in combination with other prophylactic and/or therapeutic treatments. With "cross-neutralizing antibodies", antibodies are meant that are capable of neutralizing at least two, preferably at least three, four, or five different subtypes of influenza A viruses of phylogenetic group 1 , and/or at least two, preferably at least three, four, or five different subtypes of influenza A viruses of phylogenetic group 2, and/or at least two, different subtypes of influenza B viruses, in particular at least all virus strains that are neutralized by CR6261 and CR91 14.

The polypeptides of the invention comprise the epitope of the stem-binding influenza neutralizing antibodies CR6261 and/or CR9114. In certain embodiments, the polypeptides thus selectively bind to the antibodies CR6261 and/or CR9114. In certain embodiments, the polypeptides of the invention do not bind to the antibodies CR8020 and/or CR8057. As used in the present invention, CR6261 comprises a heavy chain variable region comprising the amino acid sequence of SEQ ID NO: 9 and a light chain variable region comprising the amino acid sequence of SEQ ID NO: 10; CR9114 comprises a heavy chain variable region comprising the amino acid sequence of SEQ ID NO: 1 1 and a light chain variable region comprising the amino acid sequence of SEQ ID NO: 12. CR8057 comprises a heavy chain variable region comprising the amino acid sequence of SEQ ID NO: 13 and a light chain variable region comprising the amino acid sequence of SEQ ID NO: 14. CR8020 comprises a heavy chain variable region

comprising the amino acid sequence of SEQ ID NO: 17 and a light chain variable region comprising the amino acid sequence of SEQ ID NO: 18.

As described above, the polypeptides comprise an influenza hemagglutinin HA1 domain that comprises an HA1 N- terminal stem segment that is covalently linked by a linking sequence of 0- 50 amino acid residues to the HA1 C- terminal stem segment. The linking sequence, if present, does not occur in naturally occurring, or wild-type, HA. In certain embodiments, the linker is a peptide that comprises one amino acid residue, two or less amino acid residues, three or less amino acid residues, four or less amino acid residues, five or less amino acid residues, ten or less amino acid residues, 15 or less amino acid residues, or 20 or less amino acid residues or 30 or less amino acid residues or 40 or less amino acid residues or 50 or less amino acid residues. In a specific

embodiment, the linking sequence is a sequence selected from the group consisting of G, GS, GGG, GSG, GSA, GSGS, GSAG, GGGG, GSAGS, GSGSG, GSAGSA,

GSAGSAG, and GSGSGSG.

In certain embodiments, the HA1 N-terminal segment is directly linked to the HA1 C-terminal segment, i.e. the polypeptides do not comprise a linking sequence.

Influenza HA in its native form exists as a trimer on the cell or virus membrane. In certain embodiments the intracellular and transmembrane sequence is removed so that a secreted (soluble) polypeptide is produced following expression in cells. Methods to express and purify secreted ectodomains of HA have been described (see e.g. Dopheide et al 2009; Ekiert et al 2009, 2011; Stevens et al 2004, 2006; Wilson et al 1981). A person skilled in the art will understand that these methods can also be applied directly to stem domain polypeptides of the invention in order to achieve expression of secreted (soluble) polypeptide. Therefore these polypeptides are also encompassed in the invention.

In certain embodiments, the polypeptides comprise the full HA2 domain, thus including the transmembrane and intracellular sequences. In other embodiments, the polypeptides of the invention do not comprise the intracellular sequences of HA and the transmembrane domain. In certain embodiments, the polypeptides comprise a truncated

HA2 domain. In certain embodiments, the intracellular and transmembrane sequences, e.g. the amino acid sequence from position (or the equivalent of) 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 526, 528, 529, or 530 of the HA2 domain to the C- terminus of the HA2 domain (numbering according to SEQ ID NO: 1) has been removed to produce a soluble polypeptide following expression in cells.

In certain embodiments, the C-terminal part of the HA2 domain from position 519 to the C-terminal amino acid has been deleted. In further embodiments, the C-terminal part of the HA2 domain from position 530 to the C-terminal amino acid has been deleted.

Optionally, a his-tag sequence (HHHHHH (SEQ ID NO: 15) or HHHHHHH (SEQ ID NO: 16)) may be linked to the (optionally truncated) HA2 domain, for purification purposes, optionally connected through a linker. Optionally the linker may contain a proteolytic cleavage site to enzymatically remove the his-tag after purification.

In certain embodiments, the polypeptides are further stabilized by introducing a sequence known to form trimeric structures, i.e.

GYIPEAPRDGQAYVRKDGEWVLLSTFL (SEQ ID NO: 3) at the C-terminus of HA2, optionally connected through a linker. Thus, in certain embodiments, the C-terminal part of the HA2 domain has been replaced by the amino acid sequence

GYIPEAPRDGQAYVRKDGEWVLLSTFL (SEQ ID NO: 3), optionally connected through a linker. The linker may optionally contain a cleavage site for processing afterwards according to protocols well known to those skilled in the art. To facilitate purification of the soluble form a tag sequence may be added, e.g. a his tag (HHHHHH (SEQ ID NO: 15) or HHHHHHH (SEQ ID NO: 16)) or FLAG tag (DYKDDDDK) (SEQ ID NO: 22) or a combination of these, optionally connected via short linkers. The linker may optionally contain (part of) a proteolytic cleavage site, e.g. IEGR (SEQ ID NO: 24) (Factor X) or LVPRGS (SEQ ID NO: 23) (thrombin) for processing afterwards according to protocols well known to those skilled in the art. The processed proteins are also encompassed in the invention.

In certain embodiments, the C-terminal part of the HA2 domain from position 519-565 has been deleted (numbering according to SEQ ID NO: 1) and replaced by SGRDYKDDDDKLVPRGSPGSGYIPEAPRDGQAYVRKDGEWVLLSTFLGHHHHH

H (SEQ ID NO: 4). In certain embodiments, the C-terminal part of the HA2 domain from position 530-565 has been deleted (numbering according to SEQ ID NO: 1) and replaced by SGRDYKDDDDKLVPRGSPGSGYIPEAPRDGQAYVRKDGEWVLLSTFLGHHHHH

H (SEQ ID NO: 4).

The native HA exists as a trimer on the cell surface. Most of the interactions between the individual monomers that keep the trimer together are located in the head domain while in the stem domain trimerization is mediated by the formation of a trimeric coiled coil motif. After removal of the head the tertiary structure is destabilized and therefore modifications are needed in order to increase protein stability. By strengthening the helical propensity of the helix CD a more stable protein can be created.

In the polypeptides described in the co-pending application PCT/EP2014/060997, the sequence MKQIEDKIEEIESKQ (SEQ ID NO: 5), derived from yeast transcriptional activator protein GCN4 and known to trimerize was introduced in the CD helix at (the equivalent of) position 419-433. This sequence has a high propensity to form helical secondary structures and can enhance in this way overall stability of the polypeptides of the invention.

According to the present invention, it has surprisingly been shown that the stability and multimerization state of the polypeptide is dependent on the exact location and sequence of the GCN4 derived sequence in the primary sequence of the polypeptides of the invention.

Thus, according the invention, the sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) is introduced at position 419-433 (numbering according to SEQ ID NO: 1), or sequence RMKQIEDKIEEIE SKQK (SEQ ID NO: 21) is introduced at position 417-433 .

In certain embodiments, the polypeptides are glycosylated.

In the research that led to the present invention, for example s74H9 (SEQ ID NO:

65), sl27Hl(SEQ ID NO: 66), s71H2 (SEQ ID NO: 71), s86B4 (SEQ ID NO: 67), sl l5Al (SEQ ID NO: 70), s2201C9 (SEQ ID NO: 77), s55G7 (SEQ ID NO: 68), sl l3E7 (SEQ ID NO: 78), s6E12 (SEQ ID NO: 69), sl81H9 (SEQ ID NO: 76), described in the co-pending patent application PCT/EP2014/060997 were modified, using techniques of molecular biology well known to those skilled in the art, to create sequences s74H9-t2 (SEQ ID NO: 93), sl27Hl-t2 (SEQ ID NO: 91), s71H2-t2 (SEQ ID NO: 97), s86B4-t2 (SEQ ID NO: 92), si 15Al-t2 (SEQ ID NO: 96), s220C9-t2 (SEQ ID NO: 99), s55G7-t2 (SEQ ID NO: 95), si 13E7-t2 (SEQ ID NO: 100), s6E12-t2 (SEQ ID NO: 94), sl81H9-t2 (SEQ ID NO: 98) containing sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) at position 419-433.

In a similar manner, polypeptides s74H9-t3 (SEQ ID NO: 123), sl27Hl-t3 (SEQ ID

NO: 121), s71H2-t3 (SEQ ID NO: 127), s86B4-t3 (SEQ ID NO: 122), sll5Al-t3 (SEQ ID NO: 126), s2201C9-t3 (SEQ ID NO: 129 ), s55G7-t3 (SEQ ID NO: 125), sl l3E7-t3 (SEQ

ID NO: 130), s6E12-t3 (SEQ ID NO: 124), sl81H9-t3 (SEQ ID NO: 128) containing sequence RMKQIEDKIEEIESKQK (SEQ ID NO: 21) at position 417-433 were created.

The polypeptides of the present invention show increased binding of the influenza antibodies, in particular CR6261 and/or CR9114, and/or an increased propensity to multimerize and/or an increased stability, as compared to stem polypeptides described earlier

(PCT/EP2012/073706 and PCT/EP2014/060997).

In certain embodiments, the polypeptides comprise the amino acid sequence:

DTICIGYHANNSTDTVDTVLEK VTVTHSVNLLENGGGGKYVCSAKLRMVTGL

RNX ₁PSX ₂QSQGLFGAIAGX ₃X ₄EGGWTGMVDGWYGYHHQNEQGSGYAADQKS TQNAINGITNKVNSVIEKX ₅NTQX ₆T AX ₇GKEX ₈NKX ₉ERRMKQIEDKIEEIESKIWC

YNAELLVLLENERTLDFHD SNVKNLYEKVKS QLKNNAKEIGNGCFEFYHKCNDE

CMESVKNGTYDYPKYSEESKLNREKIDGVSGRDYKDDDDKLVPRGSPGSGYIPE

APRDGQAYVRKDGEWVLLSTFLGHHHHHH (SEQ ID NO: 145),

wherein Xi is an amino acid selected from the group consisting of E, I, K, V, A, and T; X ₂ is an amino acid selected from the group consisting of I, K, R, T, F, N, S and Y;

X ₃ is an amino acid selected from the group consisting of D, F, V, Y, A, I, N, S, and T;

X4 is an amino acid selected from the group consisting of I, K, R, T, E, G and V;

X ₅ is an amino acid selected from the group consisting of M, E, K, V, R, T;

X ₆ is an amino acid selected from the group consisting of F, I, N, S, T, Y, H, and L;

X ₇ is an amino acid selected from the group consisting of A, G, I, R, T, V, F, and S;

Xs is an amino acid selected from the group consisting of F, I, N, S, T, Y, G, E, K, M, and

V; and

X9 is an amino acid selected from the group consisting of H, I, L, N, R, and S.

In certain embodiments, the polypeptides comprise the amino acid sequence: DTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENGGGGKYVCSAKLRMVTGL RNX ₁PSX ₂QSQGLFGAIAGX ₃X ₄EGGWTGMVDGWYGYHHQNEQGSGYAADQKS TQNAINGITNKVNSVIEKX ₅NTQX6TAX ₇GKEX8NKX ₉ERRMKQIEDKIEEIESKIWC YNAELLVLLENERTLDFHD SNVKNLYEKVKS QLKNNAKEIGNGCFEFYHKCNDE CMESVKNGTYDYPKYSEESKLNREKIDG (SEQ ID NO: 146),

wherein Xi is an amino acid selected from the group consisting of E, I, K, V, A, and T; X ₂ is an amino acid selected from the group consisting of I, K, R, T, F, N, S and Y;

X ₃ is an amino acid selected from the group consisting of D, F, V, Y, A, I, N, S, and T; X4 is an amino acid selected from the group consisting of I, K, R, T, E, G and V;

X ₅ is an amino acid selected from the group consisting of M, E, K, V, R, T;

X ₆ is an amino acid selected from the group consisting of F, I, N, S, T, Y, H, and L;

X ₇ is an amino acid selected from the group consisting of A, G, I, R, T, V, F, and S;

Xs is an amino acid selected from the group consisting of F, I, N, S, T, Y, G, E, K, M, and V; and

X9 is an amino acid selected from the group consisting of H, I, L, N, R, and S.

In certain embodiments, the polypeptides comprise the amino acid sequence:

DTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENGGGGKYVCSAKLRMVTGL RNX ₁PSX ₂QSQGLFGAIAGX ₃X ₄EGGWTGMVDGWYGYHHQNEQGSGYAADQKS

TQNAINGITNKVNSVIEKX ₅NTQX ₆TAX ₇GKEX ₈NKX ₉ERRMKQIEDKIEEIESKIWC

YNAELLVLLENERTLDFHD SNVKNLYEKVKS QLKNNAKEIGNGCFEFYHKCNDE

CMESVKNGTYDYPKYSEESKLNREKIDGVKLESMGVYQIEG (SEQ ID NO: 147), wherein Xi is an amino acid selected from the group consisting of E, I, K, V, A, and T; X ₂ is an amino acid selected from the group consisting of I, K, R, T, F, N, S and Y;

X ₃ is an amino acid selected from the group consisting of D, F, V, Y, A, I, N, S, and T;

X4 is an amino acid selected from the group consisting of I, K, R, T, E, G and V;

X ₅ is an amino acid selected from the group consisting of, M, E, K, V, R, T _;

X ₆ is an amino acid selected from the group consisting of F, I, N, S, T, Y, H, and L;

X ₇ is an amino acid selected from the group consisting of A, G, I, R, T, V, F, and S;

Xs is an amino acid selected from the group consisting of F, I, N, S, T, Y, G, E, K, M and

V; and

X9 is an amino acid selected from the group consisting of H, I, L, N, R, and S.

In certain embodiments, the polypeptides comprise the amino acid sequence: DTICIGYHANNSTDTVDTVLEKNVTVTHSVNLLENGGGGKYVCSAKLRMVTGL RNX ₁PSX ₂QSQGLFGAIAGX ₃X ₄EGGWTGMVDGWYGYHHQNEQGSGYAADQKS TQNAINGITNKV SVIEKX ₅NTQX ₆TAX ₇GKEX ₈NKX ₉ERRMKQIEDKIEEIESKIWC Y AELLVLLENERTLDFHD SNVK LYEKVKS QLK AKEIGNGCFEFYHKCNDE CMESVKNGTYDYPKYSEESKLNREKIDGVKLESMGWQILAIYSTVASSLVLLVS LGAISFWMCSNGSLQCRICI (SEQ ID NO: 148),

wherein Xi is an amino acid selected from the group consisting of E, I, K, V, A, and T; X ₂ is an amino acid selected from the group consisting of I, K, R, T, F, N, S and Y;

X ₃ is an amino acid selected from the group consisting of D, F, V, Y, A, I, N, S, and T; X4 is an amino acid selected from the group consisting of I, K, R, T, E, G and V;

X ₅ is an amino acid selected from the group consisting of M, E, K, V, R, T;

X ₆ is an amino acid selected from the group consisting of F, I, N, S, T, Y, H, and L;

X ₇ is an amino acid selected from the group consisting of A, G, I, R, T, V, F, and S;

X8 is an amino acid selected from the group consisting of F, I, N, S, T, Y, G, E, K, M and V; and

X9 is an amino acid selected from the group consisting of H, I, L, N, R, and S.

In certain embodiments, Xi is K, X ₂ is K, X ₃ is F, X4 is T, X5 is M, X ₆ is Y, X ₇ is

I, X ₈ is Y, and X ₉ is S in SEQ ID NO: 145-148.

The influenza hemagglutinin stem domain polypeptides can be prepared

according to any technique deemed suitable to one of skill, including techniques described below.

Thus, the immunogenic polypeptides of the invention may be synthesized as DNA sequences by standard methods known in the art and cloned and subsequently expressed, in vitro or in vivo, using suitable restriction enzymes and methods known in the art. The present invention thus also relates to nucleic acid molecules encoding the above described polypeptides. The invention further relates to vectors comprising the nucleic acids encoding the polypeptides of the invention. In certain embodiments, a nucleic acid molecule according to the invention is part of a vector, e.g. a plasmid. Such vectors can easily be manipulated by methods well known to the person skilled in the art, and can for instance be designed for being capable of replication in prokaryotic and/or eukaryotic cells. In addition, many vectors can directly or in the form of an isolated desired fragment there from be used for transformation of eukaryotic cells and will integrate in whole or in part into the genome of such cells, resulting in stable host cells comprising the desired nucleic acid in their genome. The vector used can be any vector that is suitable for cloning DNA and that can be used for transcription of a nucleic acid of interest. When host cells are used it is preferred that the vector is an integrating vector. Alternatively, the vector may be an episomally replicating vector.

The person skilled in the art is capable of choosing suitable expression vectors, and inserting the nucleic acid sequences of the invention in a functional manner. To obtain expression of nucleic acid sequences encoding polypeptides, it is well known to those skilled in the art that sequences capable of driving expression can be functionally linked to the nucleic acid sequences encoding the polypeptide, resulting in recombinant nucleic acid molecules encoding a protein or polypeptide in expressible format. In general, the promoter sequence is placed upstream of the sequences that should be expressed. Many expression vectors are available in the art, e.g. the pcDNA andpEF vector series of Invitrogen, pMSCV and pTK-Hyg from BD Sciences, pCMV-Script from Stratagene, etc, which can be used to obtain suitable promoters and/or transcription terminator sequences, polyA sequences, and the like. Where the sequence encoding the polypeptide of interest is properly inserted with reference to sequences governing the transcription and translation of the encoded polypeptide, the resulting expression cassette is useful to produce the polypeptide of interest, referred to as expression. Sequences driving expression may include promoters, enhancers and the like, and combinations thereof. These should be capable of functioning in the host cell, thereby driving expression of the nucleic acid sequences that are functionally linked to them. The person skilled in the art is aware that various promoters can be used to obtain expression of a gene in host cells. Promoters can be constitutive or regulated, and can be obtained from various sources, including viruses, prokaryotic, or eukaryotic sources, or artificially designed. Expression of nucleic acids of interest may be from the natural promoter or derivative thereof or from an entirely heterologous promoter (Kaufman, 2000). Some well-known and much used promoters for expression in eukaryotic cells comprise promoters derived from viruses, such as adenovirus, e.g. the El A promoter, promoters derived from cytomegalovirus (CMV), such as the CMV immediate early (IE) promoter (referred to herein as the CMV promoter) (obtainable for instance from pcDNA, Invitrogen), promoters derived from Simian Virus 40 (SV40) (Das et al, 1985), and the like. Suitable promoters can also be derived from eukaryotic cells, such as methallothionein (MT) promoters, elongation factor l a (EF-la) promoter (Gill et al., 2001), ubiquitin C or UB6 promoter (Gill et al., 2001), actin promoter, an immunoglobulin promoter, heat shock promoters, and the like. Testing for promoter function and strength of a promoter is a matter of routine for a person skilled in the art, and in general may for instance encompass cloning a test gene such as lacZ, luciferase, GFP, etc. behind the promoter sequence, and test for expression of the test gene. Of course, promoters may be altered by deletion, addition, mutation of sequences therein, and tested for functionality, to find new, attenuated, or improved promoter sequences. According to the present invention, strong promoters that give high transcription levels in the eukaryotic cells of choice are preferred.

The constructs may be transfected into eukaryotic cells (e.g. plant, fungal, yeast or animal cells) or suitable prokaryotic expression systems like E. coli using methods that are well known to persons skilled in the art. In some cases a suitable 'tag' sequence (such as for example, but not limited to, a his-, myc-, strep-, or flag-tag) or complete protein (such as for example, but not limited to, maltose binding protein or glutathione S transferase) may be added to the sequences of the invention to allow for purification and/or identification of the polypeptides from the cells or supernatant. Optionally a sequence containing a specific proteolytic site can be included to afterwards remove the tag by proteolytic digestion.

Purified polypeptides can be analyzed by spectroscopic methods known in the art (e.g. circular dichroism spectroscopy, Fourier Transform Infrared spectroscopy and MR spectroscopy or X-ray crystallography) to investigate the presence of desired structures like helices and beta sheets. ELISA, Octet and FACS and the like can be used to investigate binding of the polypeptides of the invention to the broadly neutralizing antibodies described before (CR6261 , CR9114, CR8057). Thus, polypeptides according to the invention having the correct conformation can be selected.

The invention further relates to immunogenic compositions comprising a therapeutically effective amount of at least one of the polypeptides and/or nucleic acids of the invention. The immunogenic compositions preferably further comprise a

pharmaceutically acceptable carrier. In the present context, the term "pharmaceutically acceptable" means that the carrier, at the dosages and concentrations employed, will not cause unwanted or harmful effects in the subjects to which they are administered. Such pharmaceutically acceptable carriers and excipients are well known in the art (see Remington's Pharmaceutical Sciences, 18th edition, A. R. Gennaro, Ed., Mack Publishing Company [1990]; Pharmaceutical Formulation Development of Peptides and Proteins, S. Frokjaer and L. Hovgaard, Eds., Taylor & Francis [2000]; and Handbook of Pharmaceutical Excipients, 3rd edition, A. Kibbe, Ed., Pharmaceutical Press [2000]). The term "carrier" refers to a diluent, adjuvant, excipient, or vehicle with which the composition is

administered. Saline solutions and aqueous dextrose and glycerol solutions can e.g. be employed as liquid carriers, particularly for injectable solutions. The exact formulation should suit the mode of administration. The polypeptides and/or nucleic acid molecules preferably are formulated and administered as a sterile solution. Sterile solutions are prepared by sterile filtration or by other methods known per se in the art. The solutions can then be lyophilized or filled into pharmaceutical dosage containers. The pH of the solution generally is in the range of pH 3.0 to 9.5, e.g. pH 5.0 to 7.5.

The invention also relates to influenza HA stem domain polypeptides, nucleic acid molecules and/or vectors as described above for use in inducing an immune response against influenza HA protein. The invention also relates to methods for inducing an immune response in a subject, the method comprising administering to a subject, a polypeptide, nucleic acid molecule and/or immunogenic composition as described above. A subject according to the invention preferably is a mammal that is capable of being infected with an infectious disease-causing agent, in particular an influenza virus, or otherwise can benefit from the induction of an immune response, such subject for instance being a rodent, e.g. a mouse, a ferret, or a domestic or farm animal, or a non- human-primate, or a human. Preferably, the subject is a human subject. The invention thus provides methods for inducing an immune response to an influenza virus

hemagglutinin (HA), in particular of a group 1 and/or group 2 influenza A virus, such as an influenza virus comprising HA of the HI, H2, H3, H4, H5, H7 and/or H10 subtype, and/or of an influenza B virus, in a subject utilizing the polypeptides, nucleic acids and/or immunogenic compositions described herein. In some embodiments, the invention provides methods for inducing an immune response to an influenza virus comprising HA of the HI subtype, in a subject utilizing the polypeptides, nucleic acids and/or

immunogenic compositions described herein. In some embodiments, the immune response induced is effective to prevent and/or treat an influenza virus infection caused by a group 1 and/or group 2 influenza A virus subtypes and/or influenza B viruses. In some embodiments, the immune response induced by the polypeptides, nucleic acids and/or immunogenic compositions described herein is effective to prevent and/or treat an influenza A and/or B virus infection caused by two, three, four, five or six subtypes of influenza A and/or B viruses. In some embodiments, the immune response induced is effective to prevent and/or treat an influenza virus infection caused by an influenza virus comprising HA of the HI subtype.

Since it is well known that small proteins and/or nucleic acid molecules do not always efficiently induce a potent immune response it may be necessary to increase the immunogenicity of the polypeptides and/or nucleic acid molecules by adding an adjuvant. In certain embodiments, the immunogenic compositions described herein comprise, or are administered in combination with, an adjuvant. The adjuvant for administration in combination with a composition described herein may be administered before,

concomitantly with, or after administration of said composition. Examples of suitable adjuvants include aluminium salts such as aluminium hydroxide and/or aluminium phosphate; oil-emulsion compositions (or oil-in-water compositions), including squalene- water emulsions, such as MF59 (see e.g. WO 90/14837); saponin formulations, such as for example QS21 and Immunostimulating Complexes (ISCOMS) (see e.g. US 5,057,540; WO 90/03184, WO 96/11711 , WO 2004/004762, WO 2005/002620); bacterial or microbial derivatives, examples of which are monophosphoryl lipid A (MPL), 3-O-deacylated MPL (3dMPL), CpG-motif containing oligonucleotides, ADP-ribosylating bacterial toxins or mutants thereof, such as E. coli heat labile enterotoxin LT, cholera toxin CT, pertussis toxin PT, or tetanus toxoid TT, Matrix M (Isconova). In addition, known immunopotentiating technologies may be used, such as fusing the polypeptides of the invention to proteins known in the art to enhance immune response (e.g. tetanus toxoid, CRM197, rCTB, bacterial flagellins or others) or including the polypeptides in virosomes, or combinations thereof. Other non-limiting examples that can be used are e.g. disclosed by Coffman et al. (2010).

In an embodiment, the influenza hemagglutinin stem domain polypeptides of the invention are incorporated into viral-like particle (VLP) vectors. VLPs generally comprise a viral polypeptide(s) typically derived from a structural protein(s) of a virus. Preferably, the VLPs are not capable of replicating. In certain embodiments, the VLPs may lack the complete genome of a virus or comprise a portion of the genome of a virus. In some embodiments, the VLPs are not capable of infecting a cell. In some embodiments, the VLPs express on their surface one or more of viral (e.g., virus surface glycoprotein) or non-viral (e.g., antibody or protein) targeting moieties known to one skilled in the art.

In a specific embodiment, the polypeptide of the invention is incorporated into a virosome. A virosome containing a polypeptide according to the invention may be produced using techniques known to those skilled in the art. For example, a virosome may be produced by disrupting a purified virus, extracting the genome, and reassembling particles with the viral proteins (e.g., an influenza hemagglutinin stem domain polypeptide) and lipids to form lipid particles containing viral proteins.

The invention also relates to the above-described polypeptides, nucleic acids and/or immunogenic compositions for inducing an immune response in a subject against influenza HA, in particular for use as a vaccine. The influenza hemagglutinin stem domain polypeptides, nucleic acids encoding such polypeptides, or vectors comprising such nucleic acids or polypeptides described herein thus may be used to elicit neutralizing antibodies against influenza viruses, for example, against the stem region of influenza virus hemagglutinin. The invention in particular relates to polypeptides, nucleic acids, and/or imunogenic compositions as described above for use as a vaccine in the prevention and/or treatment of a disease or condition caused by an influenza A virus of phylogenetic group 1 and/or phylogenetic group 2 and/or an influenza B virus. In an embodiment, the vaccine may be used in the prevention and/or treatment of diseases caused by two, three, four, five, six or more different subtypes of phylogenetic group 1 and/or 2 and/or influenza B viruses. In an embodiment, the vaccine may be used in the prevention and/or treatment of influenza infection caused by an influenza virus comprising HA of the HI subtype.

The polypeptides of the invention may be used after synthesis in vitro or in a suitable cellular expression system, including bacterial and eukaryotic cells, or

alternatively, may be expressed in vivo in a subject in need thereof, by expressing a nucleic acid coding for the immunogenic polypeptide. Such nucleic acid vaccines may take any form, including naked DNA, plasmids, or viral vectors including adenoviral vectors.

Administration of the polypeptides, nucleic acid molecules, and/or immunogenic compositions according to the invention can be performed using standard routes of administration. Non-limiting examples include parenteral administration, such as intravenous, intradermal, transdermal, intramuscular, subcutaneous, etc, or mucosal administration, e.g. intranasal, oral, and the like. The skilled person will be capable to determine the various possibilities to administer the polypeptides, nucleic acid molecules, and/or immunogenic compositions according to the invention, in order to induce an immune response. In certain embodiments, the polypeptide, nucleic acid molecule, and/or immunogenic composition (or vaccine) is administered more than one time, i.e. in a so- called homologous prime -boost regimen. In certain embodiments where the polypeptide, nucleic acid molecule, and/or immunogenic composition is administered more than once, the administration of the second dose can be performed after a time interval of, for example, one week or more after the administration of the first dose, two weeks or more after the administration of the first dose , three weeks or more after the administration of the first dose, one month or more after the administration of the first dose, six weeks or more after the administration of the first dose, two months or more after the administration of the first dose, 3 months or more after the administration of the first dose, 4 months or more after the administration of the first dose, etc, up to several years after the administration of the first dose of the polypeptide, nucleic acid molecule, and/or immunogenic composition. It is also possible to administer the vaccine more than twice, e.g. three times, four times, etc, so that the first priming administration is followed by more than one boosting administration. In other embodiments, the polypeptide, nucleic acid molecule, and/or immunogenic composition according to the invention is administered only once.

The polypeptides, nucleic acid molecules, and/or immunogenic compositions may also be administered, either as prime, or as boost, in a heterologous prime -boost regimen.

The invention further provides methods for preventing and/or treating an influenza virus disease in a subject utilizing the polypeptides, nucleic acids and/or compositions described herein. In a specific embodiment, a method for preventing and/or treating an influenza virus disease in a subject comprises administering to a subject in need thereof an effective amount of a polypeptide, nucleic acid and/or immunogenic composition, as described above. A therapeutically effective amount refers to an amount of the polypeptide, nucleic acid, and/or composition as defined herein, that is effective for preventing, ameliorating and/or treating a disease or condition resulting from infection by a group 1 or 2 influenza A virus, and/or an influenza B virus, preferably a disease resulting from infection by an influenza A virus comprising HA of the HI subtype. Prevention

encompasses inhibiting or reducing the spread of influenza virus or inhibiting or reducing the onset, development or progression of one or more of the symptoms associated with infection by an influenza virus. Ameloriation as used in herein may refer to the reduction of visible or perceptible disease symptoms, viremia, or any other measurable manifestation of influenza infection.

Those in need of treatment include those already inflicted with a condition resulting from infection with a group 1 or a group 2 influenza A virus, or an influenza B virus, as well as those in which infection with influenza virus is to be prevented. The polypeptides, nucleic acids and/or compositions of the invention thus may be administered to a naive subject, i.e., a subject that does not have a disease caused by influenza virus infection or has not been and is not currently infected with an influenza virus infection, or to subjects that already are and/or have been infected with an influenza virus.

In an embodiment, prevention and/or treatment may be targeted at patient groups that are susceptible to influenza virus infection. Such patient groups include, but are not limited to e.g., the elderly (e.g. > 50 years old, > 60 years old, and preferably > 65 years old), the young (e.g. < 5 years old, < 1 year old), hospitalized patients and patients who have been treated with an antiviral compound but have shown an inadequate antiviral response.

In another embodiment, the polypeptides, nucleic acids and/or immunogenic compositions may be administered to a subject in combination with one or more other active agents, such as existing, or future influenza vaccines, monoclonal antibodies and/or antiviral agents, and/or antibacterial, and/or immunomodulatory agents. The one or more other active agents may be beneficial in the treatment and/or prevention of an influenza virus disease or may ameliorate a symptom or condition associated with an influenza virus disease. In some embodiments, the one or more other active agents are pain relievers, anti- fever medications, or therapies that alleviate or assist with breathing.

Dosage regimens of the polypeptides and/or nucleic acid molecules of the invention can be adjusted to provide the optimum desired response (e.g., a therapeutic response). A suitable dosage range may for instance be 0.1 -100 mg/kg body weight, preferably 1 -50 mg/kg body weight, preferably 0.5-15 mg/kg body weight. The precise dosage of the polypeptides and/or nucleic acid molecules to be employed will e.g. depend on the route of administration, and the seriousness of the infection or disease caused by it, and should be decided according to the judgment of the practitioner and each subject's circumstances. For example, effective doses vary depending target site, physiological state of the patient

(including age, body weight, health), and whether treatment is prophylactic or therapeutic. Usually, the patient is a human but non-human mammals including transgenic mammals can also be treated. Treatment dosages are optimally titrated to optimize safety and efficacy.

The polypeptides of the invention may also be used to verify binding of monoclonal antibodies identified as potential therapeutic candidates. In addition, the polypeptides of the invention may be used as diagnostic tool, for example to test the immune status of an individual by establishing whether there are antibodies in the serum of such individual capable of binding to the polypeptide of the invention. The invention thus also relates to an in vitro diagnostic method for detecting the presence of an influenza infection in a patient said method comprising the steps of a) contacting a biological sample obtained from said patient with a polypeptide according to the invention; and b) detecting the presence of antibody-antigen complexes.

The polypeptides of the invention may also be used to identify new binding molecules or improve existing binding molecules, such as monoclonal antibodies and antiviral agents.

The invention is further illustrated in the following examples and figures. The examples are not intended to limit the scope of the invention in any way. EXAMPLES

Example 1 : Stem based polypeptides as described in PCT/EP2014/060997

PCT/EP2012/073706 discloses influenza hemagglutinin stem domain

polypeptides, compositions and vaccines and methods of their use in the field of prevention and/or treatment of influenza. PCT/EP2014/060997 discloses additional sequences of stem domain polypeptides derived from the full length HA of H1 1 A/Brisbane/59/2007 (SEQ ID NO: 1), which were obtained by site-directed mutation of Hl-mini2-clusterl+5+6-GCN4 (SEQ ID NO: 2) and which also stably presented the broadly neutralizing epitope of CR6261 (Throsby et al, 2009; Ekiert et al 2010) and/or CR91 14.

Hl-mini2-clusterl+5+6-GCN4 (SEQ ID NO: 2) was derived from the full length HA of H1N1 A/Brisbane/59/2007 (SEQ ID NO: 1) by taking the following steps:

1. Removal of the cleavage site in HA0. Cleavage of wild type HA at this site results in HA1 and HA2. The removal can be achieved by mutation of R to Q at the P 1 position (see e.g. Sun et al, 2010 for an explanation of the nomenclature of the cleavage site (position 343 in SEQ ID NO: 1).

2. Removal of the head domain by deleting amino acids 53 to 320 from SEQ ID NO; 1.

The remaining N- and C-terminal parts of the sequence were joined by a four residue flexible linker, GGGG.

3. Increasing the solubility of the loop (between the A-helix and the CD helix) formed by (the equivalent of) residues 402 to 418 in HI A/Brisbane/59/2007 (SEQ ID NO: 1 ) in order to both increase the stability of the pre-fusion conformation and to destabilize the post-fusion conformation of the modified HA. In Hl -mini2- clusterl+5+6-GCN4 (SEQ ID NO: 2) mutations F406S, V409T, F413G and L416S

(numbering refers to SEQ ID NO: 1) were introduced

4. Introducing a disulfide bridge between amino acids at position 324 and 436 in HI A/Brisbane/59/2007; this is achieved by introducing mutations R324C and Y436C. (numbering refers to SEQ ID NO: 1)

5. Introducing the GCN4 derived sequence MKQIEDKIEEIESKQ (SEQ ID NO: 5), that is known to trimerize, at position 419-433 (numbering refers to SEQ ID NO: 1). In certain embodiments, the sequence of the transmembrane and intracellular domain was deleted from position (or the equivalent thereof, as determined from sequence alignment) 514, 515, 516, 517, 518, 519, 520, 521 , 522, 523, 524, 525, 526, 526, 527, 528, 529, or 530 of HA2 to the C-terminus of HA2 (numbering according to SEQ ID NO: 1) so that a secreted (soluble) polypeptide was produced following expression in cells. The soluble polypeptide was further stabilized by introducing a sequence known to form trimeric structures, i.e. the foldon sequence GYIPEAPRDGQAYVRKDGEWVLLSTFL (SEQ ID NO: 3), optionally connected through a short linker, as described above. The linker may optionally contain a cleavage site for processing afterwards according to protocols well known to those skilled in the art. To facilitate purification and detection of the soluble form a tag sequence may be optionally added, e.g. a histidine tag (HHHHHH (SEQ ID NO: 15) or HHHHHHH (SEQ ID NO: 16) or a FLAG tag (DYKDDDDK; SEQ ID NO: 22) or combination of these, optionally connected via short linkers. The linker may optionally contain (part of) a proteolytic cleavage site, e.g. LVPRGS (SEQ ID NO: 23) (thrombin) or IEGR (SEQ ID NO: 24) (Factor X) for processing afterwards according to protocols well known to those skilled in the art. The processed proteins are also encompassed in the invention.

An example of such a C-teminal sequence combining FLAG-tag, thrombin cleavage site, foldon, and His sequences is SEQ ID NO: 4 FLAG- thrombin-foldon-His. This sequence was combined with a soluble form of Hl-mini2-clusterl+5+6-GCN4 (SEQ ID NO: 2) sequence to create the parental sequence (SEQ ID NO: 6) that was used to create novel polypeptides of the invention by mutagenesis. This sequence does not contain the leader sequence corresponding to amino acids 1-17 of SEQ ID NO: 1 and 2.

The stem domain polypeptides thus were created by deleting the part of the hemagglutinin sequence that encodes the head domain of the molecule and reconnecting the N- and C-terminal parts of the sequence on either side of the deletion through a linker as described in PCT/2012/073706 and above. The removal of the head domain leaves part of the molecule that was previously shielded from the aqueous solvent exposed, potentially destabilizing the structure of the polypeptides of the invention. For this reason residues in the B-loop (in particular amino acid residue 406 (F and S in SEQ ID NO: 1 and 2, respectively), 409 (V and T) 413 (F and G) and 416 (L and S) were mutated in various combinations using parental sequence SEQ ID NO: 6 as the starting point. SEQ ID NO: 6 was created from Hl -mini2-clusterl+5+6-GCN4 (SEQ ID NO: 2) by removing the leader sequence, and replacing residues 520-565 with a Flag-thrombin-foldon~his sequence (SEQ ID NO: 4).

Similarly, in the area around the fusion peptide a number of hydrophobic residues are exposed to the solvent, caused by the fact that, unlike the native full length HA, the polypeptides cannot be cleaved and undergo the associated conformational change that buries the hydrophobic fusion peptide in the interior of the protein. To address this issue some or all of the residues 1337, 1340, F352 and 1353 in SEQ ID NO: 2 were also mutated.

This way, the soluble forms of HA stem polypeptides 74H9 (SEQ ID NO: 57), 127H1(SEQ ID N0: 55), 71H2 (SEQ ID NO: 61), 86B4 (SEQ ID NO: 56), 115A1 (SEQ ID NO: 60), 2201C9 (SEQ ID NO: 63), 55G7 (SEQ ID NO: 59), 113E7 (SEQ ID NO: 64), 6E12 (SEQ ID NO: 58), 181H9 (SEQ ID NO: 62) were created.

DNA sequences encoding the polypeptides described above were transformed into

Pichia pastoris or transfected into HEK293F cells using protocols well known to persons skilled in the art. Constructs used for expression in mammalian cells contained the HA leader sequence (residue 1-17 in SEQ ID NO: 1 and 2), whereas in constructs used for expression in P. pastoris the HA leader sequence was replaced with the yeast alpha factor leader sequence (SEQ ID NO: 7). In this way expressed protein are directed towards the cell culture medium thus allowing binding and expression to be determined without further purification of the polypeptides of the invention. All sequences contained the FLAG-foldon-HIS C-terminal sequence (SEQ ID NO: 4).

Monoclonal antibody binding (CR6261 , CR91 14, CR8020) to the polypeptides was determined by ELISA. To this end ELISA plates were treated overnight with a 2 μg/ml monoclonal antibody solution (20 μΐ/well) at 4 °C. After removal of the antibody solution the remaining surface was blocked with 4% solution of non-fat dry milk powder in PBS for a minimum of 1 h at room temperature. After washing of the plates, 20 μΐ of cell culture medium (neat or diluted) was added to each well and incubated for at least 1 h at room temperature. ELISA plates were then washed and 20 μΐ of anti-FLAG-HRP antibody solution (Sigma A8952, 2000 times diluted in 4% non-fat dry milk in PBS- Tween) was added. After incubation (lh at room temperature) plates were washed once more, and 20 μΐ luminescent substrate (Thermoscientific C#34078) was added to develop the signal. Alternatively, a colorimetric detection method can be used to develop the signal.

Expression of polypeptides of the invention was determined from a homogeneous time -resolved fluorescence assay (for a general description see e.g. Degorce et al., Curr. Chem. Genomics 2009 3: 22-32). To this end a mixture of Terbium (Tb) labeled anti- FLAG monoclonal antibody (donor) and Alexa488 labeled anti-His monoclonal antibody (acceptor) (HTRF solution) was prepared by adding 210.5 μΐ Anti-FLAG-TB (stock solution 26 g/ml) and 1.68 ml of anti-HIS-488 (stock solution 50 g/ml) to 80 ml of a 1 to 1 mixture of culture medium and 50 mM HEPES + 0.1% BSA. 19 μΐ of HTRF solution was added to each well of an ELISA plate and 1 μΐ of culture medium was added. Upon excitation and after a delay to allow interfering short-lived background signals arising from other compounds (proteins, media components etc) to decay, the ratio of fluorescence emission at 520 and 665 nm was determined. This is a measure of total protein content in the sample and is used to normalize the mAb binding signals between different experiments.

The polypeptides listed in Table 3 and 4 were expressed in P.Pastoris following protocols well known to those skilled in the art. Culture medium was collected and binding to CR6261 binding of and expression of the stem domain polypeptides was determined as described above. Since the response in the binding assay scales with the concentration of expresses protein, ELISA binding signal was normalized for protein expression by comparing the ratio of binding signal over the signal in the HTRF assay for each expressed sequence. All expressed polypeptides exhibit higher ratio's of CR6261 binding to HTRF signal compared to the parental sequence of SEQ ID NO: 6.

In addition, the ratio of CR6261 binding to HTRF signals was calculated and compared to the ratio calculated for the parental sequence SEQ ID NO: 6. The results are listed in column 5 of table 3 and 4; all expressed proteins exhibit higher ratios, indicating that the stem polypeptides described above show increased binding of CR6261. Example 2 : Design and characterization of polypeptides of the invention

The polypeptides of the present invention contain sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) or RMKQIEDKIEEIESKQK (SEQ ID NO: 21) derived from yeast transcriptional activator protein GCN4, in the CD helix. This sequence has a high propensity to form helical secondary structures and can enhance in this way overall stability of the polypeptide of the invention. According to the present invention, it has surprisingly been found that stability and aggregation state of the polypeptides of the invention is dependent on the exact location and sequence of the GCN4 derived sequence in the primary sequence of the polypeptides of the invention.

Thus, here we describe a novel set of polypeptides of the invention where sequence

RMKQIEDKIEEIESK (SEQ ID NO: 20) is introduced at position 419-433 (numbering according to SEQ ID NO: 1; for example SEQ ID NO. 81 tol 10) or sequence

RMKQIEDKIEEIESKQK (SEQ ID NO: 21) is introduced at position 417-433 (for example SEQ ID NO 111 to 140).

To this end, the polypeptides described in Example 1, i.e 74H9 (SEQ ID NO: 57),

127H1 (SEQ ID N0: 55), 71H2 (SEQ ID NO: 61), 86B4 (SEQ ID NO: 56), 115A1 (SEQ ID NO: 60), 2201C9 (SEQ ID NO: 63), 55G7 (SEQ ID NO: 59), 113E7 (SEQ ID NO: 64), 6E12 (SEQ ID NO: 58), 181H9 (SEQ ID NO: 62) were modified, using techniques of molecular biology well known to those skilled in the art, to create sequences 74H9-t2 (SEQ ID NO: 83), 127Hl-t2 (SEQ ID NO: 81), 71H2-t2 (SEQ ID NO: 87), 86B4-t2 (SEQ ID NO: 82), 115Al-t2 (SEQ ID NO: 86), 220C9-t2 (SEQ ID NO: 89), 55G7-t2 (SEQ ID NO: 85), 113E7-t2 (SEQ ID NO: 90), 6E12-t2 (SEQ ID NO: 84), 181H9-t2 (SEQ ID NO: 88) containing sequence RMKQIEDKIEEIESK (SEQ ID NO: 20) at position 419-433.

In a similar manner sequences 74H9-t3 (SEQ ID NO: 1 13), 127Hl-t3 (SEQ ID NO: 111), 71H2-t3 (SEQ ID NO: 117), 86B4-t3 (SEQ ID NO: 112), 115Al-t3 (SEQ ID NO: 116), 2201C9-t3 (SEQ ID NO: 119), 55G7-t3 (SEQ ID NO: 115), 113E7-t3 (SEQ ID NO: 120), 6E12-t3 (SEQ ID NO: 114), 181H9-t3 (SEQ ID NO: 118) containing sequence RMKQIEDKIEEIESKQK (SEQ ID NO: 21) at position 417-433 were created.

Polypeptides of the invention can be created on the basis of the sequence of HA molecules from different viral strains. SEQ ID NO: 149-155 for example describe polypeptides of the invention based on the HA sequence of the H1N1 A/California/07/09 strain. As described before, soluble polypeptides of the invention can be created by removing the C-terminal part of the HA based sequences for example from residue 519, 520, 521, 522, 523, 524, 525, 526, 527, 526, 528, 529, or 530 of the HA2 domain to the C- terminus of the HA2 domain (numbering according to SEQ ID NO: 1).

The polypeptidescan further be stabilized by introducing a sequence known to form trimeric structures, i.e GYIPEAPRDGQAYVRKDGEWVLLSTFL (SEQ ID NO: 3), optionally connected through a linker. The linker may optionally contain a cleavage site for processing afterwards according to protocols well known to those skilled in the art. To facilitate purification of the soluble form a tag sequence may be added, e.g. a his tag

(HHHHHHH (SEQ ID NO: 16) or HHHHHH (SEQ ID NO: 15)) or FLAG tag

(DYKDDDDK) (SEQ ID NO: 22) or a combination of these, optionally connected via short linkers. The linker may optionally contain (part of) a proteolytic cleavage site, e.g. IEGR (SEQ ID NO: 24) (Factor X) or LVPRGS (SEQ ID NO: 23) (thrombin) for processing afterwards according to protocols well known to those skilled in the art. The processed proteins are also encompassed in the invention.

Soluble forms of the polypeptides of SEQ ID NO 55-64 and 81 -90 were created by replacement of the equivalent of residue 519-565 (numbering refers to SEQ ID NO: 1) with sequence RSLVPRGSPGHHHHHH, containing both a modified thrombin cleavage site and a 6 histidine tag (SEQ ID NO: 15) and were expressed in HEK293F cells following protocols well known to those skilled in the art.

For reasons of comparison, soluble forms of Hl -mini2-clusterl+5+6-GCN4t2 (SEQ ID NO:52) and Hl-mini2-clusterl+5+6-GCN4t3 (SEQ ID NO: 53). Culture medium was collected and binding to CR6261, CR9114 was detected by a sandwich ELISA, using coated mAb CR6261 or CR9114 to capture the polypeptide of the invention directly from the culture medium and a Horse Radish Peroxidase (HRP) conjugated antibody directed against the C- terminal his-tag for detection purposes. Alternatively, biotinylated CR9114 in combination with HRP-conjugated streptavidin was used for detection of CR9114 captured polypeptides of the invention in a sandwich ELISA. This format allows the detection of the presence of multimeric forms of polypeptides of the invention. All polypeptides of the invention tested were capable of binding to CR9114 (Fig 2 A and B, Fig 3 A and B and Figure 4A and B) and CR6261 (Fig 2 C and D, Fig 3 C and D, Fig 4 C and D) as determined by ELISA. Increased levels of multimerization as detected by the CR9114 capture - biotinylated CR9114 detection sandwich ELISA were observed for s55G7-t2 (SEQ ID NO: 95), s86B4-t2 (SEQ ID NO: 92), sl l5Al-t2 (SEQ ID NO: 96), sl27Hl -t2 (SEQ ID NO: 91), sl l3E7-t2 (SEQ ID NO: 100), s220C9-t2 (SEQ ID NO: 99), s71H2-t3 (SEQ ID NO: 127), sl27Hl-t3 (SEQ ID NO: 121), s74H9-t3 (SEQ ID NO: 123) as shown in Fig 2 E and F, Fig 3 E and F and Fig 4 E and F.

In order to obtain a highly pure preparations of polypeptides of the invention for further characterization, HEK293F cells were transfected with expression vector pcDNA2004 containing the genes encoding soluble forms of 127Hl-t2 (SEQ ID NO: 81), 86B4-t2 (SEQ ID NO: 82) and 55G7-t2 (SEQ ID NO: 85). It will be understood by the skilled person that the leader sequence (or signal sequence) that directs transport of a protein during production (corresponding to amino acids 1-17 of SEQ ID NO: 1) will not be present in the secreted final polypeptide.

To produce the polypeptides of the invention 1.0* 10 ⁶ vc/mL were seeded by spinning down HEK293F cells (Invitrogen) at 300 g for 5 min and resuspending in 300 mL pre -warmed Freestyle™ medium per SF1000 flask. This culture was incubated for 1 hour at 37 °C, 10% C02 at 110 rpm in a multitron incubator. After 1 hour the plasmid DNA was pipetted in 9.9 mL Optimem medium to a concentration of 1.0 μg/mL in the 300 mL culture volume. In parallel 440 293fectin® was pipetted in 9.9 mL Optimem medium and incubated for 5 minutes at room temperature. After 5 minutes the plasmid DNA/Optimem mix was added to the 293fectin®/Optimem mix and incubated at room temperature for 20 minutes. After the incubation the plasmid DNA/293fectin® mix was added drop wise to the cell suspension. The transfected cultured was incubated at 37 °C, 10% C02 and 110 rpm in a multitron incubator. At day 7 cells were separated from the culture medium by centrifugation (30 minutes at 3000 g), while the supernatant containing the soluble polypeptides of the invention was filtrated over a 0.2 μηι bottle top filter for further processing.

For purification purposes 1500 ml (sl27Hl_t2), 1800 ml (s86B4_t2), and 2400 ml

(s55G7_t2) of culture supernatant was applied to a 24 ml Ni Sepharose HP column, pre- equilibrated in wash buffer (20 mM T IS, 500 mM NaCl, pH 7.8). Following a washing step with 10 mM Imidazole in wash buffer the bound polypeptides of the invention were eluted with a step-wise gradient of 300 mM imidazole in wash buffer. The elution peaks were collected, concentrated, and applied to a size exclusion column for further purification

(Superdex 200). Elution profiles are shown in Figure 5. For 55G7-t2 and 127Hl-t2 fractions were collected, pooled as indicated on the figure and analyzed by SDS-PAGE (Figure 6), ELISA and analytical size exclusion chromatography combined with multi-angle light scattering to estimate molecular mass (SEC-MALS). ELISA results confirmed binding of the polypeptides of the invention to CR6261 and CR9114, but not CR8020. SEC-MALS results are summarized in Table 8.

Figure 5 and Table 8 indicate that polypeptide of the invention sl27Hl -t2 has a higher yield (-30 mg protein/1 culture supernatant) compared to s55G7-t2 and s86B4-t2. The majority of the protein exhibits a molecular weight of 62 kDa, which is in between what is expected for a monomer or a dimer. To confirm the aggregation state of the protein the SEC- MALS experiment was repeated in the presence of Fab-fragments derived from CR6261, CR9114 and CR8020. Results are shown in Figure 7 and summarized in Table 8.

The results show that the soluble form of polypeptide of the invention sl27Hl-t2 forms a complex (as evidenced by the shift of the peak in SEC chromatogram) in the presence of the Fab fragments from CR6261 and CR9114, but not with CR8020. This is in line with the specificity of the binding reactions of the Fab fragments, since CR6261 and CR9114 bind to HA's derived from group 1, whereas CR8020 does not. The size of the complex is listed in Table 8, and this indicates that polypeptide sl27Hl-t2 binds one to two Fab fragments, indicating that at least part of the population of purified polypeptide of the invention sl27Hl-t2 is in dimeric form.

To further analyze the binding reaction between polypeptide of the invention 127H1- t2 and mAb's CR6261and CR9114, as well as to confirm the presence of the conformational epitopes of CR6261 and CR9114 the complexation of these antibodies with the purified protein was studied by biolayer interferometry (Octet Red ³⁸⁴, Forte Bio). To this end, biotinylated CR6261, CR9114 and CR8020 were immobilized on streptavidin coated sensors, which subsequently were exposed first to a solution of the purified polypeptide of the invention to measure the rate of association and then to a wash solution to measure the rate of dissociation. The results are shown in Figure 8.

The immobilized CR6261and CR9114 both recognize the polypeptide of the invention as evidenced by the clear responses after exposure to the soluble form of 127Hl-t2 (Figure 8). To estimate the dissociation constant for the binding interaction a titration was performed using a 2-fold dilution series. Sensors containing immobilized CR6261 or CR9114 were exposed to soluble sl27Hl-t2 solutions at concentrations of 40, 20, 10, 5, 2.5, 1.3 and 0.63 nM, respectively, and the final response after 6600 seconds recorded. The responses were plotted as a function of the stem domain polypeptide concentration, and a fit to a steady state 1 :1 binding model was performed, yielding a dissociation constant ¾ of 3.5 nM for the CR6261/stem domain polypeptide complex and 2.3 nM for the CR9114 complex (Figure 8).

In conclusion polypeptide of the invention sl27Hl-t2 (SEQ ID NO: 91 is produced in high quantities and is capable of binding broadly neutralizing monoclonal antibodies CR6261 and CR9114 with high affinity, confirming the presence of the corresponding neutralizing epitopes in this stem domain polypeptide. The polypeptide has a propensity to form dimeric structures.

Example 3: Evaluation of protective efficacy of a polypeptide of the invention in a lethal influenza challenge model

In order to evaluate the protective efficacy of polypeptides of the invention sl27Hl-t2 (SEQ ID NO: 91) in a lethal influenza challenge model, groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 3 times at 3 week intervals with 10 μg of purified sl27Hl-t2 either unadjuvated or adjuvated with 10 μg Matrix-M. As a positive control for the challenge model, broadly neutralizing antibody monoclonal antibody

CR6261 (15 mg/kg) was administered i.m. 1 day prior to challenge, while immunization with PBS served as a negative control. Four weeks after the last immunization mice were challenged with 25xLD50 heterologous challenge virus (H1N1 A Puerto Rico/8/34) and monitored daily (survival, weight, clinical scores) for 3 weeks. Pre-challenge serum is tested in ELISA assays for binding to polypeptide of the invention sl27Hl-t2 that was used for immunization (to verify correct immunization), binding to soluble H1N1 A/Brisbane/59/07 full length HA (to verify recognition of full length HA) and

competition with the broadly neutralizing antibody monoclonal antibody CR9114 for binding to full length HA (to determine whether induced antibodies bind at close proximity to the broadly neutralizing CR91 14 epitope). The results are shown in Figures 9 -12.

The results show that the experiment is valid since all mice in the PBS control group succumb to infection at day 7 post challenge, whereas the positive control group (15mg/kg CR6261, 1 day before challenge) is fully protected (Figure 9). In contrast to the PBS treated mice, 3 out of 10 of the mice immunized with the unadjuavted polypeptide of the invention sl27Hl -t2 (SEQ ID NO: 91) and 10 out of 10 of the mice immunized with the adjuvated polypeptide of the invention survive the lethal challenge (See Figure 10). Compared to the PBS control group, increased survival proportion, increased survival time and reduced clinical score are observed for the groups immunized with polypeptide of the invention sl27Hl-t2. The differences are most pronounced for the group receiving the adjuvated polypeptide of the invention, but are also observed for the group receiving the unadjuvated polypeptide.

The ELISA data using sl27Hl -t2 or the soluble full length HA as the antigen indicate that the polypeptide of the invention sl27Hl is immunogenic and induces antibodies that are capable of recognizing full length HA regardless of the use of an adjuvant (Figure 1 1 A and B).

To further understand the immunological response to the immunization a competition binding ELISA was performed. To this end plate bound full length HA is incubated with serial diluted serum samples, after which CR9114-biotin at a

predetermined titrated concentration is added. After further incubation, the amount of CR91 14-biotin bound is quantified using streptavin -conjugated horse radish peroxidase following protocols well known in the art. Data are analysed using linear regression of OD versus log dilution, expressed as 'slope OD' (AOD/10 fold dilution). The data show that detectable levels of antibodies that are capable of competing for binding with the broadly neutralizing antibody CR9114 are induced by immunization with adjuvated polypeptides of the invention, as indicated by the elevated levels of competition observed in Figure 12A. As a comparison levels induced by unlabeled CR91 14 (i.e. self- competition) and the non-binding monoclonal antibodies CR8020 and CR-JB, both serially diluted from 5 g/ml starting concentration are indicated in a separate graph. In conclusion we have shown that immunization with polypeptides of the invention sl27Hl-t2 (SEQ ID NO: 91) can protect mice against lethal infection with influenza. The polypeptide is immunogenic and induces antibodies that can bind to full length HA. When the polypeptide of the invention is used in combination with an adjuvant, at least part of the induced detectable antibodies bind at, or close to, the epitope of the broadly neutralizing epitope of monoclonal antibody CR9114. Example 4: Evaluation of protective efficacy of a polypeptide of the invention in a lethal influenza challenge model

In order to further evaluate the protective efficacy of polypeptides of the invention sl27Hl-t2 (SEQ ID NO: 91) in a lethal influenza challenge model, groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 1 , 2 and 3 times at 3 week intervals with 30 μg of purified sl27Hl-t2 adjuvated with 10 μg Matrix-M. As a positive control for the challenge model, broadly neutralizing antibody monoclonal antibody CR6261 (15 mg/kg) was administered i.v. 1 day prior to challenge, while immunization with PBS served as a negative control. Four weeks after the last immunization mice were challenged with 25xLD50 heterologous challenge virus (H1N1 A/Puerto Rico/8/34) and monitored daily (survival, weight, clinical scores) for 3 weeks. Pre-challenge serum obtained 4 weeks after the final immunization was tested in ELISA assays for binding to polypeptide of the invention sl27Hl -t2 that was used for immunization (to verify correct immunization), binding to soluble H1N1 A/Brisbane/59/07 full length HA (to verify recognition of full length HA) and competition with the broadly neutralizing antibody monoclonal antibody CR91 14 for binding to full length HA (to determine whether induced antibodies bind at close proximity to the broadly neutralizing CR9114 epitope). The results are shown in Figures 13-18.

The results show that the experiment is valid since all mice in the PBS control group succumbed to infection at day 7 post challenge, whereas the positive control group (15mg/kg CR6261, 1 day before challenge) was fully protected (Figure 13A). Mice immunized once with sl27Hl-t2 (SEQ ID NO: 91) all succumbed to infection between day 7 and 9 (Figure 14A). In contrast, after two immunizations 8 out of 10 mice survived, and after 3 immunizations all mice (10 out of 10) survived the lethal challenge (Figure 14B,C). Also body weight loss was reduced for groups immunized multiple times with lowest percentages observed for animals immunized three times (Figure 15B,C).

Compared to the PBS control group, statistically significant increased survival proportion, increased survival time, reduced body weight loss and reduced clinical score (see figure 16B,C) were observed for the groups immunized two or three times with polypeptide of the invention sl27Hl-t2 The ELISA data from pre-challenge timepoints 4 week after the final

immunization using sl27Hl-t2 (Figure 17A) or the soluble full length HA (Figure 17B) as the antigen indicate that the polypeptide of the invention sl27Hl is immunogenic and induces antibodies that are capable of recognizing full length HA even after one immunization, although levels are significantly higher after two and three immunizations. Using the CR9114 competition binding assay described above detectable levels of antibodies that are capable of competing for binding with the broadly neutralizing antibody CR9114 were induced after two and three immunizations with polypeptide of the invention sl27Hl -t2 (SEQ ID NO: 91) (Fig 18A). As a comparison levels induced by unlabeled CR91 14 (i.e. self-competition) and the non-binding monoclonal antibodies CR8020 and CR-JB, both serially diluted from 5 g/ml starting concentration are indicated in a separate graph (Figure 18B).

In conclusion we have shown that two and three times immunization with polypeptide of the invention sl27Hl-t2 (SEQ ID NO: 91) can protect mice against lethal infection with influenza. The polypeptide is immunogenic and induces antibodies that can bind to full length HA. At least part of the induced antibodies bind at, or close to, the epitope of the broadly neutralizing epitope of monoclonal antibody CR9114.

Example 5: Evaluation of protective efficacy of a polypeptide of the invention in a lethal heterosubtypic H5N1 influenza challenge model

In order to further evaluate the protective efficacy of polypeptides of the invention sl27Hl-t2- (SEQ ID NO: 91) in a lethal H5N1 influenza challenge model, groups of 8-12 female BALB/c mice (age 6-8 weeks) were immunized 3 times at 3 week intervals with 30 μg ofpurified sl27Hl-t2 adjuvated with 10 μg Matrix-M. As a positive control for the challenge model, broadly neutralizing antibody monoclonal antibody CR6261 (15 mg kg) was administered i.v. 1 day prior to challenge, while immunization with PBS served as a negative control. Four weeks after the last immunization mice were challenged with 12.5xLD50 heterosubtypic challenge virus (H5N1 A/Hong Kong/156/97) and monitored daily (survival, weight, clinical scores) for 3 weeks.

The results show that the experiment is valid since all mice in the PBS control group succumb to infection between day 8-10 post challenge, whereas the positive control group (15mg/kg CR6261, 1 day before challenge) is fully protected (Figure 19A). Eight out of 10 (80%) mice immunized with sl27Hl-t2 (SEQ ID NO: 91) survive the lethal challenge (Figure 19B). Mean bodyweight loss is approximately 15% at day 9, but surviving animals recover and gain bodyweight (Fig 19C). Median clinical score is 1.5 at day 3-6, but from day 8 onwards no clinical symptoms were observed for surviving mice (Figure 19D). Compared to the PBS control group, a statistical significant increased survival proportion, increased survival time, a decrease of body weight loss and reduced clinical scores are observed for the group immunized with polypeptide of the invention sl27Hl-t2. In conclusion we have shown that immunization with polypeptide of the invention sl27Hl -t2 (SEQ ID NO: 91) can protect mice against lethal infection with a heterosubtypic H5N 1 influenza strain.

Example 6: Evaluation of the breadth of binding of sera elicited through immunization with a polypeptide of the invention

Pre-challenge sera from mice immunized 3 times as described in example 5 were also tested for binding against full length HA's from a number of other group 1 (HI , H5 and H9) and group 2 (H3 and H7) influenza strains by ELISA following protocols well known in the art (Figure 20). The results demonstrate that antibodies induced with polypeptide of the invention sl27Hl-t2 (SEQ ID NO: 91) efficiently recognize epitopes present in the native sequences of FL HA and that the epitopes to which the antibodies bind are conserved among different group 1 influenza strains including HI, H5 and H9 HA.

Example 7: Evaluation of protective efficacy of a polypeptide of the invention in a lethal H1N1 A/Brisbane/ 59/2007 influenza challenge model

In order to further evaluate the protective efficacy of sl27Hl-t2 (SEQ ID NO: 91) in a lethal H1N1 influenza challenge model, groups of 8-18 female BALB/c mice (age 6- 8 weeks) were immunized 3 times at 3 week intervals with 30 μg of purified sl27Hl -t2 adjuvated with 10 μg Matrix-M. As a positive control for the challenge model, broadly neutralizing antibody monoclonal antibody CR6261 (15 mg/kg) was administered i.v. 1 day prior to challenge, while immunization with PBS served as a negative control. Four weeks after the last immunization mice were challenged with 12.5xLD50 challenge virus (H1N1 A/Brisbane/59/2007) and monitored daily (survival, weight, clinical scores) for 3 weeks.

The results show that the experiment is valid since all mice in the PBS control group succumb to infection between day 7-10 post challenge, whereas the positive control group (15mg/kg CR6261, 1 day before challenge) is fully protected (Figure 21 A). Ten out of 10 mice immunized with sl27Hl-t2 (SEQ ID NO: 91) survive the lethal challenge (Fig 21B). In addition bodyweight loss is ca 20% on average 5 days post infection (Fig 21C), but animals recover fully within the21 days follow-up period.

Median clinical scores peak at a value of 3 between 2 and 9 days post infection but return to baseline level (0) from day 16 post infection onwards (Fig 2 ID). Compared to the PBS control group, a statistical significant increased survival proportion, increased survival time, a decrease of body weight loss and reduced clinical scores are observed for the group immunized with polypeptide of the invention sl27Hl-t2

In conclusion we have shown that immunization with polypeptide of the invention sl27Hl-t2 (SEQ ID NO: 91) can protect mice against lethal infection with H1N1 A Brisbane/59/2007.

Example 8: Evaluation of the presence of influenza neutralizing antibodies in sera of mice immunized with the polypeptide of the invention

To further investigate antibody-mediated effector mechanisms that play a role in protection against influenza, pre-challenge sera were tested in pseudoparticles neutralization assay (Alberini et al 2009) using the pseudoparticles derived from H5N1 A/Vietnam/1 194/04 as described below.

Pseudoparticle neutralization assay

Pseudoparticles expressing FL HA were generated as previously described (Temperton et al., 2007). Neutralizing antibodies were determined using a single transduction round of HEK293 cells with H5 A/Vietnam/1 194/04 pseudoparticles encoding luciferase reporter gene, as described previously (Alberini et al 2009), with a few modifications. Briefly, heat-inactivated (30 minutes at 56°C) pre-challenge serum samples were 3-fold serially diluted in growth medium (MEM Eagle with EBSS (Lonza, Basel, Switserland) supplemented with 2 mM L-Glutamine (Lonza), 1% Non-Essential Amino Acid Solution (Lonza), 100 U/ml Pen/Strep (Lonza) and 10% FBS (Euroclone, Pero, Italy)) in triplicate in 96-well fiat bottom culture plates and a titrated number of H5 A Vietnam/ 1194/04 pseudoparticles (yielding 10 ⁶ Relative Luminescence Units (RLU) post-infection) was added. After lh incubation at 37°C, 5% C0 ₂ 10 ⁴ HEK293 cells were added per well. After 48h incubation at 37°C, 5% C0 ₂ luciferase substrate (Britelie Plus, Perkin Elmer, Waltham, MA) was added and luminescence measured using a

luminometer (Mithras LB 940, Berthold Technologies, Germany) according to manufacturers' instructions.

Pre challenge sera obtained from animals immunized with polypeptide of the invention sl27Hl -t2 (SEQ ID NO: 91) as described in examples 5, 6, and 7 showed detectable neutralization at high serum concentrations using the pseudoparticle neutralization assay (Figure 22). This demonstrates the ability of the polypeptide of the invention to elicit broadly neutralizing antibodies when used as an immunogen.

Besides direct virus neutralization, Fc-mediated effector mechanisms, such as Antibody Dependent Cellular Cytotoxicity (ADCC) and Antibody Dependent Cellular Phagocytosis (ADCP), contribute substantially to protection against influenza, with stem- directed bnAbs being particularly effective in these mechanisms (DiLillo et al., 2014). In order to test whether the antibodies elicited after immunization with polypeptide of the invention sl27Hl -t21181ong (SEQ ID NO: 186) were capable of inducing ADCC, we tested pre-challenge sera using an ADCC surrogate assay (Parekh et al., 2012;

Schneuriger et al., 2012; Cheng et al., 2014), adapted for mouse as described below. Antibody Dependent Cellular Cytotoxicity (ADCC) surrogate assay

Human lung carcinoma-derived A549 epithelial cells (ATCC CCL-185) were maintained in Dulbecco's modified eagle medium (DMEM) medium supplemented with 10% heat inactivated fetal calf serum at 37°C, 10% C02. Two days before the experiment, A549 cells were trans fected with plasmid DNA encoding H5 A/Hong Kong/156/97 HA or HI A/Brisbane/59/2007 HA using Lipofectamine 2000 (Invitrogen) in Opti-MEM (Invitrogen). One day before the assay, transfected cells were harvested and seeded in white 96-well plates (Costar) for ADCC, and black clear bottom 96-well plate (BD Falcon) for imaging. After 24 hours, samples were diluted in assay buffer (4% ultra-low IgG FBS (Gibco) in RPMI 1640 (Gibco)) and heat inactivated for 30 minutes at 56°C, followed by serial dilution in assay buffer. For the ADCC bioassay, A549 cells were replenished with fresh assay buffer and antibody dilutions and ADCC Bioassay Jurkat effector cells expressing mouse Fc gamma receptor IV (FcyRIV; Promega) were added to the cells and incubated for 6 hours at 37°C at a target-effector ratio of 1 :4.5. Cells were equilibrated to room temperature for 15 min before Bio-Glo Luciferase System substrate (Promega) was added. Luminescence was read out after 10 minutes on a Synergy Neo (Biotek). Data are expressed as fold induction of signal in the absence of serum.

Using this assay pre-challenge sera obtained from animals immunized with polypeptide of the invention sl27Hl-t2 (SEQ ID NO: 91) as described in examples 5, 6, and 7 were tested for FcyRIV signaling activity using target cells transfected with FL HA from H5N1 A/Hong Kong/156/97 or H1N1 A/Brisbane/59/07 as the source of antigen (Figure 23). In both cases a 30 fold induction is observed at highest serum concentration tested, demonstrating the ability of the polypeptide of the invention to elicit antibodies that activate FcyRIV signaling, indicative for ADCC/ADCP effector function in mice.

These results shown in examples 5-8 show the capability of polypeptide of the invention sl27Hl -t2 (SEQ ID NO: 91) is able to elicit stem-targeting, neutralizing and ADCC-mediating antibodies and protect mice against a lethal challenge with

homologous, heterologous and heterosubtypic group I influenza strains.

Example 9: Protection from lethal challenge with H5N1 A/Hong Kong/156/97 by passive transfer of serum from mice immunized with polypeptides of the invention

To determine the contribution of antibodies induced by polypeptides of the invention to protection observed, transfer studies were performed. The aim of this study was to assess whether passive transfer (multiple dosing) of serum from mice immunized three times with sl27Hl-t2 (SEQ ID NO: 91) and sl27Hl-t21ong (SEQ ID NO: 101) containing an additional His-tag in the presence of an adjuvant (Matrix-M) confers protection to a lethal challenge with H5N1 Influenza A/Hong Kong/156/97. Groups of female BALB/c donor mice (age 6-8 weeks) were immunized 3 times at a 3 week interval with 30 μg sl27Hl-t2 (SEQ ID NO: 91) sl27Hl-t21ong (SEQ ID NO: 101) containing a C-terminal His-tag adjuvated with 10 μg Matrix-M or PBS. Four weeks after the last immunization (d70) serum was isolated, pooled per group and transferred in recipient mice (female BALB/c, age 6-8 weeks, n=10 per group). Each mouse received 400 μΐ serum i.p. on three consecutive days before challenge (d-3, -2 and -1). As a positive control for the challenge model CR6261 (15 mg/kg) was administered 1 day prior to challenge (n=8), while injection with PBS served as a negative control (n=8). On day 0, mice were challenged with 12.5xLD50 challenge virus and monitored

(survival, weight, clinical scores) for 3 weeks.

To verify immunogenicity of polypeptides of the invention in donor mice and asses HA-specific antibody levels after transfer of serum into recipient mice, pooled serum samples of terminal bleeds (d70) of donor mice, pooled serum samples of naive recipient mice before serum transfer (d-4) as well as individual serum samples of recipient mice after 3 serum transfers just prior to challenge (dO), were tested in ELISA for binding to FL HA from H1N1 A/Brisbane/59/07.

Results

Challenge

- Experiment was valid; all mice in the PBS control group succumb to infection at or before day 13 post challenge (median 9.5 days), whereas the positive control group (15mg/kg CR6261, 1 day before challenge) is folly protected (p<0.001).

Three serum transfers of serum from Matrix-M adjuvated polypeptide of the invention SEQ ID NO: 91 immunized mice into naive recipient mice leads to significant increase in survival time (p=0.007) and reduction in clinical score

(p=0.012), compared to the PBS serum transfer control group (Fig. 24).

Three serum transfers of serum from Matrix-M adjuvated polypeptide of the invention SEQ ID NO: 101 immunized mice into naive recipient mice leads to significant increase in survival proportion (p=0.002), increase in survival time (p<0.001), decrease in bodyweight loss (p=0.002) and reduction in clinical score (p<0.001), compared to the PBS serum transfer control group. (Fig. 24)

For the polypeptides of the invention tested FL HA A Brisbane/59/07 specific antibody titers after three serum transfers wee similar to levels obtained after active immunization (Fig. 25).

Conclusion

Serum components (most likely antibodies) induced by 3 times immunization with Matrix-M adjuvated polypeptide of the inventions SEQ ID NO: 91 and 101 can protect mice from lethal challenge with H5N1 A/Hong Kong/156/97 (survival percentages are 30 and 78%, respectively).

Example 10: In vivo protective efficacy of polypeptides of the invention in H1N1 A/NL/602/09 challenge model in mice

The protective efficacy of polypeptides of the invention sl27Hl-t2 (SEQ ID NO: 91) and sl27Hl -t21ong (SEQ ID NO: 101) containing an additional His-tag with Matrix- M in a H1N1 A/NL/602/09 challenge model compared to a PBS control group was determined.

Groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 3 times at a 3 week interval with 30 μg polypeptides of the invention with 10 μg Matrix-M. As a positive control for the challenge model CR6261 (15 mg/kg) was administered 1 day prior to challenge (n=8), while injection with PBS served as a negative control (n=18). Four weeks after the last immunization mice were challenged with 12.5xLD50 challenge virus and monitored (survival, weight, clinical scores) for 3 weeks.

To verify immunogenicity of polypeptides of the invention, pre-challenge sera (day -1) were tested in ELISA assays for binding to FL HA from H1N1

A/Brisbane/59/07. To determine whether induced antibodies bind at close proximity to the CR9114 epitope, a CR91 14 competition ELISA was performed. Competition data were expressed as using the slope OD to be able to quantify responses. Results

The experiment was valid; all mice in the PBS control group succumb to infection at or before day 8 post challenge (median 5 days), whereas the positive control group (15mg/kg CR6261 , 1 day before challenge) is fully protected (p<0.001). - Three immunizations with Matrix-M adjuvated s 127H 1 -t2 (SEQ ID NO : 91 ) and sl27Hl-t21ong (SEQ ID NO: 101) containing an additional His-tag lead to significant increase in survival proportion (p<0.001), increase in survival time (p<0.001) and reduction in clinical score (p<0.001), compared to the PBS control group (Fig. 26). - Three immunizations with Matrix-M adjuvated H 1 mini-HA variant s 127H 1 -t2

(SEQ ID NO: 91) leads to significant decrease in bodyweight (p<0.001) compared to the PBS control group (Fig. 26).

IgG antibody titers to H1N1 A Brisbane/59/07 FL HA induced by polypeptides of the invention are significantly higher compared to PBS for all HI mini-HA variants tested (p<0.001) (Fig. 27A).

- H 1 mini-HA variant s 127H 1 -t2 (SEQ ID NO : 91 ) has significantly higher IgG antibody titers to H1N1 A/Brisbane/59/07 FL HA compared to sl27Hl-t21ong (SEQ ID NO: 101) containing an additional His-tag (p=0.021) (Fig. 27A).

All Matrix-M adjuvanted polypeptides of the invention tested have significantly higher CR9114 competition titers compared to PBS (p<0.001) (Fig. 27B).

Conclusion:

Matrix-M adjuvated polypeptides of the invention sl27Hl-t2 (SEQ ID NO: 91) and sl27Hl-t21ong (SEQ ID NO: 101) containing an additional His-tag confer protection against lethal challenge with H1N1 A/NL/602/09, seen as increase in survival proportion, survival duration and reduction of clinical scores. In addition, Matrix-M adjuvated sl27Hl-t2 (SEQ ID NO: 91) also resulted in a reduced bodyweight loss after lethal challenge with H1N1 A/NL/602/09. Example 11 : Library screening

PCT/EP2012/073706 discloses influenza hemagglutinin stem domain

polypeptides, compositions and vaccines and methods of their use in the field of prevention and/or treatment of influenza. Here we describe additional sequences of stem domain polypeptides derived from the full length HA of Hl l A/Brisbane/59/2007 (SEQ ID NO: 1). The stem domain polypeptides are obtained by site-directed mutation of Hl - mini2-clusterl+5+6-GCN4t2 (SEQ ID NO: 52) and present the broadly influenza neutralizing epitope of CR6261 (Throsby et al, 2009; Ekiert et al 2010) and/or CR9114.

Hl-mini2-clusterl+5+6-GCN4t2 (SEQ ID NO: 52) was derived from the full length HA of HlNl A/Brisbane/59/2007 (SEQ ID NO: 1) by taking the following steps:

Removal of the cleavage site in HA0. Cleavage of wild type HA at this site results in HA1 and HA2. The removal can be achieved by mutation of R to Q at the PI position (see e.g. Sun et al, 2010 for an explanation of the nomenclature of the cleavage site (position 343 in SEQ ID NO: 1).

- Removal of the head domain by deleting amino acids 53 to 320 from SEQ ID NO;

1. The remaining N- and C-terminal parts of the sequence were joined by a four residue flexible linker, GGGG.

Increasing the solubility of the loop (between the A-helix and the CD helix) formed by (the equivalent of) residues 402 to 418 in HI A/Brisbane/59/2007 (SEQ ID NO: 1) in order to both increase the stability of the pre-fusion conformation and to destabilize the post-fusion conformation of the modified HA.

In Hl-mini2-clusterl+5+6-GCN4 (SEQ ID NO: 2) mutations F406S, V409T,

F413G and L416S (numbering refers to SEQ ID NO: 1) were introduced.

Introducing a disulfide bridge between amino acids at position 324 and 436 in HI A/Brisbane/59/2007; this is achieved by introducing mutations R324C and

Y436C. (numbering refers to SEQ ID NO: 1).

- Introducing the GCN4 derived sequence RMKQIEDKIEEIESK (SEQ ID NO:

20), that is known to trimerize, at position 419-433 (numbering refers to SEQ ID

NO: 1).

In certain embodiments, the polypeptides of the invention contain the intracellular sequences of HA and the transmembrane domain. In other embodiments, the sequence of the transmembrane and intracellular domain have been deleted from position (or the equivalent thereof, as determined from sequence alignment) 519, 520, 521, 522, 523, 524, 525, 526, 526, 527, 528, 529, or 530 of HA2 to the C-terminus of HA2 (numbering according to SEQ ID NO: 1) so that a secreted (soluble) polypeptide is produced following expression in cells. The soluble polypeptide can be further stabilized by introducing a sequence known to form trimeric structures, i.e. the foldon sequence AYVRKDGEWVLL (SEQ ID NO: 3), optionally connected through a short linker, as described above. The linker may optionally contain a cleavage site for processing afterwards according to protocols well known to those skilled in the art. To facilitate purification and detection of the soluble form a tag sequence may be optionally added, e.g. a histidine tag (HHHHHHH (SEQ ID NO: 16) or HHHHHH (SEQ ID NO: 15) or a FLAG tag (DYKDDDDK; SEQ ID NO: 22) or combination of these, optionally connected via short linkers. The linker may optionally contain (part of) a proteolytic cleavage site, e.g. LVPRGS (SEQ ID NO: 23) (thrombin) or IEGR (SEQ ID NO: 24) (Factor X) for processing afterwards according to protocols well known to those skilled in the art. The processed proteins are also encompassed in the invention.

An example of such a C-teminal sequence combining FLAG-tag, thrombin cleavage site, foldon, and His sequences is SEQ ID NO: 4 FLAG- thrombin-foldon-His. This sequence was combined with a soluble form of Hl-mini2-clusterl+5+6-GCN4t2 (SEQ ID NO: 51) sequence to create the parental sequence (SEQ ID NO: 156) that was used to create novel polypeptides of the invention by mutagenesis. This sequence does not contain the leader sequence corresponding to amino acids 1-17 of SEQ ID NO: 1 and 2.

The stem domain polypeptides are created by deleting the part of the

hemagglutinin sequence that encodes the head domain of the molecule and reconnecting the N- and C-terminal parts of the sequence on either side of the deletion through a linker as described in PCT/2012/073706 and above. The removal of the head domain leaves part of the molecule that was previously shielded from the aqueous solvent exposed, potentially destabilizing the structure of the polypeptides of the invention. For this reason residues in the B-loop (in particular amino acid residue 406 (F and S in SEQ ID NO: 1 and 2, respectively), 409 (V and T) 413 (F and G) and 416 (L and S) were mutated in various combinations using parental sequence SEQ ID NO: 156 as the starting point. SEQ ID NO: 156 was created from Hl-mini2-clusterl+5+6-GCN4t2 (SEQ ID NO: 52) by removing the leader sequence, and replacing residues 520-565 with a Flag-thrombin- foldon-his sequence (SEQ ID NO: 4).

conformational change that buries the hydrophobic fusion peptide in the interior of the protein. To address this issue some or all of the residues 1337, 1340, F352 and 1353 in SEQ ID NO: 156 were also mutated.

Two different sets of mutant polypeptides are disclosed in Table 9. In all cases these polypeptides contain SEQ ID NO: 20 at position 419-433 (numbering refers to SEQ ID NO: 1). Example 12: Identification, purification and characterization of the trimeric polypeptides of the invention.

Libraries of polypeptides as described in example 1 1 (set 1 and set 2) containing SEQ ID NO: 20 at position 419-433 were created. Single clones into HEK293F cells and screen culture medium for multimers (CR9114 sandwich ELISA), CR6261 binding (ELISA) and protein expression (HTRF assay) were individually transfected. Hits based on CR91 14 sandwich assay, CR91 14, CR6261, and CR8020 ELISA, and HTRF assay were confirmed and ranked.

Multimerization by crosslinking with primary amine (present in Lysine residues) specific crosslinker BS3 followed by SDS-PAGE (see below) was assessed. Because of extensive multimerization, the C-terminal Flag-Foldon-His (FFH) tag sequence was replaced with thrombin cleavage site and his-tag sequence TCShis). Subsequently, multimerization of TCS-his containing sequences (CR9114 sandwich assay, BS3 cross- linking) was re-confirmed, and clones were ranked and selected. Selected clones were expressed, purified and characterized.

The cross-linking assay was performed as follows:

• Add cross-linker BS3 (bis(sulfosuccinimidyl)suberate) directly to culture medium • Incubate for 30 min at room temperature.

• Collect medium and analyze by SDS-PAGE/ Western Blot under reducing (R, 5 mM DTT) and non-reducing (NR) conditions

• Under reducing conditions only BS3-crosslinked species will remain covalently linked

• Detection of mini-HA via Western blotting using a his-tag specific mAb Results:

1. Two libraries of high quality (>90% of ORF correct) containing SEQ ID NO: 20 at position 419-433 and the expected sequence variation (>97% randomization) were succesfully created

2. A total of 10472 clones (5544 and 4928 from set 1 and 2, respectively) were evaluated in the primary screen (Fig 28)

3. Clones exhibiting expression >50 % of FL HA expression and binding signalsto CR6261 >80% of the signals observed for FL HA were considered hits; this procedure yielded 703 hits (596 and 107 from library 1 and 2, respectively)

4. 658 out of 703 hits were retained after the confirmation screen

5. Crosslinking assay of top 20% hits (11 1) indicated the presence of higher order multimers that could potentially interfere with purification oftrimeric species.

6. Top 20% confirmed hits (111) were successfully cloned to replace FFH C- terminus with TCS-his sequence, followed by CR9114 sandwich ELISA and crosslinking assay evaluations

7. Cross-linking assays yielded 9 clones that were considered the most promising trimer candidates (SEQ ID NO: 158 to 166, Table 11). Based on the CR9114 sandwich ELISA (Fig 29) three candidates (2 with TCS-his, 1 with FFH C- terminus ) were selected for expression and purification 8. Two of the selected candidates did not express well and purification was not pursued. Candidate GW1.5E2.FFH (SEQ ID NO: 158) was purified to homogeneity (7.6 mg total protein; purity >95%, HP-SEC), following procedures as described in Example 4. 9. Characterization of GW1.5E2.FFH (SEQ ID NO: 158) by SEC-MALS analysis indicates trimer formation in solution, with 3 Fab fragments of CR9114 or CR6261 binding per trimer (Fig 30 and table below 10). Κ ^ρρ as determined from bilayer interferometry measurements (Octet) is 1 nM for both CR6261 and CR91 14. As expected, binding of CR8020 (negative control) could not be detected by either method.

Conclusion: The non-covalent trimeric polypeptide of the invention (GW1.5E2.FFH, SEQ ID NO: 158) that binds bnAbs CR6261 and CR91 14 with high affinity in a 3: 1 stoichiometry has been identified . Example 13: Protective efficacy of polypeptide of the invention sHl mini-HA GW1.5E2- FFH (SEQ ID NO: 158) in H1N1 A/Brisbane/ 59/07 mouse model

The protective efficacy of sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with Matrix-M in a H1N1 A/Brisbane/59/07 challenge model compared to a PBS control group was determined.

Groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 3 times at a

3 week interval with 30 μg sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with 10 μg Matrix-M. As a positive control for the challenge model CR6261 (15 mg kg) was administered 1 day prior to challenge (n=8), while injection with PBS served as a negative control (n=16). Four weeks after the last immunization mice were challenged with 12.5xLD50 challenge virus and monitored (survival, weight, clinical scores) for 3 weeks.

To verify immunogenicity of sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158), pre-challenge sera (day -1) were tested in ELISA assays for binding to FL HA from H1N1 A/Brisbane/59/07. To determine whether polypeptide of the invention sHl mini- HA GW1.5E2-FFH (SEQ ID NO: 158) induced antibodies bind at close proximity to the CR91 14 epitope, a CR91 14 competition ELISA was performed. Competition data were visualized as '% competition', defined as (A-P)/Axl00), where A is the maximum OD signal of CR91 14 binding to FL HA when no serum is present and P is the OD signal of CR91 14 binding to FL HA in presence of serum at a given dilution or expressed using the slope OD metric to be able to quantify responses; for reference CR91 14 and CR8020 (starting concentration 5 mg/ml) solutions were included.

Results:

Experiment was valid; all mice in the PBS control group (n=16) succumb to infection at or before day 10 post challenge (median 8 days), whereas the positive control group (n=8, 15mg kg CR6261 , 1 day before challenge) is fully protected (pO.001).

- Three immunizations with sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with Matrix-M lead to significant increase in survival proportion (p<0.001), increase in survival time (p<0.001), decrease in bodyweight loss (p<0.001) and reduction in clinical score (p<0.001), compared to the PBS control group (Fig 31).

Pre-challenge IgG antibody titers to H1N1 A Brisbane/59/07 FL HA induced by sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) are significantly higher compared to PBS (p<0.001) (Fig 32A).

- IgG antibody titers to H1N1 A Brisbane/59/07 FL HA plateau after two

immunizations (not shown).

Matrix-M adjuvated polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) induce significantly higher CR91 14 competition titers compared to PBS (pO.001) (Fig 32B).

Conclusion: Matrix-M adjuvated polypeptide of the invention sHl mini-HA GW1.5E2- FFH (SEQ ID NO: 158) confers protection against lethal challenge with H1N1

A/Brisbane/59/07. Example 14: Protective efficacy of polypeptide of the invention sHl mini-HA GW1.5E2- FFH (SEQ ID NO: 158) in a H5N1 A/Hong Kong/156/97 mouse model

The protective efficacy of leading HI mini-HA variants adjuvated with Matrix-M in a H5N1 A/Hong Kong/156/97 challenge model compared to a PBS control group was determined.

Groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 3 times at a 3 week interval with 30 μg polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with 10 μg Matrix-M. As a positive control for the challenge model CR6261 (15 mg/kg) was administered 1 day prior to challenge (n=8), while injection with PBS served as a negative control (n=l 6). Four weeks after the last immunization mice were challenged with 12.5xLD50 challenge virus and monitored (survival, weight, clinical scores) for 3 weeks.

To verify immunogenicity of polypeptide of the invention sHl mini-HA

GW1.5E2-FFH (SEQ ID NO: 158), pre-challenge sera (day -1) were tested in ELISA assays for binding to FL HA from H1N1 A/Brisbane/59/07. To determine whether mini- HA induced antibodies bind at close proximity to the CR91 14 epitope, a CR9114 competition ELISA was performed. Competition data were visualized as '%

competition', defined as (A-P)/Axl00), where A is the maximum OD signal of CR91 14 binding to FL HA when no serum is present and P is the OD signal of CR9114 binding to FL HA in presence of serum at a given dilution or expressed using the slope OD metric to be able to quantify responses; for reference CR9114 and CR8020 (starting concentration 5 μg/ml) solutions were included.

Results:

- Experiment was valid; 15 out of 16 mice in the PBS control group succumb to infection at or before day 9 post challenge (median 9 days), whereas the positive control group (n=8, 15mg/kg CR6261 , 1 day before challenge) is fully protected (pO.001).

Three immunizations polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with Matrix-M lead to significant increase in survival proportion (p<0.001), increase in survival time (p<0.001), decrease in bodyweight loss (p<0.001) and reduction in clinical score (p<0.001), compared to the PBS control group (Fig 33).

Pre-challenge IgG antibody titers to H1N1 A/Brisbane/59/07 FL HA induced by polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) are significantly higher compared to PBS (p<0.001) (Fig 34A).

Matrix-M adjuvated polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) induce significantly higher CR91 14 competition titers compared to PBS (p<0.001) (Fig 34B).

Conclusion: Matrix-M adjuvated polypeptide of the invention sHl mini-HA GW1.5E2- FFH (SEQ ID NO: 158) confers heterosubtypic protection against lethal challenge with H5N1 A/Hong Kong/156/97.

Example 15: Protective efficacy of polypeptide of the invention sHl mini-HA GW1.5E2- FFH (SEQ ID NO: 158) in a H1N1 A/Puerto Rico/8/34 mouse model

The protective efficacy of polypeptide of the invention sHl mini-HA GW1.5E2-

FFH (SEQ ID NO: 158) adjuvated with Matrix-M in a H1N1 A/Puerto Rico/8/1934 challenge model compared to a PBS control group was determined.

Groups of 10 female BALB/c mice (age 6-8 weeks) were immunized 3 times at a 3 week interval with 30 μg polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) adjuvated with 10 μg Matrix-M. As a positive control for the challenge model CR6261 (15 mg/kg) was administered 1 day prior to challenge (n=8), while 3 immunizations with PBS served as a negative control (n=16). Four weeks after the last immunization mice were challenged with 25xLD50 challenge virus and monitored (survival, weight, clinical scores) for 3 weeks.

To verify immunogenicity polypeptide of the invention sHl mini-HA GW1.5E2-

FFH (SEQ ID NO: 158), pre-challenge sera (day -1) were tested in ELISA assay for binding to FL HA from H1N1 A/Brisbane/59/07. To determine whether mini-HA induced antibodies bind at close proximity to the CR9114 epitope, a CR9114 competition ELISA was performed. Competition data were visualized as '% competition', defined as (A-P)/Axl 00), where A is the maximum OD signal of CR9114 binding to FL HA when no serum is present and P is the OD signal of CR91 14 binding to FL HA in presence of serum at a given dilution or expressed using the slope OD metric to be able to quantify responses; for reference CR91 14 and CR8020 (starting concentration 5 μg/ml) solutions were included.

Results

Experiment is valid; all mice in the PBS control group (n=16) succumb to infection at or before day 9 post challenge (median 8 days), whereas the positive control group (n=8, 15mg kg CR6261 , 1 day before challenge) is fully protected (p<0.001).

Three immunizations polypeptide of the invention sHl mini-HA GWl .5E2-FFH (SEQ ID NO: 158), adjuvated with Matrix-M lead to significant increase in survival proportion (p<0.001), increase in survival time (p<0.001), decrease in bodyweight loss (p<0.001) and reduction in clinical score (p<0.001), compared to the PBS control group (Fig. 35).

Pre-challenge IgG antibody titers to HlNl A/Brisbane/59/07 FL HA induced by polypeptide of the invention sHl mini-HA GW1.5E2-FFH (SEQ ID NO: 158) are significantly higher compared to PBS (p<0.001) (Fig. 36A).

Matrix-M adjuvated polypeptide of the invention sHl mini-HA GWl .5E2-FFH (SEQ ID NO: 158) induce significantly higher CR9114 competition titers compared to PBS (p<0.001) (Fig. 36B).

Conclusion: Matrix-M adjuvated polypeptide of the invention sHl mini-HA GWl .5E2- FFH (SEQ ID NO: 158) confers protection against lethal challenge with HlNl A/Puerto Rico/8/34. Table 1. Standard amino acids, abbreviations and properties

Amino Acid 3-Lettei 1 -Letter Side chain Side chain charge (pH 7.4) polarity

alanine Ala A nonpolar Neutral arginine Arg R polar Positive asparagine Asn N polar Neutral aspartic acid Asp D polar Negative

cysteine Cys C nonpolar Neutral glutamic acid Glu E polar Negative

glutamine Gin Q polar Neutral glycine Gly G nonpolar Neutral histidine His H polar positive (10%) neutral(90%) isoleucine He I nonpolar Neutral leucine Leu L nonpolar Neutral lysine Lys K polar Positive methionine Met M nonpolar Neutral phenylalanine Phe F nonpolar Neutral

proline Pro P nonpolar Neutral serine Ser S polar Neutral threonine Thr T polar Neutral tryptophan Trp w nonpolar Neutral tyrosine Tyr Y polar Neutral valine Val V nonpolar Neutral

Table 2. Sequence alignment of HI sequences according to particular embodiments of the invention

1. A/Solomon Islands/6/2003 (HlNl) (SEQ ID NO: 25)

2. A/Brisbane/59/2007 (HlNl) (SEQ ID NO: 1)

3. A/New Caledonia/20/1999(H1N1) (SEQ ID NO: 26)

4. A/California/07/2009 (HlNl) (SEQ ID NO: 27)

5. A/swine/Hubei/Sl/2009(HlNl) (SEQ ID NO: 28)

6. A/swine/Haseluenne/IDT2617/2003 (HlNl) (SEQ ID NO: 29)

7. A/NewYork/8/2006(HlNl) (SEQ ID NO: 30)

8. A/SolomonIslands/3/2006(HlNl) (SEQ ID NO: 31 )

9. A/NewYork/146/2000(HlNl) (SEQ ID NO: 32)

10. A NewYork/653/1996(HlNl) (SEQ ID NO: 33)

11. A/Beijing/262/1995(HlNl) (SEQ ID NO: 34)

12. A/Texas/36/1991 (HlNl) (SEQ ID NO: 35)

13. A/Singapore/6/1986(HlNl) (SEQ ID NO: 36)

14. A/Chile/1/1983(H1N1) (SEQ ID NO: 37)

15. A/Baylor/11515/1982(HlNl) (SEQ ID NO: 38)

16. A/Brazil/I l/1978(H1N1) (SEQ ID NO: 39)

17. A USSR/90/1977(HlNl) (SEQ ID NO: 40)

18. A/NewJersey/8/1976(HlNl) (SEQ ID NO: 41 )

19. A/Denver/1957(H1N1) (SEQ ID NO: 42)

20. A/Albany/4835/1948(HlNl ) (SEQ ID NO: 43)

21. A/FortMonmouth/l/1947(HlNl) (SEQ ID NO: 44)

22. A/Cameron/1946(HlNl) (SEQ ID NO: 45)

23. A/Weiss/1943(H1N1) (SEQ ID NO: 46)

24. A/Iowa/1943(HlNl) (SEQ ID NO: 47)

25. A/Bellamy/1942(H1N1) (SEQ ID NO: 48)

26. A/PuertoRico/8/1934(HlNl) (SEQ ID NO: 49)

27. A/WSN/1933(H1N1) (SEQ ID NO: 50)

28. A/SouthCarolina/l/1918(HlNl) (SEQ ID NO: 51 )

MKVKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCL 60 MKVKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL ENSHNGKLCL 60 MKAKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCL 60 MKAILWLLY TFATANADTL CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDKHNGKLCK 60 MEAKLFVLFC AFTALKADTF CVGYHANYST HTVDTILEKN VTVTHSVNLL ENSHNGKLCS 60 MEAKLFVLFC AFTALKADTI CVGYHANNST DTVDTILEKN VTVTHSINLL ENNHNGKLCS 60 MKVKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCL 60 MKVKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCL 60 MKAKLLVLLC AFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

10. MKAKLLVLLC AFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

11. MKAKLLVLLC TFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCL 60

12. MKAKLLVLLC AFTATYADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

13. MKAKLLVLLC AFTATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

14. MKAKLLVLLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDNHNGKLCK 60

15. MKAKLLVLLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

16. MKAKLLVLLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

17. MKAKLLVLLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

18. MKAKLLVLLC AFTATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

19. MKAKLLILLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

20. MKAKLLVLLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

21. MKAKLLILLC ALTATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

22. MKAKLLILLC ALSATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

23. MKARLLVLLC ALAATDADTI CIGYHANNST DTVDTILEKN VTVTHSVNLL EDSHNGKLCR 60

24. MKARLLVLLC ALAATDADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60

25. MKARLLVLLC AIAATDADTI CIGYHANNST DTVDTILEKN VTVTHSVNLL EDSHNGKLCR 60 MKANLLVLLC ALAAADAD I CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCR 60 MKAKLLVLLY AFVATDADTI CIGYHANNST DTVDTIFEKN VAVTHSVNLL EDRHNGKLCK 60 MEARLLVLLC AFAATNADTI CIGYHANNST DTVDTVLEKN VTVTHSVNLL EDSHNGKLCK 60 *. *..** .. .. ***. ********** *****..*** *.******** *. ******

LKGIAPLQLG NCSVAGWILG NPECELLISR ESWSYIVEKP NPENGTCYPG HFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPECELLISK ESWSYIVEKP NPENGTCYPG HFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPECELLISR ESWSYIVETP NPENGTCYPG YFADYEELRE 120

LRGVAPLHLG KCNIAGWILG NPECESLSTA SSWSYIVETP SSDNGTCYPG DFIDYEELRE 120

LNGKIPLQLG NCNVAGWILG NPKCDLLLTA ΝΞΞΞΥΙΙΕΤΞ KSKNGACYPG EFADYEELKE 120

LNGKAPLQLG NCNVAGWILG NPECDLLLTV DSWSYIIETS NSKNGACYPG EFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPECELLISK ESWSYIVETP NPENGTCYPG YFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPECELLISR ESWSYIVEKP NPENGTCYPG HFADYEELRE 120

LKGTAPLQLG NCSIAGWILG NPECESLFSK ESWSYIAETP NPKNGTCYPG YFADYEELRE 120

LKGTAPLQLG NCSVAGWILG NPECESLFSK ESWSYIAETP NPENGTCYPG YFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPECESLISK ESWSYIVETP NPENGTCYPG YFADYEELRE 120

LKGIAPLQLG NCSVAGWILG NPKCESLFSK ESWSYIAETP NPENGTCYPG YFADYEELRE 120

LKGIAPLQLG NCSIAGWILG NPECESLFSK KSWSYIAETP NSENGTCYPG YFADYEELRE 120

LKGIAPLQLG KCSIAGWILG NPECESLFSK KSWSYIAETP NSENGTCYPG YFADYEELRE 120

LKGIAPLQLG KCNIAGWILG NPECESLFSK KSWSYIAETP NSENGTCYPG YFADYEELRE 120

LKGIAPLQLG NCSIAGWILG NPECESLFSK KSWSYIAETP NSENGTCYPG YFADYEELRE 120

LKGKAPLQLG NCNIAGWVLG NPECESLLSN RSWSYIAETP NSENGTCYPG DFADYEELRE 120

LKGIAPLQLG KCNIAGWILG NPECESLFSK KSWSYIAETP NSENGTCYPG YFADYEELRE 120

LKGIAPLQLG KCNIAGWILG NPECESLLSK RSWSYIAETP NSENGACYPG DFADYEELRE 120

LKGIAPLQLG KCNIAGWILG NPECESLLSE RSWSYIVEIP NSENGTCYPG DFTDYEELRE 120

LKGIAPLQLG KCNIAGWILG NPECESLLSE RSWSYIVETP NSENGTCYPG DFIDYEELRE 120

LKGIAPLQLG KCNIAGWLLG NPECDPLLPV RSWSYIVETP NSENGICYPG DFIDYEELRE 120

LKGIAPLQLG KCNITGWLLG NPECDSLLPA RSWSYIVETP NSENGACYPG DFIDYEELRE 120

LKGIAPLQLG KCNIAGWLLG NPECDLLLTA SSWSYIVETS NSENGTCYPG DFIDYEELRE 120

* . * ***.** * ******* QLSSVSSFER FEIFPKESSW PNHTTT-GVS ASCSHNGESS FYKNLLWLTG KNGLYPNLSK 179 QLSSVSSFER FEIFPKESSW PNHTVT-GVS ASCSHNGESS FYRNLLWLTG KNGLYPNLSK 179 QLSSVSSFER FEIFPKESSW PNHTVT-GVS ASCSHNGKSS FYRNLLWLTG KNGLYPNLSK 179 QLSSVSSFER FEIFPKTSSW PNHDSNKGVT AACPHAGAKS FYKNLIWLVK KGNSYPKLSK 180 QLSTVSSFER FEIFPKAISW PDHDATRGTT VACSHSGVNS FYRNLLSTVK KGNSYPKLSK 180 QLSTVSSFER FEIFPKATSW PNHDTTRGTT ISCSHSGANS FYRNLLWIVK KGNSYPKLSK 180 QLSSVSSFER FEIFPKESSW PNHTVT-GVS ASCSHNGKSS FYRNLLWLTG KNGLYPNLSK 179 QLSSVSSFER FEIFPKESSW PNHTTT-GVS ASCSHNGESS FYKNLLWLTG KNGLYPNLSK 179 QLSSVSSFER FEIFPKDSSW PNHTVTKGVT ASCSHNGKSS FYKNLLWLTE KNGLYPNLSK 180 QLSSVSSFER FEIFPKESSW PNHTVTKGVT ASCSHNGKSS FYKNLLWLTE KNGLYPNLSK 180 QLSSVSSFER FEIFPKESSW PNHTVT-GVT ASCSHNGKSS FYRNLLWLTE KNGLYPNLSN 179 QLSSVSSFER FEIFPKESSW PNHTVTKGVT TSCSHNGKSS FYRNLLWLTK KNGLYPNVSK 180 QLSSVSSFER FEIFPKESSW PNHTVTKGVT ASCSHKGRSS FYRNLLWLTK KNGSYPNLSK 180 QLSSVSSFER FEIFPKESSW PKHNVTKGVT AACSHKGKSS FYRNLLWLTE KNGSYPNLSK 180 QLSSVSSFER FEIFPKESSW PKHSVTRGVT ASCSHKGKSS FYRNLLWLTE KNGSYPNLSK 180 QLSSVSSFER FEIFPKERSW PKHNITRGVT ASCSHKGKSS FYRNLLWLTE KNGSYPNLSK 180 QLSSVSSFER FEIFPKERSW PKHNVTRGVT ASCSHKGKSS FYRNLLWLTE KNGSYPNLSK 180 QLSSVSSFER FEIFPKESSW PNHTVTKGVT ASCSHKGRSS FYRNLLWLTK KNGSYPNLSK 180 QLSSVSSFER FEIFPKERSW PNHTTR-GVT AACPHARKSS FYKNLVWLTE ANGSYPNLSR 179 QLSSVSSFER FEIFPKERSW PKHNITRGVT AACSHKGKSS FYRNLLWLTE KNGSYPNLNK 180 QLSSVSSFER FEIFPKERSW PKHNITRGVT AACSHAGKSS FYKNLLWLTE TDGSYPKLSK 180 QLSSVSSFER FEIFPKGRSW PEHNIDIGVT AACSHAGKSS FYKNLLWLTE KDGSYPNLNK 180 QLSSVSSFER FEIFPKESSW PKHNTARGVT AACSHAGKSS FYRNLLWLTE KDGSYPNLKN 180 QLSSVSSFER FEIFSKESSW PKHTTG-GVT AACSHAGKSS FYRNLLWLTE KDGSYPNLNN 179 QLSSVTSFER FEIFPKETSW PKHNTTKGVT AACSHAGKCS FYRNLLWLTE KDGSYPNLNN 180 QLSSVSSFER FEIFPKESSW PNHNTN-GVT AACSHEGKSS FYRNLLWLTE KEGSYPKLKN 179 27. QLSSVSSLER FEIFPKESSW PNHTFN-GVT VSCSHRGKSS FYRNLLWLTK KGDSYPKLTN 179

28. QLSSVSSFEK FEIFPKTSSW PNHETTKGVT AACSYAGASS FYRNLLWLTK KGSSYPKLSK 180

1. SYANNKEKEV LVLWGVHHPP NIGDQRALYH KENAYVSWS SHYSRKFTPE IAKRPKVRDQ 239

2. SYANNKEKEV LVLWGVHHPP NIGNQKALYH TENAYVSWS SHYSRKFTPE IAKRPKVRDQ 239 3. SYVNNKEKEV LVLWGVHHPP NIGNQRALYH TENAYVSWS SHYSRRFTPE IAKRPKVRDQ 239

4. SYINDKGKEV LVLWGIHHPS TSADQQSLYQ NADAYVFVGS SRYSKKFKPE IAIRPKVRXX 240

5. SYTNNKGKEV LVIWGVHHPP TDSVQQTLYQ NKHTYVSVGS SKYYKRFTPE IVARPKVRGQ 240

6. SYTNNKGKEV LVIWGVHHPP TDSDQQTLYQ NNHTYVSVGS SKYYQRFTPE IVTRPKVRGQ 240

7. SYANNKEKEV LVLWGVHHPP NIGDQRALYH TENAYVSWS SHYSRRFTPE IAKRPKVRDQ 239

8. SYANNKEKEV LVLWGVHHPP NIGDQRALYH KENAYVSWS SHYSRKFTPE IAKRPKVRDQ 239

9. SYVNKKGKEV LVLWGVHHPS NMGDQRAIYH KENAYVSVLS SHYSRRFTPE IAKRPKVRDQ 240

10. SYVNNKEKEV LVLWGVHHPS NIGDQRAIYH TENAYVSWS SHYSRRFTPE ITKRPKVRDQ 240

11. SYVNNKEKEV LVLWGVHHPS NIRDQRAIYH TENAYVSWS SHYSRRFTPE IAKRPKVRGQ 239

12. SYVNNKEKEV LVLWGVHHPS NIGDQRAIYH TENAYVSWS SHYSRRFTPE IAKRPKVRDQ 240

13. SYVNNKEKEV LVLWGVHHPS NIGDQRAIYH TENAYVSWS SHYNRRFTPE IAKRPKVRDQ 240

14. SYVNNKEKEV LVLWGVHHPS NIEDQKTIYR KENAYVSWS SHYNRRFTPE IAKRPKVRNQ 240

15. SYVNDKEKEV LVLWGVHHPS NIEDQKTIYR KENAYVSWS SHYNRRFTPE IAKRPKVRDQ 240

16. SYVNNKEKEV LVLWGVHHPS NIEDQKTIYR KENAYVSWS SNYNRRFTPE IAKRPKVRGQ 240

17. SYVNNKEKEV LVLWGVHHPS NIEDQKTIYR KENAYVSWS SNYNRRFTPE IAERPKVRGQ 240

18. SYVNNKEKEV LVLWGVHHPS NIGDQRAIYH TENAYVSWS SHYNRRFTPE IAKRPKVRDQ 240

19. SYVNNQEKEV LVLWGVHHPS NIEEQRALYR KDNAYVSWS SNYNRRFTPE IAKRPKVRDQ 239

20. SYVNNKEKEV LVLWGVHHPS NIEDQKTLYR KENAYVSWS SNYNRRFTPE IAERPKVRGQ 240

21. SYVNNKEKEV LVLWGVHHPS NIEDQKTLYR KENAYVSWS SNYNRRFTPE IAERPKVRGQ 240

22. SYVNKKEKEV LILWGVHHPP NIENQKTLYR KENAYVSWS SNYNRRFTPE IAERPKVRGQ 240

23. SYVNKKGKEV LVLWGVHHPS SIKEQQTLYQ KENAYVSWS SNYNRRFTPE IAERPKVRDQ 240

24. SYVNKKGKEV LVLWGVHHPS NIKDQQTLYQ KENAYVSWS SNYNRRFTPE IAERPKVRGQ 239

25. SYVNKKGKEV LVLWGVHHPS NIKDQQTLYQ KENAYVSWS SNYNRRFTPE IAERPKVRGQ 240

26. SYVNKKGKEV LVLWGIHHPP NSKEQQNLYQ NENAYVSWT SNYNRRFTPE IAERPKVRDQ 239

27. SYVNNKGKEV LVLWGVHHPS SSDEQQSLYS NGNAYVSVAS SNYNRRFTPE IAARPKVKDQ 239

28. SYVNNKGKEV LVLWGVHHPP TGTDQQSLYQ NADAYVSVGS SKYNRRFTPE IAARPKVRDQ 240

1. EGRINYYWTL LEPGDTIIFE ANGNLIAPRY AFALSRGFGS GI INSNAPMD ECDAKCQTPQ 299 2. EGRINYYWTL LEPGDTIIFE ANGNLIAPRY AFALSRGFGS GI INSNAPMD KCDAKCQTPQ 299 3. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNAPMD ECDAKCQTPQ 299 4. EGRMNYYWTL VEPGDKITFE ATGNLWPRY AFAMERNAGS GIIISDTPVH DCNTTCQTPK 300 5. AGRMNYYWTL FDQGDTITFE ATGNLIAPWH AFALKKGSSS GIMLSDAQVH NCTTKCQTPH 300 6. AGRMNYYWTL LDQGDTITFE ATGNLIAPWH AFALNKGPSS GIMISDAHVH NCTTKCQTPH 300 7. EGRINYYWTL LEPGDTIIFE ANGNLIAPRF AFALSRGFGS GIITSNAPMD ECDAKCQTPQ 299 8. EGRINYYWTL LEPGDTIIFE ANGNLIAPRY AFALSRGFGS GI INSNAPMD ECDAKCQTPQ 299 9. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIIISNASMG ECDAKCQTPQ 300 10. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GI ITSNASMG ECDAKCQTPQ 300 11. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNAPMN ECDAKCQTPQ 299 12. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMD ECDAKCQTPQ 300 13. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMD ECDAKCQTPQ 300 14. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMD ECDAKCQTPQ 300 15. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNVSMD ECDAKCQTPQ 300 16. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMD ECDTKCQTPQ 300 17. AGRINYYWTL LEPGDTIIFE ANGNLIAPWH AFALNRGFGS GIITSNASMD ECDTKCQTPQ 300 18. EGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMD ECDAKCQTPQ 300 19. SGRMNYYWTL LEPGDTIIFE ATGNLIAPWY AFALSRGPGS GI ITSNAPLD ECDTKCQTPQ 299 20. AGRINYYWTL LEPGDTIIFE ANGNLIAPWH AFALSRGFGS GIITSNASMD ECDTKCQTPQ 300 21. AGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRDFGS GIITSNASMD ECDTKCQTPQ 300 22. AGRINYYWTL LEPGDTIIFE ANGNLIAPWY AFALNRGIGS GIITSNASMD ECDTKCQTPQ 300 23. AGRMNYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMH ECDTKCQTPQ 300 24. AGRINYYWTL LKPGDTIMFE ANGNLIAPWY AFALSRGFGS GIITSNASMH ECDTKCQTPQ 299 GRMNYYWTL LEPGDTIIFE ANGNLIAPWY AFALSRGFGS GIITSNASMH ECNTKCQTPQ 300GRMNYYWTL LKPGDTIIFE ANGNLIAPMY AFALRRGFGS GIITSNASMH ECNTKCQTPL 299GRMNYYWTL LEPGDTIIFE ATGNLIAPWY AFALSRGFES GIITSNASMH ECNTKCQTPQ 299GRMNYYWTL LEPGDTITFE ATGNLIAPWY AFALNRGSGS GIITSDAPVH DCNTKCQTPH 300

**.****** ..*** * ** * ***. * ***. * * *** *. . *.. **** AINSSLPFQ NVHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NVHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NVHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINTSLPFQ NIHPITIGKC PKYVKSTKLR LATGLRNIPS IQSRGLFGAI AGFIEGGWTG 360ALKNNLPLQ NVHLFTIGEC PKYVKSTQLR MATGLRNIPS IQSRGLFGAI AGFIEGGRTG 360ALKSNLPFQ NVHPSTIGEC PKYVKSTQLR MATGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NVHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNVPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NVHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS VQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NIHPVTIGEC PKYVKSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NIHPFTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWDG 360AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 360AINSSLPYQ NIHPVTIGEC PKYVRSAKLR MVTGLRNIPS IQSRGLFGAI AGFIEGGWTG 359SINSNLPFQ NIHPVTIGEC PKYVRSTKLR MVTGLRNIPS IQYRGLFGAI AGFIEGGWTG 359AINSSLPFQ NIHPVTIGEC PKYVRSTKLR MATGLRNIPS IQSRGLFGAI AGFIEGGWTG 360

******* ******** VDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419VDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419VDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419VDGWYGYHH QNEQGSGYAA DLKSTQNAID EITNKVNSVI EKMNTQFTAV GKEFNHLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQIAID GI NKANSVI GKMNIQLTSV GKEFNSLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQIAID GI NKVNSII EKMNTQFTSV GKEFNDLEKR 420VDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419VDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSII EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAID GITNKVNSVI EKMNTQFTAV GKEFNKLERR 420MDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 419IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSII EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLERR 420MDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 419IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN WITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNKLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNNLEKR 420IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNNLEKR 419IDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNNLEKR 420 MIDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNIQFTAV GKEFNKLEKR 419 MIDGWYGYHH QNEQGSGYAA DQKSTQNAIN GITNKVNSVI EKMNTQFTAV GKEFNNLEKR 419 MIDGWYGYHH QNEQGSGYAA DQKSTQNAID GITNKVNSVI EKMNTQFTAV GKEFNNLERR 420 *.******** ********** * *******. *******.* **** ***** *****.**.*

MENLNKKVDD GFIDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFIDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 IENLNKKVDD GFLDIWTYNA ELLVLLENER TLDYHDSNVK NLYEKVRSQL KNNAKEIGNG 480 KENLNKTVDD RFLDVWTFNA ELLVLLENQR TLEFHDLNIK SLYEKVKSHL RNNDKEIGNG 480 IENLNKKVDD GFLDVWTYNA ELLILLENER TLDFHDFNVK NLYEKVKSQL RNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFIDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDLNVK NLYEKVKNQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKTQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENGR TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFMDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKNQL RNNAKELGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKNQL RNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKNQL RNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLILLENER TLDFHDSNVK NLYEKVKSQL RNNAKEIGNG 480 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKNQL RNNAKEIGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL RNNAKEIGNG 480 MENLNNKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVK NLYEKVKSQL KNNAKEIGNG 479 MENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDLNVK NLYEKVKSQL KNNAKEIGNG 479 IENLNKKVDD GFLDIWTYNA ELLVLLENER TLDFHDSNVR NLYEKVKSQL KNNAKEIGNG 480

.****.**** **.******* ***.**** ****** .

CFEFYHKCND ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 539

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 539

CFEFYHKCDN TCMESVKNGT YDYPKYSEEA KLNREEIDGV KLESTRIYQI LAIYSTVASS 540

CFEFYHKRDN ECLECVKNGT ΥΝΥΡΚΥΞΕΕΞ KFNREEIVGV KLESMGIHQI LAIYSTVASS 540

CFEFYHKCDN ECMESVKNGT ΥΝΥΡΚΥΞΕΕΞ KLNREKIDGV KLESMGVHQI LAIYSTVASS 540

CFEFYHKCND ECMESVKNGT YDYPKYSEES KLNRERIDGV KLESMGVYQI LAIYSTVASS 539

CFEFYHKCND ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 539

CFEFYHKCNN ECMESVKNGT YDYPKYSKES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 539

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNRGKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCDN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYRI LAIYSTVASS 539

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKFSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIΥΞΤΑΑΞΞ 539

CFEFYHKCNN ECMESVKNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 540 CFEFYHKCDN ECMESVRNGT YDYPKYSEES KLNREKVDGV KLESMGIYQI LAIYSTVASS 539 CFEFYHKCDN ECMESVRNGT YDYPKYSEES KLNREKIDGV KLESMGVYQI LAIYSTVASS 539 CFEFYHKCDD ACMESVRNGT YDYPKYSEES KLNREEIDGV KLESMGVYQI LAIYSTVASS 540

******** . *****.*** *****. . *** **** ****** ***

LVLLVSLGAI SF MCSNGSL QCRICI 565