Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IMMUNOGENIC PROTEINS AND FRAGMENTS THEREOF FROM ALLERGENIC MITES
Document Type and Number:
WIPO Patent Application WO/2017/203057
Kind Code:
A1
Abstract:
The invention relates to novel immunogenic polypeptides identified in house dust mites and storage mites, which have the potential to be used in allergy immunotherapy, for diagnostic purposes, eventually via production of antibodies binding the polypeptide or for characterising allergen extracts of house dust mites and storage mites.

Inventors:
PETERS BJOERN (US)
LUND GITTE (DK)
CHRISTENSEN LARS HARDER (DK)
STRANZL THOMAS (DK)
SETTE ALESSANDRO (US)
Application Number:
PCT/EP2017/062866
Publication Date:
November 30, 2017
Filing Date:
May 29, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ALK-ABELLÓ AS (DK)
LA JOLLA INST ALLERGY & IMMUNOLOGY (US)
International Classes:
A61K39/35; C07K14/435
Domestic Patent References:
WO2015100360A12015-07-02
WO2017055235A12017-04-06
WO2012049310A12012-04-19
WO2005040219A12005-05-06
WO2007031080A12007-03-22
Foreign References:
JP2007244216A2007-09-27
US5925565A1999-07-20
US5935819A1999-08-10
US20050238646A12005-10-27
US20020161201A12002-10-31
Other References:
S. BANERJEE ET AL: "Conversion of Der p 23, a New Major House Dust Mite Allergen, into a Hypoallergenic Vaccine", THE JOURNAL OF IMMUNOLOGY, vol. 192, no. 10, 14 April 2014 (2014-04-14), US, pages 4867 - 4875, XP055400768, ISSN: 0022-1767, DOI: 10.4049/jimmunol.1400064
S. DEAN RIDER ET AL: "Draft genome of the scabies mite", PARASITES & VECTORS, vol. 8, no. 1, 10 December 2015 (2015-12-10), XP055401087, DOI: 10.1186/s13071-015-1198-2
TANG VIVIAN ET AL: "Identification and Characterization of a Group of Polymorphic, Single Domain Peptidoglycan Hydrolases of the NlpC/P60 Superfamily in Dust Mites", FASEB JOURNAL, vol. 29, no. Suppl. 1, April 2015 (2015-04-01), & EXPERIMENTAL BIOLOGY MEETING 2015; BOSTON, MA, USA; MARCH 28 -APRIL 01, 2015, pages 720.2, XP002773204
K. Y. JEONG ET AL: "Immunoglobulin E Reactivity of Recombinant Allergen Tyr p 13 from Tyrophagus putrescentiae Homologous to Fatty Acid Binding Protein", CLINICAL AND VACCINE IMMUNOLOGY, vol. 12, no. 5, 1 May 2005 (2005-05-01), US, pages 581 - 585, XP055401178, ISSN: 1556-6811, DOI: 10.1128/CDLI.12.5.581-585.2005
DATABASE UniProt [online] 20 February 2007 (2007-02-20), "RecName: Full=Superoxide dismutase [Cu-Zn] {ECO:0000256|RuleBase:RU000393}; EC=1.15.1.1 {ECO:0000256|RuleBase:RU000393};", XP002773205, retrieved from EBI accession no. UNIPROT:A2I463 Database accession no. A2I463
DATABASE UniProt [online] 17 October 2006 (2006-10-17), "RecName: Full=Superoxide dismutase [Cu-Zn] {ECO:0000256|RuleBase:RU000393}; EC=1.15.1.1 {ECO:0000256|RuleBase:RU000393};", XP002773206, retrieved from EBI accession no. UNIPROT:Q09JE3 Database accession no. Q09JE3
DATABASE Protein [online] 21 September 2015 (2015-09-21), XP002773207, retrieved from NCBI Database accession no. XP_005494816
DATABASE Protein [online] 18 June 2015 (2015-06-18), XP002773208, retrieved from NCBI Database accession no. XP_012788259
SMITH; WATERMAN, ADV. APPL. MATH, vol. 2, 1981, pages 482
NEEDLEMAN; WUNSCH, J. MOL. BIOL., vol. 48, 1970, pages 443
PEARSON; LIPMAN, PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403 - 10
ALTSCHUL ET AL., J. MOL. BIOL., vol. 215, 1990, pages 403
PEARSON ET AL., PROC. NATL. ACAD. SCI. USA, vol. 85, 1988, pages 2444
PEARSON, METHODS MOL. BIOL., vol. 132, 2000, pages 185
SMITH ET AL., J. MOL. BIOL., vol. 147, 1981, pages 195
BOSTICK ET AL., BIOCHEM BIOPHYS RES COMMUN., vol. 304, 2003, pages 320
"Remington: The Science and Practice of Pharmacy 20th ed.,", 2003, MACK PUBLISHING CO.
"Remington's Pharmaceutical Sciences 18th ed.,", 1990, MACK PUBLISHING CO.
"The Merck Index 12th ed.,", 1996, MERCK PUBLISHING GROUP
"Pharmaceutical Principles of Solid Dosage Forms", 1993, TECHNONIC PUBLISHING CO., INC.
ANSEL AD SOKLOSA: "Pharmaceutical Calculations 11th ed.,", 2001, LIPPINCOTT WILLIAMS & WILKINS
POZNANSKY ET AL.: "Drug Delivery Systems", 1980, pages: 253 - 315
III ET AL., PROTEIN ENG, vol. 10, 1997, pages 949 - 57
HOLLIGER; HUDSON, NAT BIOTECHNOL, vol. 23, 2005, pages 1126 - 1136
TRAUGER A. ET AL., SPECTROSCOPY, vol. 16, no. 1, 2002, pages 15 - 28
WELLS W ET AL., JOURNAL OF PROTEOME RESEARCH, vol. 5, no. 3, 2006, pages 651 - 658
COOPER; J. FENG; W. GARRETT, SPECTROSCOPY, vol. 21, no. 9, 2010, pages 1534 - 1546
HAQQANI AS ET AL., METHODS MOL. BIOL., vol. 439, 2008, pages 241 - 56
BRET, COOPER; J. FENG; W. GARRETT, SPECTROSCOPY, vol. 21, no. 9, 2010, pages 1534 - 1546
GOODMAN R. ET AL., CLIN TRANSL ALLERGY., vol. 4, no. 2, 2014, pages 12
HENMAR H ET AL., CLIN EXP IMMUNOL, vol. 153, no. 3, 2008, pages 316 - 23
ISHIHAMA Y; ODA Y; TABATA T; SATO T; NAGASU T; RAPPSILBER J; MANN M.: "Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein", MOL CELL PROTEOMICS, vol. 4, 2005, pages 1265 - 72, XP002494877, DOI: doi:10.1074/mcp.M500061-MCP200
Attorney, Agent or Firm:
INSPICOS P/S (DK)
Download PDF:
Claims:
CLAIMS

1. A polypeptide comprising or consisting of

(a) an amino acid sequence selected from the group consisting of any one of SEQ ID NOs: 31, 291, 1-30, 32-44, 261-290, 292-332, or

(b) an amino acid sequence consisting of at least or exactly 9 contiguous amino acid residues from the amino acid sequence of (a), or

(c) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (a), or

(d) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (b).

2. The polypeptide according to claim 1, wherein the sequence identity in option (c) is at least 65%, such as at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%. 3. The polypeptide according to any one of claims 1 or 2, wherein the polypeptide of option (c) has the same biological activity or functionality as the polypeptide of option (a) for example, the same enzymatic functionality, optionally the same, greater or less ability to elicit, stimulate or induce an immune response (e.g. in vitro T cell proliferation or T cell cytokine production, such as the cytokines, IL-4, IL-5, IL-13 and/or IL-10) in blood from a mite allergic individual; optionally the same, greater or less ability to induce immunological tolerance against mites, a mite allergen or the polypeptide of option (a) and/or to bind or interact with IgE, IgG or IgA antibodies raised against the polypeptide of option (a).

4. The polypeptide according to claim 1, wherein the at least or exactly 9 contiguous amino acid residues of option (b) or (d) constitute at least or exactly or at most 10, at least or exactly or at most 11, at least or exactly or at most 12, at least or exactly or at most 13, at least or exactly or at most 14, at least or exactly or at most 15, at least or exactly or at most 16, at least or exactly or at most 17, at least or exactly or at most 18, at least or exactly or at most 19, at least or exactly or at most 20, at least or exactly or at most 21, at least or exactly or at most 22, at least or exactly or at most 23, at least or exactly or at most 24, at least or exactly or at most 25, at least or exactly or at most 26, at least or exactly or at most 27 at least or exactly or at most 28, at least or exactly or at most 29, at least or exactly or at most 30, at least or exactly or at most 31, at least or exactly or at most 32, at least or exactly or at most 33, at least or exactly or at most 34, at least or exactly or at most 35, at least or exactly or at most 36, at least or exactly or at most 37, at least or exactly or at most 38, at least or exactly or at most 39, at least or exactly or at most 40, at least or exactly or at most 41, at least or exactly or at most 42, at least or exactly or at most 43, at least or exactly or at most 44, at least or exactly or at most 45, at least or exactly or at most 46, at least or exactly or at most 47, at least or exactly or at most 48, at least or exactly or at most 49, at least or exactly or at most 50, at least or exactly or at most 51, at least or exactly or at most 52, at least or exactly or at most 53, at least or exactly or at most 54, at least or exactly or at most 55, at least or exactly or at most 56, at least or exactly or at most 57, at least or exactly or at most 58, at least or exactly or at most 59, at least or exactly or at most 60, at least or exactly or at most 61, at least or exactly or at most 62, at least or exactly or at most 63, at least or exactly or at most 64, at least or exactly or at most 65, at least or exactly or at most 66, at least or exactly or at most 67, at least or exactly or at most 68, at least or exactly or at most 69, at least or exactly or at most 70, at least or exactly or at most 71, at least or exactly or at most 72, at least or exactly or at most 73, at least or exactly or at most 74, at least or exactly or at most 75, at least or exactly or at most 76, at least or exactly or at most 77, at least or exactly or at most 78, at least or exactly or at most 79, at least or exactly or at most 80, at least or exactly or at most 81, at least or exactly or at most 82, at least or exactly or at most 83, at least or exactly or at most 84, at least or exactly or at most 85, at least or exactly or at most 86, at least or exactly or at most 87, at least or exactly or at most 88, at least or exactly or at most 89, at least or exactly or at most 90, at least or exactly or at most 91, at least or exactly or at most 92, at least or exactly or at most 93, at least or exactly or at most 94, at least or exactly or at most 95, at least or exactly or at most 96, at least or exactly or at most 97, at least or exactly or at most 98, at least or exactly or at most 99, at least or exactly or at most 100, at least or exactly or at most 101, at least or exactly or at most 102, at least or exactly or at most 103, at least or exactly or at most 104, at least or exactly or at most 105, at least or exactly or at most 106, at least or exactly or at most 107, at least or exactly or at most 108, at least or exactly or at most 109, at least or exactly or at most 110, at least or exactly or at most 111, at least or exactly or at most 112, at least or exactly or at most 113, at least or exactly or at most 114, at least or exactly or at most 115, at least or exactly or at most 116, at least or exactly or at most 117, at least or exactly or at most 118, at least or exactly or at most 119, at least or exactly or at most 120, at least or exactly or at most 121, at least or exactly or at most 122, at least or exactly or at most 123, or at least or exactly or at most 124 amino acid residues.

5. The polypeptide according to any one of claims 1 or 4, wherein the contiguous amino acid residues of option (b) or (d) commence

• at amino acid residue 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40,42,

43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, or 117 in any one of SEQ ID NOs: 1-44 and 261-332; or

at amino acid residue 118, 119, 120, 121, or 122 in any one of SEQ ID NOs: 1-44, 261-304, 306-318, and 320-332; or

at amino acid residue 123 or 124 in any one of SEQ ID NOs: 2-44, 262-304, 306-318, and 320-332; or

at amino acid residue 125 or 126 in any one of SEQ ID NOs: 4-44, 264-304, 306-318, and 320-332; or

at amino acid residue 127, 128, or 129 in any one of SEQ ID NOs: 5-44, 265-304, 306- 318, and 320-332; or

at amino acid residue 130, 131, 132, 133 , 134, 135, 136, 135, 136, 137, 138, 139, 140, 141, 142 in any one of SEQ ID NOs: 5-44, 265-304, 306-314, 316-318, 320-328, and 330-332; or

at amino acid residue, 143 or 144 in any one of SEQ ID NOs: 5-44, 265-304, 306-313, 316-318, 320-327, and 330-332; or

at amino acid residue 145 and 146 in any one of SEQ ID NOs: 7-44, 267-304, 306-313, 316-318, 320-327, and 330-332; or

at amino acid residue 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, or 163 in any one of SEQ ID NOs: 9-44, 269-304, 306-313, 316- 318, 320-327, and 330-332; or

at amino acid residue 164 in any one of SEQ ID NOs: 10-44, 270-304, 306-313, 316- 318, 320-327, and 330-332; or

at amino acid residue 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178 in any one of SEQ ID NOs: 11-44, 271-304, 306-313, 316-318, 320-327, and 330-332; or

at amino acid residue 179 or 180 in any one of SEQ ID NOs: 11-44 and 271-304, 306- 313, 316-317, 320-327, and 330-331; or

at amino acid residue 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, or 214 in any one of SEQ ID NOs: 13-44, 273-304, 306-313, 316- 317, 320-327, and 330-331; or

at amino acid residue 215, 216, 217, 218, 219, or 220 in any one of SEQ ID NOs: in any one of SEQ ID NOs: 15-44, 275-304, 306-313, 316-317, 320-327, and 330-331; or

at amino acid residue 221 in any one of SEQ ID NOs: 17-44 and 275-304; or at amino acid residue 222, 223, or 224 in any one of SEQ ID NOs: 17, 19-44, 277,

279-304, 306-313, 316-317, 320-327, and 330-331; or

at amino acid residue 225 in any one of SEQ ID NOs: 17, 19-44, 277, 279-304, 306- 313, 317, 320-327, and 331; or at amino acid residue 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, or 244 in any one of SEQ ID NOs: 19-44, 279-304, 306- 313, 317, 320-327, and 331; or

at amino acid residue 245 in any one of SEQ ID NOs: 20-44, 280-304, 306-313, and 320-327; or

at amino acid residue 246, 247, 248, 249, 250, 251, or 252 in any one of SEQ ID NOs: 21-44 and 281-304, 306-313, and 320-327; or

at amino acid residue 253 in any one of SEQ ID NOs: 21-44, 281-304, 306-312, and 320-326; or

at amino acid residue 254 in any one of SEQ ID NOs: 22-44 and 282-304; or at amino acid residue 255, 256, 257, 258, 259, 260, 261, or 262 in any one of SEQ ID

NOs: 23-44, 283-304, 306-312, and 320-326; or

at amino acid residue 263 in any one of SEQ ID NOs: in any one of SEQ ID NOs: 24-44, 284-304, 306-312, and 320-326; or

at amino acid residue 264, 265, 266, or 267 in any one of SEQ ID NOs: 25-44, 285- 304, 306-312, and 320-326; or

at amino acid residue 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301 in any one of SEQ ID NOs: 26-44, 286-304, 306-312, and 320-326; or

at amino acid residue 302 in any one of SEQ ID NOs: 26-44, 286-304, 306-311, and 320-325; or

at amino acid residue, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, or 313 in any one of SEQ ID NOs: 27-44, 287-304, 306-311, and 320-325; or

at amino acid residue 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, or 324 in any one of SEQ ID NOs: 28-44, 288-304, 306-311, and 320-325; or

at amino acid residue 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336,

337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, or

353 in any one of SEQ ID NOs: 30-44, 290-304, 306-311, and 320-325; or

at amino acid residue 354 in any one of SEQ ID NOs: 31-44 291-304, 306-311, and

320-325; or

at amino acid residue 355 in any one of SEQ ID NOs: 32-44 292-304, 306-311, and 320-325; or

at amino acid residue 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, or 421 in any one of SEQ ID NOs: 32-44, 292-304, 307-311, and 321-325; or at amino acid residue 422, 423, 424, 425, or 426 in any one of SEQ ID NOs: 33-44, 293-304, 307-311, and 321-325; or

at amino acid residue 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451 in any one of SEQ ID NOs: 34-44, 294-304, 307-311, and 321-325; or

at amino acid residue 452, 453, or 454 in any one of SEQ ID NOs: 34-44, 294-304, 307-310, and 321-324; or

at amino acid residue 455 in any one of SEQ ID NOs: 36-44, 296-304, 307-310, and 321-324; or

at amino acid residue 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, or 466 in any one of SEQ ID NOs: 37-44, 297-304, 307-310, and 321-324; or

at amino acid residue 467, 468, 469, 470, 471, 472, 473, or 474 in any one of SEQ ID NOs: 38-44, 298-304, 307-310, and 321-324; or

at amino acid residue 475, 476, 477, 478, 479, 480, 481, 482, or 483 in any one of SEQ ID NOs: 38-44, 298-304, 307, 308, 310, 321, 323, and 324; or

at amino acid residue 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, or 501 in any one of SEQ ID NOs: 38-44, 298-304, 307, 308, 321, and 323; or

at amino acid residue 502, 503, 504, 505, 506, 507, 508, 509, 510, or 511 in any one of SEQ ID NOs: 39-44, 299-304, 307, 308, 321, and 323; or

at amino acid residue 512 in any one of SEQ ID NOs: 39, 41-44, 299, 301-304, 307, 308, 321, and 323; or

at amino acid residue 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524,

525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541

542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558

559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575

576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592

593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609

610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626

627, 628, 629, 630, 631, 632, 633, 634, 635, 636, 637, 638, 639, 640, 641, 642, 643

644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660

661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677

678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694

695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711

712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728

729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745

746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762

763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779

780, 781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, 814, 815, 816, 817, 818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 830, 831, 832, 833, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843, 844, 845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 855, 856, 857, 858, 859, 860, 861, 862, 863, 864, 865, 866, 867, 868, 869, 870, 871, 872, 873, 874, 875, 876, or 877 in any one of SEQ

ID NOs: 41-44 301-304, 307, 308, 321, and 323; or

• at amino acid residue 878 or 879 in any one of SEQ ID NOs: 41, 43, 44, 301, 303, 304, 307, 308, 321, and 323; or

• at amino acid residue 880or 881 in any one of SEQ ID NOs: 43, 44, 303, 304, 307, 308, 321, and 323; or

• at amino acid residue, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 919, 920, 921, 922, 923, 924, 925, 926, 927, 928, 929, 930, 931, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 946, 947, 948, 949, 950, 951, 952, 953, 954, 955, 956, 957, 958, 959, 960, 961,

962, 963, 964, 965 in any one of SEQ ID NOs: 43, 44, 303, 304, 307, and 321; or

• at amino acid residue, 966 or 967 in any one of SEQ ID NOs: 43, 44, 303, and 304; or at amino acid residue 968, 969, 970, 971, 972, 973, 974, 975, 976, 977, 978, 979, 980, 981, or 982 in SEQ ID NO: 44 or 304; with the proviso that the number, N, of the

commencing amino residue satisfies the formula N≤ L-n+ 1, where L is the number of amino acid residues in the sequence among SEQ ID NOs: 1-44 and 261-332 from which the commencing residue is selected, and n is the number of consecutive amino acid residues.

6. The polypeptide according to any one of claims 1, 4 and 5, wherein the sequence identity in option (d) is at least 60%, such as at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, or at least 99%.

7. The polypeptide according to any one of claims 1, 4-6, wherein the polypeptide of option (d) may have the same biological activity or functionality as the polypeptide of option (b), for example, it may have the same enzymatic functionality, it may have the same, greater or less ability to elicit, stimulate or induce an immune response (e.g. in vitro T cell proliferation or T cell cytokine production (for example the cytokines, IL-4, IL-5, IL-13 and/or IL-10) in blood of an mite allergic individual; to induce immunological tolerance against mites, mite allergens or the polypeptide of option (b); and/or to bind or interact with IgE, IgG or IgA antibodies raised against the polypeptide of option (b).

8. The polypeptide according to any one of claims 1, 4-7, wherein the polypeptide of option (b) and (d) has a length of 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 amino acid residues.

9. The polypeptide according to claim 8, wherein the polypeptide of option (d) has greater or less ability to bind a Class HLA II allele or a group of Class HLA II alleles than the parent polypeptide of option (b), for example, the polypeptide of option (d) may bind to at least 70%, such as at least 75%, 80%, 85%, 90% or 95% of the Class HLA II alleles that the parent polypeptide of option (b) binds to.

10. The polypeptide according to any one of claims 1, 4-9, wherein the consecutive amino acids of option (b) and (d) comprise a T cell epitope, optionally a Th2 cell epitope.

11. The polypeptide according to any one of the preceding claims, which comprises or consists of an amino acid sequence consisting of

a) 9, 10, 11, 12, 13, 14 or 15 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260 and any one of SEQ ID NOs: 45, 61, 63, 80, 100, 113, 147, 154, 170, 172, 191, 215, 225, 226, 248, and 260, wherein any cysteine residue is/are substituted with a serine residue, an alanine residue or a 2-aminobutyric acid residue, or

b) a variant of 9 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence, or

c) a variant of 10 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence, or

d) a variant of 11 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence, or

e) a variant of 12 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence, or

f) a variant of 13 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1, 2, 3, 4, or 5 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence, or

g) a variant of 14 consecutive amino acid residues from a parent polypeptide sequence selected from any one of SEQ ID NOs: 45-260, wherein 1, 2, 3, 4, or 5 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent polypeptide sequence

h) a variant of a parent polypeptide sequence selected from any one of SEQ ID NOs: 45- 260, wherein 1, 2, 3, 4, 5, or 6 amino acids are substituted with a different amino acid in the variant relative to the parent polypeptide sequence. 12. The polypeptide according to claim 11, wherein the parent polypeptide sequence commences at

- residue 1, 2, 3, 4, 5, 6, or 7 in any one of SEQ ID NOs: 45-260 when the peptide is 9 amino acids in length, or

- residue 1, 2, 3, 4, 5, or 6 in any one of SEQ ID NOs: 45-260 when the peptide is 10 amino acids in length, or

- residue 1, 2, 3, 3, 4, or 5 in any one of SEQ ID NOs: 45-260 when the peptide is 11 amino acids in length, or

- residue 1, 2, 3, or 4 in any one of SEQ ID NOs: 45-260 when the peptide is 12 amino acids in length, or

- residue 1, 2, or 3 in any one of SEQ ID NOs: 45-260 when the peptide is 13 amino acids in length, or

- residue 1 or 2 in any one of SEQ ID NOs: 45-260 when the peptide is 14 amino acids in length.

13. The polypeptide according to any one of claims 11 and 12, wherein the polypeptide of option a) to h) comprises a T cell epitope, optionally a Th2 cell epitope.

14. The polypeptide according to any one of claims 11-13, which comprises any one of SEQ ID NOs: 45-260.

15. The polypeptide according to any one of claims 11-14, wherein the polypeptide has a length of 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 amino acid residues.

16. The polypeptide according to any one of the preceding claims, which is able to cross- react with any one of SEQ ID NOs: 1-260 when tested in a T-cell proliferation assay employing T cells from a patient allergic to the mites or the protein allergens.

17. The polypeptide according to any one of the preceding claims, which consists of an amino acid sequence identical with the amino acid sequence of a proteolytic fragment of a protein consisting of an amino acid sequence selected from any one of SEQ ID NOs: 1-44, and 261-332, preferably from any one of SEQ ID NOs: 1-44 and 305-318.

18. The polypeptide according to claim 17, wherein the proteolytic fragment is a tryptic or chymotryptic fragment.

19. The polypeptide according to claim 17 or 18, which includes a mass modifying label.

20. The polypeptide according to any one of the preceding claims, which comprises at least one of the following modifications:

a) N-terminal acetylation or methylation

b) C terminal amidation

c) C-terminal amination

d) addition of one to three charged amino acid residues selected from any one of lysine (K), arginine (R), aspartic acid (D), and glutamic acid (E) in the N- terminal or C-terminal

e) replacement of one or more hydrogen(s) on the side chain amines of arginine and/or lysine with a methylene group

f) glycosylation, and

g) phosphorylation.

21. A composition, such as a pharmaceutical composition, comprising one or more of the polypeptides according to any one of the preceding claims.

22. The pharmaceutical composition according to claim 21, further comprising a pharmaceutically acceptable carrier, excipient, and/or adjuvant, optionally sterile.

23. The pharmaceutical composition according to claim 21 or 22 formulated as a vaccine for parenteral administration.

24. The pharmaceutical composition according to any one of claims 21-23, wherein the pharmaceutical composition is a powder, optionally formulated to be re-dissolved before use.

25. A method of treating allergy in a patient, where signs and/or symptoms of said allergy are elicited in the patient by exposure to house dust mites or storage mites and/or exposure to at least one protein allergen present in house dust mites or storage mites, the method comprising administering, to the patient, a therapeutically effective amount of a polypeptide according to any one of claims 1-20 or the composition according to any one of claims 21-24.

26. The method according to claim 25, wherein exposure of the patient to the polypeptide does not elicit signs or symptoms of allergy in the patient. 27. The method according to claim 25 or 26, wherein exposure of the patient to one or more protein(s) having an amino acid sequence consisting of or comprising an amino acid sequence selected from SEQ ID NOs: 1-44 or 261-304 does not elicit signs of allergy (i.e. IgE-mediated allergy) in the patient.

28. The method according to any one of claims 25-27, wherein treating the allergy comprises or consists of relieving or reducing an immune response triggered by exposure to the mites or the protein allergen.

29. The method according to any one of claims 25-28, wherein treating the allergy comprises or consists of relieving one or more symptoms of an immune response triggered by exposure to the mites or the protein allergen. 30. The method according to any one of claims 25-29, wherein treating the allergy consists of or comprises inducing immunological tolerance against the mites or the protein allergen.

31. The method according to any one of claims 25-30, wherein treating the allergy comprises or consists of relieving one or more symptom(s) associated with allergic rhinitis and/or allergic conjunctivitis and/or allergic asthma and/or allergic eczema (e.g. atopic dermatitis).

32. The method according to any one of claims 25-31, wherein treating the allergy comprises or consists of relieving one or more signs or symptoms associated with allergic rhinitis, such as

- reducing the intensity of itchy nose and/or

- reducing the number of sneezes within a given period (e.g. daily, weekly, monthly) and/or

- reducing the intensity of blocked nose (congestion) and/or

- reducing the amount of nasal fluid and/or

- reducing the eosinophilic count in nasal fluid and/or

- reducing specific IgE antibody level (titre) in nasal fluid or in serum and/or

- reducing basophil histamine release in blood.

33. The method according to any one of claims 25-32, wherein treating the allergy comprises or consists of relieving one or more signs or symptoms associated with allergic conjunctivitis, such as

- reducing the intensity of itchy eyes, redness in the white of the eyes and/or watery eyes; and/or

- reducing the eosinophilic count in conjunctival tissue scrapings; and/or

- reducing specific IgE antibody level (titer) in conjunctival tissue scrapings or in serum; and/or

-reducing basophil histamine release in blood. 34. The method according to any one of claims 25-33, wherein treating the allergy comprises or consists of relieving one or more signs or symptoms associated with allergic asthma, such as

- reducing the intensity and/or number of coughs within a given period (e.g. daily, weekly, monthly); and/or

- reducing the intensity of wheezes; and/or

- improving being short of breath; and/or

- improving lung function; and/or

- reducing specific IgE antibody level (titre) in lung fluid or in serum; and/or

- reducing basophil histamine release in blood. 35. The method according to any one of claims 25-34, wherein treating the allergy comprises or consists of relieving one or more signs or symptoms associated with atopic dermatitis, such as

- reducing itch intensity of the skin; and/or

- reducing eczema score; and/or

- reducing number of (peripheral) blood eosinophils.

36. The method according to any one of claims 25-35, comprising or consisting of reducing the patient's need for concomitant treatment with corticosteroids or HI

antihistamines to reduce, relieve or suppress one or more symptoms of an immune response associated with the allergy. 37. The method according to claim 36, wherein the immune response is clinically presented as atopic dermatitis, urticaria, contact dermatitis, allergic conjunctivitis, allergic rhinitis, allergic asthma, anapylaxis, and/or hay fever.

38. The method according to claim 36 or 37, wherein the method decreases, reduces, suppresses or inhibits atopic dermatitis, urticaria, contact dermatitis, allergic conjunctivitis, allergic rhinitis, allergic asthma, anaphylaxis, and/or hay fever.

39. The method according to any one of claims 25-38, comprising inducing or increasing an IgG antibody response in the patient to a protein allergen of the mites and/or decreasing an IgE antibody response in the patient to a protein allergen of the mites and/or decreasing a T cell response in the patient against a protein allergen of the mites.

40. The method according to any one of claims 25-39, wherein the patient is sensitized to at least one protein allergen of the mites. 41. The method according to any one of claims 25-40, wherein the mites are house dust mites of the genus Dermatophagoides (for example selected from the group consisting of Dermatophagoides pteronyssinus, Dermatophagoides farinae) or of the genus Euroglyphus (for example Euroglyphus maynei), or wherein the mites are storage mites of the genus Glycyphagus, Lepidoglyphus, Tyrophagus, or Blomia (for example Glycyphagus domesticus, Lepidoglyphus destructor, Tyrophagus putrescentiae, or Blomia tropicalis).

42. The method according to any one of claims 25-41, wherein the protein allergen is selected from one or more protein allergens in the groups consisting of

- a group 1 allergen of mites (for example a group 1 allergen of a house dust mite (e.g. Der p

1, Der f 1, or Eur m 1, or a group 1 allergen of a storage mite, e.g. Gly d 1, Lep d 1, Typ p 1 and Bio t 1) and

- a group 2 allergen of mites (for example a group 2 allergen of a house dust mite, e.g. Der p

2, Der f 2 and Eur m 2, and a group 2 allergen of a storage mite, e.g. Gly d 2, Lep d 2, Typ p 2 and Bio t 2).

43. The method according to any one of claims 25-42, wherein the signs or symptoms of allergy not elicited by the polypeptide are one or more signs or symptoms selected from the group consisting of:

- the presence in the patient of specific IgE antibodies that binds to the immunogen;

- a positive skin prick test with the polypeptide; and

- the signs or symptoms defined in any one of claims 31-35. 44. An in vitro method of determining whether T cells of a subject are responsive to one or more of the polypeptides according to any one of claims 1-20 and/or the composition according to any one of claims 21-24, comprising a step of contacting T cells obtained from the subject with said polypeptide(s) or composition and detecting whether the T cells are stimulated.

45. An in vitro method of diagnosing a subject for sensitization to (allergy to) house dust mites or storage mites, comprising a step of contacting T cells obtained from the subject with one or more of the polypeptides according to any one of claims 1-20 and/or the composition according to any one of claims 21-24 and detecting whether the T cells are stimulated.

46. An in vitro method for determining whether a subject has, or is at risk of developing, an allergy to house dust mites or storage mites, comprising a step of contacting T cells obtained from the subject with one or more of the polypeptides according to any one of claims 1-20 and/or the composition according to any one of claims 21-24 and detecting whether the T cells are stimulated.

47. A diagnostic kit comprising one or more of the polypeptides according to any one of claims 1-20 and/or the composition according to any one of claims 21-24.

48. An isolated nucleic acid fragment, which comprises

i) a nucleotide sequence encoding a polypeptide according to any one of claims 1-20, or ii) a nucleotide sequence complementary to the nucleotide sequence in i).

49. The nucleic acid fragment according to claim 48, which is a DNA or an RNA fragment.

50. A vector comprising the nucleic acid fragment according to claim 48, such as a cloning vector or an expression vector.

51. The vector according to claim 50, which comprises in operable linkage and in the 5'-3' direction, an expression control region comprising an enhancer/promoter for driving expression of the nucleic acid fragment defined in claim 48-i), optionally a signal peptide coding sequence, a nucleotide sequence defined in claim 48-i), and optionally a terminator. 52. The vector according to claim 51, wherein the expression control region drives expression in prokaryotic cell such as a bacterium, e.g. in E coli.

53. The vector according to claim any one of claims 50-52, which is capable of autonomous replication.

54. The vector according to any one of claims 50-53, which is capable of being integrated into the genome of a host cell.

55. The vector according to any one of claims 50-54, which is selected from the group consisting of a virus, such as an attenuated virus, a bacteriophage, a plasmid, a

minichromosome, and a cosmid.

56. A cell which is transformed to carry the vector according to any one of claims 50-55.

57. The transformed cell according to claim 56, which is capable of replicating the nucleic acid fragment defined in claim 48-i) and/or which is capable of expressing the nucleic acid fragment defined in claim 48-i).

58. The transformed cell according to claim 56 or 57, which is selected from a prokaryotic cell and a eukaryotic cell.

59. The transformed cell according to any one of claims 56-58, which is a bacterial cell selected from the group consisting of Escherichia (such as E. coli .), Bacillus (e.g. Bacillus subtilis), Salmonella, and Mycobacterium, preferably non-pathogenic, e.g. M. bovis BCG.

60. The transformed cell according to any one of claims 56-59, which is stably transformed by having the nucleic acid defined in claim 48-i) stably integrated into its genome.

61. The transformed cell according to any one of claims 56-60, which secretes or carries on its surface the polypeptide according to any one of claims 1-20.

62. The transformed cell according to claim 61, wherein the cell is a bacterium and secretion is into the periplasmic space.

63. A cell line derived from the transformed cell according to any one of claims 56-62.

64. A method for the preparation of the polypeptide according to any one of claims 1-20, comprising

- culturing a transformed cell according to any of claims 56-62 or the cell line according to claim 63 under conditions that facilitate that the transformed cell expresses the nucleic acid fragment according to claim 48-i) and subsequently recovering said polypeptide, or

- preparing said polypeptide by means of solid or liquid phase peptide synthesis.

65. An isolated polyclonal antibody, which has been raised against the polypeptide according to any one of claims 1-20.

66. A monoclonal antibody or a fragment or analogue thereof, which specifically binds the polypeptide according to any one of claims 1-20 wherein said fragment or analogue comprises at least the variable regions of the antigen binding site of said monoclonal antibody.

67. A method for qualitative or quantitative determination of the presence in a sample of the polypeptide according to any one of claims 1-20, the method comprising any one of the following approaches:

- contacting the sample with an antibody according to any one of claims 65 or 66 and detecting specific binding of material in said sample to said antibody,

- contacting the sample with a system comprising a solid phase with an antibody according to claim 65 or 66 coupled thereto and comprising a labelled polypeptide according to any one of claims 1-20, where said labelled polypeptide specifically binds said antibody, and gauging the degree of competition exerted by material in the sample on the binding between said labelled polypeptide and said antibody,

- contacting the sample with a system comprising 1) a solid phase with a polypeptide according to any one of claims 1-20 coupled thereto and comprising 2) a labelled antibody according to claim 65 or 66, where said polypeptide specifically binds said labelled antibody, and gauging the degree of competition exerted by material in the sample on the binding between said polypeptide and said antibody,

- subjecting polypeptide material from the sample to proteolytic treatment and subjecting the thus obtained material to quantitative MS, optionally using at least one polypeptide according to any one of claims 17-19 as standard calibration peptide.

Description:
IMMUNOGENIC PROTEINS AND FRAGMENTS THEREOF FROM ALLERGENIC MITES FIELD OF THE INVENTION

The present invention relates to the field of medicine, in particular allergy immunotherapy against mite allergy. The present invention deals with a group of immunogenic polypeptides with low IgE antibody reactivity but considerable T cell reactivity in a mite allergic population. The immunogenic polypeptides are conserved across important species of house dust mites as well as storage mites, and may be usable in the field of allergy immunotherapy against mite allergy.

BACKGROUND OF THE INVENTION House dust mites of the genus Dermatophagoides are one of the most frequent indoor allergen sources worldwide and are potent inducers of perennial asthma and rhinitis. Several groups of allergens from the most important species (Dermatophagoides pteronyssinus (Der p) and Dermatophagoides farinae (Der f)) are reported (http\www. allergen. org). The group 1 allergens (e.g. Der p 1 and Der f 1) and the group 2 allergens (e.g. Der p 2 and Der f 2) are considered the clinically most important allergens among house dust mites with IgE binding frequencies of more than 80 percent. Other known allergens from the genus

Dermatophagoides have variable levels of IgE antibody titers, e.g. Der p 4, 5, 7, 8, 10, 11, 13-15, 18, 20, 21 and 23. In some tropical and subtropical regions of the world, the clinically most important mite allergens may be from both house dust mites and storage mites of which storage mites of the genus Blomia (e.g. of the species Blomia tropicalis) may be more clinically important than of the genus Dermatophagoides. While the major allergens of the species Der p and Der f are highly cross-reactive and have sequence identity of above 80- 85%, the sequence identity to the corresponding allergens in storage mite species are much lower (below 40-50%). Allergen-specific immunotherapy (SIT) represents a causative and disease-modifying approach with long-lasting effects with the efficacy of reducing the symptom burden and concomitant medication use. SIT is based on the administration of increasing doses of the disease-eliciting allergens into sensitized subjects in order to achieve a state of clinical tolerance to subsequent exposure. Conventionally, SIT includes subcutaneous injection (SCIT) or sublingual administration (SLIT) of a pharmaceutical formulation of an allergen extract of the disease-eliciting allergen source, e.g. an allergen extract of house dust mite bodies and fecal particles. Conventional SIT may induce severe side-effects in allergic patients, e.g. anaphylaxis, though SLIT has been proven to have a superior safety profile to SCIT. However, the risk of inducing anaphylaxis is still not negligible because the allergen extracts contains considerable amounts of IgE-reactive allergens. This may limit the broad applicability of this treatment approach.

Current SIT products on the market target either house dust mite allergy or storage mite allergy. Thus, patients with dual sensitization to both house dust mite species and storage mite species may not be well treated by current SIT products.

Accordingly, an unmet need exists in the art for allergy immunotherapeutic products with high safety profile and efficacy to both house dust mites and optionally storage mites.

OBJECT OF THE INVENTION It is an object of embodiments of the invention to provide proteins and fragments thereof with low or absent IgE reactivity, but T cell reactivity in a high fraction of a mite allergic population and which have sequences with high sequence identity to proteins present in house dust mites and optionally also storage mites.

SUMMARY OF THE INVENTION The present inventors have identified a number of proteins present in house dust mites. The proteins share the feature of being immunogenic in the sense that they, at least, elicit T cell responses in a high fraction of a mite allergic population, while only a low or insignificant fraction of the same population has raised an IgE antibody immune response against these proteins as such. This renders the use of these proteins and optionally peptides thereof relevant for treatment of allergy, optionally by exploitation of the bystander suppression effect, e.g. as disclosed in WO 2012/049310: effective immunization of a patient to obtain a tolerogenic immune response with a first immunogenic protein, preferably a protein the patient has not raised IgE antibodies against, which is present in a material (e.g. an allergen- source material) which causes allergy in the patient due to the presence of at least one protein allergen (e.g. a protein to which the patient has raised IgE antibodies), followed by later exposure of the patient to both the first protein and the allergen-source material has the consequence that the tolerogenic immune response induced by the first protein suppresses the undesired allergic immune response induced by the protein allergen. So, somewhat paradoxically, immunization with a protein immunogen different from the protein allergen can reduce a later immune response against a protein allergen to which the patient is exposed, provided that this later exposure is accompanied by exposure to the protein immunogen. Thus, in a first aspect the present invention relates to a polypeptide comprising or consisting of

(a) an amino acid sequence selected from the group consisting of any one of SEQ ID NOs: 1- 44 and 261-332, or

(b) an amino acid sequence consisting of at least or exactly 9 contiguous amino acid residues from the amino acid sequence of (a), or

(c) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (a), or

(d) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (b).

In a related second aspect, the present invention relates to a composition, such as a pharmaceutical composition, comprising one or more of the polypeptides of the first aspect of the invention.

A third aspect of the invention relates to a method of treating allergy (i.e. IgE-mediated allergy) in a patient, where signs and/or symptoms of said allergy are elicited in the patient by exposure to house dust mites or storage mites and/or exposure to at least one protein allergen present in house dust mites or storage mites, the method comprising administering, to the patient, a therapeutically effective amount of a polypeptide of the first aspect of the invention or a composition of the second aspect of the invention. Consequently, in related aspects, the invention relates to the polypeptides of the first aspect and/or the composition of the second aspect for use as a pharmaceutical, in particular for use in a method of the third aspect of the invention. Likewise, in related aspects the invention relates to use of a polypeptide of the first aspect of the invention or the composition of the second aspect of the invention in a method of the third aspect of the invention. And, in related aspects, the invention relates to use of the polypeptides of the first aspect of the invention in the preparation of a pharmaceutical composition for use in a method of the second aspect of the invention.

A fourth aspect of the invention relates to an in vitro method of determining whether T cells of a subject are responsive to one or more of the polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention, comprising a step of contacting T cells obtained from the subject with said one or more polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention and detecting whether the T cells are stimulated. A fifth aspect of the invention relates to an in vitro method of diagnosing a subject for sensitization or allergy to house dust mites or storage mites, comprising contacting T cells obtained from the subject with one or more of the polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention and determining whether the T cells are stimulated.

A sixth aspect of the invention relates to an in vitro method for determining whether a subject has, or is at risk of developing, an allergy to house dust mites or storage mites, comprising contacting T cells obtained from the subject with one or more of the polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention and determining whether the T cells are stimulated.

A seventh aspect relates to an in vitro method of diagnosing a subject for allergy or sensitivity to house dust mites or storage mites, comprising determining the presence of specific IgE against one or more of the polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention in a biological sample (e.g. serum) obtained from the subject.

An eighth aspect of the invention relates to a diagnostic kit comprising one or more of the polypeptides of the first aspect of the invention and/or the composition of the second aspect of the invention.

A ninth aspect of the invention relates to a nucleic acid fragment, which encodes a polypeptide of the first aspect of the invention.

A tenth aspect of the invention relates to a vector comprising a nucleic acid fragment of the ninth aspect of the invention.

An eleventh aspect of the invention relates to a transformed cell carrying a nucleic acid fragment of the ninth aspect of the invention or a vector of the tenth aspect of the invention. Included in this aspect is also a cell line derived from the transformed cell.

A twelfth aspect of the invention relates to a method of preparing a polypeptide of the first aspect of the invention, the method comprising culturing a transformed cell of the tenth aspect of the invention under conditions that facilitate expression of the nucleic acid fragment of the ninth aspect, and subsequently recovering the expression product (a polypeptide of the second aspect of the invention) from the culture medium. A thirteenth aspect of the invention relates to an antibody (polyclonal, monoclonal) or an antibody fragment or analogue that specifically binds the polypeptide of the first aspect of the invention.

Finally, a fourteenth aspect relates to a method for qualitative or quantitative determination of the presence in a sample of the polypeptide of the first aspect, the method comprising any one of the following approaches:

- contacting the sample with an antibody of the thirteenth aspect and detecting specific binding of material in said sample to said antibody,

- contacting the sample with a system comprising a solid phase with an antibody of the thirteenth aspect coupled thereto and comprising a labelled polypeptide of the first aspect, where said labelled polypeptide specifically binds said antibody, and gauging the degree of competition exerted by material in the sample on the binding between said labelled polypeptide and said antibody,

- contacting the sample with a system comprising 1) a solid phase with a polypeptide of the first aspect coupled thereto and comprising 2) a labelled antibody of the thirteenth aspect, where said polypeptide specifically binds said labelled antibody, and gauging the degree of competition exerted by material in the sample on the binding between said polypeptide and said antibody,

- subjecting polypeptide material from the sample to proteolytic treatment and subjecting the thus obtained material to quantitative MS, optionally using at least one polypeptide described herein as useful as a standard calibration peptide.

DETAILED DISCLOSURE OF THE INVENTION Definitions

The term "antigen" is an agent that is recognized (i.e. bound by) an antibody and/or a T cell receptor. The latter is normally only possible when the antigen is presented in the context of an MHC Class I or II molecule and after being processed by an antigen presenting cell such as a macrophage or a dendritic cell. This means that relatively large polypeptides may be antigens even though they do not directly bind a T cell receptor but since shorter peptides that are products of antigen presenting cell-processing are recognized by T cell receptors, such proteins are nevertheless termed "antigens".

An "immunogen" is a type of antigen, which is capable of eliciting a specific adaptive immune response that targets the antigen, i.e. immunogens are able to induce the production by the animal body of the antibodies and T cells that recognize antigens. This is in contrast to "haptens", which denote antigens that are not themselves capable of inducing an immune response but which are capable of being recognized by antibodies and/or T-cell receptors.

Of particular interest are "protein antigens", "protein immunogens", "polypeptide antigens", "polypeptide immunogens", "peptide antigens", and "peptide immunogens", which are each characterized by comprising or consisting of a protein, polypeptide or peptide, which in itself is an antigen or immunogen.

The terms "protein", "polypeptide", "oligopeptide", and "peptide" are used interchangeably herein if no other characteristics are used to describe these molecules in terms of molecule size or length: where a polypeptide and protein typically is of a larger size (e.g. > 100 amino acid residues), an oligopeptide has between 10 and 100 amino acid residues, and a peptide is an even shorter molecule, the present description and claims will as a rule indicate the relevant length of the proteins, polypeptides, oligopeptides and peptides disclosed herein. These molecules are characterized by being constituted of multiple amino acid residues linked via peptide bonds. Typically all the amino acid residues (except for glycine, which is achiral) are in the L-form (since this allows for processing of the polypeptides by antigen presenting cells), but the presence of D-amino acid residues is not excluded.

A "protein" is also meant to designate a biomolecule comprising or consisting of at least one polypeptide, oligopeptide, or peptide, but which optionally may include other molecular entities, such as prosthetic groups, sugars, lipids, and various other derivatizations of the side groups in the amino acid chain(s). For example, the human adult protein hemoglobin is composed of 4 (2+2) polypeptides (2 identical a chains and 2 identical β chains), which are each tightly associated to a heme group (a prostethetic group).

As used herein an "epitope" refers to a region or part of an antigen, such as a poly(peptide) or protein disclosed herein, that elicits an immune response when administered to a subject. An epitope may be a T cell epitope, i.e., an epitope that elicits, stimulates, induces, promotes, increases or enhances a T cell activity, function or response; for example a Th2 cell epitope. Any peptide or combination of peptides of interest can be analyzed to determine whether they include at least one T cell epitope using any number of assays known in the art (e.g. T cell proliferation assays, lymphokine secretion assays, T cell non-responsiveness studies, etc.).

The term "allergen" refers to an antigen which elicits, induces, stimulates, or enhances an immune response, e.g. Th 2 -immune response, by a cell of the immune system of an exposed animal (e.g., human). An antigen is an allergen when the specific immune response is the development of enhanced sensitivity or a hypersensitivity to the antigen, but the antigen itself is not typically innately harmful. An allergen is therefore a particular type of antigen that can cause development of enhanced or increased sensitivity or hypersensitivity in a subject. For example, an allergen can elicit production of IgE antibodies and histamine release from mast cells or basophil cells in predisposed subjects.

If no other meaning is given specifically, the term "T cell response" refers to induction of cytokines or proliferation of a T cell in response to an immunogen. It may be determined as explained in Example 2. It may in some instances be referred to simply as a "response" to an immunogen, such as a peptide, polypeptide or a protein. The term "allergic response" is intended to refer to the hypersensitive immune reaction to a normally innocuous environmental substance known as an allergen. The most common mechanism of allergic reactions is the binding of IgE to the FceRI on the surface of mast cells and basophils, which in turn causes asthma, hay fever and other common allergic reactions due to release of cytokines, notably histamine. The term "identity" and "identical" and grammatical variations thereof, as used herein, mean that two or more referenced entities are the same (e.g., amino acid sequences). Thus, where two proteins, polypeptides or peptides are identical, they have the same amino acid sequence. The identity can be over a defined area, e.g. over at least 12, 13, 14, 15, 16, 17, 18, 19, 20, or more contiguous amino acids, such as 50, 100, 150, 200 or the entire length of the parent protein, polypeptide or peptide, optionally wherein the alignment is the best fit with gaps permitted.

Identity can be determined by comparing each position in aligned sequences. A degree of identity between amino acid sequences is a function of the number of identical or matching amino acids at positions shared by the sequences, i.e. over a specified region. Optimal pairwise alignment of sequences for comparisons of identity may be conducted using a variety of algorithms, as are known in the art, including the Clustal Omega program available at http://www.ebi.ac.uk/Tools/msa/clustalo/, the local homology algorithm of Smith and Waterman, 1981, Adv. Appl. Math 2: 482, the homology alignment algorithm of Needleman and Wunsch, 1970, 3. Mol. Biol. 48:443, the search for similarity method of Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85: 2444, and the computerized implementations of these algorithms (such as GAP, BESTFIT, FASTA and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, Madison, WI, U.S.A.). Sequence identity may also be determined using the BLAST algorithm, described in Altschul et a/., 1990, J. Mol. Biol. 215:403-10 (using the published default settings). Software for performing BLAST analysis may be available through the National Center for Biotechnology Information (through the internet at http://www.ncbi.nlm.nih.gov/). Such algorithms that calculate percent sequence identity generally account for sequence gaps and mismatches over the comparison region or area. For example, a BLAST (e.g. , BLAST 2.0) search algorithm (see, e.g. , Altschul et al., J. Mol. Biol. 215:403 (1990), publicly available through NCBI) has exemplary search

parameters as follows: Mismatch -2; gap open 5; gap extension 2. For polypeptide sequence comparisons, a BLASTP algorithm is typically used in combination with a scoring matrix, such as PAM100, PAM 250, BLOSUM 62 or BLOSUM 50. FASTA (e.g., FASTA2 and FASTA3) and SSEARCH sequence comparison programs are also used to quantitate the extent of identity (Pearson et al., Proc. Natl. Acad. Sci. USA 85: 2444 (1988); Pearson, Methods Mol. Biol.

132: 185 (2000); and Smith et al., J. Mol. Biol. 147: 195 (1981)). Programs for quantitating protein structural similarity using Delaunay-based topological mapping have also been developed (Bostick et al., Biochem Biophys Res Commun. 304: 320 (2003)). Thus, a polypeptide having an amino acid sequence with at least, for example, 85 percent identity to the sequence with SEQ ID NO: 1, it is intended that the amino acid sequence of the polypeptide, after global pairwise alignment with the sequence SEQ ID NO: 1, may include up to 15 amino acid modifications per each 100 amino acids of the sequence SEQ ID NO: 1. That is to say that to obtain a polypeptide having an amino acid sequence at least 85 percent identical to the sequence SEQ ID NO: 1, up to 15 percent (15 of 100) of the amino acid residues in the subject sequence may be inserted, deleted, or substituted with another amino acid.

As used herein, the term "immune response" includes T cell (cellular) mediated and/or B cell (humoral) mediated immune responses, or both cellular and humoral responses. In particular, the term "immune response" may include an IgE-mediated immune response (i.e. an allergic immune response). Exemplary immune responses include T cell responses, such as Th2 responses resulting in cytokine production and/or cellular cytotoxicity. In addition, the term "immune response" includes responses that are indirectly affected by T cell activation, e.g., antibody production (humoral responses) and activation of cytokine responsive cells, e.g., eosinophils, macrophages. Immune cells involved in the immune response include lymphocytes, such as T cells (CD4+, CD8+, Thl and Th2 cells, memory T cells, regulatory T cells) and B cells; antigen presenting cells (e.g., professional antigen presenting cells such as dendritic cells, macrophages, B lymphocytes, Langerhans cells, and non-professional antigen presenting cells such as keratinocytes, endothelial cells, astrocytes, fibroblasts, oligodendrocytes); natural killer (NK) cells; and myeloid cells, such as macrophages, eosinophils, mast cells, basophils, and granulocytes. A particular immune response is production of immunoglobulin (Ig) isotype antibodies or decreasing IgE antibodies.

Specific embodiments of the invention Embodiments of the first aspect of the invention The polypeptide comprising or consisting of

(a) an amino acid sequence selected from the group consisting of any one of SEQ ID NOs: 1- 44 and 261-332, or

(b) an amino acid sequence consisting of at least or exactly 9 contiguous amino acid residues from the amino acid sequence of (a), or

(c) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (a), or

(d) an amino acid sequence having a sequence identity of at least 60% with the amino acid sequence of (b)

constitutes the first aspect of the invention. In other words, apart from the polypeptides defined by SEQ ID NOs: 1-44 and 261-332, the invention also provides fragments and amino acid sequence variants of these proteins which can be useful in eliciting an immune response such as for example a specific T-cell response and IgG production.

Thus, a first aspect of the invention includes the option that a polypeptide of option (a) comprises an amino acid sequence variant of any one of SEQ ID NOs: 1-44 and 261-332. Hence, the sequence identity specified in option (c) is in some embodiments at least 65%, such as at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, and at least 99%. The variant sequence may have the same biological activity or functionality as the parent polypeptide of option (a). For example, a variant sequence may have the same enzymatic functionality. The variant sequence may optionally have the same, greater or less ability to elicit, stimulate or induce an immune response (e.g. in vitro T cell proliferation or T cell cytokine production, such as the cytokines, IL-4, IL-5, IL-13 and/or IL- 10); to induce immunological tolerance against the original polypeptide and/or to bind or interact with IgE, IgG or IgA antibodies raised against the parent polypeptide.

As mentioned, a first aspect of the invention includes the option (b) that polypeptides are fragments of the polypeptides of option (a) as well as the option (d) which comprises an amino acid sequence variant of polypeptides of option (b) that may still be useful in eliciting an immune response such as for example a specific T-cell response and IgG production. Hence, the sequence identity specified in option (d) is in some embodiments at least 60%, such as at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, and at least 99%. The variant sequence defined in option (d) may have the same biological activity or functionality as the parent sequence defined in option (b). For example, a variant sequence may have the same enzymatic functionality (i.e. ability to act on the same substrate(s)). The variant sequence may optionally have the same, greater or less ability to

- elicit, stimulate or induce an immune response (e.g. effecting in vitro T cell proliferation or T cell cytokine production (for example of the cytokines, IL-4, IL-5, IL-13 and/or IL-10)) in blood from mite allergic individuals;

- to induce immunological tolerance against mites, a mite allergen or the parent polypeptide of option (b); and/or

- to bind or interact with IgE, IgG or IgA antibodies raised against the parent polypeptide. For polypeptides of more limited length, for example in the range of 9-30 amino acids in length, the variant sequence may result in the same, greater or less ability to bind a Class HLA II allele or a group of Class HLA II alleles. For example, a variant sequence may bind to at least 70%, such as at least 75%, 80%, 85%, 90% or 95% of the Class HLA II alleles that the parent polypeptide of option (b) binds to. The ability of the parent polypeptide and the variant sequence to bind HLA Class II alleles may be tested under the same test conditions, for example by use of HLA binding prediction tool or in-vitro HLA binding assay. For example, the binding of polypeptide of the invention may be investigated to one or more of the following Class HLA II alleles: DPA1*02: 01-DPB1*01: 01, DPA1*01 : 03-DPB1*02: 01, DPA1*01 : 03-DPB1*03: 01, DPA1*01 : 03-DPB1*04: 01, DPA1*01 : 03-DPB1*04: 02,

DPA1*02: 02-DPB1*05: 01, DPA1*02: 01-DPB1*14: 01, DQA1*05: 01-DQB1*02: 01,

DQA1*05: 01-DQB1*03: 01, DQA1*03: 01-DQB1*03: 02, DQA1*04: 01-DQB1*04: 02,

DQA1*01 : 01-DQB1*05: 01, DQA1*01 : 02-DQB1*06: 02, DRB1*01 : 01, DRB1*03: 01,

DRB1*04: 01, DRB1*04: 05, DRB1*07: 01, DRB1*09: 01, DRB1*11 : 01, DRB1*12: 01,

DRB1*13: 02, DRB1*15: 01, DRB3*01 : 01, DRB3*02: 02, DRB4*01 : 01 and DRB5*01 : 01. A polypeptide of option (b) and (d) may be of any length. In some embodiments, the polypeptides may be useful for peptide immunotherapy and comprise a limited number of amino acid residues. For example, a polypeptide of option (b) and (d) may consist of 9 to 30 amino acid residues, such as having a length of 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 amino acid residues. As mentioned, such polypeptides of option (b) or (d) may comprise at least or exactly 9 contiguous amino acid residues, such as at least or exactly or at most 10, at least or exactly or at most 11, at least or exactly or at most 12, at least or exactly or at most 13, at least or exactly or at most 14, at least or exactly or at most 15, at least or exactly or at most 16, at least or exactly or at most 17, at least or exactly or at most 18, at least or exactly or at most 19, at least or exactly or at most 20, at least or exactly or at most 21, at least or exactly or at most 22, at least or exactly or at most 23, at least or exactly or at most 24, at least or exactly or at most 25, at least or exactly or at most 26, at least or exactly or at most 27 at least or exactly or at most 28, at least or exactly or at most 29, at least or exactly or at most 30 contiguous amino acid residues. In such embodiments, the consecutive amino acids of option (b) and (d) may comprise a T cell epitope, optionally a Th 2 cell epitope.

In other embodiments, a polypeptide of option (b) or (d) may comprise several amino acid residues. Hence, in option (b) or (d), the at least or exactly 9 contiguous amino acid residues may constitute at least or exactly or at most 31, at least or exactly or at most 32, at least or exactly or at most 33, at least or exactly or at most 34, at least or exactly or at most 35, at least or exactly or at most 36, at least or exactly or at most 37, at least or exactly or at most 38, at least or exactly or at most 39, at least or exactly or at most 40, at least or exactly or at most 41, at least or exactly or at most 42, at least or exactly or at most 43, at least or exactly or at most 44, at least or exactly or at most 45, at least or exactly or at most 46, at least or exactly or at most 47, at least or exactly or at most 48, at least or exactly or at most 49, at least or exactly or at most 50, at least or exactly or at most 51, at least or exactly or at most 52, at least or exactly or at most 53, at least or exactly or at most 54, at least or exactly or at most 55, at least or exactly or at most 56, at least or exactly or at most 57, at least or exactly or at most 58, at least or exactly or at most 59, at least or exactly or at most 60, at least or exactly or at most 61, at least or exactly or at most 62, at least or exactly or at most 63, at least or exactly or at most 64, at least or exactly or at most 65, at least or exactly or at most 66, at least or exactly or at most 67, at least or exactly or at most 68, at least or exactly or at most 69, at least or exactly or at most 70, at least or exactly or at most 71, at least or exactly or at most 72, at least or exactly or at most 73, at least or exactly or at most 74, at least or exactly or at most 75, at least or exactly or at most 76, at least or exactly or at most 77, at least or exactly or at most 78, at least or exactly or at most 79, at least or exactly or at most 80, at least or exactly or at most 81, at least or exactly or at most 82, at least or exactly or at most 83, at least or exactly or at most 84, at least or exactly or at most 85, at least or exactly or at most 86, at least or exactly or at most 87, at least or exactly or at most 88, at least or exactly or at most 89, at least or exactly or at most 90, at least or exactly or at most 91, at least or exactly or at most 92, at least or exactly or at most 93, at least or exactly or at most 94, at least or exactly or at most 95, at least or exactly or at most 96, at least or exactly or at most 97, at least or exactly or at most 98, at least or exactly or at most 99, at least or exactly or at most 100, at least or exactly or at most 101, at least or exactly or at most 102, at least or exactly or at most 103, at least or exactly or at most 104, at least or exactly or at most 105, at least or exactly or at most 106, at least or exactly or at most 107, at least or exactly or at most 108, at least or exactly or at most 109, at least or exactly or at most 110, at least or exactly or at most 111, at least or exactly or at most 112, at least or exactly or at most 113, at least or exactly or at most 114, at least or exactly or at most 115, at least or exactly or at most 116, at least or exactly or at most 117, at least or exactly or at most 118, at least or exactly or at most 119, at least or exactly or at most 120, at least or exactly or at most 121, at least or exactly or at most 122, at least or exactly or at most 123, at least or exactly or at most 124, or at least or exactly or at most 125 contiguous amino acid residues.

The number of contiguous amino acids in option (b) and (d) can be higher for all of SEQ ID NOs: 1-44 and 261-304, 306-318, and 320-332. Another way to phrase this is that for each of SEQ ID NOs: 2-44 and 262-304, the number of the contiguous amino acid residues is at least or exactly or at most N-n, where N is the length of the sequence ID in question and n is any integer between 1 and N-9; that is, the at least 9 contiguous amino acids can be at least any number between 9 and the length of the reference sequence minus one, in increments of one. Consequently:

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 1-44, 261-304, 306-318, and 320- 332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 126, at least or exactly or at most 127, at least or exactly or at most 128, or at least or exactly or at most 129 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 2-44 262-304, 306-318, and 320- 332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 130 or at least or exactly or at most 131 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 4-44, 264-304, 306-318, and 320- 332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 132 or at least or exactly or at most 133 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 5-44, 265-304, 306-318, and 320- 332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 134, at least or exactly or at most 135, or at least or exactly or at most 136 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 5-44, 265-304, 306-314. 316-318, 320-328, and 330-332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 137, at least or exactly or at most 138, at least or exactly or at most 139, at least or exactly or at most 140, at least or exactly or at most 141, at least or exactly or at most 142, at least or exactly or at most 143, at least or exactly or at most 144, at least or exactly or at most 145, at least or exactly or at most 146, at least or exactly or at most 147, at least or exactly or at most 148, or at least or exactly or at most 149 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 5-44 265-304, 306-313, 316-318, 320-327, and 330-332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute, at least or exactly or at most 150, at least or exactly or at most 151 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 7-44, 267-304, 306-313, 316-318, 320-327, and 330-332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 152 or at least or exactly or at most 153 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 9-44, 269-304,306-313, 316-318, 320-327, and 330-332 the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 154, at least or exactly or at most 155, at least or exactly or at most 156, at least or exactly or at most 157, at least or exactly or at most 158, at least or exactly or at most 159, at least or exactly or at most 160, at least or exactly or at most 161, at least or exactly or at most 162, at least or exactly or at most 163, at least or exactly or at most 164, at least or exactly or at most 165, at least or exactly or at most 166, at least or exactly or at most 167, at least or exactly or at most 168, at least or exactly or at most 169, or at least or exactly or at most 170 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 10-44, 270-304, 306-313, 316- 318, 320-327, and 330-332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 171 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 11-44, 271-304, 306-313, 316- 318, 320-327, and 330-332, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 172, at least or exactly or at most 173, at least or exactly or at most 174, at least or exactly or at most 175, at least or exactly or at most 176, at least or exactly or at most 177, at least or exactly or at most 178, at least or exactly or at most 179, at least or exactly or at most 180, at least or exactly or at most 181, at least or exactly or at most 182, at least or exactly or at most 183, at least or exactly or at most 184, or at least or exactly or at most 185 contiguous amino acid residues

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 11-44, 271-304, 306-313, 316- 317, 320-327, and 330-331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 186, or at least or exactly or at most 187 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 13-44, 273-304, 306-313, 316-

317, 320-327, and 330-331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 188, at least or exactly or at most 189, at least or exactly or at most 190, at least or exactly or at most 191, at least or exactly or at most 192, at least or exactly or at most 193, at least or exactly or at most 194, at least or exactly or at most 195, at least or exactly or at most 196, at least or exactly or at most 197, at least or exactly or at most 198, at least or exactly or at most 199, at least or exactly or at most 200, at least or exactly or at most 201, at least or exactly or at most 202, at least or exactly or at most 203, at least or exactly or at most 204, at least or exactly or at most 205, at least or exactly or at most 206, at least or exactly or at most 207, at least or exactly or at most 208, at least or exactly or at most 209, at least or exactly or at most 210, at least or exactly or at most 211, at least or exactly or at most 212, at least or exactly or at most 213, at least or exactly or at most 214, at least or exactly or at most 215, at least or exactly or at most 216, at least or exactly or at most 217, at least or exactly or at most 218, at least or exactly or at most 219, at least or exactly or at most 220, or at least or exactly or at most 221 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 15-44, 275-304, 306-313, 316- 317, 320-327, and 330-331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 222, at least or exactly or at most 223, at least or exactly or at most 224, at least or exactly or at most 225, at least or exactly or at most 226, or at least or exactly or at most 227 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 17-44, 277-304, 306-313, 316- 317, 320-327, and 330-331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 228 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 17, 19-44, 277, 279-304, 306-313, 316-317, 320-327, and 330-331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 229, at least or exactly or at most 230, or at least or exactly or at most 231 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 17, 19-44, 277, 279-304, 306-313, 317, 320-327, and 331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 232 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 19-44, 279-304, 306-313, 317, 320-327, and 331, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 233, at least or exactly or at most 234, at least or exactly or at most 235, at least or exactly or at most 236, at least or exactly or at most 237, at least or exactly or at most 238, at least or exactly or at most 239, at least or exactly or at most 240, at least or exactly or at most 241, at least or exactly or at most 242, at least or exactly or at most 243, at least or exactly or at most 244, at least or exactly or at most 245, at least or exactly or at most 246, at least or exactly or at most 247, at least or exactly or at most 248, at least or exactly or at most 249, at least or exactly or at most 250, or at least or exactly or at most 251 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 20-44, 280-304, 306-313, and 320-327, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 252 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 21-44 and 281-304, 306-313, and 320-327, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 253, at least or exactly or at most 254, at least or exactly or at most 255, at least or exactly or at most 256, at least or exactly or at most 257, at least or exactly or at most 258, or at least or exactly or at most 259 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 21-44 and 281-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 260 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 22-44, 282-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 261 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 23-44, 283-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 262, at least or exactly or at most 263, at least or exactly or at most 264, at least or exactly or at most 265, at least or exactly or at most 266, at least or exactly or at most 267, at least or exactly or at most 268, or at least or exactly or at most 269 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 24-44, 284-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 270 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 25-44, 285-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 271, at least or exactly or at most 272, at least or exactly or at most 273, or at least or exactly or at most 274 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 26-44, 286-304, 306-312, and 320-326, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 275, at least or exactly or at most 276, at least or exactly or at most 277, at least or exactly or at most 278, at least or exactly or at most 279, at least or exactly or at most 280, at least or exactly or at most 281, at least or exactly or at most 282, at least or exactly or at most 283, at least or exactly or at most 284, at least or exactly or at most 285, at least or exactly or at most 286, at least or exactly or at most 287, at least or exactly or at most 288, at least or exactly or at most 289, at least or exactly or at most 290, at least or exactly or at most 291, at least or exactly or at most 292, at least or exactly or at most 293, at least or exactly or at most 294, at least or exactly or at most 295, at least or exactly or at most 296, at least or exactly or at most 297, at least or exactly or at most 298, at least or exactly or at most 299, at least or exactly or at most 300, at least or exactly or at most 301, at least or exactly or at most 302, at least or exactly or at most 303, at least or exactly or at most 304, at least or exactly or at most 305, at least or exactly or at most 306, at least or exactly or at most 307, or at least or exactly or at most 308 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 26-44, 286-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 309 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 27-44, 287-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 310, at least or exactly or at most 311, at least or exactly or at most 312, at least or exactly or at most 313, at least or exactly or at most 314, at least or exactly or at most 315, at least or exactly or at most 316, at least or exactly or at most 317, at least or exactly or at most 318, at least or exactly or at most 319, or at least or exactly or at most 320 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 28-44, 288-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 321, at least or exactly or at most 322, at least or exactly or at most 323, at least or exactly or at most 324, at least or exactly or at most 325, at least or exactly or at most 326, at least or exactly or at most 327, at least or exactly or at most 328, at least or exactly or at most 329, at least or exactly or at most 330, or at least or exactly or at most 331 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 30-44, 290-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 332, at least or exactly or at most 333, at least or exactly or at most 334, at least or exactly or at most 335, at least or exactly or at most 336, at least or exactly or at most 337, at least or exactly or at most 338, at least or exactly or at most 339, at least or exactly or at most 340, at least or exactly or at most 341, at least or exactly or at most 342, at least or exactly or at most 343, at least or exactly or at most 344, at least or exactly or at most 345, at least or exactly or at most 346, at least or exactly or at most 347, at least or exactly or at most 348, at least or exactly or at most 349, at least or exactly or at most 350, at least or exactly or at most 351, at least or exactly or at most 352, at least or exactly or at most 353, at least or exactly or at most 354, at least or exactly or at most 355, at least or exactly or at most 356, at least or exactly or at most 357, at least or exactly or at most 358, at least or exactly or at most 359, or at least or exactly or at most 360 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 31-44, 291-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 361 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 32-44 and 292-304, 306-311, and 320-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 362 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 32-44 and 292-304, 307-311, and 321-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 363, at least or exactly or at most 364, at least or exactly or at most 365, at least or exactly or at most 366, at least or exactly or at most 367, at least or exactly or at most 368, at least or exactly or at most 369, at least or exactly or at most 370, at least or exactly or at most 371, at least or exactly or at most 372, at least or exactly or at most 373, at least or exactly or at most 374, at least or exactly or at most 375, at least or exactly or at most 376, at least or exactly or at most 377, at least or exactly or at most 378, at least or exactly or at most 379, at least or exactly or at most 380, at least or exactly or at most 381, at least or exactly or at most 382, at least or exactly or at most 383, at least or exactly or at most 384, at least or exactly or at most 385, at least or exactly or at most 386, at least or exactly or at most 387, at least or exactly or at most 388, at least or exactly or at most 389, at least or exactly or at most 390, at least or exactly or at most 391, at least or exactly or at most 392, at least or exactly or at most 393, at least or exactly or at most 394, at least or exactly or at most 395, at least or exactly or at most 396, at least or exactly or at most 397, at least or exactly or at most 398, at least or exactly or at most 399, at least or exactly or at most 400, at least or exactly or at most 401, at least or exactly or at most 402, at least or exactly or at most 403, at least or exactly or at most 404, at least or exactly or at most 405, at least or exactly or at most 406, at least or exactly or at most 407, at least or exactly or at most 408, at least or exactly or at most 409, at least or exactly or at most 410, at least or exactly or at most 411, at least or exactly or at most 412, at least or exactly or at most 413, at least or exactly or at most 414, at least or exactly or at most 415, at least or exactly or at most 416, at least or exactly or at most 417, at least or exactly or at most 418, at least or exactly or at most 419, at least or exactly or at most 420, at least or exactly or at most 421, at least or exactly or at most 422, at least or exactly or at most 423, at least or exactly or at most 424, at least or exactly or at most 425, at least or exactly or at most 426, at least or exactly or at most 427, or at least or exactly or at most 428 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 33-44, 293-304, 307-311, and 321-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 429, at least or exactly or at most 430, at least or exactly or at most 431, at least or exactly or at most 432, at least or exactly or at most 433 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 34-44, 294-304, 307-311, and 321-325, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 434, at least or exactly or at most 435, at least or exactly or at most 436, at least or exactly or at most 437, at least or exactly or at most 438, at least or exactly or at most 439, at least or exactly or at most 440, at least or exactly or at most 441, at least or exactly or at most 442, at least or exactly or at most 443, at least or exactly or at most 444, at least or exactly or at most 445, at least or exactly or at most 446, at least or exactly or at most 447, at least or exactly or at most 448, at least or exactly or at most 449, at least or exactly or at most 450, at least or exactly or at most 451, at least or exactly or at most 452, at least or exactly or at most 453, at least or exactly or at most 454, at least or exactly or at most 455, at least or exactly or at most 456, at least or exactly or at most 457, or at least or exactly or at most 458 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 34-44, 294-304, 307-310, and 321-324, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 459, at least or exactly or at most 460, or at least or exactly or at most 461 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 36-44, 296-304, 307-310, and 321-324, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 462 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 37-44, 297-304, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 463, at least or exactly or at most 464, at least or exactly or at most 465, at least or exactly or at most 466, at least or exactly or at most 467, at least or exactly or at most 468, at least or exactly or at most 469, at least or exactly or at most 470, at least or exactly or at most 471, at least or exactly or at most 472, or at least or exactly or at most 473 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 38-44, 298-304, 307-310, and 321-324, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 474, at least or exactly or at most 475, at least or exactly or at most 476, at least or exactly or at most 477, at least or exactly or at most 478, at least or exactly or at most 479, at least or exactly or at most 480, or at least or exactly or at most 481 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 38-44, 298-304, 307, 308, 310, 321, 322, and 324, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 482, at least or exactly or at most 483, at least or exactly or at most 484, at least or exactly or at most 485, at least or exactly or at most 486, at least or exactly or at most 487, at least or exactly or at most 488, at least or exactly or at most 489, or at least or exactly or at most 490 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 38-44, 298-304, 307, 308, 321, and 322, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 491, at least or exactly or at most 492, at least or exactly or at most 493, at least or exactly or at most 494, at least or exactly or at most 495, at least or exactly or at most 496, at least or exactly or at most 497, at least or exactly or at most 498, at least or exactly or at most 499, at least or exactly or at most 500, at least or exactly or at most 501, at least or exactly or at most 502, at least or exactly or at most 503, at least or exactly or at most 504, at least or exactly or at most 505, at least or exactly or at most 506, at least or exactly or at most 507, or at least or exactly or at most 508 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 39-44, 299-304, 307, 308, 321, and 321, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 509, at least or exactly or at most 510, at least or exactly or at most 511, at least or exactly or at most 512, at least or exactly or at most 513, at least or exactly or at most 514, at least or exactly or at most 515, at least or exactly or at most 516, at least or exactly or at most 517, at least or exactly or at most 518 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 39, 41-44, 299, 301-304, 307, 308, 321, and 321, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 519 contiguous amino acid residues. Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 41-44, 301-304, 307, 308, 321, and 322, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 520, at least or exactly or at most 521, at least or exactly or at most 522, at least or exactly or at most 523, at least or exactly or at most 524, at least or exactly or at most 525, at least or exactly or at most 526, at least or exactly or at most 527, at least or exactly or at most 528, at least or exactly or at most 529, at least or exactly or at most 530, at least or exactly or at most 531, at least or exactly or at most 532, at least or exactly or at most 533, at least or exactly or at most 534, at least or exactly or at most 535, at least or exactly or at most 536, at least or exactly or at most 537, at least or exactly or at most 538, at least or exactly or at most 539, at least or exactly or at most 540, at least or exactly or at most 541, at least or exactly or at most 542, at least or exactly or at most 543, at least or exactly or at most 544, at least or exactly or at most 545, at least or exactly or at most 546, at least or exactly or at most 547, at least or exactly or at most 548, at least or exactly or at most 549, at least or exactly or at most 550, at least or exactly or at most 551, at least or exactly or at most 552, at least or exactly or at most 553, at least or exactly or at most 554, at least or exactly or at most 555, at least or exactly or at most 556, at least or exactly or at most 557, at least or exactly or at most 558, at least or exactly or at most 559, at least or exactly or at most 560, at least or exactly or at most 561, at least or exactly or at most 562, at least or exactly or at most 563, at least or exactly or at most 564, at least or exactly or at most 565, at least or exactly or at most 566, at least or exactly or at most 567, at least or exactly or at most 568, at least or exactly or at most 569, at least or exactly or at most 570, at least or exactly or at most 571, at least or exactly or at most 572, at least or exactly or at most 573, at least or exactly or at most 574, at least or exactly or at most 575, at least or exactly or at most 576, at least or exactly or at most 577, at least or exactly or at most 578, at least or exactly or at most 579, at least or exactly or at most 580, at least or exactly or at most 581, at least or exactly or at most 582, at least or exactly or at most 583, at least or exactly or at most 584, at least or exactly or at most 585, at least or exactly or at most 586, at least or exactly or at most 587, at least or exactly or at most 588, at least or exactly or at most 589, at least or exactly or at most 590, at least or exactly or at most 591, at least or exactly or at most 592, at least or exactly or at most 593, at least or exactly or at most 594, at least or exactly or at most 595, at least or exactly or at most 596, at least or exactly or at most 597, at least or exactly or at most 598, at least or exactly or at most 599, at least or exactly or at most 600, at least or exactly or at most 601, at least or exactly or at most 602, at least or exactly or at most 603, at least or exactly or at most 604, at least or exactly or at most 605, at least or exactly or at most 606, at least or exactly or at most 607, at least or exactly or at most 608, at least or exactly or at most 609, at least or exactly or at most 610, at least or exactly or at most 611, at least or exactly or at most 612, at least or exactly or at most 613, at least or exactly or at most 614, at least or exactly or at most 615, at least or exactly or at most 616, at least or exactly or at most 617, at least or exactly or at most 618, at least or exactly or at most 619, at least or exactly or at most 620, at least or exactly or at most 621, at least or exactly or at most 622, at least or exactly or at most 623, at least or exactly or at most 624, at least or exactly or at most 625, at least or exactly or at most 626, at least or exactly or at most 627, at least or exactly or at most 628, at least or exactly or at most 629, at least or exactly or at most 630, at least or exactly or at most 631, at least or exactly or at most 632, at least or exactly or at most 633, at least or exactly or at most 634, at least or exactly or at most 635, at least or exactly or at most 636, at least or exactly or at most 637, at least or exactly or at most 638, at least or exactly or at most 639, at least or exactly or at most 640, at least or exactly or at most 641, at least or exactly or at most 642, at least or exactly or at most 643, at least or exactly or at most 644, at least or exactly or at most 645, at least or exactly or at most 646, at least or exactly or at most 647, at least or exactly or at most 648, at least or exactly or at most 649, at least or exactly or at most 650, at least or exactly or at most 651, at least or exactly or at most 652, at least or exactly or at most 653, at least or exactly or at most 654, at least or exactly or at most 655, at least or exactly or at most 656, at least or exactly or at most 657, at least or exactly or at most 658, at least or exactly or at most 659, at least or exactly or at most 660, at least or exactly or at most 661, at least or exactly or at most 662, at least or exactly or at most 663, at least or exactly or at most 664, at least or exactly or at most 665, at least or exactly or at most 666, at least or exactly or at most 667, at least or exactly or at most 668, at least or exactly or at most 669, at least or exactly or at most 670, at least or exactly or at most 671, at least or exactly or at most 672, at least or exactly or at most 673, at least or exactly or at most 674, at least or exactly or at most 675, at least or exactly or at most 676, at least or exactly or at most 677, at least or exactly or at most 678, at least or exactly or at most 679, at least or exactly or at most 680, at least or exactly or at most 681, at least or exactly or at most 682, at least or exactly or at most 683, at least or exactly or at most 684, at least or exactly or at most 685, at least or exactly or at most 686, at least or exactly or at most 687, at least or exactly or at most 688, at least or exactly or at most 689, at least or exactly or at most 690, at least or exactly or at most 691, at least or exactly or at most 692, at least or exactly or at most 693, at least or exactly or at most 694, at least or exactly or at most 695, at least or exactly or at most 696, at least or exactly or at most 697, at least or exactly or at most 698, at least or exactly or at most 699, at least or exactly or at most 700, at least or exactly or at most 701, at least or exactly or at most 702, at least or exactly or at most 703, at least or exactly or at most 704, at least or exactly or at most 705, at least or exactly or at most 706, at least or exactly or at most 707, at least or exactly or at most 708, at least or exactly or at most 709, at least or exactly or at most 710, at least or exactly or at most 711, at least or exactly or at most 712, at least or exactly or at most 713, at least or exactly or at most 714, at least or exactly or at most 715, at least or exactly or at most 716, at least or exactly or at most 717, at least or exactly or at most 718, at least or exactly or at most 719, at least or exactly or at most 720, at least or exactly or at most 721, at least or exactly or at most 722, at least or exactly or at most 723, at least or exactly or at most 724, at least or exactly or at most 725, at least or exactly or at most 726, at least or exactly or at most 727, at least or exactly or at most 728, at least or exactly or at most 729, at least or exactly or at most 730, at least or exactly or at most 731, at least or exactly or at most 732, at least or exactly or at most 733, at least or exactly or at most 734, at least or exactly or at most 735, at least or exactly or at most 736, at least or exactly or at most 737, at least or exactly or at most 738, at least or exactly or at most 739, at least or exactly or at most 740, at least or exactly or at most 741, at least or exactly or at most 742, at least or exactly or at most 743, at least or exactly or at most 744, at least or exactly or at most 745, at least or exactly or at most 746, at least or exactly or at most 747, at least or exactly or at most 748, at least or exactly or at most 749, at least or exactly or at most 750, at least or exactly or at most 751, at least or exactly or at most 752, at least or exactly or at most 753, at least or exactly or at most 754, at least or exactly or at most 755, at least or exactly or at most 756, at least or exactly or at most 757, at least or exactly or at most 758, at least or exactly or at most 759, at least or exactly or at most 760, at least or exactly or at most 761, at least or exactly or at most 762, at least or exactly or at most 763, at least or exactly or at most 764, at least or exactly or at most 765, at least or exactly or at most 766, at least or exactly or at most 767, at least or exactly or at most 768, at least or exactly or at most 769, at least or exactly or at most 770, at least or exactly or at most 771, at least or exactly or at most 772, at least or exactly or at most 773, at least or exactly or at most 774, at least or exactly or at most 775, at least or exactly or at most 776, at least or exactly or at most 777, at least or exactly or at most 778, at least or exactly or at most 779, at least or exactly or at most 780, at least or exactly or at most 781, at least or exactly or at most 782, at least or exactly or at most 783, at least or exactly or at most 784, at least or exactly or at most 785, at least or exactly or at most 786, at least or exactly or at most 787, at least or exactly or at most 788, at least or exactly or at most 789, at least or exactly or at most 790, at least or exactly or at most 791, at least or exactly or at most 792, at least or exactly or at most 793, at least or exactly or at most 794, at least or exactly or at most 795, at least or exactly or at most 796, at least or exactly or at most 797, at least or exactly or at most 798, at least or exactly or at most 799, at least or exactly or at most 800, at least or exactly or at most 801, at least or exactly or at most 802, at least or exactly or at most 803, at least or exactly or at most 804, at least or exactly or at most 805, at least or exactly or at most 806, at least or exactly or at most 807, at least or exactly or at most 808, at least or exactly or at most 809, at least or exactly or at most 810, at least or exactly or at most 811, at least or exactly or at most 812, at least or exactly or at most 813, at least or exactly or at most 814, at least or exactly or at most 815, at least or exactly or at most 816, at least or exactly or at most 817, at least or exactly or at most 818, at least or exactly or at most 819, at least or exactly or at most 820, at least or exactly or at most 821, at least or exactly or at most 822, at least or exactly or at most 823, at least or exactly or at most 824, at least or exactly or at most 825, at least or exactly or at most 826, at least or exactly or at most 827, at least or exactly or at most 828, at least or exactly or at most 829, at least or exactly or at most 830, at least or exactly or at most 831, at least or exactly or at most 832, at least or exactly or at most 833, at least or exactly or at most 834, at least or exactly or at most 835, at least or exactly or at most 836, at least or exactly or at most 837, at least or exactly or at most 838, at least or exactly or at most 839, at least or exactly or at most 840, at least or exactly or at most 841, at least or exactly or at most 842, at least or exactly or at most 843, at least or exactly or at most 844, at least or exactly or at most 845, at least or exactly or at most 846, at least or exactly or at most 847, at least or exactly or at most 848, at least or exactly or at most 849, at least or exactly or at most 850, at least or exactly or at most 851, at least or exactly or at most 852, at least or exactly or at most 853, at least or exactly or at most 854, at least or exactly or at most 855, at least or exactly or at most 856, at least or exactly or at most 857, at least or exactly or at most 858, at least or exactly or at most 859, at least or exactly or at most 860, at least or exactly or at most 861, at least or exactly or at most 862, at least or exactly or at most 863, at least or exactly or at most 864, at least or exactly or at most 865, at least or exactly or at most 866, at least or exactly or at most 867, at least or exactly or at most 868, at least or exactly or at most 869, at least or exactly or at most 870, at least or exactly or at most 871, at least or exactly or at most 872, at least or exactly or at most 873, at least or exactly or at most 874, at least or exactly or at most 875, at least or exactly or at most 876, at least or exactly or at most 877, at least or exactly or at most 878, at least or exactly or at most 879, at least or exactly or at most 880, at least or exactly or at most 881, at least or exactly or at most 882, at least or exactly or at most 883, or at least or exactly or at most 8834 contiguous amino ack i residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 41, 43,44, 302, 303, and 304, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 885 or at least or exactly or at most 886 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 43-44, 303-304, 307, 308, 321, and 322, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 887 or at least or exactly or at most 888 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 43-44, 303-304, 307, and 321, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 889, at least or exactly or at most 890, at least or exactly or at most 891, at least or exactly or at most 892, at least or exactly or at most 893, at least or exactly or at most 894, at least or exactly or at most 895, at least or exactly or at most 896, at least or exactly or at most 897, at least or exactly or at most 898, at least or exactly or at most 899, at least or exactly or at most 900, at least or exactly or at most 901, at least or exactly or at most 902, at least or exactly or at most 903, at least or exactly or at most 904, at least or exactly or at most 905, at least or exactly or at most 906, at least or exactly or at most 907, at least or exactly or at most 908, at least or exactly or at most 909, at least or exactly or at most 910, at least or exactly or at most 911, at least or exactly or at most 912, at least or exactly or at most 913, at least or exactly or at most 914, at least or exactly or at most 915, at least or exactly or at most 916, at least or exactly or at most 917, at least or exactly or at most 918, at least or exactly or at most 919, at least or exactly or at most 920, at least or exactly or at most 921, at least or exactly or at most 922, at least or exactly or at most 923, at least or exactly or at most 924, at least or exactly or at most 925, at least or exactly or at most 926, at least or exactly or at most 927, at least or exactly or at most 928, at least or exactly or at most 929, at least or exactly or at most 930, at least or exactly or at most 931, at least or exactly or at most 932, at least or exactly or at most 933, at least or exactly or at most 934, at least or exactly or at most 935, at least or exactly or at most 936, at least or exactly or at most 937, at least or exactly or at most 938, at least or exactly or at most 939, at least or exactly or at most 940, at least or exactly or at most 941, at least or exactly or at most 942, at least or exactly or at most 943, at least or exactly or at most 944, at least or exactly or at most 945, at least or exactly or at most 946, at least or exactly or at most 947, at least or exactly or at most 948, at least or exactly or at most 949, at least or exactly or at most 950, at least or exactly or at most 951, at least or exactly or at most 952, at least or exactly or at most 953, at least or exactly or at most 954, at least or exactly or at most 955, at least or exactly or at most 956, at least or exactly or at most 957, at least or exactly or at most 958, at least or exactly or at most 959, at least or exactly or at most 960, at least or exactly or at most 961, at least or exactly or at most 962, at least or exactly or at most 963, at least or exactly or at most 964, at least or exactly or at most 965, at least or exactly or at most 966, at least or exactly or at most 967, at least or exactly or at most 968, at least or exactly or at most 969, at least or exactly or at most 970, at least or exactly or at most 971, or at least or exactly or at most 972 contiguous amino acid residues.

Insofar as embodiment (b) and (d) relate to SEQ ID NOs: 43-44, and 303-304, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 973, or at least or exactly or at most 974 contiguous amino acid residues.

Finally, insofar as embodiment (b) and (d) relate to SEQ ID NOs: 44 and 304, the at least 9 contiguous amino acids referred to in option (b) in the definition of the first aspect of the invention may also constitute at least or exactly or at most 975, at least or exactly or at most 976, at least or exactly or at most 977, at least or exactly or at most 978, at least or exactly or at most 979, at least or exactly or at most 980, at least or exactly or at most 981, at least or exactly or at most 982, at least or exactly or at most 983, at least or exactly or at most 984, at least or exactly or at most 985, at least or exactly or at most 986, at least or exactly or at most 987, at least or exactly or at most 988, or at least or exactly or at most 989 contiguous amino acid residues.

In any one of the embodiments of option (b) and (d) above, the polypeptide of the invention is also one that has at least or exactly 9 contiguous amino acid residues defined for option (b) above in any one of the embodiments and wherein the contiguous amino acid residues commence

• at amino acid residue 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40,42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106,

107, 108, 109, 110, 111, 112, 113, 114, 115, 116, or 117 in any one of SEQ ID NOs: 1-44 and 261-332; or

• at amino acid residue 118, 119, 120, 121, or 122 in any one of SEQ ID NOs: 1-44, 261-304, 306-318, and 320-332; or

· at amino acid residue 123 or 124 in any one of SEQ ID NOs: 2-44, 262-304, 306-318, and 320-332; or

• at amino acid residue 125 or 126 in any one of SEQ ID NOs: 4-44, 264-304, 306-318, and 320-332; or

• at amino acid residue 127, 128, or 129 in any one of SEQ ID NOs: 5-44, 265-304, 306- 318, and 320-332; or

• at amino acid residue 130, 131, 132, 133 , 134, 135, 136, 135, 136, 137, 138, 139, 140, 141, 142 in any one of SEQ ID NOs: 5-44, 265-304, 306-314, 316-318, 320-328, and 330-332; or

• at amino acid residue, 143 or 144 in any one of SEQ ID NOs: 5-44, 265-304, 306-313, 316-318, 320-327, and 330-332; or

• at amino acid residue 145 and 146 in any one of SEQ ID NOs: 7-44, 267-304, 306-313, 316-318, 320-327, and 330-332; or

at amino acid residue 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, or 163 in any one of SEQ ID NOs: 9-44, 269-304, 306-313, 316- 318, 320-327, and 330-332; or

• at amino acid residue 164 in any one of SEQ ID NOs: 10-44, 270-304, 306-313, 316- 318, 320-327, and 330-332; or at amino acid residue 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178 in any one of SEQ ID NOs: 11-44, 271-304, 306-313, 316-318, 320-327, and 330-332; or

at amino acid residue 179 or 180 in any one of SEQ ID NOs: 11-44 and 271-304, 306- 313, 316-317, 320-327, and 330-331; or

at amino acid residue 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, or 214 in any one of SEQ ID NOs: 13-44, 273-304, 306-313, 316- 317, 320-327, and 330-331; or

at amino acid residue 215, 216, 217, 218, 219, or 220 in any one of SEQ ID NOs: in any one of SEQ ID NOs: 15-44, 275-304, 306-313, 316-317, 320-327, and 330-331; or

at amino acid residue 221 in any one of SEQ ID NOs: 17-44 and 275-304; or at amino acid residue 222, 223, or 224 in any one of SEQ ID NOs: 17, 19-44, 277,

279-304, 306-313, 316-317, 320-327, and 330-331; or

at amino acid residue 225 in any one of SEQ ID NOs: 17, 19-44, 277, 279-304, 306- 313, 317, 320-327, and 331; or

at amino acid residue 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, or 244 in any one of SEQ ID NOs: 19-44, 279-304, 306- 313, 317, 320-327, and 331; or

at amino acid residue 245 in any one of SEQ ID NOs: 20-44, 280-304, 306-313, and 320-327; or

at amino acid residue 246, 247, 248, 249, 250, 251, or 252 in any one of SEQ ID NOs: 21-44 and 281-304, 306-313, and 320-327; or

at amino acid residue 253 in any one of SEQ ID NOs: 21-44, 281-304, 306-312, and 320-326; or

at amino acid residue 254 in any one of SEQ ID NOs: 22-44 and 282-304; or at amino acid residue 255, 256, 257, 258, 259, 260, 261, or 262 in any one of SEQ ID

NOs: 23-44, 283-304, 306-312, and 320-326; or

at amino acid residue 263 in any one of SEQ ID NOs: in any one of SEQ ID NOs: 24-44, 284-304, 306-312, and 320-326; or

at amino acid residue 264, 265, 266, or 267 in any one of SEQ ID NOs: 25-44, 285- 304, 306-312, and 320-326; or

at amino acid residue 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301 in any one of SEQ ID NOs: 26-44, 286-304, 306-312, and 320-326; or

at amino acid residue 302 in any one of SEQ ID NOs: 26-44, 286-304, 306-311, and 320-325; or at amino acid residue, 303, 304, 305, 306, 307, 308, 309, 310, 311, 312, or 313 in any one of SEQ ID NOs: 27-44, 287-304, 306-311, and 320-325; or

at amino acid residue 314, 315, 316, 317, 318, 319, 320, 321, 322, 323, or 324 in any one of SEQ ID NOs: 28-44, 288-304, 306-311, and 320-325; or

at amino acid residue 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336,

337, 338, 339, 340, 341, 342, 343, 344, 345, 346, 347, 348, 349, 350, 351, 352, or

353 in any one of SEQ ID NOs: 30-44, 290-304, 306-311, and 320-325; or

at amino acid residue 354 in any one of SEQ ID NOs: 31-44 291-304, 306-311, and

320-325; or

at amino acid residue 355 in any one of SEQ ID NOs: 32-44 292-304, 306-311, and

320- 325; or

at amino acid residue 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, or 421 in any one of SEQ ID NOs: 32-44, 292-304, 307-311, and 321-325; or

at amino acid residue 422, 423, 424, 425, or 426 in any one of SEQ ID NOs: 33-44, 293-304, 307-311, and 321-325; or

at amino acid residue 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451 in any one of SEQ ID NOs: 34-44, 294-304, 307-311, and 321-325; or

at amino acid residue 452, 453, or 454 in any one of SEQ ID NOs: 34-44, 294-304, 307-310, and 321-324; or

at amino acid residue 455 in any one of SEQ ID NOs: 36-44, 296-304, 307-310, and

321- 324; or

at amino acid residue 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, or 466 in any one of SEQ ID NOs: 37-44, 297-304, 307-310, and 321-324; or

at amino acid residue 467, 468, 469, 470, 471, 472, 473, or 474 in any one of SEQ ID NOs: 38-44, 298-304, 307-310, and 321-324; or

at amino acid residue 475, 476, 477, 478, 479, 480, 481, 482, or 483 in any one of SEQ ID NOs: 38-44, 298-304, 307, 308, 310, 321, 323, and 324; or

at amino acid residue 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, or 501 in any one of SEQ ID NOs: 38-44, 298-304, 307, 308, 321, and 323; or

at amino acid residue 502, 503, 504, 505, 506, 507, 508, 509, 510, or 511 in any one of SEQ ID NOs: 39-44, 299-304, 307, 308, 321, and 323; or

at amino acid residue 512 in any one of SEQ ID NOs: 39, 41-44, 299, 301-304, 307, 308, 321, and 323; or at amino acid residue 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524,

• at amino acid residue 878 or 879 in any one of SEQ ID NOs: 41, 43, 44, 301, 303, 304, 307, 308, 321, and 323; or

• at amino acid residue 880or 881 in any one of SEQ ID NOs: 43, 44, 303, 304, 307, 308, 321, and 323; or

• at amino acid residue, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 919, 920, 921, 922, 923, 924, 925, 926, 927,

928, 929, 930, 931, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 946, 947, 948, 949, 950, 951, 952, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965 in any one of SEQ ID NOs: 43, 44, 303, 304, 307, and 321; or

• at amino acid residue, 966 or 967 in any one of SEQ ID NOs: 43, 44, 303, and 304; or · at amino acid residue 968, 969, 970, 971, 972, 973, 974, 975, 976, 977, 978, 979,

980, 981, or 982 in SEQ ID NO: 44 or 304.

The possible commencement point in the sequences listed above is of course dependent on the number of contiguous amino acid residues (L) selected: the N-terminal first residue cannot in any case be higher numbered than N-L+ 1, where N is the number of amino acid residues of the sequence among SEQ ID NOs: 1-44 and 261-332 in which the contiguous amino acid residues are found.

As will be apparent from the examples, certain peptides are particularly interesting embodiments of the first aspect of the invention: These embodiments of the first aspect relate to a polypeptide, optionally of 9 to 30 amino acid residues in length, comprise or consist of an amino acid sequence consisting of

• 9, 10, 11, 12, 13, 14 or 15 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260 and any one of SEQ ID NOs: 45, 61, 63, 80, 100, 113, 147, 154, 170, 172, 191, 215, 225, 226, 248, and 260, wherein any cysteine residue is/are substituted with a serine residue, an alanine residue or a 2-aminobutyric acid residue, or

• a variant of 9 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or

• a variant of 10 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or

• a variant of 11 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or

· a variant of 12 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1 or 2 or 3 or 4 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or

• a variant of 13 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1, 2, 3, 4, or 5 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or

• a variant of 14 consecutive amino acid residues from a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1, 2, 3, 4, or 5 amino acid residues are substituted with a different amino acid residue in the variant relative to the parent sequence, or • a variant of a parent sequence selected from any one of SEQ ID NOs: 45-260, wherein 1, 2, 3, 4, 5, or 6 amino acids are substituted with a different amino acid in the variant relative to the parent sequence.

In these embodiments, the parent sequence may commence at

- residue 1, 2, 3, 4, 5, 6, or 7 in any one of SEQ ID NOs: 45-260 when the peptide is 9 amino acids in length, or

- residue 1, 2, 3, 4, 5, or 6 in any one of SEQ ID NOs: 45-260 when the peptide is 10 amino acids in length, or

- residue 1, 2, 3, 3, 4, or 5 in any one of SEQ ID NOs: 45-260 when the peptide is 11 amino acids in length, or

- residue 1, 2, 3, or 4 in any one of SEQ ID NOs: 45-260 when the peptide is 12 amino acids in length, or

- residue 1, 2, or 3 in any one of SEQ ID NOs: 45-260 when the peptide is 13 amino acids in length, or

- residue 1 or 2 in any one of SEQ ID NOs: 45-260 when the peptide is 14 amino acids in length.

Thus, in some embodiments of the polypeptide of the first aspect of the invention, the polypeptide comprises or consists of 9 to 15 consecutive amino acid residues of an amino acid sequence set forth in any one of SEQ ID NOs: 45-260 or a variant sequence thereof wherein 1, 2, 3, 4, 5, or 6 amino acids are substituted with a different amino acid in the variant relative to the parent sequence. In such embodiments, the polypeptide may have a length of 9-30 amino acid residues or more, for example 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30, or optionally more amino acid residues. The variant sequence may have the same biological activity or functionality as the parent sequence as defined of polypeptides of option (b) of the first aspect. For example, the variant sequence may result in the same, greater or less ability to bind a Class HLA II allele or a group of Class HLA II alleles or the variant sequence may comprise a T cell epitope, optionally a Th 2 cell epitope. Optionally, the Class HLA II binding is determined with respect to a particular group of Class HLA II alleles, for example one or more or all of the following alleles: DPA1*02: 01-DPB1*01 : 01, DPA1*01 : 03-DPB1*02: 01, DPA1*01 : 03-DPB1*03: 01, DPA1*01 : 03-DPB1*04: 01, DPA1*01 : 03-DPB1*04: 02, DPA1*02: 02-DPB1*05: 01,

DPA1*02: 01-DPB1*14: 01, DQA1*05: 01-DQB1*02: 01, DQA1*05: 01-DQB1*03: 01,

DQA1*03: 01-DQB1*03: 02, DQA1*04: 01-DQB1*04: 02, DQA1*01 : 01-DQB1*05: 01,

DQA1*01 : 02-DQB1*06: 02, DRB1*01 : 01, DRB1*03: 01, DRB1*04: 01, DRB1*04: 05,

DRB1*07: 01, DRB1*09: 01, DRB1*11 : 01, DRB1*12: 01, DRB1*13: 02, DRB1*15: 01,

DRB3*01 : 01, DRB3*02: 02, DRB4*01 : 01 and DRB5*01 : 01. The polypeptide of the first aspect of the invention may in certain embodiments find special use in qualitative or quantitative mass spectrometric determination of polypeptides. In these embodiments, the polypeptide of the invention typically consists of an amino acid sequence identical with the amino acid sequence of a proteolytic fragment of a protein consisting of an amino acid sequence selected from any one of SEQ ID NOs: 1-44, and 261-332 (preferably SEQ ID NOs 1-44 and 305-318, i.e. proteolytic fragments of naturally occurring proteins). Such a proteolytic fragment is typically a tryptic or chymotryptic fragment, but any suitable protease can be used to provide the proteolytic fragment: papain, pepsin, ArgC, LysC, V8 protease, AspN, pronase, and carboxypeptidease C. In certain embodiments, the polypeptide, which has the amino acid sequence of a proteolytic fragment will also include a mass modifying label; see infra for a discussion of labels useful in qMS.

Embodiments of the second aspect of the invention

A composition of the second aspect of the invention comprises one or more of the

polypeptides of the first aspect of the invention. When the composition is for pharmaceutical use, it further comprises a pharmaceutically acceptable carrier, excipient and/or adjuvant, optionally sterile. It will typically be formulated as a vaccine for parenteral or sublingual administration.

Any suitable administration form is useful for the pharmaceutical composition, but one particularly relevant form is a powder, optionally formulated to be re-dissolved before use. Also, fast-dispersing tablets (optionally freeze dried) suitable for sublingual administration or buccal administration are of relevance.

The pharmaceutical composition may be a vaccine, e.g. a product for use in conducting immunotherapy, including but not limited to a vaccine for treating an allergic immune response to mites. The vaccine may be formulated for parenteral administration, such as by subcutaneous, intradermal, transcutaneous administration, e.g. formulated as a powder that optionally may be re-dissolved before use.

A pharmaceutical composition comprises in addition to the peptide combination,

therapeutically inactive ingredients, such as pharmaceutically acceptable or physiologically acceptable excipient(s), carrier(s) and/or adjuvant(s), which are well-known to the person skilled in the art and may include, but are not limited to, solvents, emulsifiers, wetting agents, plasticizers, solubilizers (e.g. solubility enhancing agents), coloring substances, fillers, preservatives, anti-oxidants, anti-microbial agents, viscosity adjusting agents, buffering agents, pH adjusting agents, isotonicity adjusting agents, mucoadhesive substances, and the like. Examples of formulation strategies are well-known to the person skilled in the art.

In some embodiments, the peptide(s) may be formulated (e.g. mixed together) with immune-modifying agents like adjuvants usually applied in immunotherapy products. In some embodiments, the pharmaceutical composition may be formulated for parenteral administration, such as formulated for injection, e.g. subcutaneous and/or intradermal injection. Therefore, in some embodiments, the pharmaceutical composition may be a liquid (i.e. formulated as a liquid), including a solution, a suspension, a dispersion, and a gelled liquid. A liquid pharmaceutical composition may be formed by dissolving a powder, granulate or lyophilizate of a peptide combination described herein in a suitable solvent and then administering to a subject. Suitable solvents may be any solvent having physiologically acceptable properties and able to dissolve the peptide combination in desired concentrations. A desired concentration may depend on the aliquot to be administered (i.e. to be injected) and the desired single dose. It is emphasized that for the purpose of injection the aliquot is in the range of about 10 to 500 microliters, e.g. 50 to 300 microliters or less and a desired single dose is within range of 1 to 1000 nanomoles. Typically the concentration of each peptide is the same, such as in an equimolar concentration, but each peptide of the composition may also be present in different concentrations. Typically, the solvent is an aqueous solution, optionally mixed with other solvents. Thus, a solvent may comprise at least 60% w/w of water, e.g. at least 65% w/w, 70% w/w, 75% w/w, 80% w/w , 85% w/w, 90% w/w or 95% w/w , 99% w/w of water, such as distilled water, such as sterile water. In some embodiments, the solvent is sterile distilled water, e.g. water for injection. An aqueous solution may comprise other solvents than water, for example DMSO (dimethylsulfoxide), glycerol, ethanol, acetonitrile, vegetable or synthetic oils. The pH of the aqueous phase of the solvent may be in a physiological acceptable range, typically in the range of 3 to 9, such as in the range of pH 3 to 8, such as in the range of pH 4 to 8, such as in the range of pH 5 to 8, such as in the range of pH 6 to 8. Thus, the liquid formulation may comprise a pH controlling agent or buffering agent (e.g. citrate buffer, phosphate buffer, acetate buffer), optionally the pH may be adjusted with dilutions of strong base (e.g. sodium hydroxide or the like) and/or dilutions of strong acids (e.g. hydrochloric acid).

Typically, the liquid formulation is isotonic, and optionally sterile. Therefore, in some embodiments, the formulation comprises saline, such as isotonic saline. The liquid may contain additional excipients, such as another solvent, a solubilizing enhancing agent, ionic and non-ionic emulsifiers, a dispersant, a thickener, a preservative, an anti-microbial agent, and/or an antioxidant. Non-limiting illustrative examples of solvents include water, saline, DMSO, glycerol, ethanol, acetonitrile, vegetable or synthetic oils. Typically, the freeze-dried composition may be dissolved before use, for example dissolved in an aqueous, optionally sterile, solution, for example a solution having a pH in the range of 3- 9, such as a pH in the range of 3-8, such as a pH in the range of 4-8. A lyophilizate may contain additional ingredients, e.g. bulking agents and lyoprotectants, buffering,

antioxidants, antimicrobial agents, solubilizers.

A freeze-dried composition may also be formulated into a solid dosage form that is administered for example by the oral route such as by oral mucosa. Thus, in some embodiments, the pharmaceutical composition may be formulated for oral administration, for example for sublingual administration. Therefore, the pharmaceutical composition may be a solid dosage form, such as a freeze-dried solid dosage form, typically a tablet, a capsule or sachet, which optionally may be formulated for fast disintegration. Pharmaceutical formulations and delivery systems appropriate for the compositions, methods and uses of the invention are known in the art (see, e.g., Remington: The Science and Practice of Pharmacy (2003) 20th ed., Mack Publishing Co., Easton, PA; Remington's Pharmaceutical Sciences (1990) 18th ed., Mack Publishing Co., Easton, PA; The Merck Index (1996) 12th ed., Merck Publishing Group, Whitehouse, NJ; Pharmaceutical Principles of Solid Dosage Forms (1993), Technonic Publishing Co., Inc., Lancaster, Pa. ; Ansel ad Soklosa, Pharmaceutical Calculations (2001) 11th ed., Lippincott Williams & Wilkins, Baltimore, MD; and Poznansky et al., Drug Delivery Systems (1980), R. L. Juliano, ed., Oxford, N.Y., pp. 253-315). Peptides may be prone to degradation when exposed to oxygen, for example when exposed to air or solvents containing air. Therefore, in some embodiments, the pharmaceutical composition comprises an inert gas, e.g. argon or nitrogen.

Embodiments relating to the third aspect of the invention

As set forth above, the third aspect relates to a method of treating allergy in a patient, where signs or symptoms of said allergy are elicited in the patient by exposure to house dust mites or storage mites and/or exposure to at least one protein allergen present in house dust mites or storage mites, the method comprising administering, to the patient, a therapeutically effective amount of a polypeptide of the first aspect of the invention, optionally two or more polypeptides of the first aspect of the invention or a composition of the second aspect of the invention.

As discussed above, the inventors have found that the polypeptides of SEQ ID NOs: 1-44 appear to be generally non-allergenic in many patients, thus rendering them highly safe as immunogens used in anti-allergy therapy. It cannot be excluded that occasional patients will be allergic (e.g. have raised IgE antibodies against one or more of the sequences of SEQ ID NOs: 1-44), but it is generally understood that it is advantageous that patients subjected to the immunogens according to the invention are non-allergic towards the protein from which the immunogen is derived, meaning that it is attempted to avoid to treat those patients having detectable IgE levels against the polypeptides of SEQ ID NOs: 1-44 with said polypeptides or a T cell-epitope-containing fragment thereof. Hence, in embodiments of the first aspect, the polypeptide used for the administration is one, wherein exposure of the patient to the polypeptide does not elicit signs or symptoms of allergy in the patient.

Regarding signs or symptoms of allergy, cf. below for further discussion, but it is generally understood that this means signs or symptoms of IgE mediated allergy including that the patient has elicited IgE-antibodies against the polypeptide.

While true prophylaxis of allergy is not excluded when carrying out the method of the third aspect of the invention, it is expected that the method will find particular use in patients that have already experienced mite allergy or have raised IgE antibodies against a mite allergen. Therefore, all embodiments of the third aspect may entail or consist of treating the allergy by relieving or reducing an immune response triggered by exposure to the mites or the protein allergen. Also, treating the allergy can in all embodiments of the third aspect comprise or consist of relieving one or more signs/symptoms of an immune response triggered by exposure to the mites or the protein allergen. Moreover, treating the allergy may in all embodiments of the third aspect consist of or comprise induction of immunological tolerance against the mites or the protein allergen. And treating the allergy may in all embodiments of the third aspect comprise or consist of relieving one or more signs/symptom(s) associated with allergic rhinitis and/or allergic conjunctivitis and/or allergic asthma and/or allergic eczema (e.g. atopic dermatitis).

The signs/symptoms of allergy mentioned above are those typically associated with the allergies treated according to the present invention, typically signs/symptoms may include one or more of the following; itchy running nose, itchy watery eyes, itchy skin and shortness of breath and the patient may experience that the signs/symptoms will to some extent be relieved by treatment with antihistamines or steroids. In a clinical setting, the signs and symptoms may include detectable levels of IgE antibodies against one or more the mites of interest.

In the event that the treatment entails or consists of relieving one or more signs or symptoms associated with allergic rhinitis, the relief is typically

- reduction of the intensity of itchy nose and/or

- reduction of the number of sneezes within a given period (e.g. daily, weekly, monthly) and/or

- reduction of the intensity of blocked nose (congestion) and/or - reduction of the amount of nasal fluid and/or

- reduction of the eosinophilic count in nasal fluid and/or

- reduction of specific IgE antibody level (titre) in nasal fluid or in serum and/or

- reduction of basophil histamine release in blood. It is to be noted that a "sign" of allergy is an objectively observable characteristic of the disease, whereas a "symptom" is the patient's subjective experience(s) relative to the disease. Some signs can be symptoms and vice versa, but if a patient for instance experiences dizziness due to a disease, this can only be categorized as a symptom, because it is not objectively observable by anybody else than the patient. On the other hand, increasing levels of for example IgE-antibodies is a "sign", since it cannot be sensed by the patient but it can be objectively measured in an appropriate assay.

Where treating the allergy comprises or consists of relieving one or more signs or symptoms associated with allergic conjunctivitis, the relief typically comprises

- reducing the intensity of itchy eyes, redness in the white of the eyes and/or watery eyes; and/or

- reducing the eosinophilic count in conjunctival tissue scrapings; and/or

- reducing specific IgE antibody level (titer) in conjunctival tissue scrapings or in serum; and/or

-reducing basophil histamine release in blood. Where treating the allergy comprises or consists of relieving one or more signs or symptoms associated with allergic asthma, the relief typically comprises

- reducing the intensity and/or number of coughs within a given period (e.g. daily, weekly, monthly); and/or

- reducing the intensity of wheezes; and/or

- improving being short of breath; and/or

- improving lung function; and/or

- reducing specific IgE antibody level (titre) in lung fluid or in serum; and/or

- reducing basophil histamine release in blood.

Where treating the allergy comprises or consists of relieving one or more signs or symptoms associated with atopic dermatitis, the relief typically comprises

- reducing itch intensity of the skin; and/or

- reducing eczema score; and/or

- reducing number of (peripheral) blood eosinophils. In all embodiments of the third aspect of the invention, the method may comprise or consist of reducing the patient's need for concomitant treatment with corticosteroids or HI antihistamines to reduce, relieve, or suppress one or more symptoms of an immune response associated with the allergy. In other words, these embodiments have the long term benefit of reducing the patient's need for medication.

As used herein, the term "immunological tolerance" refers to a) a decreased or reduced level of a specific immunological response (thought to be mediated at least in part by antigen- specific effector T lymphocytes, B lymphocytes, antibodies, or a combination thereof); b) a delay in the onset or progression of a specific immunological response; or c) a reduced risk of the onset or progression of a specific immunological response to mites. An increase, improvement, enhancement or induction of "tolerance" may refer to a decrease, reduction, inhibition, suppression, or limiting or controlling or clearing of specific immunological reactivity to an allergen as compared to reactivity to the allergen in a previous exposure to the same allergen. Thus, in certain embodiments, the method comprises inducing

immunological tolerance in a subject to mites, e.g. to an allergen of mites discussed herein to suppress an allergic immune response to the allergen. Immunological tolerance in a subject to an allergen can also be reflected by reducing the occurrence, frequency, severity, progression, or duration of an allergic response of the subject to the allergen. Induction of immune tolerance (also referred to as desensitization), and the relative amount of immune tolerance, can be measured by methods disclosed herein or known to the skilled artisan. For example, induction of immune tolerance can be measured by the modulated lymphokine and/or cytokine level in a subject or animal before versus after administering a peptide combination described herein for the first time. A modulated cytokine level can be an increase of a cytokine level, for instance an increase of a lymphokine and/or cytokine level of at least 1.5, 2, 3, 4, 5, 6, 7, 8, 10, 20, 50 times or more relative to before administering the peptide combination for the first time. Alternatively, modulation can be a decrease of the level of a particular cytokine level, for instance a decrease of the lymphokine and/or cytokine level of at least 1.5, 2, 3, 4, 5, 6, 7, 8, 10, 20, 50 times or more relative to before administering a peptide combination for the first time. The lymphokines/cytokines chosen to be measured can be from any relevant lymphokines/cytokines, such as IL-2, IL-5, IL-4, IL-6, IL-10, IL-12, IL-13, IL-17, TNF-alfa, IFN-gamma, TGF-beta, MCP-1, RANK-L and Flt3L.

Accordingly, the term "inducing immunological tolerance" may include eliciting, stimulating, promoting, increasing or enhancing immunological tolerance. Immunological tolerance may involve modulation of T cell activity, including but not limited to CD4+ T cells, CD8+ T cells, Thl cells, Th2 cells and regulatory T cells (Tregs), and memory T cells, including

inflammatory lymphokines/cytokines produced by T cells. The patients subjected to the treatment of the third aspect of the invention typically present with an immune response clinically presented as atopic dermatitis, urticaria, contact dermatitis, allergic conjunctivitis, allergic rhinitis, allergic asthma, anapylaxis, and/or hay fever. In particular advantageous versions of any one of the embodiments of the third aspect of the invention, the treatment thus decreases, reduces, suppresses or inhibits atopic dermatitis, urticaria, contact dermatitis, allergic conjunctivitis, allergic rhinitis, allergic asthma, anaphylaxis, and/or hay fever.

Without being bound to any theory, it is believed that the method of the third aspect of the invention is capable of increasing an IgG antibody response in the patient to a protein allergen of the mites and/or of decreasing an IgE antibody response in the patient to a protein allergen of the mites and/or of decreasing a T cell response in the patient against a protein allergen of the mites, since each one of these physiological effects have a beneficial effect on the signs and symptoms of allergy. Hence, in advantageous versions of all embodiments of the third aspect of the invention, the method does provide for increasing an IgG antibody response in the patient to a protein allergen of the mites and/or for decreasing an IgE antibody response in the patient to a protein allergen of the mites and/or for decreasing a T cell response in the patient against a protein allergen of the mites.

It will be understood that the patients that are subjected to the method of the third aspect of the invention are typically sensitized to at least one protein allergen of the mites. It is to be understood that such patients may exhibit allergy signs or experience symptoms of allergy, but it is not excluded that "patients" that merely exhibit clinical signs of being sensitized against at least one protein allergy of the mites will also benefit from the treatment.

The allergy treated according to the invention is in all embodiments of the third aspect of the invention allergy towards house dust mites of the genus Dermatophagoides (for example selected from the group consisting of Dermatophagoides pteronyssinus, Dermatophagoides farinae) or of the genus Euroglyphus (for example Euroglyphus maynei), or wherein the mites are storage mites of the genus Glycyphagus, Lepidoglyphus, Tyrophagus, or Blomia (for example Glycyphagus domesticus, Lepidoglyphus destructor, Tyrophagus putrescentiae, or Blomia tropica I is). Consequently, the protein allergen is in all embodiments of the third aspect of the invention selected from one or more protein allergens in the groups consisting of

- a group 1 allergen of mites (for example a group 1 allergen of a house dust mite (e.g. Der p 1, Der f 1, or Eur m 1, or a group 1 allergen of a storage mite, e.g. Gly d 1, Lep d 1, Typ p 1 and Bio t 1) and

- a group 2 allergen of mites (for example a group 2 allergen of a house dust mite, e.g. Der p 2, Der f 2 and Eur m 2, and a group 2 allergen of a storage mite, e.g. Gly d 2, Lep d 2, Typ p 2 and Bio t 2).

As mentioned above, a particular embodiment of the third aspect of the invention entails that a polypeptide of the first aspect or a composition of the second aspect does not elicit signs or symptoms of allergy. These signs and symptoms are in important embodiments selected from the group consisting of:

- the presence in the patient of specific IgE antibodies that binds to the a polypeptide of the first aspect or a composition of the second aspect (e.g. the level of specific IgE is below the detection level when tested in an assay measuring specific IgE (e.g. ImmunoCAP® Specific IgE Blood Test), for example the level is below 0.7 kU/L, when tested by an ImmunoCAP® test;

- serum histamine release induced by a polypeptide of the first aspect or a composition of the second aspect is below the detection level when tested in a basophil activation test (BAT)

- a positive skin prick test with a polypeptide of the first aspect or a composition of the second aspect; and

- the signs or symptoms discussed in detail supra.

In certain embodiments of third aspect of the invention, a polypeptide of the first aspect or a composition of the second aspect is formulated together with a pharmaceutically and immunologically acceptable carrier, vehicle or excipient. When exercising the method of the third aspect of the invention and any embodiment thereof, a polypeptide of the first aspect or a composition of the second aspect may further be formulated together with an immunological adjuvant. Also a polypeptide of the first aspect or a composition of the second aspect may be formulated with a suitable carrier, diluent, or vehicle. It is particularly preferred that a polypeptide of the first aspect or a composition of the second aspect is administered by the parenteral route to the patient, such as via a route of administration selected from any one of subcutaneous, intradermal, epicutaneous, topical, sublingual, buccal, intranasal, respiratory and the intralymphatic route. In particular the sublingual and buccal routes are of interest. A polypeptide of the first aspect or a composition of the second aspect may also be administered to a subject in need thereof by injection, such as by subcutaneous or intradermal administration, but may also include other routes of administration, such as epicutaneous, transcutaneous, topical, rectal, oral, intranasal, respiratory and intralymphatic route of administration. Typically, the subject in need thereof is a human, a pet such as a dog or a cat, a domestic animal such as a horse, or a laboratory animal (a mouse, a guinea pig or a rabbit). The subject may be sensitized to mites (e.g. having specific IgE antibodies against an allergen of mites and/or having a T cell response against an allergen of mites). Therefore, a subject in need thereof may produce specific IgE antibodies or a T cell response against mite allergens.

A polypeptide of the first aspect or a composition of the second aspect may be formulated for injection or for sublingual administration (e.g. a solid dosage form such as a tablet, and in particular a freeze-dried tablet) or is formulated in a composition as described infra for the compositions of the invention. Typically, a polypeptide of the first aspect or a composition of the second aspect is administered several times, i.e. repeatedly, such as in weekly, by-weekly, monthly or quarterly intervals.

As will be understood from the above, the allergy is, according to the third aspect of the invention and any embodiments thereof, preferably treated by immunotherapy. The patient in question need not be human, since many pets suffer from allergy towards the mites discussed above. As such, the patient may be human or a mammal, such as a cat, dog, and a horse.

A pharmacologically effective amount of a single dose of a polypeptide of the first aspect or a composition of the second aspect may be in the range of 1 to 1000 nanomole, for example 1 to 500 nanomole, for example 1 to 250 nanomole, for example 5 to 250 nanomole. Typically, a polypeptide or composition of the invention is administered as a liquid in a volume of about 50 to 150 microliter, such as by intradermal administration.

Embodiments of diagnostic aspects of the invention

The fourth aspect of the invention relates to an in vitro method of determining whether T cells of a subject are responsive to one or more polypeptides of the first aspect or a composition of the second aspect. The method comprises contacting T cells obtained from the subject with said polypeptide(s) or composition(s) and determining whether the T cells are stimulated.

The fifth aspect of the invention relates to an in vitro method of diagnosing a subject for sensitization or allergy to house dust mites or storage mites, comprising contacting T cells obtained from the subject one or more polypeptides of the first aspect or a composition of the second aspect and determining whether the T cells are stimulated. The sixth aspect of the invention relates to an in vitro method for determining whether a subject has, or is at risk of developing, an allergy to house dust mites or storage mites, comprising contacting T cells obtained from the subject with one or more polypeptides of the first aspect or a composition of the second aspect and determining whether the T cells are stimulated.

A number of assay formats are available for the purpose of determining T cell stimulation and are well known for the person skilled. For instance ELISPOT/ Fluorospot, simple proliferation assays as well as the assay disclosed in Example 2 are all useful for the purpose of determining T cell responsiveness. The seventh aspect of the invention relates to an in vitro method of diagnosing a subject for allergy or sensitivity to house dust mites or storage mites, comprising determining the presence of specific IgE against one or more polypeptides of the first aspect or a composition of the second aspect in a biological sample (e.g. serum, plasma or blood) obtained from the subject. Any conventional antibody based immune assay is useful for this purpose and include enzyme linked immune sorbent assays (ELISAs), radioimmune assays (RIAs), immunoblotting techniques, etc. but also cell based assays such as measurement of histamine release induced byan analyte in a basophil activation test (BAT).

The eighth aspect of the invention relates to a diagnostic kit comprising one or more polypeptides of the first aspect or a composition of the second aspect. Such a kit will normally also include necessary detection agents, visualisation means, carriers etc. that enable one or more of the above-described diagnostic assays.

Other aspects of the invention

The ninth aspect of the invention relates an isolated nucleic acid fragment, which comprises i) a nucleotide sequence encoding a polypeptide according to the first aspect of the invention, or

ii) a nucleotide sequence complementary to the nucleotide sequence in i)-v).

A tenth aspect of the invention relates to a vector comprising a nucleic acid sequence of the invention, such as a cloning vector or an expression vector. Such a vector conventionally may include, in operable linkage and in the 5'-3' direction,

- an expression control region comprising an enhancer/promoter for driving expression of the nucleic acid fragment defined in option i) for the nucleic acid fragment of the invention,

- an optional signal peptide coding sequence, - a nucleotide sequence defined in option i) for the nucleic acid of the invention, and

- an optional terminator.

The expression control region may drive expression in prokaryotic cell such as a bacterium, e.g. in E coli, but it may in certain instances be necessary to includes expression control regions suitable for eukaryotic cells and in certain cases this applies in particular to plant cells.

The vector may be capable of autonomous replication and/or it may be capable of being integrated into the genome of a host cell - the latter is of particular of relevance when constructing cells and cell lines that are capable of stable expression of the nucleic acid fragment of the invention.

Suitable vectors are a virus, such as an attenuated virus, a bacteriophage, a plasmid, a minichromosome, and a cosmid.

It will be understood that the nucleic acid fragments of the invention may be used for both production purposes, so such vectors will typically be in the form of cloning vectors or expression vectors.

Such a vector of the invention often comprises in operable linkage and in the 5'-3' direction, an expression control region comprising an enhancer/promoter for driving expression of the nucleic acid, an optional signal peptide coding sequence, a nucleotide sequence of the invention, and optionally a terminator. Hence, such a vector constitutes an expression vector useful for effecting production in cells of a polypeptide of the invention. Since the

polypeptides of the invention are of mite origin, recombinant production has to be effected in host cells that can express the coding nucleic acid. Bacterial host cells may be used in some cases. However, if the vector is to drive expression in eukaryotic cell, the expression control region should be adapted to this particular use. For production purposes it is therefore often convenient that the expression control region drives expression in a prokaryotic cell such as a bacterium, e.g. in E. coli, or in a eukaryotic cell such as a fungal cell, a plant cell, an insect cell, or a mammalian cell.

Also, for production purposes, it is practical that the vector is capable of integrating the nucleic acid into the genome of a selected host cell - this is particularly useful if the vector is use in the production of stably transformed cells, where the progeny will also include the genetic information introduced via the vector. Alternatively, vectors incapable of being integrated into the genome of a piscine host cell are useful in early screening of production cells.

Polypeptides of the invention may as indicated be encoded by a nucleic acid molecule comprised in a vector. A nucleic acid sequence can be "heterologous," which means that it is in a context foreign to the cell in which the vector is being introduced, which includes a sequence homologous to a sequence in the cell but in a position within the host cell where it is ordinarily not found.

Vectors include naked DNAs, RNAs, plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant viruses), and artificial chromosomes (e.g., YACs). One of skill in the art would be well equipped to construct a vector through standard recombinant techniques. In addition to encoding the polypeptides of this invention, a vector of the present invention may encode polypeptide sequences such as a "tag" or immunogenicity enhancing peptide (e.g. an immunogenic carrier or a fusion partner that stimulates the immune system, such as a cytokine or active fragment thereof). Useful vectors encoding such fusion proteins include pIN vectors, vectors encoding a stretch of histidines, and pGEX vectors, for use in generating glutathione S-transferase (GST) soluble fusion proteins for later purification and separation or cleavage.

Vectors of the invention may be used in a host cell to produce a polypeptide of the invention that may subsequently be purified for administration. Expression vectors can contain a variety of "control sequences," which refer to nucleic acid sequences necessary for the transcription and possibly translation of an operably linked coding sequence in a particular host cell. In addition to control sequences that govern transcription and translation, vectors and expression vectors may contain nucleic acid sequences that serve other functions as well and are described infra. 1. Promoters and Enhancers

A "promoter" is a control sequence. The promoter is typically a region of a nucleic acid sequence at which initiation and rate of transcription are controlled. It may contain genetic elements at which regulatory proteins and molecules may bind such as RNA polymerase and other transcription factors. The phrases "operatively positioned," "operatively linked," "under control," and "under transcriptional control" mean that a promoter is in a correct functional location and/or orientation in relation to a nucleic acid sequence to control transcriptional initiation and expression of that sequence. A promoter may or may not be used in conjunction with an "enhancer," which refers to a cis-acting regulatory sequence involved in the transcriptional activation of a nucleic acid sequence.

A promoter may be one naturally associated with a gene or sequence, as may be obtained by isolating the 5' non-coding sequences located upstream of the coding segment or exon. Such a promoter can be referred to as "endogenous." Similarly, an enhancer may be one naturally associated with a nucleic acid sequence, located either downstream or upstream of that sequence. Alternatively, certain advantages will be gained by positioning the coding nucleic acid segment under the control of a recombinant or heterologous promoter, which refers to a promoter that is not normally associated with a nucleic acid sequence in its natural environment. A recombinant or heterologous enhancer refers also to an enhancer not normally associated with a nucleic acid sequence in its natural state. Such promoters or enhancers may include promoters or enhancers of other genes, and promoters or enhancers isolated from any other prokaryotic, viral, or eukaryotic cell, and promoters or enhancers not "naturally occurring," i.e., containing different elements of different transcriptional regulatory regions, and/or mutations that alter expression. In addition to producing nucleic acid sequences of promoters and enhancers synthetically, sequences may be produced using recombinant cloning and/or nucleic acid amplification technology, including polymerase chain reaction in connection with the compositions disclosed herein.

It may be important to employ a promoter and/or enhancer that effectively direct(s) the expression of the DNA segment in the cell type or organism chosen for expression. Those of skill in the art of molecular biology generally know the use of promoters, enhancers, and cell type combinations for protein expression. The promoters employed may be constitutive, tissue-specific, or inducible and in certain embodiments may direct high level expression of the introduced DNA segment under specified conditions, such as large-scale production of recombinant proteins or peptides.

Examples of inducible elements, which are regions of a nucleic acid sequence that can be activated in response to a specific stimulus, include but are not limited to Immunoglobulin Heavy Chain, Immunoglobulin Light Chain, T Cell Receptor, HLA DQa and/or DQ , β- Interferon, Interleukin-2, Interleukin-2 Receptor, MHC Class II 5, MHC Class II HLA-DRa, β- Actin, Muscle Creatine Kinase (MCK), Prealbumin (Transthyretin), Elastase I, Metallothionein (MTII), Collagenase, Albumin, a-Fetoprotein, γ-Globin, β-Globin, c-fos, c-HA-ras, Insulin, Neural Cell Adhesion Molecule (NCAM), al-Antitrypain, H2B (TH2B) Histone, Mouse and/or Type I Collagen, Glucose-Regulated Proteins (GRP94 and GRP78), Rat Growth Hormone, Human Serum Amyloid A (SAA), Troponin I (TN I), Platelet-Derived Growth Factor (PDGF), Duchenne Muscular Dystrophy, SV40, Polyoma, Retroviruses, Papilloma Virus, Hepatitis B Virus, Human Immunodeficiency Virus, Cytomegalovirus (CMV) IE, and Gibbon Ape Leukemia Virus.

Inducible Elements include MT II - Phorbol Ester (TFA)/Heavy metals; MMTV (mouse mammary tumor virus) - Glucocorticoids; β-Interferon - poly(rl)x/poly(rc); Adenovirus 5 E2 - EIA; Collagenase - Phorbol Ester (TPA); Stromelysin - Phorbol Ester (TPA); SV40 - Phorbol Ester (TPA); Murine MX Gene - Interferon, Newcastle Disease Virus; GRP78 Gene - A23187; α-2-Macroglobulin - IL-6; Vimentin - Serum; MHC Class I Gene H-2Kb - Interferon; HSP70 - E1A/SV40 Large T Antigen; Proliferin - Phorbol Ester/TPA; Tumor Necrosis Factor - PMA; and Thyroid Stimulating Hormonea Gene - Thyroid Hormone. Also contemplated as useful in the present invention are the dectin-1 and dectin-2 promoters. Additionally any promoter/enhancer combination (as per the Eukaryotic Promoter Data Base EPDB) could also be used to drive expression of structural genes encoding oligosaccharide processing enzymes, protein folding accessory proteins, selectable marker proteins or a heterologous protein of interest. The particular promoter that is employed to control the expression of peptide or protein encoding polynucleotide of the invention is not believed to be critical, so long as it is capable of expressing the polynucleotide in a targeted cell, preferably a bacterial cell. Where a mammalian cell is targeted, it is preferable to position the polynucleotide coding region adjacent to and under the control of a promoter that is capable of being expressed in a mammalian cell. Generally speaking, such a promoter might include either a bacterial, human or viral promoter.

In various embodiments, the human cytomegalovirus (CMV) immediate early gene promoter, the SV40 early promoter, and the Rous sarcoma virus long terminal repeat can be used to obtain high level expression of a related polynucleotide to this invention. The use of other viral or mammalian cellular or bacterial phage promoters, which are well known in the art, to achieve expression of polynucleotides is contemplated as well.

A specific initiation signal also may be required for efficient translation of coding sequences. These signals include the ATG initiation codon or adjacent sequences. Exogenous

translational control signals, including the ATG initiation codon, may need to be provided. One of ordinary skill in the art would readily be capable of determining this and providing the necessary signals. It is well known that the initiation codon must be "in-frame" with the reading frame of the desired coding sequence to ensure translation of the entire insert. The exogenous translational control signals and initiation codons can be either natural or synthetic and may be operable in bacteria or mammalian cells. The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements.

In certain embodiments of the invention, the use of internal ribosome entry sites (IRES) elements are used to create multigene, or polycistronic, messages. IRES elements are able to bypass the ribosome scanning model of 5' methylated Cap dependent translation and begin translation at internal sites. IRES elements from two members of the picornavirus family (polio and encephalomyocarditis) have been described, as well an IRES from a mammalian message. IRES elements can be linked to heterologous open reading frames. Multiple open reading frames can be transcribed together, each separated by an IRES, creating

polycistronic messages. By virtue of the IRES element, each open reading frame is accessible to ribosomes for efficient translation. Multiple genes can be efficiently expressed using a single promoter/enhancer to transcribe a single message (see U.S. Patents 5,925,565 and 5,935,819, herein incorporated by reference).

2. Multiple Cloning Sites Vectors can include a multiple cloning site (MCS), which is a nucleic acid region that contains multiple restriction enzyme sites, any of which can be used in conjunction with standard recombinant technology to digest the vector. Frequently, a vector is linearized or fragmented using a restriction enzyme that cuts within the MCS to enable exogenous sequences to be ligated to the vector. Techniques involving restriction enzymes and ligation reactions are well known to those of skill in the art of recombinant technology.

3. Splicing Sites

Most transcribed eukaryotic RNA molecules will undergo RNA splicing to remove introns from the primary transcripts. If relevant in the context of vectors of the present invention, vectors containing genomic eukaryotic sequences may require donor and/or acceptor splicing sites to ensure proper processing of the transcript for protein expression.

4. Termination Signals

The vectors or constructs of the present invention will generally comprise at least one termination signal. A "termination signal" or "terminator" is comprised of the DNA sequences involved in specific termination of an RNA transcript by an RNA polymerase. Thus, in certain embodiments a termination signal that ends the production of an RNA transcript is contemplated. A terminator may be necessary in vivo to achieve desirable message levels. In eukaryotic systems, the terminator region may also comprise specific DNA sequences that permit site-specific cleavage of the new transcript so as to expose a polyadenylation site. This signals a specialized endogenous polymerase to add a stretch of about 200 A residues (poly A) to the 3' end of the transcript. RNA molecules modified with this polyA tail appear to more stable and are translated more efficiently. Thus, in other embodiments involving eukaryotic cells, it is preferred that that terminator comprises a signal for the cleavage of the RNA, and it is more preferred that the terminator signal promotes polyadenylation of the message.

Terminators contemplated for use in the invention include any known terminator of transcription described herein or known to one of ordinary skill in the art, including but not limited to, for example, the bovine growth hormone terminator or viral termination sequences, such as the SV40 terminator. In certain embodiments, the termination signal may be a lack of transcribable or translatable sequence, such as due to a sequence truncation.

5. Polyadenylation Signals In expression, particularly eukaryotic expression, one will typically include a polyadenylation signal to effect proper polyadenylation of the transcript. The nature of the polyadenylation signal is not believed to be crucial to the successful practice of the invention, and/or any such sequence may be employed. Preferred embodiments include the SV40 polyadenylation signal and/or the bovine growth hormone polyadenylation signal, convenient and/or known to function well in various target cells. Polyadenylation may increase the stability of the transcript or may facilitate cytoplasmic transport.

6. Origins of Replication

In order to propagate a vector in a host cell, it may contain one or more origins of replication sites (often termed "on"), which is a specific nucleic acid sequence at which replication is initiated. Alternatively an autonomously replicating sequence (ARS) can be employed if the host cell is yeast.

7. Selectable and Screenable Markers

In certain embodiments of the invention, cells containing a nucleic acid of the present invention may be identified in vitro or in vivo by encoding a screenable or selectable marker in the expression vector. When transcribed and translated, a marker confers an identifiable change to the cell permitting easy identification of cells containing the expression vector. Generally, a selectable marker is one that confers a property that allows for selection. A positive selectable marker is one in which the presence of the marker allows for its selection, while a negative selectable marker is one in which its presence prevents its selection. An example of a positive selectable marker is a drug resistance marker.

Usually the inclusion of a drug selection marker aids in the cloning and identification of transformants, for example, markers that confer resistance to neomycin, puromycin, hygromycin, DHFR, GPT, zeocin or histidinol are useful selectable markers. In addition to markers conferring a phenotype that allows for the discrimination of transformants based on the implementation of conditions, other types of markers including screenable markers such as GFP for colorimetric analysis. Alternatively, screenable enzymes such as herpes simplex virus thymidine kinase (tk) or chloramphenicol acetyltransferase (CAT) may be utilized. One of skill in the art would also know how to employ immunologic markers that can be used in conjunction with FACS analysis. The marker used is not believed to be important, so long as it is capable of being expressed simultaneously with the nucleic acid encoding a protein of the invention. Further examples of selectable and screenable markers are well known to one of skill in the art.

The eleventh aspect of the invention relates to a cell which is transformed so as to carry the vector of the invention - particularly preferred transformed cells are also capable of expressing the nucleic acid fragment of the invention in order to enable production of the polypeptides disclosed herein. The transformed cell may hence be capable of replicating the nucleic acid fragment defined in option i) or ii) of the ninth aspect of the invention and/or capable of expressing said nucleic acid fragment.

Depending on the particular use of the transformed cell it can be of prokaryotic or eukaryotic origin cell. Preferred prokaryotic cells are bacteria selected from the group consisting of Escherichia (such as E. coli .), Bacillus (e.g. Bacillus subtilis), Salmonella, and Mycobacterium, preferably non-pathogenic, e.g. M. bovis BCG. Preferred eukaryotic cells are fungal cells, insect cells, mammalian cells, and plant cells.

For production purposes, it is preferred that the cell is stably transformed by having the nucleic acid defined in option i) or ii) of the ninth aspect of the invention stably integrated into its genome.

Also for production purposes, it is preferred that the transformed cell secretes or carries on its surface the polypeptide disclosed herein - when the cell is a bacterium, it may be advantageous that secretion is into the periplasmic space or into the culture medium. The twelfth aspect of the invention is a cell line derived from a transformed cell of the invention. In particular clonal cell lines are interesting.

The twelfth aspect of the invention relates to a method for the preparation of the polypeptide disclosed herein, comprising

- culturing a transformed cell or cell line of the invention under conditions that facilitate that the transformed cell expresses the nucleic acid fragment according to option i) of the ninth aspect of the invention and subsequently recovering said polypeptide, or

- preparing said polypeptide by means of solid or liquid phase peptide synthesis.

The twelfth aspect may be preceded by steps that include recombinant preparation of the cell or cell line of the invention, i.e. introduction of a vector of the invention into a host cell and propagation and selection of those transformed cells that effectively express the nucleic acid of the invention.

The thirteenth aspect of the invention relates to antibodies that specifically bind and recognize a polypeptide of the first aspect of the invention, in particular the polypeptides having the amino acid sequences set forth in SEQ ID NO: 1-44 and 261-332 (preferably SEQ ID NOs: 1-44 and 305-318).

As such, the antibody may be an isolated polyclonal antibody, which has been raised against the polypeptide of the first aspect of the invention. In this connection "isolated" is intended to mean that the polyclonal antibody is essentially free from antibody species that bind non- specifically to the polypeptide of the first aspect. Another way to phrase this is that the polyclonal antibody of the present invention is essentially free from antibody species that have K D values > 10 "5 for binding to the polypeptide of the first aspect of the invention.

Polyclonal antibodies of the invention can be obtained from any mammalian species of convenience: the antibody can e.g. be isolated from a rabbit, mouse, rat, cat, dog, horse, cow, camel, llama, or even a human being.

Also, the antibody can be a monoclonal antibody or a fragment or analogue thereof, which specifically binds the polypeptide of the first aspect of the invention.

A "fragment or analogue" of a monoclonal antibody comprises at least the antigen-binding or variable regions of the monoclonal antibody. Examples of antibody fragments/analogues include Fab, Fab', F(ab) 2 , F(ab') 2 , F(ab) 3 , Fv (typically the V L and V H domains of a single arm of an antibody), single-chain Fv (scFv), dsFv, Fd fragments (typically the V H and C H 1 domain), and dAb (typically a V H domain) fragments; V H , V L , VhH, and V-NAR domains; minibodies, diabodies, triabodies, tetrabodies, and kappa bodies (see, e.g ., Ill et al ., Protein Eng 1997; 10: 949-57) ; camel or llama IgG; IgNAR; and multispecific antibody fragments formed from antibody fragments, and one or more isolated CDRs or a functional paratope, where isolated CDRs or antigen-binding residues or polypeptides can be associated or linked together so as to form a functional antibody fragment. Various types of antibody fragments have been described or reviewed in, e.g ., Holliger and Hudson, Nat Biotechnol 2005; 23, 1126-1136; WO 2005/040219, and published U.S. Patent Applications 2005/0238646 and 2002/0161201, all of which are incorporated by reference herein.

The monoclonal antibody of the invention, the fragment, or the analogue thereof may also be presented in the form of a "derivative", wherein one or more of the amino acids of the monoclonal antibody, the fragment, or the analogue are chemically modified, e.g. , by alkylation, PEGylation, acylation, ester formation or amide formation or the like, e.g ., for linking the antibody to a second molecule. This includes, but is not limited to, PEGylated antibodies, cysteine-PEGylated antibodies, and variants thereof. Monoclonal antibody are preferably those having high affinity for the polypeptide of the first aspect of the present invention. Typically, high affinities, expressed as a K D of less than 10 "5 are preferred, and even lower K D values are preferred, such as less than 10 "7 , 10 "8 , 10 "9 , 10 "10 , 10 "11 , or 10 "12 .

The fourteenth aspect of the invention relates to methods of detecting, quantitatively or qualitatively, the presence in a sample of a polypeptide of the first aspect of the present invention. For example, the sample may be an allergen extract or an immunotherapy product comprising an allergen extract or a polypeptide of the first aspect. The method may be performed in order to characterise an allergen extract, either qualitatively or quantitatively. Any convenient detection method may be employed . For instance, many such methods (which are by nature qualitative or semi-quantitative) rely on the use of specifically binding antibodies. For instance, detection may entail contacting the sample with an antibody of the thirteenth aspect of the invention and detecting specific binding of material in said sample to said antibody. Such assays may have very simple formats and can e.g . be in the form of agglutination assays or immunoblots (dot blot analysis, quantitative dot blot, Western blot) of any format. To facilitate detection the antibody may be labelled with a radioactive isotope, a component of a ligand/receptor pair, a luminescent or fluorescent label, an enzyme, etc.

Possible formats for use in immune detection are for instance

- contacting the sample with a system comprising a solid phase with an antibody of the thirteenth aspect coupled thereto and comprising a labelled polypeptide (as described above) of the first aspect of the invention, where said labelled polypeptide specifically binds said antibody, and gauging the degree of competition exerted by material in the sample on the binding between said labelled polypeptide and said antibody; hence, this format is in the form of a competitive binding assay where the ability of the sample to out-compete a polypeptide of the first aspect is gauged,

- contacting the sample with a system comprising 1) a solid phase with a polypeptide of the first aspect coupled thereto and comprising 2) a labelled antibody of the thirteenth aspect, where said polypeptide specifically binds said labelled antibody, and gauging the degree of competition exerted by material in the sample on the binding between said polypeptide and said antibody; also this is a competitive assay, but here the ability of the sample to attract labelled antibody is gauged.

The latter immune assays may be put into practice in a number of format known per se in the art: ELISAs, RIAs, etc.

A further possibility is to utilise the polypeptides of the first aspect in similar assay formats but with a view to identifying IgE antibodies in a sample. In such assays (e.g . RAST assays), possible presence of anti-polypeptide IgE is gauged by either indirect assays (competitive assays) or in assays that determine direct binding between polypeptide and antibody.

A further embodiment of the method of the fourteenth aspect of the invention relates to mass spectrometric identification or quantification of a polypeptide of the first aspect of the invention in a sample, for example in an allergen extract or an immunotherapeutic product comprising an allergen extract or a polypeptide of the first aspect. In essences, the polypeptide material of a sample is subjected to proteolytic treatment and the thus obtained material is subsequently subjected to quantitative MS, optionally using at least one polypeptide of the first aspect or a fragment of said polypeptide, which is obtainable by the same proteolytic treatment as the sample, but often produced synthetically. Thus, a further embodiment of the invention relates to a synthetically produced fragment of a polypeptide of the first aspect, which is identical to a fragment produced by proteolytic treatment of said polypeptide. Proteolytic treatment may be performed with trypsin or chymotrypsin or other enzymes known in the art. The synthetically produced fragment may be used in the mass spectrometric identification or quantification of said polypeptides. This method is in particular useful if the polypeptide tested has any one of SEQ ID NO: 1-44, and 261-332, but in particular of the naturally occurring polypeptides having SEQ ID NOs: 1-44 and 305-318.

Methods for qualitative determination for instance involve mass fingerprinting methods as those taught in Trauger A. et al. (2002), Spectroscopy. 16 (1) : 15-28. For relevant teachings pertaining to quantitative determination, reference is made to Wells W et a/. (2006), Journal of Proteome Research. 5 (3) : 651-658, as well as to Bret, Cooper and J. Feng and W. Garrett (2010), Spectroscopy. 21 (9) : 1534-1546, Haqqani AS et a/. (2008), Methods Mol. Biol. 439: 241-56. These references are incorporated by reference herein. If employing labelling of standard peptides for use in qMS, SILAC (stable isotope labeling by amino acids in cell culture), trypsin-catalyzed 18 0 labelling, ICAT (isotope coded affinity tagging), and iTRAQ (isobaric tags for relative and absolute quantitation) are useful. "Semiquantitative" mass spectrometry may be performed without labelling of samples, e.g. with MALDI analysis (in linear mode). The peak intensity, or the peak area, from individual proteins is correlated to the total amount of protein in the sample. Other types of "label-free" quantitative mass spectrometry uses the spectral counts (or peptide counts) of enzyme digested proteins as a means for determining relative protein amounts.

It is however preferred to employ labelled standard peptides in the qMS methods. Reference is generally made to the quantification methods taught in WO 2007/031080. SEQUENCES

The amino acid sequences of the polypeptides of the present invention are set forth in the sequence listing. For ease of reference, the sequences are provided as follows, together with the alternative designation used herein as well as their origin.

SEQ ID NO: 13; Cluster ID (L) 96; Cluster ID (A) 55

Protein name: AOOOl; Species: Dermatophagoides farinae

KKTKDCDVEK PIRECLKNGL LRYSDGQKIN QFPDSIEDLN RACEELKKSE TCARNFIDTC TETSYEKRSL DSLLDGIQRV LKRLCRSQSK KEQLLQNVGC ANSVVQDTKL CLKNYRMLVF AANKLNDKSK IMRILCCKSR KVAPCIGEAM KSKGNAVCSA KNIDYFREMH QNIKAEMTAV VCSDFERDQC ENVEVPAITE AEYKDQNIFN PLRDLYKKVI LA SEQ ID NO: 14; Cluster ID (L) 96; Cluster ID (A) 55

Protein name: AOOOl; Species: Dermatophagoides pteronyssinus

KKSPDCDIER PIRECLKDGL LRYSSGQKIN QFPDTIQDLN RACEELKKSE TCARTFIDTC TESSYEKRSL DSLLDGIQRV MKRLCRSQTK KEKLLENVGC ANSVVQDTKQ CLKNYRMLVF AANKLDNKNK IMRILCCKSR KVAPCIGEAM KAKGTAVCSA KNIDYFKDVH QNIKQEMTAV VCSDFERDQC ENVDVPNISE SEYKDQNIFN PLRDLYKKVI LG

SEQ ID NO: 2; Cluster ID (L) 65; Cluster ID (A) 74

Protein name: A0003; Species: Dermatophagoides farinae MAIDGKYQME SSEHFEEFVK EMGLDVDMTN VDLSKTSTME ICKDGDVYHI KSETAGIAHE IKFKVGEEFE DDMNGHKFKN VVTMECDNKM VQKKTSADGG KVVNWREFT DAGCTVKSTY NTVTWTRVYK RM

SEQ ID NO: 3; Cluster ID (L) 65; Cluster ID (A) 74

Protein name: A0003; Species: Dermatophagoides pteronyssinus

MAIDGKYQME SSEHFEEFVK EMGLDVDMTN VDLSKTSTME ICKDGDVYHI KSETAGIAHE IKFKVGEEFE DDMNGHKFKN VVTMECDNKM VQKKTSADGG KVVNWREFT DAGCTVKSTY NTVTWTRVYK RM

SEQ ID NO: 305

Protein name: A0003; Species: Blomia tropicalis

GKYQLESSEN FDEFLKELGV NFILRNLAKT SKPTIEITLD GDTYTIKTIT TLKTSVITFK IGEEFEESRM DGKTVKTVIT QEGDKLIQVQ QGDKEVKIVR EFTETHLTTI CTVGEITSTR VYKRV

SEQ ID NO: 34; Cluster ID (L) 46; Cluster ID (A) 21

Protein name: A0006; Species: Dermatophagoides farinae

DSNSDTTFIF NGDGCEQNHL FQTRYRPQIQ QLASDVQRI I DHVMSVNESG RTYRQLAEFV DRFGSRLTGT

KNLEDSIDYM IDLLRQEGHD NVHGESVQVP RWTRGNEWAR MIKPREKKLN ILGLGYSEGT NGQTIEAPIV WRNFTELEQ KSRLIPGKIV VYNFHYESYG KQAIYRHSGA SRAAEFGAVA AMIRSLTPFS IDSPHTGMQT

YDVNVTRI PA ISITAEDADL FQRFSDRNEE VIVQIYSENR NEKEQGISRN TVSDIRGEQY PDEIVLVSGH

IDSWDVGQGA LDDGAGSFIS WRALSVIKQL GLRPKRTMRS ILWTGEEFGL IGVYDYVKKH QNELKNYVLA

MESDIGTFTP KGITFSGRNS TSQCTLWEIL QLMHPINATT LTISTEGSDV QAFYENGVPI SSLDTANDKY

FYFHHTQGDT MTVEQSDDLD KCQALWTSIS YALAMLDDRL PR SEQ ID NO: 35; Cluster ID (L) 46; Cluster ID (A) 21

Protein name: A0006; Species: Dermatophagoides pteronyssinus

DSNPGETSIF NGEGCANDQL FQTRIRPQIQ QLASNVQRI I DHVMSANESG RTYRQLAEFV DRFGSRLTGT KNLEDSIDYM IDLLKQEGHD NVHGEPVQVP KWTRGNEWAR MIKPRDKKLN ILGLGYSEGT NGQTIEAPIV WRNFTELEQ KAGLIPGKIV VYNFKYESYG KQAIYRHSGA SRAAKFGAVA AMIRSLTPFS IDSPHTGMQS YDVNVTKI PA ISITTEDADL FQRFSDRNEE VIVQIYSENH NEKDKGISRN TVSDVRGEKY PNEIVLVSGH IDSWDVGQGA SDDGAGAFIS WRALSVIKKL GLRPKRTLRS VLWTGEEFGL IGVYDYIKKH RNELKDYVIA MESDIGTFTP RGITYSGKNS TSQCTLWEIL QLMHPINATT LTISTEGSDV QAFYENGVPI SSLDTANDKY FYFHHTQGDT MTVEQPDDLD KCQALWTSVS YALAMLDDRL SR

SEQ ID NO: 30; Cluster ID (L) 61; Cluster ID (A) 30

Protein name: A0007; Species: Dermatophagoides farinae

MAKFNYLPVD VQEELRNTAN AIVSVGKGIL AADESTGTIG KRFADINVEN VEPNRRAYRQ LLFYSENIEQ YISGVILFDE TVYQKDDNNT PFPELLKKKG IIPGIKVDTG VVTLQGTNGE STTQGLDNLT KRCQEYYNHG CRFAKWRCVL KIGKDEPSAL AILENANVLA RYASCCQQAR IVPIVEPEIL PDGDHDLERC QKVTETVLAA VYKALNDHHV YLEGSLLKPN MVTPGQSCPQ KASPQDIARA TVTALQRTVP AAVPGVVFLS GGQSEEEASV NLNAINQYQG KKPWALSFSY GRALQASALR AWQGKPENIS AGQKEFLQRA KANSLSAQGQ YTGGVVGAAA DQDLFIKDHQ Y

SEQ ID NO: 31; Cluster ID (L) 61; Cluster ID (A) 30

Protein name: A0007; Species: Dermatophagoides pteronyssinus

MAKFNYLPVD VQEELRNTAN AIVSVGKGIL AADESTGTIG KRFADINVEN VEQNRQAYRQ LLFYSEGIEQ

YISGVILFDE TVYQKDDKGV PFPELLKKKG IIPGIKVDTG VVTLQGTNGE STTQGLDNLT KRCQEYYNQG

CRFAKWRCVL KIGQDEPSSL AIVENANVLA RYASCCQQAR IVPIVEPEIL PDGDHNLERC QKVTETVLAA

VYKALNDHHV YLEGTLLKPN MVTPGQSCPQ KASPQEVAQA TVTALQRTVP AAVPGIVFLS GGQSEEEASV

NLNAINQYQG KKPWALSFSY GRALQASALR AWQGKPENIG AGQKELLQRA KANVLAHKGQ YVAGSIPSLA SAKSNFVAQH KY

SEQ ID NO: 306

Protein name: A0007; Species: Blomia tropicalis

MSIIQNLPAD VQEELRKTAN AIVTPGKGIL AADESTGTIG KRFADINVEN VENNRRTYRD LLFSAPDEVN NYISGVILFD ETVYQKNAAG VPFPQVLAKR GIIPGIKVDT GVWLQGTNG ESTTQGLDNL TKRCQAYYEQ GCRFAKWRCV LKIGDNEPSP LAILENANVL ARYASCCQQA RIVPIVEPEI LPDGAHDIER CQKVTEKVLA AVYKALNDHN VFLEGTLLKP NMVTAGQSFA GPKPSPQEVA RATVTALQRT VPAAVPGIVF LSGGQSEEEA SINLNAINQF EGKKPWALSF SYGRALQASV LRAWQGKDEL IAAGQKELVN RSKANSDASL GKYSGGIVGA AGEQDLFIKD HQY

SEQ ID NO: 7; Cluster ID (L) 44; Cluster ID (A) 67

Protein name: A0008; Species: Dermatophagoides farinae

MSANTERTFI MLKPDAVQRG IVGEI IRRFE AKGFKLVAMK FMMASEDLLK KHYADLAARP FFPGLIKYMQ MGPVVPMVWE GLNAVKTGRV MLGETNPAES KPGTIRGDLC IQTGRNI IHG SDSVETAKRE IDLWFRPEEL VDYKPSQYEW VYEN

SEQ ID NO: 8; Cluster ID (L) 44; Cluster ID (A) 67

Protein name: A0008; Species: Dermatophagoides pteronyssinus

MSANTERTFI MLKPDAVQRG IVGEI IRRFE AKGFKLVAMK FMMASEDLLK KHYADLAARP FFPGLIKYMQ MGPVVPMVWE GLNAVKTGRV MLGETNPAES KPGTIRGDLC IQTGRNI IHG SDSVETAKRE IDLWFRPEEL VNYKPSQYEW VYEN

SEQ ID NO: 43; Cluster ID (L) 58; Cluster ID (A) 6

Protein name: A0009; Species: Dermatophagoides farinae

VVIKVENLPE RCDYSQCPKW DPNDINVHLV AHTHDDVGWL KTVEQYYYGL KNDIQRAGVQ YILDTVIEEL

IRNKQRRFIY VEIAFFWKWW QEQDEDQRMI VRELVRTGQL EFINGGWSMP DEAATHYNSL IDQSTWGLRQ

LNDTFGKCGH PKVTWQIDPF GHSREMANLY AQMGYDALFF ARQDYQDREN RMTNRKLEHV WQGSDDLGTA

GDIFTGMMFS GYGPIEFNWD ITNGPEDAVV DNPESEEYNV PDKIRRFVEK AKYFAQYYAT NHFMFPMGTD FQYGDAHTWF KNLDKLIKAV NNAGKGVRAF YSTPSCYARA LYETNRTWTT KTDDFFPYAS DEHAYWTGYF TSRPALKRME RMGNNLLQAC KQLDILAGND GRFEMNITRL REAMGVMQHH DAVTGTEKQH VAFNYAKMLD SAMLQCRHVI SESYRKLFPT QTKEQHEFCP YLNISSCPST EMGESRTIHL YNPLGHRLVN RTIRVPVKDG YYYQVRDQND HSIPAVLISI PEFVRKI PGR KSVATKELVF RVPIIESLGI RRFHMIATKE KQQDSAVEIQ GEKFVGHKGQ RFQLKDGLI I EFDSNGKIAT MIRNNQSISI SNEFRLFHGA DIGRHSGAYI FRPSEQKTFP VTEKMEATLY VDQKFGIVQE VHQQFDSFVG QIIRLDKQGD YVEFDFVVGP I PVDDLIGKE I ITRYNTNLA NDETFFTDSN GRQMLRRRWN YRPSWKYEIE EPVSGNYYPV NSRIAIRDDR KSLQMTIMTD RSQGGSLSPE QINGSVDLMV HRRLLHDDYF GVDEPLNEPG VDGHGIVIRG RHLLLLDTLE KAAEKHRPLA QEMFMEPIIS FTSSMEKNQP IYKGLTKDLP GNVHLLTLEQ WHSKRYLLRL EHFYQRFEDP SLSNPATVSL RHLFQSFEIT AVEELTLGAN QPISALKNRL QYRYIRPLNE QQSSIITDPI IEGENFDIHL EPMQIRTFLI DIKRN SEQ ID NO: 44; Cluster ID (L) 58; Cluster ID (A) 6

Protein name: A0009; Species: Dermatophagoides pteronyssinus

VVIKVENLPE QCDYTQCPKW SKDDINVHLV AHTHDDVGWL KTVEQYYYGL KNDIQRAGVQ YILDTMIEEL

IRNKDRRFIY VEIAFFWKWW QEQNEEQRMI VKELVRTGQL EFINGGWSMP DEAATHYNSL IDQSTWGLRQ

LNDTFGRCGH PKVTWQIDPF GHSKEMANLY AQMGYDALFF ARQDYQDREN RMSNRTLEHV WQGSDDLGEI GDIFTGMMFS GYGPIEFNWD ITNGPEDAVV DNPESEEYNV PDKIRRFVEK AKYFGQFYAT NHFMFPMGTD

FQYTDAHTWF KNLDKLINAV NKAGKGVRAF YSTPSCYAHA LYEQNRTWTT KTDDFFPYAS DEHAYWTGYF

TSRPAIKRME RIGNNLLQAC KQLDVLADNN GRFEMNLTKM REAMGVMQHH DAVTGTEKQH VAFNYAKMLD

SAMLQCRHI I NESYKKLLPK SSTSEHEFCP YLNISSCPTT EMGESRI IYL YNPLGHRLIN HTVRLPIKNG

YYYRIQDQNN QSVPSVLVPI PEFVQKIPGR KSVATKELVF RVPVIEPLGI TTMYMYVDKN EQPNSAIEIK GENPDDNDDK SKWLVLTKNL IVEFYSNGTI SRISIDKLHQ SISISNEFRL YHGAGGTGRH SGAYIFRPNE

QKTFPVTNKI KSTFFIDRKY HIVQEVHQQF DSSFVGQIIR MDKYNDNVEF DFVVGPIPVN DQIGKEI IAS

YKTDLENDET FYTDANGRQM LRRRWNYRPS WKYNVQEPIS GNYYPVNSRI AIRDEKQSLQ MTIMTDRSQG

GSLSPEQING SIDIMIHRRL LHDDYFGVGE ALNEPGVDGH GLVIRGKHLL LLNSIKQSAS EHRPLAQQMF

MEPIISFTSI ESNKQAEKQS NQYIGLNNDL PSNVHLLTLE QWHSKRYLLR LEHFYQSNED TELSKPVKLS LRHLFKSFEI IAVEELTLGA NQPISSLKNR LHYRYNRPLE QRQQQQSSLL LDDPKIEGEN FDIHLSPMQI RTFLIDIKRN

SEQ ID NO: 307

Protein name: A0009; Species: Blomia tropicalis

WIKVENLPA RCDYTKCPKS DPNKINVHLV PHTHDDVGWL KTVEQYYYGS KTYYQKAGVQ YILDSVMNEL IHNKERKFIY VETAFFWKWW MEQDYGMRNI VKELVETGQL EFINAGWSMN DEASTHYNSI IDQMSWGFYR

LQTTFGRCGV PKVAWQIDPF GHSKEQAALF ALMNFDALFF AREDWQEQSH RRKNRTLEHV WQASSDLGKS

ADLFTGMMNF GYGPPQGFNW DLVGGADEPV IDDPESDEYN VPRRVKELID LAKTYQKYYA TNNVMFPMGT

DFQYQDAHIY FKNMDKLIKY VNENSTEVNI FYSTPSCYAK SLKDSGKTFT AKNDDYFPYA SDPHSYWTGY

FTSRPAIKRF ERVGNNYLQV CKQMDTYTGH QATRDRHTTK LREIMGVMQH HDAVSGTEKQ HVAFNYAKHL QSGIESCRKV ISEAYQLLQH PHTKTVQTFC DYLNISSCAI TESGQNFWN IYNPLSKTLK NHPIRLPINS

DKYYNVVDDE GKSVYSELTF IPEYVQAIPE RTTNATTDLV FLAS I PPLGY ASYFVQATTT KSPDSANAVT

VTKITNETRL SSGNFSVVFD STGALSKVEL PSGESIPFKN EFRYYNGAAD NIRASGAYIF RPKEQQTFPF

AKLVSANLLT RTSSGGIVHE VHQKFDSNVE QVIRVLPDSD SIEFEYVVGP I PVKDGIGKE VVLTYETDFK

NNKTFYTDAN GRQMMKRKWD YRPEFKMEVT EPISGNYYPI NSRIYLQDEK KGMQMTILND RSQGGTSPRD GVIEIMVHRR LLHDDGFGVG EALNEPGVDN KGLI IRGRHL VQFSDIKTAA SKHRPKAQQL FMAPVLSFVP

DVSDYETYKR SHLTKYSALI NPLPEQIHLL TLERWMEGHF LLRLEHYFQT NEDAELSKPV TLNLKHMFKS

FKIFEAEELT LGGNQPIFET KHRMKFNYIP VENVTEPPEH SFDPTKLEVK LYPMQIRTFS VRV

SEQ ID NO: 41; Cluster ID (L) 10; Cluster ID (A) 7

Protein name: AOOIO; Species: Dermatophagoides pteronyssinus

LDSDPMKCNS IRNEDRIDCN PDPPISKEIC EQRGCCWNAG NNTDDGNLIS RALPHLGVPS CYYGENYIGY

KIEKIYIKDE DLSMTKLKRV RPSGFPKDIE NVNIEIHQLN DQVLRLKFID ANQKRYEVPT PKLNIPSVSK

SSNSRLYSTE ISGSHLIVRR RETNQSIFDI NLAQMVYSDQ LIHLTSKLPS KYIYGIGEHR EPFRKTTDWK

RYTQWTRDQV PISDHALYGS HPFYMMVENK TKLASGVFLF NSNAMDILTQ PSPAITFRTV GGILDFFIFF GPKPEQVVQQ YHNLIGLPAM PPFWSLGYQQ CRYGYNNFTN LNQTYWRTRQ AGIPMDVQWT DIDMFDSYND

FTYNHKQFKE LPDFIRNVLH KNGQKFI PMF DCGISSGEKA HSYRPYDYGV ELDIFVKNSS KQIFNGKVWN

GKSTVWPDFS HPNATKYWSK MFEEYHKI IE FDGAWIDMNE PSNFYDGQID GCPKTEIENP QYVPGMTDDS

LTLRHKTLCM TARHYNDQLH YNLHNLYGFQ EAIATNEALK TTLNKRPFI I SRSSAPGHGH WASHWDGDVI

SDWSSMRWTI PSILNFNLFG VPMIGADICG FNGDTTVELC RRWYQLGAFY SFVRNHNTDN AIDQDPVALG ETVVRTARSA LTYRYAFLPY LYTLFYNVHQ NGGTVLRPMF FEFPDDDHLY DIETQFMWGD SMLIAPILYP

NQTENKVYLP KGTWHNMRQT FESQGQYFTI KDSLDDINYV FFRSGSIIPI QGPQNNTEMM KSKDFGLVVI

LDSKNPEPYA KGSLYLDSGD SLDPVKKGEY NFYNFEVKNN TLTIESQHLG YQTNQSIIIL EILGIDRKPT

SIIFDGKPYY QFIYTTNNML I IQTKLS IFN DNDKSKKIHY QFEWKFN

SEQ ID NO: 42; Cluster ID (L) 10; Cluster ID (A) 7

Protein name: AOOIO; Species: Dermatophagoides farinae

DSLKCSSIRN EDRIDCNPDP PISKNVCEQR GCCWKTAGND LKNLSSKVLP NLNVPYCYYG ENYIGYKIEK

HSKNLIQLKR NRSSGFARDI ENINIEIHEL NDKVIRLKFI DANKKRYEVP I PKLNLPSTT SSSSSNSRLY

SVELDGSHLI VRRRETNQS I FDINLAYMVY SDQLIHVTSR LPSKYIYGLG EHRAPFRKNT NWKRYTQWTR

DQYPVTDKAL YGNHPFYLTV EDESPKKSAS GVFLFNSNAM DIITQPSPAI TFRTIGGILD FFVFFGPKPE DVISQYQNLI GLPAMPPFWS LGYQQCRYGY NNFTNLNTTY TRNRAVGIPM DVQWTDIDAF NSNNDFTYDH

KRFKELPDFI NNVLHPNGQK FIPMFDCGIS SGEPAGSYKP FDSGVELDVF VKNSSNKIFR GKVWNGKSTV

WPDFSHPNAT EYWMDMFAEY HKTIAFDGAW LDMNEPSNFY NGEEHGCPES EIENPQYVPG MTDDSLTLRH

KTLCMTARHY NDQLHYNLHN LYSLSMAMAT NAALTKLNKR PFIISRATAP GHGHWAYHWN GDILSDWSSM

RWTIPSILNF NMFGIPMVGA DICGFGGNTA EELCIRWYQL GAFYSFARNH NDIHSIDQDP AALGESVIRA ARSSLQYRYR FLAHLYTLFY HVHKNGGTVL RPMFFEFPHD EHTYEIETQF MWGDSVLIAP ILYPNQTQHK

IYLPKGTWYN RKVSFESQGQ YITMNDSYDD IDYVFVRGGS I I PTQEPHDN TELMKTKDFL LIVALDNQTS

YAKGSLYWDS GDSLNPDKTG HYNFYNFDAV NNTLTIQSQW LGYQTTQNIN FINILGVPKL PTSFKLNGHV

SDPRI IRFNY DEQTNILTVE TKLPIYNQDS SSHDRIHYQF EWIME

SEQ ID NO: 308

Protein name: AOOIO; Species: Blomia tropicalis

QCMAIPPNSR IDCNPDPPIS AEVCQSRGCC WMPSSNESSE NMNLLKKNVL PPLNVPYCFF GSDYHGYNVS

NVQTINDNQK VINLQRIRDS GFVNDVKNVR IQIDELSSNV LRIKMIDSDS SRYEVPIPVL NLPKRNEVLE

SLNEKMYQVE MNSTDFMLTV YRAKTKAIVF NVNLGQLIYS NQFIQITNKL ASNFIFGIGE NRESFRKLTN WKRYTLFARD QWPVPDRALY GSHPFYLATE SDNSSHGVFL FNSNAMDIIT QPMPAITYRT IGGILDFFLF LGPTSENVIE QYHQLIGLPT MPAYWTLGFH LSRYGYRNLS NLEKTFRRTR KAEIPFDVQW TDIDMFDSNN DFTYDRKRFD GLPKFIEHLH SINMRFVPMF DCGISSGEHP PQSYLPYKMG LEMNVFVRNG TNQPFEGKVW NSKSTVWPDF THPNATKYWT RQFAEYHKTI QFDGAWIDMN EPSNFLDGAF NGCPTNSTLE TPQYTPGMVE DSLTLNHKTL CMSARHS IGL HYNLHNLYGI SEAIVTKSAL ESVLKRRSFI LSRSTAPGHG HFAAHWDGDI LSDWPSMKWS ISSILNFNIF GVPLIGADIC GFNGNTTIEL CARWHQLGAF YTFVRNHNTD NAIDQDPVAL GPLVVKAAKN ALKLRYALLP YLYTQFYRVH RKGGTILRPL FFEFVHDQVV LEIETQFMWG SSIMVAPALS INETETSVYF PSGTWFHSYN FTRINTIGKF LPQLASFDYP NVYFRAGSI I PTLRPMLTTD ETHSGNFTLL VALSNENGHA EGDLYLDSGD GLDTEVLGHY NLYSFKVEKK ILEIKSSHLG YSTEQMIDNV LILGIDKSPI EIKINGRSMK SWSYSKNKIH INSLNLPLYD LKTIDKSKLI QIHYQIEWV

SEQ ID NO: 39; Cluster ID (L) 64; Cluster ID (A) 16

Protein name: AOOll; Species: Dermatophagoides farinae

KKAPEGCFRA AVLDHVHQTN VRQLSDFAKI IELNFKVYED AAALAKKQGA DIIVFPEDGL IYNIASREKA

DEFASDIPDG ETNACTLETK SVYNRLACLA QKHEIFVVAD LIDRKSCEEL GISNTSDSCP ADKKFLFNTA VLFDRQGKLL GRYHKMHLFG EMTMNI PPKP ELLVIDTELG RLGMQICFDM IFKTPGHFLA EQNKFDTMLF

PTWWFDEAPM LSSSQYQMAW AFGNNVTLLA SNIHRVELGS RGSGIYVGPH QTLATALYDD SVERLVLANV

PIKPRETDKS VCPLDSEIIE VPQQIPIPNS VKYHHLNMNL LDVTLVELSS KDSEFHICYK GVCCQIEYRL

AVKDQPRESW VDRVPLLANM LEYFTPEERY YLMVANRTRP GTYRWTEEIC AVVVCPSSRW NIGKVEKDCS

QFGSNQELNS RFVYAKLRGA FSESTAVYPS AVGPKNQLIN PENKWKYWKV NVPDKPEHFV ELGAKDNPES KAIELSTLAL YGRNYDLDPT YKQKPVPINL

SEQ ID NO: 40; Cluster ID (L) 64; Cluster ID (A) 16

Protein name: AOOll; Species: Dermatophagoides pteronyssinus

KSAPEGCFRA AVLDHVHQTD ARHLSNTAKI IDLNFKVYED AAALAKKQGA DIIVFPENGL IYSILSREKA

DEFASDIPDA EVNACTLDSK FVYNRLACLA QKHQMFVVAD LIDRKSCEEL GINNVSDSCP ADKKFLFNTA VLFDRQGKLL GRYHKMHLFG EISMNPPPKP ELLVIDTELG RLGMQICFDM IFKTPGYLLA QENKFDTMLF

PTWWFDESPM LSSSQYQMAW AFGNNVTLLA SNIHRIEVGS RGSGIYVGPH RTLAAALYDD SVERLVLANV

PIKPKETDQS ACPLDSEIIE VPQQIPIPKS VKYHHQNLNL KDVTLLQLSS NESEVHLCHK GVCCQFEYRL

AMKDQPQESW VDRVPLLANM LHYLTPEERY YLLIANRTRP GAYPWSEEFC AVVVCPSSRW NFGKMQKDCS

KIGSNQELSS RFVHAKLRGK FSEDTAVYPS AVGSKNQLIY PENKWKFWKV NVPNEPEYFI ELGAKDNSES RAMELGALVL YGRNYNRDPR YEQKALPIN

SEQ ID NO: 309

Protein name: AOOll; Species: Blomia tropicalis

GCFRAAVLDH VHQSSRNGGG TKENIKLNLK LYETAAKTAK EQGADI IVFP ENGIVYGIGS RANALKYGEI

LPESKTSMCT DSYASSHPIA YQLACLAKEH QMFVAADMID VQTCQTKSCP IDKKYAFNTA VLFDRNGYLL

GKYHKMHPFG ELQFNVPPKD ELVVIETEIG RLSMQVCFDL IYNKPGVVLA SQDKIDTMLF PTWWFDELPF

LAASQYQMSW AFGNKINLLA SNIHLVAVGS KGSGIFAGGH GQFEVISEPD AKARILVATL PINARSDAQC

SMDSKKIEVP QMVPIPSNVI YNYQMMNLTE NTVKKLDPSM EAISACDGGV CCQLNYQMDQ SSIKSDEEYY LIVTNRTRPG AYPWTEEYCG LVLCPHMTKL DTCKQISSNN PLQTKFLYAK LSGEFSSETH VYPSVIGSEH KVLPKDGGLW TYEDEKTDVG AKKQKFFITF GNKEERKSYT ISTIGLYGRV YARDPPYEQK PL

SEQ ID NO: 22; Cluster ID (L) 103; Cluster ID (A) 19

Protein name: A0012; Species: Dermatophagoides pteronyssinus

QSRDRNNKPY RIVCYWGTWA FYRPGTGKFE AENVNPNLCT HLMYGFAKLQ NNKIALYDPD LDDGDEDWNS GLNWGHGMIR RMVNLRTYNP HLTTMISIGG WNEGSDKYSM MVRDPSSRKI FIQSVLDLLA EFDLDGLDFD WEYPSMKATG DNDRKPGRDE DKEDFITLLR ELHEAFQPHG YLLSSAVSAG KPTIDRAYNI PEVSKYLDFI NLMSYDYHGG WESHTGHNAP LNSYDNANEL DKEFTVTYSV DYWLSHGVDA KN

SEQ ID NO: 38; Cluster ID (L) 103; Cluster ID (A) 19

Protein name: A0012; Species: Dermatophagoides farinae

QSRDRNDKPY RIVCYWGTWA FYRPASGKFQ AENVNPNLCT HIMYGFAKLQ NNKIALYDPD LDDGDEDWNS

GLQWGHGMIR RMVNLRTYNP HLTTMISLGG WNEGSDKYS I MVRDPASRKI FIQSVLHLLA EFDLDGLDFD

WEYPAMQASG DSDRKPGRAE DKEDFVTLLR ELHEAFQPHG YVLSSAVSAG KPTIDRAYNI PEVSKYLDFI

NLMSYDYHGG WESHTGHNAP LNSYKNANEL DKEFTVTYSV EYWLNHGVDP KKLVLGI PLY GRTFTLAGSE HGIGAPTIGK GGESGTITRT IGMLGYNEIC TMIKQGWQLY RDEIERIPYA VHANQWIGYD DRESVNEKLN

LLMAKHLGGA MVWSIDTDDF VGNCVGVKYP LLRS ISKKLN NVDGPDPDIK RYHYHTSTAK PHTDGTTSTH

HDHKTTTTKH HKTTQPHHKT TQPHHTQTIT TTTERPHGKF QCHQAGFFAD PENPRKFHQC VDFGGHLKDY EFMCGEGTHY DEKLHICVR

SEQ ID NO: 310

Protein name: A0012; Species: Blomia tropicalis

DRNKLPHKW CYWGTWAFYR PGSDGKFEAE NINPNLCTHI NYGFAKLVGN KIALFDPDLD TGDEDWASGL TWGHGMIRRL NELRKYNKNL STLISIGGWN EGSNKYSTMV STAGGRSEFV KSVIEFLQKY EFDGLDLDWE YPGMSASGDA DRKPGREQDK ADYIELLKEL RQAFEPHGYI LSAAVSAGAP TIDRAYNVPE VSKHLHFINL MAYDFHGGWD TKTAHNAPLY ALPGAEGIDK EFTVSYAVEY WISKGADPKK LVLGI PLYGR TFTLAGPNHD IGAPVTGHGG QAGPITRLIG MLGYNEICSM VKNGWEIHWN DIQQIPYATH ASQWIGYDNE KSIEKKLDYV HQKNLGGGMV WSIDTDDFSG HCGVKYPLLK TISRRLNNID GPDVVIPRTH ATTPHPDDHD HTTKRPDDPH TDPHTEPHHD KTTSAPNPDG KFQCHSTGFF KDPSDPRKFH QCVDIGNGKL KDYEFNCPLG SHYDEQLHVC V

SEQ ID NO: 36; Cluster ID (L) 40; Cluster ID (A) 23

Protein name: A0013; Species: Dermatophagoides farinae

DTPANCTYED IKGLWLFEES TPINDRTEKC DNGRREYTKK IYVRLDFPNT AVDKFGNVGT WTLIYNQGFE

VI INYRKYFA FSAYERKSNS KVISYCHKTI PGWSHDLLGN NWACYIGHKV NDWNSSPLQK IGSEQFPIKE

HIEQPLYLKN IDLSHALSQN HVDQINSKQK SWKATVYPEM QSKTVEHLIK MAGGEKSRIM SRPKPIRATE

QQRHEARGLP ESFDWRNVDG INYVSPVRNQ GNCGSCYAFA SMAMLEARIR IATNNTAKPV FSPQEWDCS

EYSQGCDGGF GYLIAGKYAQ DFGWEESCY PYKAYTGKCK LDYNTTAKCQ QRTYTIKYNY LGGYFGACNE EAMRIELVKN GPIAVGFEVY KDFMTYRRGI YSHDSDYETE QKVGVEFNPF VLTNHAVLIV GYGRDEKSGE

NYWIVKNSWG EQWGIDGGYF LIRRGTNECG IES IAMAATP IPN SEQ ID NO: 37; Cluster ID (L) 40; Cluster ID (A) 23

Protein name: A0013; Species: Dermatophagoides pteronyssinus

DTPANCTYED IKGLWLFEET EPIKDRWEKC PEHQQQREKY SKKIFIRLDF PNVAVDKFGN IGEWTMIYNQ

GFEVKINYRK YFAFSAYERK SENNVLSYCH KTQPGWSHDV LGNNWACYVG HKVNNWNDDD VSKTTTVGAE KFPVKQHSER ELYLQNINVE HILSQKHIDH LNSQQKSWKA IVYPDLQSKS IEHLIQMAGG RKSRI INRPK

PLRATEQQKQ LARSLPESFD WRNLNGIDYV SPVRDQGKCG SCYTFASMAM LESRIRIQTN NTFKPIFSTQ

EWDCSEYSQ GCDGGFSYLI AGKYAQDFGV IDESCYPYKG VTGKCQNQQN FNQTNEKCKQ RTYTIDYKYV

GGYFGACNEE AMQIELVQNG PIAVGFEVYG DFFGYSEGIY SHQPSNESND QHQQIKAEFN PFEMTNHAVL

IVGYGKDKKT GEKYWIVKNS WGKQWGMDGY FWMRRGTDEC AIESLAMAAT PIPN SEQ ID NO: 311

Protein name: A0013; Species: Blomia tropicalis

DTPANCTYED IRGEWEFHET ERIASRKEVC DDNSVSTTKH TVYLKLEFPN IATDQHGNVG HWTI IYNQGF

EVS INYRKYF AFSLYKQVGK QVTSYCDSTF PGWSHDVLGN NWACFKGRKV NRQQEKSFDE TMINNGKTHT

VQPFLLESVP VNHNLIQMNV NKINMKQSSW KAKFYPHLMN LNTEDLIRMA GGRGSAIVNR PSTVPASEEI KEKVRQLPES FDWRNVNGIN YVSPVRDQGK CGSCYIFSSM AQLEARVRIA TNNSEQPIFS TQEVVDCSKY

SQGCDGGFPY LIAGKYGRDY GVIADECYPY KGKNGKCSLP YNSTGTKCMK RSYTLHYHYV GGYYGGCNEE

LMLLELVKNG PITVGFEVYD DFTSYSGGIY SHDKSKDQWR NGVHFNPFQL TNHAVLIVGY GVDKQSGEKY WIVKNSWGKD WGLDGYFWIK RGNDECGIES LAVSVTPIP

SEQ ID NO: 32; Cluster ID (L) 33; Cluster ID (A) 25

Protein name: A0014; Species: Dermatophagoides farinae

IEQVHISLGT NATEMIVTWT EPQKHTDIDI DAVVYYGRAS SSFDQAAIAK SEHFKDDETK YTTFRALLTG

LESDTRYHYK IQLDDKESS I FAFKTLKLDE NWLPRFAIYG DLGYVNEQSL PYLKKDVEKN MFDVIFHIGD

IAYDLQDENG EVGNNFMRS I ESIASKIPYM TCPGNHERHS NFSHYDSRFS MIGDRSQPNH QDSLDKRINN

HFHSMEIGPA TIIMFSTEYY YYTYYGWEQI ERQYRFLEKE LIRANENRNK RPWI IAMGHR PLYCLKMGDS SCDHQTMERP EIRQGIRMHD QGERQYGLED LFHKYGVDIQ FYGHEHFYAR MFPIYKYQMY KGKQSDNPYD

HADGPIHITT GSAGNKEIHP LFNHLKEWVA HHFYDYGYTR LIFENQYRIR LQQVSDDQHG KVLDEIEIIK SSPQPHWMP

SEQ ID NO: 33; Cluster ID (L) 33; Cluster ID (A) 25

Protein name: A0014; Species: Dermatophagoides pteronyssinus

IEQVHIALGS NETEI IVTWT EPHKHDDKTS DAVVYYGQAK SSFDQKVKAI SEYFKDDKTK YTTYRALLTG

LLPGTEYHYR IQMDDLESS I FEFKTLKTGE ENWLPRFAIY GDLGYVNEQS LPYLKKDVEQ NLFDVIFHIG

DFAYDLNDEH GKVGHHFMRS IEPVASKVAY MTCPGNHERH DNFSHYDSRF SMIGDRSQPI HSDKLNKRLN

NHFHSMTIGP ATI ILFSTEY YYYTKYGWQQ IEHQYRWLEQ ELKRANENRQ KHPWIIVMGH RPLYCLKMGD

DSCDHQTMER KEIRQGIRMH DEGERQYGLE DLFFKYGVDI QFYGHEHFYA RLFPIYKYKM YNGTKSKNPY DHPGAPIHIT TGSAGNKELH PEFNHLNDWV AEHFYDYGYT RLMFEDKYRI RLQQISDDQH GKVLDEIEIV KSSPQPHWMN VEHH SEQ ID NO: 26; Cluster ID (L) 25; Cluster ID (A) 34

Protein name: A0015; Species: Dermatophagoides farinae

SPTS IRTFEE FKRQFNKQYQ SIEHEEIARK NFQETLRYVQ ANQDKAVINE YADLSAEEFA DGYLMNVQDV QDLEAEMDAH KEYFDDPDCK LHGDFNPPKE FDLRPHLTPI KKQIKNCGCC WALSTISCVE TAYLAQKNVS LQLSTQELVN CAKEHGCKKG TVLDGIEYIM ANGTTTEEAC PFISEESTCD QSKKPRYEIS NWCYFKPVED DIRKNLVLRR TSVSVSMNIE NLKAFVHYDG SFVIRENSFP SIGNKSYHAV NIVGFGTKDD IDHWIVRNSW GEKWGDKGYF YVERDINLWG IKDWAFTTIV

SEQ ID NO: 27; Cluster ID (L) 25; Cluster ID (A) 34

Protein name: A0015; Species: Dermatophagoides pteronyssinus

SPTGWNIRTF EQFKIQFNKH YDS IEQEEHA RENFLETLKY VDANPDKAVI NEFADLSAEE FADGYLMSEE SMQDSEQQLK LLRAGYDYHD DPECLFDENL EAPKQVDLRP DLSPIMRQTL HCGCCWAISP ISSAESAYKA RYNVSIQLSV QELVNCAVEH GCEIGKTAIA FNYLVTNGTT TQKAYPYTAK EGACNPPEKP RYTLENWCAY IDPSIKNKNK PDLRKVLAQK RTSITVQISI KNVKAFAHHN GSFI IRENSF PDEGKPSGHA INIVGYGTKD GVDYWIVRNS WSTGWGDKGY FYVERGVNWW GIEEYAFIAT F SEQ ID NO: 312

Protein name: A0015; Species: Blomia tropicalis

IKTFEQFKKV FGKVYRNAEE EARREHHFKE QLKWVEEHNG IDGVEYAINE YSDMSEQEFS FHLSGGGLNF

TYMKMEAAKE PLINTYGSLP QNFDWRQKAR LTRIRQQGAC GSCWAFAAAG VAESLYSIQK QQSIELSEQE

LVDCTYNRYD SSYQCNGCGS GYSTEAFKYM IRTGLVEERN YPYNMRTQWC DPDVEGQRYH VSGYQQLRYQ SSDEDVMYTI QQHGPWIYM HGSNNYFRNL GNGVLRGVAY NDAYTDHAVI LVGWGTVQGV DYWIIRNSWG TGWGNGGYGY VERGHNSLGI NNFVTYATL

SEQ ID NO: 28; Cluster ID (L) 43; Cluster ID (A) 39

Protein name: A0016; Species: Dermatophagoides farinae

MVKIGINGFG RIGRLVLRAA VKKGVEVVAV NDPFLDVKYM VYMFKFDSTH GRYQGEVKEE GGLLVVDGQK IQVFQERNPA DIPWGKVGAD YVVESTGVFT TIEKAKAHLA GGAKKVVISA PSADAPMYVM GVNHDKYDPS

QQIISNASCT TNCLAPLAKV INDKFGIENG LMTTVHAVTA TQKTVDGPSG KMWRDGRGAG QNIIPASTGA

AKAVGKVIPE LNGKLTGMAL RVPVPDVSVV DLTVTLKNPA SYDEIKAAIK AAAESDHWKG ILEYTDEEVV

SSDFISDTHS SIFDAKAGIA LTPTFVKLIA WYDNEFGYSN RVIDLIKYVA SK

SEQ ID NO: 29; Cluster ID (L) 43; Cluster ID (A) 39

Protein name: A0016; Species: Dermatophagoides pteronyssinus

MVKIGINGFG RIGRLVLRAA IKKGVEVAAI NDPFLDVKYM VYMFKFDSTH GRYQGEVKEE GGLLVVDGQK

IQVFQERNPA EIPWGKVGAD YVVESTGVFT TIEKAKAHLA GGAKKVIISA PSADAPMYVM GVNHDKYDPK

QQIISNASCT TNCLAPLAKV INDKFGIENG LMTTVHAITA TQKTVDGPSG KLWRDGRGAG QNIIPASTGA

AKAVGKVIPE LNGKLTGMAL RVPVPDVSVV DLTVTLKNPA SYDEIKAAVK AAAESDHWKG ILEYTDEEVV SSDFISDTHS SIFDAKAGIA LTPTFVKLIA WYDNEFGYSN RVVDLIKYVA SK SEQ ID NO: 21; Cluster ID (L) 13; Cluster ID (A) 49

Protein name: A0017; Species: Dermatophagoides farinae

MSSSSGKKYD FSGKVALVTG SSSGIGAAIA VQFAQYGAKL TITGRDGAAL ESVAKKIEIE SGHQPLQIVG DLLDQSLPAK LINETVSKFG RLDFLVNNAG GSTAHRELND EKLMEAFDKV FALNVRAVLQ LSQLAAIHLE KSKGNIINIS SIVSMKPYGH VYSSSKAALD MITKTLAKEL GLKGVRVNS I NPGPVATGFL RSVGMSATAY TDLADTMINH TLLKFLAQPD EIANLASFLA SDDARNMTGS IVVSDTGSLL V

SEQ ID NO: 25; Cluster ID (L) 13; Cluster ID (A) 49

Protein name: A0017; Species: Dermatophagoides pteronyssinus

MSSSSGKKYD FSGKVALVTG SSSGIGAAIA LQFAQYGAQV TITGRDAAAL ESVAKRIEAE SGHQPLQIVG NLLDQSLPAK LIDGTISKYG RLDFLVNNAG FSTQHRDIHD EKLMEAFDQV YGLNVRAVVQ LSQLAATHLE KSKGNIINIS SNLSMMPVHI IYSSSKAALD MITKTMAMEF GKKGVRVNS I NPGPVATQFM RSLGMPVTFL KENEEFVKEL TLLKFVAQPV EIANLASFLA SDDARNMTGS IVVNDTGSLL APRVDFKKLD EIKKK

SEQ ID NO: 313

Protein name: A0017; Species: Blomia tropicalis

SLTNKKYDFS GKVALVTGSS SGIGAAIAIQ FAQYGAKVTI TGRNAENLDK IAKKIAEVSN GVEALQI IGD

LTIDDSLPKR LIDETVTKFG RLDFLVNNAG GATPQGTLAS PDLLKGFDDV FKLNVRSVIE LTQLAMPHLE

KTKGNI INIS SVASIKPYMV VYSSSKAALD MITKTSALEL GPKGIRVNSI NPGPVVTAFG RSMGVDPSHH

KKMFDSFEKQ MLMERVGQPE DIANLASFLA SDDAINITGS IMVNDSGCLL

SEQ ID NO: 5; Cluster ID (L) 97; Cluster ID (A) 71

Protein name: A0018; Species: Dermatophagoides farinae

MVKAWVLKG EPNVTGTIFF EQQDNGPVKV SGTVQGLKSG LHGFHVHEFG DNTNGCTSAG AHYNPFNKTH GAPADEERHV GDLGNVEAND AGIANVAIED SLISLTGERS IVGRSLWHA DPDDLGRGGH ELSKTTGNAG GRLACGVIGV TK

SEQ ID NO: 6; Cluster ID (L) 97; Cluster ID (A) 71

Protein name: A0018; Species: Dermatophagoides pteronyssinus

MVKAWVLKG DPNVSGTIFF EQQDNGPVKV TGSVQGLKPG LHGFHVHEFG DNTNGCTSAG AHYNPLNKTH GAPNDEERHV GDLGNIEAND KGVANVVIED SLISLTGEKS IVGRSLWHA DPDDLGRGGH ELSKTTGNAG GRLVCGVIGV TK

SEQ ID NO: 314

Protein name: A0018; Species: Blomia tropicalis

KAVVVLKGDS PVSGTIFFEQ KDNGPVSVTG TVNGLTAGDH GFHVHEFGDN TNGCTSAGAH FNPFGKTHGA PADQERHVGD LGNVTADANG VANVNIQDSL ITLEGANTIV GRSLVVHADP DDLGRGGHEL SKTTGNAGGR VACGVIGLTK SEQ ID NO: 1; Cluster ID (L) 75; Cluster ID (A) 75

Protein name: A0019; Species: Dermatophagoides farinae

DGSHIVKAAR SQIGVPYSWG GGGIHGKSKG IGEGANIVGF DCSGLAQYSI YQGTHKTIAR TAAAQYNDNH CHHVAYGSHQ PGDLVFFGNP IYHVGIVSAH GRMVNAPKPG TKVREENIWS YHISHVARCW SEQ ID NO: 4; Cluster ID (L) 75; Cluster ID (A) 75

Protein name: A0019; Species: Dermatophagoides pteronyssinus

QVYCNGAAIV SAARSQIGVP YSWGGGGIHG KSRGIGEGAN TVGFDCSGLA QYSVYQGTHK VLARVASGQY SDPKCHHVAY GSHQPGDLVF FGNPIHHVGI VSAHGRMINA PHTGTNVREE NIWSDHIANV ARCW

SEQ ID NO: 315

Protein name: A0019; Species: Blomia tropicalis

QAMAGGHEIV TAARSQLGVP YSWGGGNWAG KSKGIDSGAH TVGFDCSGLA QYAVYHGTHK KIARVASAQY ADHQCHHVPY AQHLPGDLVF FNDGGS IHHV AIISGKNTMI HAPHTGDHVR EAAVYVKGRM STVQRCF

SEQ ID NO: 15; Cluster ID (L) 31; Cluster ID (A) 59

Protein name: A0020; Species: Dermatophagoides farinae

MSKPTFYFHP FSGPCRTVST VAKILNVEME MKKLDLLTQE HLKPEFLKVN PFHKI PTFVD TDGFTIDESR VIAMYLLQSR KPDSFLYPNN DLKKRTQIDR WLHYDISFAT IISTPMYCKF RGKPVQDHQV EQGKETLKTL DGVMASFGGK FLTGSDQITL ADIAMYFSCN TMEIYSEYFK FDDYPNLKSW YQRVAEALKQ YDTEGEI PKA IEMIKQFAQQ RMAESAKQ

SEQ ID NO: 16; Cluster ID (L) 31; Cluster ID (A) 59

Protein name: A0020; Species: Dermatophagoides pteronyssinus

MSKPIFYYHP FSGPCRTVST VAKILNVDME MKKLDLLTKE HLNPEFLKVN PFHKVPTFVD SDGFVVDESR

VIAMYLVESR KPDSFLYPKN DLKKRIQIDR WLHYDINLST TISAPMFCVF RGHQVQDYQV EQGKETLKTL

DGVMQSFEGK FLTGADQFTL ADIAMYFSLN TMEVYPKYFK FDDYPNLKSW YHRVAEALKQ YDTEGTI PKA IETMKQFIQQ RAAEAEKH SEQ ID NO: 316

Protein name: A0019; Species: Blomia tropicalis

MSKPTLYYMW ESPPCCTVIA IARILNIELD MKHVDLTKKD QNNPEFKKIN PFAIVPTFVE TDGYTLWESR AISTYLVQSR SPDSTLYPGS DLKKRSTIDK FLQYDLGTFN RAIYDVVSEI FKSGKLNEQN I PRLGEVLKT LEETLAANNE SNGGPFITGD DQLTIADISM HFSWTLLSLL PERLIDQSSY PTIRAWNQAV IQALKPYNRD QKFTEAQRRL KAFITMMIES AKN

SEQ ID NO: 19; Cluster ID (L) 105; Cluster ID (A) 50

Protein name: A0022; Species: Dermatophagoides farinae

EWRLVWQDEF NGNQLDLNQW SYEVGGNGWG NNELEFYTYN RTENARIENG NLVIDVRVEN YRERQFTSAR LHTRQAWTYG RFEARARMPY GHNLWPAIWM MPQDSIYGIW AASGEIDIVE YRGDNPDRIE GTAHYGGTWP NHIYSGSGPR SFSVNFSQDF HTFALEWDHK QLRWYMDNQQ YFTLDIDRML WSGKGVNPYT KNGQPFDQPF HWMLNVAVGG NFFGPGPYVT PDQARQWPKH TLEIDYVRVY QQ

SEQ ID NO: 20; Cluster ID (L) 105; Cluster ID (A) 50

Protein name: A0022; Species: Dermatophagoides pteronyssinus

NWQMVWQDEF NGGHLDQNHW EFETGGGGWG NNELEFYTAN RSQNVRVENG HLVIDVRVES YGGRDFTSGR

IHSKQAWAYG KFEARARLPS GHHLWPAIWM FPRDSKYGPW AASGEIDIME YRGDVHDKIE GTIHYGGQWP

NNIYTGSGPH HFNVDFSKDF HNFAVEWDTK EIRWYMDGNK YFSVNIDRNM WSGKGNNPYN KNGQPFDQPF

RWILNVAVGG NFFGPGPYVT PDQARHWQKH TMEIDYVRVY QWR

SEQ ID NO: 317

Protein name: A0022; Species: Blomia tropicalis

NWQLVWSDEF NGNGLDENNW NYQTGCSQQN DELECYTSHR HENVRVENGH LVIEARPEEY QGHHFTSGRL

HGKKAWAYGK FEARAKMPSG HHLWPAIWMM PRDSKYGGWA ASGEIDILEL RGDKPHEIVG TIHYGGSWPN

NIYHGSGERY YQQDFSQDYH TFAVEWDQKE IRWYVDGQHY HTENIDRNMW SGRGNNPYHK NGEPFDQPFY

WILNVAVGGN FFGPGPYVSP AEARNWHKRT MEVDYVRVYQ WR SEQ ID NO: 23; Cluster ID (L) 8; Cluster ID (A) 42

Protein name: A0023; Species: Dermatophagoides farinae

SPAQRPSLRG VTIRNAPFLE EIDGKFKGFI PDLMDAIAEK AGFDYTLYLS PDGRYGNADK EGNVTGMIGE VYNKKADFAA ADLTMTEARE NYITFTEPFM INQLAALIRR EDAEGMNTLE DLVNAGKTQP NHKPI ILGTL RNGATNHFLS KSDDPLAKKM YEQIKANDQS ATTS ISKGIE RVDKQGGYAF IMESSSAEHE IANNCKLTML LDWRNLYPRK YAFALPKDSQ YLQHFNNAIK QLNTEDKIAE LRRKYWSNNC SNTQTKNTGA

SEQ ID NO: 24; Cluster ID (L) 8; Cluster ID (A) 42

Protein name: A0023; Species: Dermatophagoides pteronyssinus

DPVQQRPTLR GVTVRVGPFV KENNGKFEGF IPDLVQAISE KVGFDYTLYL SPDGRYGNVI SDGNVTGMIG EVYNKKADFA AADLTMTEAR ENYITFTEPF MINQLAALIR REDAEGLNTL EDLAKAQETF PKRKRIVLGT LRNGATNYFL SKSDDPLAKK IYEQIKADDQ SVVKSISEGV ERVDKQGGYA FIMESASAEH EIANNCKLTM LLDWRNLFPR KYAFALPKDS PYLEHFNNAI KQLNSEGKIA ELRRKYWANN CAENKTKDDK N

SEQ ID NO: 11; Cluster ID (L) 36; Cluster ID (A) 65

Protein name: A0024; Species: Dermatophagoides farinae

MSISAHGGGL VNGIAGMENK FTVFTSGKPV SGLTVAFEGP TKPEINFNST KDGSVDVGYT PKAGGQYKIH IKYEGKEIVG SPFKCNISGD EATHRKLTEK VKVGGPNINA GKVNQDNQLT IDCKEAGITG GISFAMEGPA KVEVSFRNNN DGTITVIYKP PTPGDYKLHL KFNDIHLPGS PYPIVVAA

SEQ ID NO: 12; Cluster ID (L) 36; Cluster ID (A) 65

Protein name: A0024; Species: Dermatophagoides pteronyssinus

MSISAHGGGL VNGIAGMENK FTVFTSGKPV SGLTVAFEGP TKPEINFNST KDGSVDVGYI PKAGGQYKIH IKYEGKEIVG SPFKCNISGD ESTHRKLTEK VKVGGPNIST GKVNQDNQLT IDCKEAGITG GISFAMEGPA KVEVSFRNNN DGTITVIYKP PTPGDYKLHL KFNDIHLPGS PYPIVVSA

SEQ ID NO: 318

Protein name: A0024; Species: Blomia tropicalis

MSISAHGGGL INGIAGMENK FTVFTSGKPV SGLTVAFEGP TKPDINFNSA KDGSVDVSYT PKAGGMYKIH IKYDGKEIIG SPFKTNITGD EATHRKLTEK VKVGGPNVST GKANADNELT IDCKEAGITG GISFAMEGPA KVEVSFRNNN DGTITWYKP PQNGDYKLHL KFNDIHLPGS PFPIVVS

SEQ ID NO: 17; Cluster ID (L) 104; Cluster ID (A) NA

Protein name: A0025; Species: Dermatophagoides farinae

ESLFIYDDYS CGSYGHDVNE LIEQFQLFKK NEHNQNES IE I IGHFLKKIR EYRVEAIKVM LETDRKLLTL NNSQIILNIQ YQKKKIRCEN LKHLSELLTM HLLAYKQGMF DFAEEIDPDV NFDRQFKNFL DRSSEVMNIN EFSDIEKKWS NSSAKKLLKN DIDGLITALD DLREDFLKNI ILPEFDAQSR YDLYFSIQDQ INIRSTLKLF GTIKMFMKEL LDDLNQPDFE ILY

SEQ ID NO: 18; Cluster ID (L) 104; Cluster ID (A) NA

Protein name: A0025; Species: Dermatophagoides pteronyssinus

QSLFVDYNDY SCGSSQNETN ELIQEFKIFK KNINGNENFK KINDFIEKAR LFRDNAAKQM LEIDQQLLTL

NVIQISQRIK LENNKIQCEK LTKFSELLSM QLLAYEVGMF EFAEEIDPNI DFDRKMKNFL DETSRLFNLA

EFEKLEKKFR NATS IEKLKN YIDGELVALN DYINEFLKDI IMSEFTVQSR YYLNFS IEDQ VQIDSTLMTF SALKILLNDL KDYLEHLDN SEQ ID NO: 9; Cluster ID (L) 102; Cluster ID (A) 62

Protein name: NA; Species: Dermatophagoides farinae

NRVSVGVYYE TICSGCRTHF INAIVPLRQQ LGEYVDIDLV PFGNAHIYSN GPQCQHGALE CYGNAFQACS LDMNGFDTGF KLVECMFRSS YYSNPQYSAK RCAQQLNLNY DQLHSCATGQ KGFELIKVMA RKTPRHNYVP WTTVESRTVD VNVDLVKYIC DNYLNNVPAC N SEQ ID NO: 10; Cluster ID (L) 102; Cluster ID (A) 62

Protein name: NA; Species: Dermatophagoides pteronyssinus

TQRVTVGVYY ETICPGCRSH FIQAIVPLKN QLGQYVNIDL VPFGNAHFYS NGPQCQHGQL ECYGNAFQAC SLDMNGFETA FKLVECMFRS NYFSNPEYSS KQCSQQLNLD YQQLDSCANG QKGLQLIREM ANKTPSHQYV PWTTVQGRFV DGNVDLVDYI CENYLNGVPA CN Each of the above amino acid sequences (SEQ ID NOs: 1-44 and 305-318) can according to the present invention be modified by substituting each cysteine residue with at least either a serine residue, an alanine residue or a 2-aminobutyric acid (also known as a-butyric acid and homoalanine) residue. The sequences of the thus modified variants of SEQ ID NOs: 1-44 and 305-318 are set forth in SEQ ID NOs: 261-304 and 319-332. In embodiments of SEQ ID NOs: 261-304 and 319-332, all cysteine residues in an amino acid sequence are substituted with serine residues. In other embodiments of SEQ ID NOs: 261- 304 and 319-332, all cysteine residues in an amino acid sequence are substituted with an alanine residue. In other embodiments of SEQ ID NOs: 261-304 and 319-332, all cysteine residues in an amino acid sequence are substituted with 2-aminobutyric acid residues.

Further, in a group of embodiments of SEQ ID NOs: 261-304 and 319-332, more than 1 of serine, alanine and 2-aminobutyric acid substitutions can be present in the same amino acid sequence and in some embodiments all 3 substitutions are present in the same amino acid sequence.. SEQ ID NOs: 45-260 refer to 15-mer peptides of the invention that are fragments of proteins of SEQ ID NOs: 1-44:

SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

45 A0001 Der f/p 2344 131 IMRILCCKSRKVAPC

46 A0007 Der f/p 1800 4 FNYLPVDVQEELRNT

47 A0007 Der f/p 1805 94 ELLKKKGIIPGIKVD

48 A0007 Der f/p 1804 69 EQYISGVILFDETVY

49 A0007 Der f/p 1806 99 KGIIPGIKVDTGVVT

50 A0008 Der f/p 1360 9 FIMLKPDAVQRGIVG

51 A0008 Der f/p 1361 19 RGIVGEI IRRFEAKG

52 A0008 Der f/p 1362 24 EI IRRFEAKGFKLVA

53 A0008 Der f/p 1363 29 FEAKGFKLVAMKFMM

54 A0008 Der f/p 1364 34 FKLVAMKFMMASEDL

55 A0008 Der f/p 1365 40 KFMMASEDLLKKHYA

56 A0008 Der f/p 1366 49 LKKHYADLAARPFFP

57 A0008 Der f/p 1368 79 WEGLNAVKTGRVMLG

58 A0009 Der f 1710 387 I RLREAMGVMQHHD

59 A0009 Der f/p 1711 405 GTEKQHVAFNYAKML

60 A0009 Der f/p 1712 410 HVAFNYAKMLDSAML

61 A0009 Der f 1714 422 AMLQCRHVI SESYRK

62 A0009 Der p 1715 427 RHI INESYKKLLPKS

63 A0009 Der f 1716 447 EFCPYLNISSCPSTE

64 A0009 Der p 1718 470 LYNPLGHRLINHTVR

65 A0009 Der f 1722 507 LISIPEFVRKIPGRK

66 A0009 Der f 1742 647 IVQEVHQQFDSFVGQ

67 A0009 Der f 1757 778 LMVHRRLLHDDYFGV SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

68 A0009 Der f 1758 802 DGHGIVIRGRHLLLL

69 A0009 Der p 1767 842 EPIISFTSIESNKQA

70 A0009 Der f 1776 872 HSKRYLLRLEHFYQR

71 A0009 Der f 1777 878 LLRLEHFYQRFEDPS

72 A0009 Der p 1778 885 KRYLLRLEHFYQSNE

73 A0009 Der f 1775 888 TLEQWHSKRYLLRLE

74 A0009 Der f 1779 897 TVSLRHLFQSFEITA

75 A0009 Der p 1782 910 SLRHLFKSFEI IAVE

76 A0009 Der p 1783 915 FKSFEI IAVEELTLG

77 A0009 Der p 1786 930 ANQPISSLKNRLHYR

78 A0009 Der p 1785 990 ELTLGANQPISSLKN

79 A0010 Der f 305 38 GNDLKNLSSKVLPNL

80 A0010 Der f 307 43 NLSSKVLPNLNVPYC

81 A0010 Der p 306 44 DDGNLISRALPHLGV

82 A0010 Der f 309 63 YIGYKIEKHSKNLIQ

83 A0010 Der p 312 81 DLSMTKLKRVRPSGF

84 A0010 Der p 313 106 IHQLNDQVLRLKFID

85 A0010 Der p 314 111 DQVLRLKFIDANQKR

86 A0010 Der f 315 138 RLYSVELDGSHLIVR

87 A0010 Der f 316 158 QSIFDINLAYMVYSD

88 A0010 Der f 318 168 MVYSDQLIHVTSRLP

89 A0010 Der f 321 193 RAPFRKNTNWKRYTQ

90 A0010 Der p 324 241 TKLASGVFLFNSNAM

91 A0010 Der p 325 246 GVFLFNSNAMDILTQ

92 A0010 Der p 330 281 GPKPEQVVQQYHNLI

93 A0010 Der p 331 286 QVVQQYHNLIGLPAM

94 A0010 Der f 334 313 FTNLNTTYTRNRAVG

95 A0010 Der f 335 348 TTYTRNRAVGIPMDV

96 A0010 Der p 337 361 LPDFIRNVLHKNGQK

97 A0010 Der f 340 428 NATEYWMDMFAEYHK

98 A0010 Der f 341 433 WMDMFAEYHK IAFD

99 A0010 Der f 342 438 AEYHKTIAFDGAWLD

100 A0010 Der f/p 343 487 TLRHKTLCMTARHYN

101 A0010 Der f/p 344 498 RHYNDQLHYNLHNLY

102 A0010 Der f 346 508 QLHYNLHNLYGFQEA SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

103 A0010 Der f 345 508 QLHYNLHNLYSLSMA

104 A0010 Der f 347 508 LHNLYSLSMAMATNA

105 A0010 Der f 348 513 SLSMAMATNAALTKL

106 A0010 Der f 350 522 AALTKLNKRPFIISR

107 A0010 Der p 349 526 NEALKTTLNKRPFI I

108 A0010 Der f 351 528 NKRPFIISRATAPGH

109 A0010 Der f 352 548 HWNGDILSDWSSMRW

110 A0010 Der f 353 553 ILSDWSSMRWTIPSI

111 A0010 Der f 354 558 SSMRWTIPSILNFNM

112 A0010 Der p 355 571 PSILNFNLFGVPMIG

113 A0010 Der f 356 593 LCIRWYQLGAFYSFA

114 A0010 Der f 357 598 YQLGAFYSFARNHND

115 A0010 Der p 358 608 AFYSFVRNHNTDNAI

116 A0010 Der f 359 623 LGESVIRAARSSLQY

117 A0010 Der f 360 628 IRAARSSLQYRYRFL

118 A0010 Der f 361 633 SSLQYRYRFLAHLYT

119 A0010 Der f 362 638 RYRFLAHLYTLFYHV

120 A0010 Der f 363 643 AHLYTLFYHVHKNGG

121 A0010 Der p 364 681 DIETQFMWGDSMLIA

122 A0010 Der p 365 686 FMWGDSMLIAPILYP

123 A0010 Der f 366 728 YDDIDYVFVRGGSII

124 A0010 Der p 367 736 DINYVFFRSGSIIPI

125 A0010 Der p 368 741 FFRSGSIIPIQGPQN

126 A0010 Der f 375 816 TQNINFINILGVPKL

127 A0010 Der p 373 816 SQHLGYQTNQSIIIL

128 A0010 Der p 374 821 YQTNQSIIILEILGI

129 A0010 Der f 377 822 INILGVPKLPTSFKL

130 A0010 Der p 378 841 SIIFDGKPYYQFIYT

131 A0010 Der f 380 843 PRI IRFNYDEQTNIL

132 A0010 Der p 379 846 GKPYYQFIYTTNNML

133 A0010 Der p 381 851 QFIYTTNNMLI IQTK

134 A0010 Der p 382 856 TNNMLIIQTKLSIFN

135 A0010 Der p 383 861 IIQTKLSIFNDNDKS

136 A0010 Der p 384 873 DKSKKIHYQFEWKFN

137 A0010 Der f 369 885 HDNTELMKTKDFLLI SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

138 A0010 Der p 332 887 YHNLIGLPAMPPFWS

139 A0011 Der f 1859 32 ELNFKVYEDAAALAK

140 A0011 Der f 1860 57 EDGLIYNIASREKAD

141 A0011 Der f 1861 102 KHEIFVVADLIDRKS

142 A0011 Der f/p 1862 132 DKKFLFNTAVLFDRQ

143 A0011 Der f/p 1863 137 FNTAVLFDRQGKLLG

144 A0011 Der f 1865 152 RYHKMHLFGEMTMNI

145 A0011 Der f/p 1866 167 PPKPELLVIDTELGR

146 A0011 Der f/p 1867 172 LLVIDTELGRLGMQI

147 A0011 Der f 1868 185 LGMQICFDMIFKTPG

148 A0011 Der f 1870 212 TWWFDEAPMLSSSQY

149 A0011 Der f/p 1871 222 SSSQYQMAWAFGNNV

150 A0011 Der f 1880 427 ELNSRFVYAKLRGAF

151 A0011 Der f 1881 432 FVYAKLRGAFSESTA

152 A0011 Der f 1882 437 LRGAFSESTAVYPSA

153 A0012 Der f 2421 17 GTWAFYRPASGKFQA

154 A0012 Der f 2422 37 NLCTHIMYGFAKLQN

155 A0012 Der f 2423 42 IMYGFAKLQNNKIAL

156 A0012 Der f 2424 72 LQWGHGMIRRMVNLR

157 A0012 Der f/p 2425 74 GMIRRMVNLRTYNPH

158 A0012 Der f/p 2426 82 MVNLRTYNPHLTTMI

159 A0012 Der f 2428 107 KYSIMVRDPASRKIF

160 A0012 Der f 2429 117 SRKIFIQSVLHLLAE

161 A0012 Der f 2430 122 IQSVLHLLAEFDLDG

162 A0012 Der f 2431 161 KEDFVTLLRELHEAF

163 A0012 Der f 2432 177 QPHGYVLSSAVSAGK

164 A0012 Der f/p 2433 202 EVSKYLDFINLMSYD

165 A0012 Der f/p 2434 207 LDFINLMSYDYHGGW

166 A0012 Der f 2435 242 KEFTVTYSVEYWLNH

167 A0012 Der f 2436 257 GVDPKKLVLGIPLYG

168 A0012 Der f 2437 269 LYGRTFTLAGSEHGI

169 A0012 Der f 2441 347 EKLNLLMAKHLGGAM

170 A0012 Der f 2442 372 GNCVGVKYPLLRS I S

171 A0012 Der f 2443 377 VKYPLLRSISKKLNN

172 A0012 Der f 2445 457 HGKFQCHQAGFFADP SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

173 A0013 Der f 1096 13 GLWLFEESTPINDRT

174 A0013 Der f 1097 38 TKKIYVRLDFPNTAV

175 A0013 Der f 1099 63 LIYNQGFEVI INYRK

176 A0013 Der f 1100 68 GFEVI INYRKYFAFS

177 A0013 Der f/p 1101 73 INYRKYFAFSAYERK

178 A0013 Der f 1102 78 YFAFSAYERKSNSKV

179 A0013 Der f 1113 253 AMLEARIRIATNNTA

180 A0013 Der f 1115 287 DGGFGYLIAGKYAQD

181 A0013 Der f 1116 333 TYTIKYNYLGGYFGA

182 A0013 Der f 1117 353 MRIELVKNGPIAVGF

183 A0013 Der f 1118 363 IAVGFEVYKDFMTYR

184 A0013 Der f 1119 368 EVYKDFMTYRRGIYS

185 A0014 Der p 1006 31 DAVVYYGQAKSSFDQ

186 A0014 Der f/p 1009 110 GDLGYVNEQSLPYLK

187 A0014 Der p 1012 155 HFMRS IEPVASKVAY

188 A0014 Der p 1013 186 YDSRFSMIGDRSQPI

189 A0014 Der p 1014 211 NHFHSMTIGPATIIL

190 A0014 Der p 1015 221 ATIILFSTEYYYYTK

191 A0014 Der p 1016 261 KHPWI IVMGHRPLYC

192 A0014 Der p 1017 306 QYGLEDLFFKYGVDI

193 A0014 Der p 1018 311 DLFFKYGVDIQFYGH

194 A0014 Der p 1019 326 EHFYARLFPIYKYKM

195 A0014 Der p 1020 331 RLFPIYKYKMYNGTK

196 A0014 Der p 1021 336 YKYKMYNGTKSKNPY

197 A0014 Der p 1022 371 PEFNHLNDWVAEHFY

198 A0014 Der p 1023 381 AEHFYDYGYTRLMFE

199 A0014 Der p 1024 386 DYGYTRLMFEDKYRI

200 A0014 Der p 1025 391 RLMFEDKYRIRLQQI

201 A0016 Der f/p 1353 3 KIGINGFGRIGRLVL

202 A0017 Der f/p 404 9 YDFSGKVALVTGSSS

203 A0017 Der f 406 29 IAVQFAQYGAKLTIT

204 A0017 Der f 407 69 VGDLLDQSLPAKLIN

205 A0017 Der f 412 124 NVRAVLQLSQLAAIH

206 A0017 Der f 413 129 LQLSQLAAIHLEKSK

207 A0017 Der f 421 179 ELGLKGVRVNS INPG SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

208 A0017 Der f 425 219 NHTLLKFLAQPDEIA

209 A0017 Der f 428 247 MTGSIVVSDTGSLLV

210 A0018 Der f 2351 3 KAVVVLKGEPNVTGT

211 A0018 Der p 2352 93 VANVVIEDSLISLTG

212 A0018 Der f 2353 98 IEDSLISLTGERSIV

213 A0018 Der f 2354 103 ISLTGERSIVGRSLV

214 A0018 Der p 2355 108 EKSIVGRSLVVHADP

215 A0019 Der p 2065 3 YCNGAAIVSAARSQI

216 A0019 Der p 2066 8 AIVSAARSQIGVPYS

217 A0019 Der p 2067 53 SVYQGTHKVLARVAS

218 A0019 Der p 2068 86 GDLVFFGNPIHHVGI

219 A0019 Der p 2069 93 NPIHHVGIVSAHGRM

220 A0019 Der p 2070 98 VGIVSAHGRMINAPH

221 A0020 Der f 968 42 LKPEFLKVNPFHKIP

222 A0020 Der f 969 47 LKVNPFHKI PTFVDT

223 A0020 Der f 970 52 FHKIPTFVDTDGFTI

224 A0020 Der f 971 62 DGFTIDESRVIAMYL

225 A0020 Der f 978 157 QITLADIAMYFSCNT

226 A0020 Der f 979 162 DIAMYFSCNTMEIYS

227 A0022 Der p 2481 168 DTKEIRWYMDGNKYF

228 A0022 Der p 2485 238 QKHTMEIDYVRVYQW

229 A0023 Der f 178 7 SLRGVTIRNAPFLEE

230 A0023 Der f/p 179 87 EARENYITFTEPFMI

231 A0023 Der f/p 180 92 YITFTEPFMINQLAA

232 A0023 Der f/p 181 97 EPFMINQLAALIRRE

233 A0023 Der f 182 132 HKPI ILGTLRNGATN

234 A0023 Der f 183 137 LGTLRNGATNHFLSK

235 A0023 Der f 184 187 GYAFIMESSSAEHEI

236 A0023 Der f 185 207 LTMLLDWRNLYPRKY

237 A0023 Der f 186 212 DWRNLYPRKYAFALP

238 A0023 Der f 187 217 YPRKYAFALPKDSQY

239 A0023 Der f 188 228 KDSQYLQHFNNAIKQ

240 A0023 Der f 189 232 LQHFNNAIKQLNTED

241 A0024 Der f/p 1057 19 NKFTVFTSGKPVSGL

242 A0024 Der f/p 1058 129 TGGISFAMEGPAKVE SEQ ID NO: Source Species Peptide ID # Start pos Sequence

Protein ID

243 A0024 Der f/p 1059 154 ITVIYKPPTPGDYKL

244 A0024 Der f/p 1060 164 GDYKLHLKFNDIHLP

245 A0025 Der f 2455 49 IREYRVEAIKVMLET

246 A0025 Der f 2456 54 VEAIKVMLETDRKLL

247 A0025 Der f 2457 64 DRKLLTLNNSQI ILN

248 A0025 Der f 2459 74 QIILNIQYQKKKIRC

249 A0025 Der f 2462 94 LSELLTMHLLAYKQG

250 A0025 Der p 2480 148 GPHHFNVDFSKDFHN

251 A0025 Der f/p 2465 149 WSNSSAKKLLKNDID

252 A0025 Der f/p 2471 204 RSTLKLFGTIKMFMK

253 A0025 Der f/p 2472 209 LFGTIKMFMKELLDD

254 Cluster Der f 18 THFINAIVPLRQQLG

102

255 Cluster Der f 21 AIVPLRQQLGEYVDI

102

256 Cluster Der f 33 EYVDIDLVPFGNAHI

102

257 Cluster Der f 118 TGQKGFELIKVMARK

102

258 Cluster Der f 123 FELIKVMARKTPRHN

102

259 Cluster Der f 138 YVPWTTVESRTVDVN

102

260 Cluster Der f 153 VDLVKYICDNYLNNV

102

"Der f" denotes the species Dermatophagoides farinae

"Der p" denotes the species Dermatophagoides pteronyssinus

"Der f/p" denotes both species.

In each of the above sequences SEQ ID NOs: 45, 61, 63, 80, 100, 113, 147, 154, 170, 172, 191, 215, 225, 226, 248, and 260, cysteine residues (underlined and in bold typeface) may be substituted with serine, alanine or 2-aminobutyric acid; in different embodiments of SEQ ID NOs: 45 and 63, all cysteine residues may be so substituted, either exclusively with serine residues, exclusively with alanine residues, exclusively with 2-aminobutyric acid residues, or with a combination thereof. REFERENCES

Bret, Cooper and J . Feng and W. Garrett (2010), Spectroscopy. 21 (9) : 1534-1546.

Goodman R. et a/, Clin Transl Allergy. 2014; 4(Suppl 2) : P12

Haqqani AS et a/. (2008), Methods Mol . Biol . 439 : 241-56.

Henmar H et al., Clin Exp Immunol 2008; 153(3) : 316-23.

Ishihama Y, Oda Y, Tabata T, Sato T, Nagasu T, Rappsilber J, Mann M. Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein. Mol Cell Proteomics 2005; 4: 1265-72.

Trauger A. et a/. (2002), Spectroscopy. 16 (1) : 15-28.

Wells W et a/. (2006), Journal of Proteome Research. 5 (3) : 651-658.

EXAMPLE 1

This example includes a description of the identification of mite proteins extractable from mite fecal particles and/or mite bodies within a short extraction time upon being treated with neutral buffered aqueous solutions. Contrary to the relative long and more violent extraction conditions usually applied in the preparation of allergen extracts applicable for allergy immunotherapy, the present extraction conditions avoided mechanical manipulation, the extraction time was kept as short as 10 minutes and the extraction media was isotonic phosphate buffer with physiological pH . Using this extraction approach, there was identified HDM proteins releasable immediately and concurrently with known allergens, only. The short extraction time and mild extraction conditions were chosen to mimic the extraction of proteins/allergens potentially taken place on the respiratory mucosal surface in subjects exposed to mites. The identification of co-eluting proteins were then conducted using LC- MS/MS and transcriptomes of the two HDM species Der f and Der p. Homologous proteins to the Der f/Der p sequence were identified using transcriptomes of four other mite/storage mite species; Blomia tropicalis (Bio t), Glycyphagus domesticus (Gly d), Lepidoglyphus destructor (Lep d) and Tyrophagus putrescentiae (Tyr p) .

Preparation of extracts: 10 % (w/v) extracts were made using mite cultures of two different house dust mite species (Der p and Der f) and separately on the body fraction and the fecal fraction of the culture. In details, a sample of about 0.5 g was taken from each of the culture fractions and suspended in 5 ml of Phosphate buffer (PBS pH 7.2: 137 mM NaCI, 2.7 mM KCI, 8.2 mM Na 2 HP0 4 , 1.5 mM KH 2 P0 4 ), and then gently rotated for 10 minutes at room temperature. Larger particles were removed by filtering through a PD10 PE bed-filter followed by removal of smaller particles through a 5 μιη (Millex) + a 0.8 μιη (Millex) filter. Filtered samples were kept on ice.

LC-MS/MS: The four extraction samples were evaporated and 50 μg of each of the dried samples was re-suspended in 5 μΙ water. The samples were then denatured (6 M urea, 0.3 M N H 4 HCO3), reduced (9 mM DTT, 56°C for 15 min), alkylated (17 mM Iodoacetamide), and finally trypsin-digested (5 μg trypsin at 37°C, over night). Resulting peptides were then separated and analysed by liquid chromatography tandem mass spectrometry (LC-MS/MS).

Reverse phase liquid chromatography (Ultimate 3000 RSLC nano, Thermo) was performed using C18 pre- and analytical columns at a flow rate of 300 nl/min. The applied gradient consisted of a 220 min linear increase of solvent B from 4% to 55%, where solvent A = 0.05% v/v formic acid and solvent B = 80% v/v acetonitrile/0.04% v/v formic acid.

Peptides eluting from the LC were sprayed directly into an ESI-QTOF mass spectrometer (MaXis, Bruker). Spectra were acquired in the mass range 50-2200 m/z at 2 Hz and MS/MS sequencing at a spectral rate of 4-16 Hz. Data analysis: Data processing (compound finding and charge deconvolution) was performed using DataAnalysis 4.2 (Bruker). Proteins were identified by searching the MS/MS spectral data against a database (see section below) using MASCOT 2.2 (Matrix Science) and X! Tandem search engines at the following parameters: Enzyme = trypsin, Max missed cleavages= 2, Fixed modifications= carbamidomethyl (C), Variable modifications= oxidation (M), Peptide mass tolerance= 10 ppm, Fragment mass tolerance≤ 0.1 Da. False discovery rate (FDR) was <2% (average of 0.54%).

Database: The database used for protein identification was compiled based on in-house transcriptomes of the two HDM species Der f and Der p as well as in-house transcriptomes of four other mite/storage mite species; Blomia tropicalis (Bio t), Glycyphagus domesticus (Gly d), Lepidoglyphus destructor (Lep d) and Tyrophagus putrescentiae (Tyr p), prepared as follows:

RNA-sequencing of all mite species was performed by UCSD using an Illumina HiSeq 2000. Sequences were assembled into transcripts including isoforms and homologs with Trinity. All transcriptomes were translated into amino acid sequences in all six reading frames. For each of the transcriptome sequences, the longest translated continuous amino acid sequence without an occurring stop codon was included in the compiled transcriptome database for the MASCOT search. A minimum length of 60 amino acids was required. Additional translated sequences from other reading frames were included if the length of the respective sequence was longer than 80% of the previously identified longest translated continuous amino acid sequence.

In addition to these transcript-derived sequences, Swissprot and Trembl sequences from the Acari subclass were also included in the database, as well as all previously identified allergens from Der f and Der p (extracted from allergen.org and allergome.org), and proteins commonly found in proteomics experiments, adding up to a database of a total of 409,187 sequences. Application of an 80% homology filter to respective species of extract origin yielded a total of 87 conserved protein groups and 438 proteins, with each group consisting of 1, 2, or more proteins. A total of 492 sequences were included in the final analysis. These sequences were clustered at a 40% identity threshold using the epitope cluster analysis tool available at IEDB into 96 clusters. Each of the 96 sequence clusters were aligned separately using the MEGA software tool (using ClustalW). Clusters corresponding to known allergens were removed from consideration, leaving a set of remaining clusters, herein named "L" clusters. In another set of analysis, proteins were identified by conservation analysis of each translated sequence against three arachnid proteomes {Ixodes scapularis, Metaseiulus occidentalis, Stegodyphus mimosarum) derived from de novo sequence assembly. Each sequence was aligned against each proteome to identify proteins and known allergens that had > 70% sequence identity over at least 50% of the length of the proteome transcript. Similar analyses were performed for each of the sequences against 1,130 proteins of the aero, bacteria, contact and venom or salivary categories from the Allergen Online Database version 15 (Goodman R. et a/, Clin Transl Allergy. 2014; 4(Suppl 2) : P12). Identified proteins in the samples were clustered according to a sequence homology cut off of ≥ 67 %

(historically cut off distinguishing iso-allergens from two distinct allergen groups), and a representative sequence for each cluster was selected. These clusters were named "A" clusters.

The section headed "Amino Acid Sequences" supra shows representative sequences of 22 proteins found in "L" and "A" clusters (either the Der p or the Der f sequence) and of their homologous sequences detected in either Der p, Der f and Bio t (if detected). Other homologous sequences are also found in the transcriptomes of other mites (Gly d, Lep d and Tyr p), but not reported by their sequence.

Table 1 below shows for each protein ID, the percent amino acid sequence identity between Der f and Der p homologous proteins (column 4), calculated by sequence alignment between the protein first detected in the "L" and "A" clusters (species indicated in column 2) and the homologous sequences from the other house dust mite species (species indicated in column 3) ; the percent amino acid sequence identity between the house dust mite protein and the homologous sequences found in humans (column 5), calculated by sequence alignment between the protein first detected in the "L" and "A" clusters (species indicated in column 2) and the human homolog protein; the percent amino acid sequence identity between the house dust mite protein and the closest homologous sequences found in Bio t by mass spectrometry (column 6), calculated by sequence alignment between the protein first detected in the "L" and "A" clusters (species indicated in column 2) and the Bio t homolog protein.

Table 1

Protein First Homolog % sequence % sequence % sequence

ID detected species identity identity to human identity of closest

in between f/p homolog homolog of Bio t

A0001 Der f Der p 87% No significant Not identified

similarity

A0003 Der f Der p 83% 38% 46%

A0007 Der f Der p 90% 66% 81%

A0009 Der p Der f 80% 43% 57%

A0010 Der f Der p 73% 40% 54%

A0011 Der f Der p 84% 30% 48%

A0012 Der f Der p 92% 38% 65%

A0013 Der f Der p 72% 52% 61%

A0014 Der p Der f 80% 45% Not identified

A0015 Der p Der f 56% 28% 32%

A0016 Der f Der p 97% 74% Not identified

A0017 Der f Der p 76% 35% 66%

A0018 Der f Der p 91% 67% 82%

A0019 Der p Der f 80% <33% 62%

A0020 Der f Der p 83% 30% 44%

A0022 Der p Der f 73% No significant 73%

similarity

A0023 Der f Der p 80% 33% Not identified

A0024 Der f Der p 97% 41% 90%

A0025 Der f Der p 48% No significant Not identified

similarity EXAMPLE 2

This example includes a description of the immunogenicity of the proteins selected in Example 1. Immunogenicity was tested with respect to the ability of the protein or fragments of the proteins (peptides) to · stimulate reactivity of T cells obtained from mite allergic donors;

• activate basophilic cells obtained from mite allergic donors; and/or

• react with specific IgE and IgG antibodies of plasma from mite allergic donors.

Peptide library: Each sequence of an "L" cluster was aligned separately using the MEGA software tool with ClustalW. Fifteen-mer peptides overlapping by 10 amino acids were generated and the last 15-mer peptide was added when the sequence length was not divisible by 5, 14,783 unique peptides remained.

Promiscuous HLA class II binding predictions and pool generation: HLA class II binding predictions optimized for global coverage were performed for the seven class II alleles (HLA- DRB1*03: 01, HLA-DRB1*07: 01, HLA-DRB1*15: 01, HLA-DRB3*01 : 01, HLA-DRB3*02: 02, HLA-DRB4*01 : 01 and HLA-DRB5*01 : 01) using the standalone version of the IEDB class II binding prediction tool. The median consensus percentile rank was estimated from the consensus percentile ranks for the seven alleles. Further, peptides with more than ten overlapping amino acids, which appeared because several occurrences of some sequence regions were repeated multiple times in the same sequence, were eliminated (e.g.

"TLSDYNIQKESTLHLVLRLRGGMQIFVKTLTG" was repeated seven times in one sequence.) Variant peptides were also removed, retaining the better peptide based on the median consensus percentile rank and conservation among the sequences within its respective cluster. Peptides with median consensus percentile rank≤10.0 and conserved in≥35% of sequences in the same cluster were finally selected, also including additional selected peptides chosen to maximize DRB1 allele coverage, for a grand total of 2,589 peptides.

Peptide synthesis: Peptides were purchased from Mimotopes (Clayton, Victoria, Australia) or A and A (San Diego, CA) as crude material on a small (1-mg) scale. Individual peptides were resuspended in DMSO at a final concentration of 40 mg/mL. Peptide "megapools" of 30-65 peptides/ pool were generated. Following lyophilization, each pool was reconstituted in DMSO so that each peptide was present at a concentration of 4 mg/mL. To facilitate deconvolution of positive megapools, each megapool was further broken down in 2-6 "mesopools" (259 mesopools in total), each containing 8-14 peptides. Each mesopool was then deconvoluted to identify individual positive peptides. To avoid dimerization and polymerization of peptides by intra- and intermolecular disulfide bond formation between cysteine residues, this amino acid were in some instances substituted by a serine residue in the peptides. Such peptides are herein marked with an asterisk (*).

Expression of recombinant proteins: Small scale recombinant proteins (>75 % purity, endotoxin level< 10 EU/mg) were expressed in E.coli and/or in insect cells as a custom service by GenScript (NJ, USA) using codon optimized DNA constructs. Selected proteins were further expressed in a human embryonic kidney (HEK293) suspension cell line

(Freestyle™ 293 Expression System, Thermo Fisher, MA, USA), according to the

manufacturer's instructions. Briefly: 30 μg transfection grade, codon optimized plasmids encoding the protein of interest (made as a custom service by Genscript, NJ, USA), was mixed with 60 μΙ 293fectin™, and incubated for 25 min. This mixture was added to 30 ml suspension culture of HEK293 cells with a cell density of 1 Ί0 5 cell/ml. The culture was incubated in 125 ml disposable, polycarbonate, Erlenmeyer flasks with vent caps (Corning, NY) in a 37 °C incubator having a humidified atmosphere with 8 % C0 2 and orbital shaking at 125 rpm for 2-5 days before harvesting. Recombinant proteins secreted into the medium were harvested by sedimentation of the HEK293 cells at 100 g for 5 min. The cell supernatants were subsequently sterilized through a low protein binding Millex-GP 0,45 um filter (Millipore, MA, USA).

Study population: PBMCs from European HDM-allergic individuals were recruited in the Copenhagen region (defined by clinical history of allergy to house dust mite and specific IgE to group 1 and group 2 major allergens from Der p and/or Der f and with measured specific IgE (CAP) >0.7kU/L towards Der p/f 1 or Der p/f 2. In addition, PBMCs from 10 US HDM- allergic individuals were recruited in San Diego (defined by Der p extract IgE titers greater than 0.35 kUA/L). PBMCs were isolated from whole blood by density gradient centrifugation according to manufacturers' instructions (Ficoll-Hypaque, Amersham Biosciences, Uppsala, Sweden). Der p- and Der f-specific extract IgE titers were determined using the ImmunoCAP system (Thermo Fisher, Uppsala, Sweden). In a separate series of experiments, pooled plasma from 10 European and 10 American HDM atopic individuals from the San Diego region, respectively, was utilized to run 2D immunoblots to elucidate IgE and IgG reactivity towards the proteins, which had at least one peptide with positive T cell response. T cell reactivity of protein: T cells reactivity was determined by establishment of HDM specific T-cell lines according to standard methods. In short, PMBCs from HDM allergic donors were cultured for 2-3 weeks in the presence of house dust mite allergen extract. The responses to proteins having SEQ ID NOs 1-44 were assessed by IL-5/IFNg FluoroSPOT (Mabtech FS- 0108-10) according manufacturer protocol, (after 2 weeks) or proliferation in a standard 72 h T cell proliferation assay, as described in Henmar H et al., Clin Exp Immunol 2008; 153(3) : 316-23. (after 3 weeks). In addition these established T cell lines were used for further characterization and epitope mapping.

T cell reactivity of peptides: HDM-specific T cells were expanded in vitro. Briefly, PBMCs from HDM-allergic individuals were stimulated with HDM extract (5 μg/mL) and expanded over 14- 17 days with IL-2 (added every 3 days). Cells were harvested on day 14, restimulated with HDM extract (5 μg/mL), individual peptides (10 μg/mL) or peptide pools (5 μg/mL) and screened for IFN-A/IL-5-production by ELISPOT. Criteria for positivity were 100 or 20 spot forming cells (SFCs) per 10 5 PBMCs for peptide pools or single peptides, respectively, p < 0.05, and a stimulation index > 2. Basophil activation: Basophil Activation Test (BAT) was used as a predictive in vitro assay for indication of safety/immediate hypersensitivity reactions. The BAT assay is a widely used diagnostic test that is also used for evaluation of allergenicity of allergen derived

components. In short: whole blood from HDM allergic donors was stimulated for 1 h with the proteins, and the increased expression of activation markers on the surface of basophils were measured by flow cytometry. The BAT test was carried out using different concentrations of the proteins and the allergens Der p 2 and Der p 1 was used as controls and tested in the same concentration rates.

Determination of IqE and IaG reactivity: Briefly, extracts of Der p and Der f were mixed 1 : 1 and 300 μg of extract proteins was run on 2D gels (3 -10 pH range, 12% 138 (vol/vol) acrylamide) at Applied Biomics. The 2D-immunoblots of the labeled extracts were incubated with either (1) pooled plasma (diluted 1 : 20) from 10 HDM allergic donors recruited in San Diego or (2) pooled sera from 10 HDM allergic donors recruited in Europe (diluted 1 : 33). Blots were incubated with goat anti-human IgE and mouse anti-human IgG (Sigma-Aldrich), and HDM donor antibody reactivity visualized using Cy2-conjugated donkey anti-goat IgG and Cy5-conjugated donkey anti-mouse IgG antibodies (Biotium). In total 237 IgE and/or IgG-reactive protein spots were picked and analyzed by mass spectrometry by searching the MS spectra against a transcriptome sequence database. Using this database, the most likely protein of a given spot was identified. The antibody reactivity of each spot was then determined by visual inspection of the 2D-gel images. We took into account both the reactivities of the San Diego and European pools. If any spot in a given protein was antibody reactive with either cohort, the protein was considered reactive for that antibody. Then, the protein sequences from the proteomic analysis were aligned with the bioinformatically determined peptide clusters.

Table 2 shows the results obtained for the selection of 22 proteins (either the Der p of Der f protein were tested). Notably, a number of the proteins produced a T cell response in many of the tested donors, but none or a low fraction of donors had IgE reactivity towards the proteins.

Table 2

SEQ Cluster Cluster Protein Species aa % T cell % IgG IgE

ID ID (L) ID (A) ID length responding responding reactivity reactivity

NO: name donors to donors in

protein BAT Assay

13 96 55 A0001 Der f 222 6 of 29 0 of 16

14 96 55 A0001 Der p 222 + +

2 65 74 A0003 Der f 132

3 65 74 A0003 Der p 132 1 of 29 0 of 16 - -

34 46 21 A0006 Der f 462

35 46 21 A0006 Der p 462 + -

30 61 30 A0007 Der f 361

31 61 30 A0007 Der p 362 6 of 27 0 of 8 + +

7 44 67 A0008 Der f 154

8 44 67 A0008 Der p 154 - -

43 58 6 A0009 Der f 975

44 58 6 A0009 Der p 990 7 of 24 + +

41 10 7 A0010 Der f 887

42 10 7 A0010 Der p 885 + -

39 64 16 A0011 Der f 520

40 64 16 A0011 Der p 520 15 of 27 0 of 8 - -

22 103 19 A0012 Der f 509

38 103 19 A0012 Der p 262 11 of 27 0 of 8 + -

36 40 23 A0013 Der f 463

37 40 23 A0013 Der p 474 4 of 27 0 of 8 - -

32 33 25 A0014 Der f 429

33 33 25 A0014 Der p 434 10 of 27 1 of 8* - -

26 25 34 A0015 Der f 310

27 25 34 A0015 Der p 321 1 of 24 + +

28 43 39 A0016 Der f 332

29 43 39 A0016 Der p 332 2 of 27 1 of 8* + +

21 13 49 A0017 Der f 261

25 13 49 A0017 Der p 275 3 of 26 1 of 8* - -

5 97 71 A0018 Der f 152

6 97 71 A0018 Der p 152 0 of 27 1 of 8* - -

1 75 75 A0019 Der f 130

4 75 75 A0019 Der p 134 8 of 24 - -

15 31 59 A0020 Der f 228

16 31 59 A0020 Der p 228 3 of 24 + +

19 105 50 A0022 Der f 252

20 105 50 A0022 Der p 253 6 of 24 + + SEQ Cluster Cluster Protein Species aa % T cell % IgG IgE

ID ID (L) ID (A) ID length responding responding reactivity reactivity

NO: name donors to donors in

protein BAT Assay

23 8 42 A0023 Der f 270

24 8 42 A0023 Der p 271 3 of 24 - -

11 36 65 A0024 Der f 188

12 36 65 A0024 Der p 188 5 of 24 - -

17 104 NA A0025 Der f 233

18 104 NA A0025 Der p 233 0 of 24 - -

9 102 62 NA Der f 171

10 102 62 NA Der p 172 + -

Table 3 shows the percentage of donors that produced a response against a peptide together with information about the source protein of the peptide (i.e. the protein with 100% sequence alignment over the peptide sequence). For example peptide with ID No: 2344 derives from protein AOOOl and has 100% sequence alignment with the sequence of AOOOl of Der f as well as the Der p within the stretch of consecutive amino acid residues from position 131 to 145, while peptide with ID NO: 1714 derives specifically from the Der f sequence of protein A0009 and peptide with ID NO: 1715 derives specifically from the Der p sequence of protein A0009. The peptides in the table are identical to the 15-mer peptides having SEQ ID NOs: 45-260 that are detailed in the section supra headed "Amino Acid Sequences".

Table 3

EXAMPLE 3

This example relates to the further testing of immunogenicity of the proteins identified in Example 1. Their ability to react with IgE antibodies in HDM allergic individuals, to stimulate in vitro T cell proliferation of HDM allergic individuals and non-allergics, and to stimulate ex vivo cytokine production of HDM allergic individuals and non-allergics. The following tests were used :

• Basophil Activation Test (BAT) was used as a predictive in vitro assay for indication of safety/immediate hypersensitivity reactions. BAT test was carried out using blood from HDM allergic individuals (n = 14), and by use of different concentrations of test proteins or or the major house dust mite allergens (Der p 1, Der f 1, Der p 2 or Der f 2, e.g. a concentration of 1, 10, 100 or 1000 ng/ml.

• In vitro T cell reactivity determined in T cell lines obtained from HDM allergic individuals (n=30) and non-allergics (n=8) : Determined by establishment of HDM specific T-cell lines according to standard methods. In short, PMBCs from HDM allergic donors were cultured for 3 weeks in the presence of house dust mite allergen extract. The responses to proteins at a concentration of 0.5 ug/ml or 2 ug/ml of the test protein or the major house dust mite allergens (Der p 1, Der f 1, Der p 2 or Der f 2) were assessed by proliferation in a standard 72 h T cell proliferation assay, as described in Henmar H et al., Clin Exp Immunol 2008; 153(3) : 316-23. T cell reactivity was.

Ex vivo stimulation of PBMC cells obtained from mite allergic patients (n = 16) and non- allergics (n = 6) : Determined by measuring the production of the cytokines; IFN-gamma, IL-9, IL-10, IL-17 and IL-31 following stimulation with test protein in concentration up to 10 ug/ml. Freshly isolated PBMC 5xl0 5 /ml were cultured with a test protein for 5 days and cell supernatant were harvested and stored at -80° C. Cytokines of the supernatants were measured using ProcartaPlex Multiplex Immunoassays with MAGPIX Multiplex Reader according manufactory protocol. e 4

A010 were only tested in T-cell lines from 3 allergic donors, all responsive. Comments: Overall, the test proteins identified in Example 1, did only provide a positive BAT test in none or a very few mite allergic individuals, whereas they stimulated T cell proliferation in a larger percentage of the mite allergic individuals. In contrast, the major allergens produce both positive BAT test and stimulates T cell proliferation in a significant larger fraction of the mite allergic individuals.

Table 5

EXAMPLE 4

This example relates to the abundance of the proteins in house dust mite extracts relatively to the abundance of known allergens of house dust mite extracts.

The abundance was determined as follows: MS/MS spectra were searched (via MASCOT search engine, Matrix Science) against an in-house allergen database that included the protein sequences of all novel proteins A0001-A0025 as well as all known HDM allergens group 1-35. The sum of the relative and semi-quantitative Exponentially Modified Protein Abundance Index (emPAI) scores (Ishihama Y et al. 2005) of all hits were set to 100%, and the relative percentage (molar %) of each protein was calculated. The abundance of the known HDM allergens is shown as one pooled result.

Table 6 shows the relative abundance of the novel proteins and known HDM allergens

Table 6

Protein Der p Der p fecals Der p full Der f Der f fecals Der f full bodies Mild extraction extract bodies, Mild Mild extraction extract

Mild extraction (10 min) extraction (10 min)

(10 min) (10 min)

HDM 64,2 88,5 50,1 50,8 47,3 51,2 allergens

A0001 5,0 0,5 4,5 8,0 11,6 9,2

A0003 3,9 <0,0 17,8 19,0 11,9 13,5

A0006 0,6 0,5 0,7 1,2 0,6 0,7

A0007 1,3 0,3 2,1 4,5 2,4 3,5

A0008 2,0 0,8 4,8 4,2 3,4 4,7

A0009 1,0 0,3 0,6 2,3 0,6 0,4

A00010 0,8 0,5 0,6 2,0 0,3 1,1

A00011 <0,0 <0,0 0,2 0,4 <0,0 0,4

A00012 <0,0 <0,0 <0,0 <0,0 <0,0 0,2

A00013 <0,0 <0,0 <0,0 0,4 1,0 <0,0

A00014 4,0 0,5 0,7 0,8 <0,0 0,9

A00015 2,5 0,3 1,4 <0,0 <0,0 <0,0

A00016 0,4 <0,0 2,4 0,6 4,2 4,8

A00017 <0,0 <0,0 <0,0 <0,0 1,2 0,4

A00018 3,7 0,9 4,1 4,8 11,3 2,6

A00019 6,3 5,3 1,8 <0,0 <0,0 <0,0

A00020 <0,0 0,5 1,5 <0,0 1,3 0,9

A00022 4,2 1,0 1,3 <0,0 <0,0 <0,0

A00023 <0,0 <0,0 <0,0 <0,0 1,1 0,8 Protein Der p Der p fecals Der p full Der f Der f fecals Der f full bodies Mild extraction extract bodies, Mild Mild extraction extract

Mild extraction (10 min) extraction (10 min)

(10 min) (10 min)

A00024 <0,0 <0,0 5,3 1,0 1,8 2,9

A00025 <0,0 <0,0 <0,0 <0,0 <0,0 1,9