Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
PROTECTED TETRASACCHARIDES, THEIR PROCESS OF PREPARATION AND THEIR USE AS TRANSGLUCOSYLASE ACCEPTOR SUBSTRATES IN THE CHEMO-ENZYMATIC SYNTHESIS OF SHIGELLA FLEXNERI SPECIFIC OLIGOSACCHARIDES
Document Type and Number:
WIPO Patent Application WO/2019/007999
Kind Code:
A1
Abstract:
The present invention provides protected tetrasaccharides, their process of preparation and their use in the synthesis of oligosaccharides, in particular fragments of O-antigens from Shigella flexneri, for example of serotype la, lb, 2a, 2b, 3a, X, 4a, 4b, 5a, 5b, 7a or 7b.

Inventors:
MULARD LAURENCE (FR)
LE HEIGET GUILLAUME (FR)
HU ZHAOHU (CN)
BAREL LOUIS-ANTOINE (FR)
ANDRE ISABELLE (FR)
MOULIS CLAIRE (FR)
REMAUD-SIMEON MAGALI (FR)
BARBE SOPHIE (FR)
BENKOULOUCHE MOUNIR (FR)
BEN IMEDDOURENE AKLI (FR)
Application Number:
PCT/EP2018/068013
Publication Date:
January 10, 2019
Filing Date:
July 03, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PASTEUR INSTITUT (FR)
AGRONOMIQUE INST NAT RECH (FR)
INSTITUT NAT DES SCIENCES APPLIQUEES DE TOULOUSE (FR)
UNIV PARIS 13 PARIS NORD (FR)
CENTRE NAT RECH SCIENT (FR)
International Classes:
C07H15/00; C07H15/10; C12P19/12; C12P19/26
Domestic Patent References:
WO2008155487A22008-12-24
Foreign References:
EP2100966A12009-09-16
Other References:
STÉPHANE SALAMONE ET AL: "Investigation on the Synthesis of Shigella flexneri Specific Oligosaccharides Using Disaccharides as Potential Transglucosylase Acceptor Substrates", JOURNAL OF ORGANIC CHEMISTRY, vol. 80, no. 22, 20 November 2015 (2015-11-20), pages 11237 - 11257, XP055528217, ISSN: 0022-3263, DOI: 10.1021/acs.joc.5b01407
JULIEN BOUTET ET AL: "Detailed Investigation of the Immunodominant Role of O-Antigen Stoichiometric O-Acetylation as Revealed by Chemical Synthesis, Immunochemistry, Solution Conformation and STD-NMR Spectroscopy for Shigella flexneri ?3a", CHEMISTRY - A EUROPEAN JOURNAL, vol. 22, no. 31, 4 July 2016 (2016-07-04), DE, pages 10892 - 10911, XP055529215, ISSN: 0947-6539, DOI: 10.1002/chem.201600567
SCIENCE, vol. 341, no. 6144, 2013, pages 379 - 83
CELL. MOL. LIFE SCI., vol. 73, 2016, pages 2661 - 79
J. AM. CHEM. SOC., vol. 131, 2009, pages 7379 - 89
CHEM. COMMUN., vol. 51, 2015, pages 2581 - 4
CLIN. INFECT. DIS., vol. 59, 2014, pages 933
BIOCHEMISTRY (MOSCOW, vol. 80, 2015, pages 901 - 14
AGOSTON ET AL., TETRAHEDRON: ASYMMETRY, vol. 27, 2016, pages 707 - 728
CHEM. REV., vol. 101, 2001, pages 53 - 80
J. AM. CHEM. SOC., vol. 133, 2011, pages 3724 - 7
TETRAHEDRON: ASYMM., vol. 11, 2000, pages 3647 - 51
CARBOHYDR. RES., vol. 356, 2012, pages 115 - 31
CHEM. ASIAN J., vol. 12, 2017, pages 419 - 39
CHEM. EUR. J., vol. 22, 2016
J. ORG. CHEM., vol. 80, 2015, pages 11237 - 57
J. IMMUNOL., vol. 182, 2009, pages 2241 - 7
BIOCONJUGATE CHEM., vol. 27, 2016, pages 883 - 92
P. G. M. WUTS; T. W. GREENE: "Greene's Protective Groups in Organic Synthesis, 4th ed.", 2006, WILEY- INTERSCIENCE
"Greene's Protective Groups in Organic Synthesis, 5th ed.", 2014, WILEY-INTERSCIENCE
P. MURPHY AND C. VOGEL: "Carbohydrate Chemistry: Proven Synthetic Methods", vol. 4, 2017, article "chap. 39"
TETRAHEDRON LETT., vol. 49, 2008, pages 5339 - 42
"NCBI", Database accession no. CP000574
"GenBank", Database accession no. AHF 19404.1
VUILLEMIN ET AL., J BIOL CHEM., vol. 291, no. 14, 2016, pages 7687 - 702
G. L. MILLER, ANAL. CHEM., vol. 31, 1959, pages 426 - 428
Attorney, Agent or Firm:
GEVERS & ORES (FR)
Download PDF:
Claims:
CLAMS

1. A compound of following formula (¾) :

(TD)xABzC(TD)rR (lo)

wherein:

x and y are 0 or 1 , providing x + y = 1 ;

A is 2)-u-L-Rhap-(l→ ;

B is 2)-a-L-Rha/?-(l-> ;

C is 3)-a-L-Rha ?-( l -> ;

D is 3}-a-D-GlcpN-(l -> or 3)-P-D-GlcpN-(l→ ;

Z is CIAc, BrAc, Ac or 0;

T is a protecting group capable of anchimeric assistance and being preferentially orthogonal to Z, in particular when Z is Ac, or is azide (N3) ;

R is a protecting group compatible with chain elongation into O-antigen fragments, and being orthogonal to T and Z, when Z is CIAc, BrAc or Ac ;

CIAc is C1CH2-C(0)-;

BrAc is BrCH2-C(0)-.

2. A compound of following formula (I) :

(ci3AcD)xABzC(cBAcD)y-All (I)

wherein:

x and y are 0 or 1, providing x + y = 1 ;

A is 2)-a-L-Rha/7-(l-> ;

B is 2)-a-L-Rha^-(l-> ;

C is 3)-a-L-Rha/?-(l - > ;

D is 3)-<x -D-Glc N-(l→ or 3)-p-D-Glc/?N-(l→ ;

Z is CIAc, BrAc, Ac or 0;

All is allyl ;

C13Ac is C13C-C(0)- ;

CIAc is C1CH2-C(0)-;

BrAc is BrCH2-C(0)-.

3. The compound according to claim 1 or 2, wherein:

C13Ac is in position 2D,

Z is in position 2c, and/or All is in position 1& when x = 1 , or in position ID, when y = 1.

4. The compound according to claim 1 or 2, wherein x is 0 and y is 1 , corresponding to the following formula (la):

AB CcijAcD-All (la).

5. The compound according to claim 1 or 2, of one the following formulae:

6. The compound according to claim 1 or 2, wherein x is 1 and y is 0, corresponding to the following formula (lb):

cuAcDAB/C-AU (lb).

7. The compound according to claim 1 or 2, of the following formula:

8. A process of preparation of a compound according to claim 1 or 2, comprising the following steps:

(i) a step of contacting a protected AB/C-triosyl donor with a protected xD-R acceptor to yield a protected ABZCTD-R compound ;

(ii) one or more steps of deprotection of the protected compound obtained in step (i) to give a compound of formula AB/CTD-R.

9. The process according to claim 8, wherein the ABzC-triosyl donor is of formula ABzC-Z', wherein Z' is PTFA or TCA, PTFA representing N-phenyltrifluoroacetimidyl and TCA representing trichloroacetimidyl.

10. The process according to claim 8 or 9, wherein the protected ABzC-triosyl donor is of one of the following formulae:

TES is triethylsilyl ;

Lev is levulinyl ;

Nap is 2-naphytlmethyl ;

PMB is /7«ra-methoxybenzyl.

11. The process according to claim 8, wherein the protected CBACD-A11 acceptor is of the following formula:

12. The process according to claim 8, wherein the protected ABZCCBACD-AII compound is of one of the following formulae:

13. A process of preparation of a compound according to claim 1 , comprising the following steps:

(i) a step of contacting a protected AB/C -R acceptor with a protected jD donor to yield a protected χϋ AB/C-R compound ;

(ii) one or more steps of deprotection of the protected compound obtained in step (i) to give a compound of formula jDABzC-R.

14. The process according to claim 13, wherein the AJ¾C-A11 acceptor is of one of the following formulae:

wherein :

TES is triethylsilyl ;

PMB is /?ara-methoxybenzyl.

15. The process according to claim 13, wherein the CBACD donor is of formula wherein Z' is PTFA or TCA, PTFA representing N-phenyltrifluoroacetirnidyl and TCA representing trichloroacetimidyl.

16. The process according to claim 13, wherein the protected CDACD donor is of one of the following formulae or the corresponding oxazolines : Ph-T-O Ph-^ O

O" o

LevO OTCA Le?0 OPTFA

I3

17. A method of preparation of a saccharide comprising the following steps:

(i) a step of ct-D-glucosylation of a compound of formula (I) according to claim 1 by a sucrose-active enzyme selected from the group consisting of the enzymes of the GUI 3 family, the enzymes of the GH70 family, their variants, peptide fragments of said enzymes or variants, and modified enzymes/peptides derived from said enzymes, variants or fragments, to obtained an a-D-glucosylated (ci3AcD)xABzC(a3AcD)y-All compound;

(ii) optionally, and in particular when y is 1 , a step of acetylation of the 3A position and/or the 6D position to obtained a 3A- and/or 6o-acetylated compound;

(iii) optionally, a step of cleavage of the ClAc or BrAc protecting group borne by C to obtain a 2(-hydroxylated compound;

(iv) optionally, a step of (a) conversion of the ClAc or BrAc protecting group bome by C into an acetyl group (Ac), (b) conversion of the C13Ac masking group bome by D into an acetyl group (Ac), (c) conversion of the ally! protecting group bome by D into a propyl (Pr) moiety followed if needed by the de-O-acylation of the 2c position to obtained either a 2c-acylated or 2c-hydroxylated compound; (v) optionally, a step of chain elongation at one or both of the ends of the compound resulting from the enzymatic a-D-glucosylation of the compound of formula (1) as defined above.

18. The method according to claim 17, wherein said saccharide is a fragment of O- antigens from S. flexneri, in particular of serotypes l a, lb, 2a, 2b, 3a, X, 4a, 4b, 5a, 5b, 7a or

7b.

19. The method according to claim 17, wherein said saccharide comprises one of the following fragments:

-[(( E I→2 ):, (E 1→4)a (M)b AcD)x (L)c (El→4)c- A (El- 3)d (El→4)d- B (E 1→4)e (Ac)zC ((E 1→2)a- ( 1 ->4)a (M)b AcD)y]„-

-[(( E 1 2)a. (El→4)a (M)b AcD)x (L)c (Ε1 ->4)ε· A (El→3)d (El→4)d. B (E I→4)c (Ae)?C

((El→2)a. (El→4)a (M)b AcD)v]n-All

-[((Ε Ι→2)β· (El→4)a (M)b AcD)x (L)c (El→4)c- A (El→3)d (El→4)a- B (E 1→4)e (Ac)zC

((El→2)a. (El→4)a (M)b AcD)y]„-Pr

wherein:

L is El— >3 or Ac in position 3A;

M is E 1 ~->6 or Ac in position 6D;

a is 0 or 1 ,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1 , providing d + d' = 0 or 1 ;

when L is Ac, c is at each occurrence independently 0 or 1, c being in particular at all occurrences 0 or 1, providing c + c' = 0 or 1 ;

when L is El- 3, c is 0 or 1 , providing c + c' = 0 or 1 ;

when M is Ac, b is at each occurrence independently 0 or 1 , b being in particular at all occurrences 0 or 1 , providing a + b = 0 or 1 ;

when M is El->6, b is 0 or 1, providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3.

20. The method according to claim 17, wherein said saccharide comprises the following fragment:

-[AB(Ac)ZC(El→6)AcD]n-, or

-[(El→3)AB{Ac)zCAcD]n-, or

-[(El→4)AB(Ac)zCAcD]n-, or

-[A(E l ->4)B(Ae)zCAcD]n-.

21. A compound of one of the following formulae:

((E 1→2)a-(E 1 >4);)(E 1→6)bci3AcD)x(E 1 ->3)C(E 1 ->4)t,A(E 1 ->3)d(E 1→4)d-B(E 1→4)e ciAcC((E 1→2)..(E 1 ->4)a(E 1→6b)Ci3ACD)r All;

((E 1→2)a-(E 1→4)a(E 1 ->6)bCi3AcD)x(E 1→3)C(E 1→4)C-A(E 1→3)d(E 1 ->4)d B(E 1 - 4)eC((E 1→2)a- (E1→4),(E 1 - >6)b C13A D)y-A!];

((E 1→2)a.(E 1→4)a(E 1→6)bCi3 ACD)X(E 1→3)C(E 1 ->4)C-A(E 1→3)d(E 1 ->4)ιΓΒ(Ε 1→4)e

AcC((El→2)a El→4)a(El→6)b

({E 1→2%{E 1 ->4)a( 1 ->6)bC|3AcD)x(E 1→3)C(E 1→4) -A(E 1 ~->3)d(E 1→4)d.B(E 1→4)e

ACC((E 1→2)a E 1→4)a(E 1 - 6)b C13 AcD)y;

((E 1→2)a-(E 1→4)a(E 1→6)bCi3 AcD)x(E 1 ->3)C(E 1→4)C-A(E 1 ->3)d(E 1→4)d-B(E 1 ->4)c

clAcC((El→2)a.(El→4)a(El→6)b ci3AcD)y;

((E 1→2)a (E 1→4)a(E 1→6)bCi3AcD)x(E 1→3)C(E 1 - 4)C.A(E 1 ->3)d(E 1 ->4)d-B(E 1→4)eC((E 1→2)a.

(El→4)a(El→6)b Ci3AcD)y;

((E 1→4)a(E 1→6)bC,3ACD E 1→3)C(E 1 4)C-A(E 1→3)d(E 1→4)d.B(E 1→4)e

ciA=C((El→4)a(El→6b)cl3AcD)y-All;

((E 1 - 4)a(E 1 - 6)b c:,jAcD)x(E 1→3)C(E 1→4)C-A(E 1→3)d(E 1 >4)d B(E 1→4)e C((E 1 >4)a(E 1 >6)b

((E 1 ->4)a(E 1→6)bCi3AcD)x(E 1 >3)C(E 1 ->4)C-A(E 1→3)d(E 1 ->4)d B(E 1 ->4)c AcC((El→4)a(El→6)b

((El→4)a(El→6)bci3AcD)x(El→3)c(El→4)c.A(EI→3)d(El→4)d.B(El→4)e AcC((E l->4)a(El→6)b

(( 1→4)a(E 1→6)W:|3ACD)X(E 1 ->3)C(E 1 - 4)c. A(E 1→3)d(E 1→4)d-B(E 1→4)e

CIACC((E 1→4)a(E 1 ->6)b ci3AcD)y;

((E 1→4)a(E 1 ->6)κ:!3ΑοΟ)Λ(Ε1→3)t(E 1→4)c. A(E 1 - 3)d(E 1 >4)d-B(E 1 ->4)eC((E 1 - 4)a(E 1 ->6)b ci3ACD)y; ((E 1→4)a(E 1 ->6) ci3 ACD)X(E 1→3)c A(E 1→3)d B(E 1→4)e CIACC((E 1→4)a(E 1→6)b Cl3 AcD)r All ; ((E 1→4)A(E 1→6)b ci3AcD)x(E 1→3)c A(E 1→3)d B(E 1→4)c C((E 1→4)a(E 1→6)b a3 AcD)y-All; ((E 1→4).(E 1→6)„ ci3AcD)x(E 1→3% A(E 1→3)d B(E 1→4)E AcC((E 1→4)a(E 1 ->6)b ci3AcD)y-All; ((E 1→4)A(E 1→6)b c!3ACD)x(E 1→3)c A(E 1→3)d B(E 1→4)e AcC((E 1→4)a(E 1 ->6)b CI3 AcD)y; ((E 1→4)A(E 1→6 CD ACDME 1 ->3)C A(El→3)d B(E l→4)t CIACC((E 1 -->4)a(E 1→6)b Ci3AcD)y;

((E 1 ->4)a(E 1→6)b CHACD),(E 1→3)c A(E 1- >3):! B(E 1 >4)c C((E 1 >4)a(E 1→6)b Ci3AcD)y;

ABaAcC(E l→6) c,3 AcD-All;

ABCIACC(E 1 ->6) CI3ACD;

ABACC(E1→6)

ABAcC(El->6) cl3AcD;

(E1→3)ABCLACC CI3ACD-A11;

(El→3)ABCci3AcD-All;

(E l - >3)ABAcCC;3AcD-All;

(El→3)ABAcCci3AcD;

(E1→3)ABCIACCC,3ACD;

(El→4)ABaAcC c;3AcD-All;

(E l >4)ABAcCc,?AcD-All;

(El→4)ABAcCci3ACD;

(El→4)ABCiAcCci3AcD;

(El→4)ABCci3AcD;

A(El→4)BclAcCcl3AcD-All;

A(El ->4)BAt.CC!3AcD-All;

A(E l→4)BAcCci3AcD;

A(El→4)BCLACCcl3AcD;

A compound of one of the following formulae, wherein Z is ClAc, Br Ac

Description:
PROTECTED TETRASACCHARIDES, THEIR PROCESS OF PREPARATION AND THEIR USE AS TRANSGLUCOSYLASE ACCEPTOR SUBSTRATES IN THE CHEMO-ENZYMATIC

SYNTHESIS OF SHIGELLA FLEXNERI SPECIFIC OLIGOSACCHARIDES

The present invention provides protected tetrasaccharides, their process of preparation and their use in the synthesis of oligosaccharides, in particular fragments of O-aritigens from Shigella flexneri, for example of serotype la, lb, 2a, 2b, 3a, X, 4a, 4b, 5a, 5b, 7a or 7b.

Carbohydrates displayed at the surface of cells and pathogens are of great therapeutic potential. On the one hand, the human glycome is being scrutinized in detail, on the other hand increasing knowledge on microbial carbohydrates and carbohydrate binding proteins offers new openings for therapeutic and prophylactic interventions. Among a large diversity of applications, carbohydrates are actively investigated as vaccine components. In this context, synthetic carbohydrates represent an attractive alternative to carbohydrate antigens of biological origin. The licensing of QuimiHib®, over a decade ago, demonstrated feasibility. Other vaccine candidates derived from synthetic oligosaccharides are under development whether targeting infectious diseases or non-transmittable diseases such as cancer.

Owing to an increasing interest in well-defined carbohydrates, progress in synthetic methodologies to complex oligosaccharides evolve rapidly. Reports on the use of scaffolds compatible with customized modifications opening the way to a diversity of targets have emerged, especially related to the synthesis of highly diverse complex N-gl yeans. Chemo- enzymatic strategies, mostly relying on the use of glycosyltransferases at the latest stages of the synthesis, are highly attractive. These two approaches were successfully combined to deliver a small library of N-glycans (Science (2013) 341 (6144), 379-83). Similarly, glycorandomization/glycodiversification find wide applications. Yet, drawbacks in the use of glycosyltransferases include enzyme availability, added to cost and availability of the sugar- nucleotide donors.

An alternative strategy consists in the use of "engineered transglycosidase/low cost donor" systems adapted to the customization of non-natural acceptors for the chemo- enzymatic synthesis of carbohydrates and glycoconjugates (Cell. Mol. Life Sci. (2016) 73, 2661-79). This strategy involves mono- and disaccharide acceptors, demonstrating feasibility for simple non-natural acceptors (J. Am. Client. Soc. (2009) 131 , 7379-89, Ghent, Commiin. (2015) 51 , 2581-4). However, this method provides an access to a limited number of targets only, due to its partially divergent character. Accordingly, it is an object of the present invention to provide versatile core precursors, able to yield a great number of oligosaccharides in a highly efficient divergent maimer.

Another aim of the present invention is to provide a way to a large variety of selected targets in the context of vaccine development against shigellosis. Most of the known 15 5. flexneri serotypes are pathogenic for human and a multivalent vaccine providing broad serotype coverage is required {Clin, Infect. Dis. (2014) 59, 933). Additional 5". flexneri O- antigen diversity has been described {Biochemistry (Moscow) (2015) 80, 901-14).

Inventors have for the first time demonstrated that enzymes are being able to perform the in vitro a-D-glucosylation of the ABCD and DABC tetrasaccharides, corresponding to the backbone repeating unit, and frame-shift thereof, respectively, of S. flexneri O-antigens {Biochemistry (Moscow) (2015) 80, 901-14). Surprisingly, branching sucrases, and more generally glucansucrases, can in particular glucosylate the non-natural ABCIA C CCBA C D-AII tetrasaccharide acceptor, to access S. flexneri specifically ct-D-glucosylated pentasaccharides.

Most of the known S. flexneri O-antigens are defined by a repeating unit encompassing a common (ABCD) n tetrasaccharide backbone. This feature offers major opportunities for the development of a broad serotype coverage vaccine against S. flexneri by use of synthetic carbohydrate haptens.

In contrast to other strategies whereby the design of a n-valent polysaccharide-based vaccine requires the independent preparation of n monovalent polysaccharide antigens and their conversion into immunogens (see for example Prevnar®, Synflorix®, and other licensed polysaccharide-protein conjugate vaccines), synthesis opens the way to a divergent strategy to the target carbohydrate haptens built on a single versatile core precursor inspired from the tetrasaccharide backbone repeat of the O-antigens of interest.

Thus, in one aspect, the present invention relates to a compound of following formula

(¾) :

( T D) x AB z C( T D) r R (lo)

wherein:

x and y are 0 or 1 , providing x + y = 1 ;

A is 2)-a-L-Rha/ -( 1→ ;

B is 2)-a-L-Rha/?-(l→ ;

C is 3)-a-L-Rha/ ? -(l→ ;

D is 3)-a-D-Glc/?N-(l→ ; Z is ClAc, BrAc, Ac or 0;

T is a protecting group capable of anchimeric assistance and being orthogonal to Z, when Z is ClAc, BrAc or Ac, or is azide ( 3 ) ;

R is a protecting group compatible with chain elongation into O-antigen fragments, and being orthogonal to T and Z, when Z is ClAc, BrAc or Ac ;

ClAc is C1CH 2 -C(0)-;

BrAc is BrCH 2 -C(0)-.

In one aspect, the present invention relates to a compound of following formula (¾):

( T D) x ABzC( T D) y -R (Io)

wherein:

x and y are 0 or 1 , providing x + y = 1 ;

A is 2}-a-L-Rha/ -( 1 - ;

B is 2)-a-L-Rhap-(l→ ;

C is 3)-a-L-Rha/?-(l→ ;

D is 3)-a -D-GlcpN-( 1→ or 3)-β-0-01 Ν-( 1→ ;

Z is ClAc, BrAc, Ac or 0;

T is a protecting group capable of anchimeric assistance and being preferentially orthogonal to Z, in particular when Z is Ac, or is azide (N 3 ) ;

R is a protecting group compatible with chain elongation into O-antigen fragments, and being orthogonal to T and Z, when Z is ClAc, BrAc or Ac ;

ClAc is C1C¾-C(0)-;

BrAc is BrCH 2 -C(0)-.

In a particular embodiment:

when x = 0 and y = 1, D is 3)-a-D-Glc/>N-(l→ or 3)-P-r GlcpN-( l→ ;

when x = 1 and y = 0, D is 3)-p-D-Glc/?N-( 1— > ; Regarding formulae (¾) and (I), D is in particular ;

In a particular embodiment, TD means that T is in position 2D.

In a particular embodiment, means that Z is in position 2c-

Orthogonality and orthogonal protecting groups in carbohydrate chemistry are well known from the skilled in the art, and are in particular described in Agoston et al. (Tetrahedron: Asymmetry 27 (2016) 707-728).

R is in particular chosen from allyl (All), wra-methoxyphenyl (PMP), pentenyl (Pent), phenyl (Ph), triisopropylsilyl (TIPS) or /er/-butyldiphenylsilyl (TBDPS) group. R is in particular on the O in position lc, when x = 1, or on the O in position ID, when y = i-

When R is Ph, the atom in position lc, when x = 1, and in position ID, when y = 1, is in fact a S.

T is in particular chosen from tricMoroacetyl (CBAc), 2,2,2-trichloroethoxycarbonyl

(Troc) or allyloxycarbonyl (Alloc).

In another aspect, the present invention relates to a compound of following formula

(I) :

(ci3AcD) x AB z C(ci3AcD) y -All (I)

wherein:

x and y are 0 or 1 , providing x + y = 1 ;

A is 2)-a-L-Rha/?-(l→ ;

B is 2)-(x-L-Rha/?-(l→ ;

C is 3)-a-L-Rha/?-(l-> ;

D is 3)-a-D-Glc/ N-(l -> ;

Z is ClAc, BrAc, Ac or 0 ;

All is allyl ;

CBAc is C1 3 C-C(0)- ;

ClAc is C1CH 2 -C(0)- ;

BrAc is BrCH 2 -C(0)-.

In another aspect, the present invention relates to a compound of following formula

(I):

(ci3AcD) x AB z C(ci3AcD) y -All (I)

wherein:

x and y are 0 or 1, providing x + y = 1 ;

A is 2)-a-L-Rha -(l- ;

C is 3)-<x-i.-Rhap-(l→ ;

D is 3)-a-D-GlcpN-(l→ or 3)-p-D-Glc N-(l→ ;

Z is ClAc, BrAc, Ac or 0 ;

All is allyl ;

CBAc is C1 3 C-C(0)- ;

ClAc is C1CH 2 -C(0)- ; BrAc is BrCH 2 -C(0)-.

In a particular embodiment:

when x = 0 and y = 1, D is 3)-a-D-GlcpN-(l→ or 3)-p-D-GlqpN-(l -> ;

when x = 1 and y = 0, D is 3)-p-D-Glc N-( l→ ;

Compounds of formula (I) offer compatibility with chain elongation into O-antigen fragments, which may require a protecting group able to ensure anchimenc assistance at position 2 of the reducing residue.

The orthogonal protecting group in position 2c enables the 2c-0-acetylation pattern of importance for some serotypes.

Lastly, orthogonal protection at the anomeric position of the reducing residue enables to avoid α/β mixtures while facilitating subsequent chemical modification and in particular chain elongation into O-antigen fragments.

In a particular embodiment, the present invention relates to a compound of formula (I), wherein:

C13 Ac is in position 2D,

Z is in position 2c, and/or

All is in position lc, when x = 1 , or in position I D, when y = 1.

In a particular embodiment, Z is ClAc, corresponding to a compound of formula (CBA C D) X ABC!A C C(CBA C D) V -A11.

In a particular embodiment, Z is BrAc, corresponding to a compound of formula

(ci3AcD) x ABBrAcC(ci3AcD) r All.

In a particular embodiment, Z is Ac, corresponding to a compound of formula (ci3AcD) x AB AC C(ci3A C B) AlL

In a particular embodiment, Z is 0, corresponding to a compound of formula

(c)3AcD) x ABC(ci3AcD) Alh

In a particular embodiment, the present invention relates to a compound of formula (I), wherein x is 0 and y is 1 , corresponding to the following formula (la):

AB z Cci3AcD-AU (la).

In a particular embodiment, said compound is of formula ABCIA C CCBA C D-AU, ABBrAcCcuAcD-All, AB., ..CcBAcD-All or

The compound of formula (I) is in particular of the following formula:

more particularly with Z = H or Z = ClAc as in the following:

In a particular embodiment, the present invention relates to a compound of formula (I), wherein x is 1 and y is 0, corresponding to the following formula (lb):

CBAcDAB/C-All (lb).

In a particular embodiment, said compound is of formula CBA C DABCIA C C-AH, CBAcDABBrAcC-All, C i3 Ac DAB Ac C-Ali or CBA C DABC-AH.

The compound of formula (I) is in particular of the following formula:

more particularly with Z = H or Z = ClAc as in the following:

In another aspect, the present invention relates to a process of preparation of a compound of formula (¾) as defined above, comprising the following steps:

(i) a step of contacting a protected ABzC-triosyl donor with a protected jD-R acceptor to yield a protected ABzCjD-R compound ;

(ii) one or more steps of deprotection of the protected compound obtained in step (i) to give a compound of formula ABzCjD-R.

In another aspect, the present invention relates to a process of preparation of a compound of formula (I) as defined above, comprising the following steps:

(i) a step of contacting a protected ABzC-triosyl donor with a protected CBA C D- M acceptor to yield a protected ABZCCU AC D-AII compound ;

(ii) one or more steps of deprotection of the protected compound obtained in step (i) to give a compound of formula ABZCCDA C D- H.

In a particular embodiment, the ABzC-triosyl donor is of formula AB / C-Z', wherein Z' is PTFA or TCA, PTFA representing N-phenyltrifiuoroacetimidyl and TCA representing trichloroacetimidyl, Z' being more particularly TCA.

In a particular embodiment, the protected ABzC-triosyl donor is of one of the following formulae:

TCA

, and notably

wherein :

TES is triethylsilyl ;

Lev is levulinyl ;

Nap is 2-naphytlmcthyl ;

PMB is /jfzra-rnethoxybenzy! .

By levulinyl (or levuiinoyl) is meant the group CH3-CO-CH2-CH2-CO-.

In the whole description, a wavy bond such as indicates that the corresponding substifuent is in axial and/or in equatorial position.

Thus, a compound containing such a wavy bond exist as a mixture of the alpha and beta anomers, or only as the alpha or beta anomer.

In a particular embodiment, the protected ABzC-triosyl donor is of one of the above- mentioned formulae, wherein at least one of the TES groups or each TES group is independently replaced by a group chosen from TBS (tert-butyldimethylsilyl), TIPS (triisopropylsilyl), PMB, Nap or Lev.

In a particular embodiment, the protected ABzC-triosyl donor is of one of the above- mentioned formulae, wherein at least one of the BDA groups or each BDA (butane 2,3- diacetal) group is independently replaced by a group chosen from the 1 ,2-diacetaI family, and in particular by CDA (cyclohexane- 1 ,2-diacetal). The CDA group is for example described in Chem. Rev. (2001 ) 101 , 53-80.

In a particular embodiment, the protected ABzC-triosyl donor is of one of the above- mentioned formulae, wherein at least one of the Nap groups or each Nap group is independently replaced by a group chosen from TBS, TIPS or PMB.

In a particular embodiment, the protected G3 .\C D-A 11 acceptor is of following formula: CI 3 In. a particular embodiment, the protected ABZCCBA C D-AH compound is of one of the following formulae:

In a particular embodiment, the protected ABZCCBA C D-AII compound is of one of the above-mentioned formulae, wherein at least one of the TES groups or each TES group is independently replaced by a group chosen from TBS, TIPS, PMB, Nap or Lev.

In a particular embodiment, the protected ABZCCBA C D-AII compound is of one of the above-mentioned formulae, wherein at least one of the vicinal Nap/TES pairs or each vicinal Nap/TES pair is independently replaced by a .group chosen from BDA or CDA.

By Nap TES pair, is in particular meant a Nap that is vicinal to a Nap or TES group, as following:

In a particular embodiment, the protected ABZCCBA C D-AII compound is of one of the above-mentioned formulae, wherein at least one of the Nap groups or each Nap group is independently replaced by a group chosen from TBS, TIPS or PMB.

In another aspect, the present invention relates to a process of preparation of a compound of formula (¾) as defined above, comprising the following steps:

(i) a step of contacting a protected ABzC-R acceptor with a protected i D donor to yield a protected xDAB/C-R compound ;

(ii) one or more steps of deprotection of the protected compound obtained in step

(i) to give a compound of formula rDAB/C-R.

In another aspect, the present invention relates to a process of preparation of a compound of formula (I) as defined above, comprising the following steps:

(i) a step of contacting a protected AB/C-A11 acceptor with a protected CBA C D donor to yield a protected CBA C DAB/C-AH compound ; (ii) one or more steps of deprotection of the protected compound obtained to give a compound of formula CBA C DABZC-AII

In particular, the AB C-A11 acceptor is of one of the following formulae:

wherein:

TES is triethylsilyl ;

Lev is levulinyl ;

Nap is 2-naphty!methyl ;

PMB is /?ara-methoxybenzyl.

In a particular embodiment, the ABzC-A!l acceptor compound is of one of the above- mentioned fonnulae, wherein at least one of the TES groups or each TES group is independently replaced by a group chosen from TBS, TIPS, PMB or Nap.

In a particular embodiment, the protected ABZCCBA C D- U compound is of one of the above-mentioned formulae, wherein at least one of the Nap groups or each Nap group is independently replaced by a group chosen from TBS, TIPS or PMB.

In a particular embodiment, the protected ABzC-All acceptor is of one of the above- mentioned formulae, wherein at least one of the vicinal Nap/TES pairs or each vicinal Nap/TES pair is independently replaced by a group chosen from BDA or CD A.

In particular, the CBA C D donor is of formula CBAJD-Z', wherein Z' is PTFA or TCA, PTFA representing Λ'-ph en yl tri fl uoroaceti m i dyl and TCA representing trichloroacetimidyl.

In particular, the protected CBA C D donor is of one of the following formulae or the corresponding oxazolines:

I 3 Ph-Τ O

TESO ' OTCA TESO OPTFA

I 3

and notably:

By "corresponding oxazoline" is meant a group as following:

The 1 ,2-oxazoline may result from intramolecular cyclisation and loss of leaving group at position 1.

In a particular embodiment, the CBA C D donor compound is of one of the above- mentioned formulae, wherein the TES group is independently replaced by a group chosen

When the ABzC-AU acceptor is of formula (a) and the CDA C D donor bears a TES protecting group in position 3D, there is in particular one step (ii) of dcprotection only. In the other cases, there may be two steps of deprotection.

In a particular embodiment, the protected C A C DAB/C-AH compound is of one of the following formulae:

In a particular embodiment, the protected CBA C DABZC-AII compound is of one of the above-mentioned formulae, wherein at least one of the TES groups or each TES group is independently replaced by a group chosen from TBS, TIPS, PMB or Nap.

In a particular embodiment, the protected CBA C DABZC-AU compound is of one of the above-mentioned formulae, wherein at least one of the vicinal Nap/TES pairs or each vicinal Nap/TES pair is independently replaced by a group chosen from BDA or CD A.

In a particular embodiment, the protected CBA C DABZC-AII compound is of one of the above-mentioned formulae, wherein at least one of the TBS groups or each TBS group is replaced by a group chosen from PMB, Nap, Lev, TIPS or Lev.

In a particular embodiment, the protected CBA C DAB/C-AII compound is of one of the above-mentioned formulae, wherein the 4,6-0-benzylidene acetal is replaced by an 4,6-0- isopropylidene acetal.

In another aspect, the present invention relates to a compound as defined by the following formulae, wherein Z is ClAc, Br Ac or Ac : notably , and

I 3 In a particular embodiment, the compound is of one of the above-mentioned formulae, wherein at least one of the vicinal Nap/TES pairs or each vicinal Nap/TES pair is independently replaced by a group chosen from BDA or CDA.

In another aspect, the present invention relates to a method of preparation of a saccharide comprising the following steps:

(i) a step of a-D-glucosylation of a compound of formula (¾) as defined above by a sucrose-active enzyme selected from the group consisting of the enzymes of the GH13 family, the enzymes of the GH70 family, their variants, peptide fragments of said enzymes or variants, and modified enzymes/peptides derived from said enzymes, variants or fragments, to obtained a a-D-glucosylated ( t D) X ABKC(TD) V -R compound;

(ii) optionally, and in particular when y is 1 , a step of acetylation of the 3A position and/or the 6p position to obtained a 3A- and/or δο-0-acetylated compound;

(iii) optionally, a step of cleavage of the ClAc or BrAc protecting group bome by C to obtain a 2c-hydroxylated compound;

(iv) optionally, a step of (a) conversion of the ClAc or BrAc protecting group bome by C into an acetyl group (Ac), (b) conversion of the C13Ac masking group bome by D into an acetyl group (Ac), (c) conversion of the allyl protecting group bome by D into a propyl (Pr) moiety followed if needed by the de-O-acylation of the 2c position to obtained either a 2(-acylated or 2c-hydroxylated compound;

(v) optionally, a step of chain elongation at one or both of the ends of the compound resulting from the enzymatic a-D-glucosylation of the compound of formula (1) as defined above.

In another aspect, the present invention relates to a method of preparation of a saccharide comprising the following steps:

(i) a step of a-D-glucosylation of a compound of formula (I) as defined above by a sucrose-active enzyme selected from the group consisting of the enzymes of the GH13 family, the enzymes of the GH70 family, their variants, peptide fragments of said enzymes or variants, and modified enzymes/peptides derived from said enzymes, variants or fragments, to obtained a α-D-glucosylated (cBAcD) x AB C(c;3AcD)y-All compound;

(ii) optionally, and in particular when y is 1, a step of acetylation of the 3A position and/or the 6D position to obtained a 3A- and/or 6o-O-acetylated compound;

(iii) optionally, a step of cleavage of the ClAc or BrAc protecting group bome by C to obtain a 2c-hydroxylated compound; (iv) optionally, a step of (a) conversion of the ClAc or BrAc protecting group borne by C into an acetyl . group (Ac), (b) conversion of the CBAc masking group borne by D into an acetyl group (Ac), (c) conversion of the ally! protecting group borne by D into a propyl (Pr) moiety followed if needed by the de-O-acylation of the 2c position to obtained either a 2c-acylated or 2c-hydroxylated compound;

(v) optionally, a step of chain elongation at one or both of the ends of the compound resulting from the enzymatic oc-D-glucosylation of the compound of formula (I) as defined above.

For example, step (ii) is a regioselective O-acylation at a primary hydroxyl group as well as at the equatorial hydroxyl group of 1 ,2-cis diol systems, which has been for instance described for polyols including carbohydrates by means of a borinic-acid catalyzed regioselective step (J. Am. Chem. Soc. (2011) 133, 3724-7). Alternatively, methods selective for the O-acylation of primary hydroxyl groups have been described including enzymatic O- acylation (Tetrahedron: Asymm. (2000) 11, 3647-51). Such transformations could also be in particular envisioned using an a-D-glucosylated AB Ac C A cD-Pr substrate (wherein Pr = propyl, Ac ~ acetyl), as discussed below, or an a-D-glucosylated ABCA C D-ΡΓ substrate.

Step (iii) may be achieved using thiourea {Carbohydr. Res. (2012) 356, 1 15-31).

About step (iv), the transformation of an α-D-glucosylated (ci3AcD) x BzC(ci3AcD) y - ll compound into the corresponding ( Ac D) x ABAcC( A cI3y)- r (Pr = propyl, Ac = acetyl) may be achieved by Pd/ ' C- or Pd(OH) 2 -mediated hydrogenation {Chem. Asian J. (2017) 12, 419-39), while the subsequent de-O-acetylation at position 2c is easily achievable by conventional transesterification, for example by use of a methanol ic solution of sodium methoxide {Chem. Eur. J. (2016) 22, 10892 - 91 1).

About step (v), and, for example, taking advantage of the cw-vicinal diol in residue A (not applicable in the case enzymatic a-D-glucosylation would occur at position 3A as in the S. flexneri 3a O-antigen), the 2 A and 3A hydroxyl groups may be masked as an isopropylidene acetal either directly or post selective masking of the primary hydroxyl groups (J. Org. Chem. (2015) 80, 1 1237-57). Following the unmasking of the primary alcohols if needed, the remaining hydroxyl groups may then be masked with permanent protecting groups, preferentially benzyl ethers (J. Org. Chem. (2015) 80, 1 1237-57). Removal of the isopropylidene group under acid hydrolysis conditions, followed by selective masking of the 3 A -OH by use of a protecting group orthogonal to all those in place (PMB or a silyl ether such as TES for example) may provide a pentasaccharide acceptor compatible with chain elongation at the 2 A -OH. Alternatively, levulinylation of the hydroxyl group at position 2 A may provide a folly orthogonally protected pentasaccharide building block, which may be converted into a donor by selective removal of the ally! ether followed by activation of the obtained hemiacetal into an imidate donor (anomeric OPTFA or OTCA substitution) (J, Org. Chem. (2015) 80, 1 1237-57).

Herein, a chemo-enzymatic strategy to customized glycobricks suitable for the efficient synthesis of fragments of a diversity of S. flexneri O-antigens is disclosed. These glycobricks could be assembled into homo-oligomers, therefore providing a novel route to S. flexneri type-specific haptens, in particular through step (v) of chain elongation.

Homo-oligomers correspond to fragments encompassing n repeating units of the O- antigen from a selected S. flexneri serotype. Such a 15mer hapten, corresponding to a three repeating unit fragment of the O-antigen, was for example identified for 5". flexneri 2a (J. Immunol (2009) 182, 2241-7, Bioconjugate Chem. (2016) 27, 883-92). Besides, available data suggest that at least a 15mer, most probably a 20mer, corresponding to three and four repeating unit fragments of the O-antigen would act as a suitable S. flexneri 3 a hapten.

Glycoside hydrolases EC 3.2.1. and EC 2.4.1 are a widespread group of enzymes that hydrolyze the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycoside hydrolases, based on sequence similarity, has led to the definition of > 100 different families. This classification is well known for the skilled in the art and is in particular available on the CAZy (http://www.cazy.org) web site.

By enzymes of the G.H13 family is meant enzymes of glycoside hydrolase family 13 active on sucrose, that is a family of glycoside hydrolases, in particular amylosucrases (subfamily), which are also included in the « glucansucrase » family. This classification is well known for the skilled in the art and is in particular available on the CAZy (GH13.4 http://www.cazy.org/GH13_4.html) web site.

By enzymes of the GH13 family is meant in particular a wild type glycoside hydrolase, more particularly a transglucosylase, even more particularly an amylosucrase (EC 2.4.1.4) or a sucrose hydrolase (EC 3.2.1.-), as described in the patent application EP 2 100 966, even more particularly an amylosucrase from Neisseria polysaccharea, preferably selected from the group consisting of 1G5A, 1ZS2, 1 MVY, 1MW0, 1 S46, UGI, 1MW2, 1MW3, 1MW1 and UG9 proteins as found in the Protein Data Base (PDB, https://www.rcsb.org/) and as described in the patent application EP 2 100 966, or a mutant of the protein, as described in the patent application EP 2 100 966. By enzymes of the GH70 family is meant transglucosylases produced by lactic acid bacteria from, e.g., Streptococcus, Leuconostoc, Weisella or Lactobacillus genera. This classification is well known for the skilled in the art and is in particular available on the CAZy (http://www.cazy.org GH70.html) web site. In particular the enzymes of the GH70 family are branching sucrases and glucansucrases (EC 2.4.1). An enzyme of the GH70 family that can be used in the framework of this invention is an altemansucrase from Leuconostoc citreum, more particularly of strain NRRL B-1355.

In particular, said enzyme is selected from the group consisting of the BRS-B, BRS-B- Dl, BRS-B-D2, BRS-C, BRS-A, BRS-D, BRS-E, GBD-CD2, GBD-CD2 W2135V, GBD- CD2 W2135C-F2136I, GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135N-F2136Y, GBD-CD2 W2135N, GBD-CD2 W2135I-F2136Y, GBD-CD2 W2135L, GBD-CD2 W2135C, GBD-CD2 W2135N-F2136H, GBD-CD2 W2135L-F2136L, GBD-CD2 W2135F-F2136I, GBD-CD2 W2135C-F2136N, GBD-CD2 W2135G, GBD-CD2 W2135F, GBD-CD2 F2163G, GBD-CD2 L2166I, GBD-CD2 F2163H, GBD-CD2 F2163G L2166I, GBD-CD2 A2162E F2163L, GBD-CD2 F2163L, GBD-CD2 F2163I-D2164E-L2166I enzymes.

In particular, said enzyme is selected from the group consisting of the BRS-B, BRS-B- Dl , BRS-B-D2, BRS-C, BRS-A, BRS-D, BRS-E, GBD-CD2, GBD-CD2 W2135V, GBD- CD2 W2135C-F2136I, GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135C, GBD-CD2 W2135L-F2136L, GBD-CD2 W2135F-F2136I, GBD-CD2 W2135C- F2136N, GBD-CD2 W2135G, GBD-CD2 W2135F, GBD-CD2 F2163G, GBD-CD2 L2166I, GBD-CD2 F2163G L2166I, GBD-CD2 A2162E F2163L, GBD-CD2 F2163L, GBD-CD2 F2163I-D2164E-L21661 enzymes.

These enzymes are described in the art, as mentioned below in the examples.

In particular, said enzyme is a mutant of the BRS-B-D2 enzyme (SEQ ID NO : 4), as defined in the following table:

in particular, said enzyme is BRS-B.

In particular, said enzyme is BRS-B-D2.

In particular, said enzyme is GBD-CD2 F2163G.

In particular, said enzyme is GBD-CD2 W21351-F2136C or GBD-CD2 W2135L- F2136L or GBD-CD2 W2 I35S-F2136L.

In particular, said saccharide is a fragment of O-antigens from S. flexneri, in particular of serotype la, lb, 2a, 2b, 3a, X, 4a, 4b, 5a, 5b, 7a or 7b.

In particular, said saccharide comprises the following fragment:

-[((El→2) a - (El→4), (M) b A cD) x (L) c A (El→3) d B (E l --->4) e (Ac) ,C ((El→2) a . (El→4) a (M) b whcrcin:

L is El- 3 or Ac in position 3A

M is El— >6 or Ac in position 6D;

a and b is 0 or 1, providing a + b = 0 or 1 ;

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

c, d and e are 0 or 1 ;

a, b, c, d and e being in particular such as a + b + c + d + e = l or 2 ;

z is at each occurrence independently 0 or 1, z being in particular at all occurrences 0 or 1 ;

E represents a residue a-D-Glcp- ;

n is an integer superior or equal to 1, in particular comprised between 1 and 10.

In particular, said saccharide comprises the following fragment:

-[((El→2) a . (El→4) a (M) b Ac D) x (L) c (El→4) - A (El→3) d (El >4) d - B (El→4) e {Ac)z C i(El→2) a - (El→4) a ( ) b Ac D) y ] n - wherein:

L is El ->3 or Ac in position 3A;

M is El -»6 or Ac in position 6D;

a is 0 or 1 ,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1 , providing d + d' = 0 or 1 ;

when L is Ac, c is at each occurrence independently 0 or 1, c being in particular at all occurrences 0 or 1 , providing c + c' = 0 or 1 ;

when L is E l ->3, c is 0 or 1, providing c + c' = 0 or 1 ;

when M is Ac, b is at each occurrence independently 0 or 1, b being in particular at all occurrences 0 or 1 , providing a + b = 0 or 1 ;

when M is E 1— >6, b is 0 or 1 , providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1 , 2 or 3.

z is at each occurrence independently 0 or 1 , z being in particular at all occurrences 0 or 1 ;

E represents a residue a-D-Glcp- ;

n is an integer superior or equal to 1 , in particular comprised between 1 and 10.

In the above and below paragraphs, L and/or M are in particular Ac at at least one occurrence, more particularly at all occurrences.

In. the above and below paragraphs, L and/or M are in particular El - ->3 and El— >6 respectively, at all occurrences.

In particular, said saccharide comprises the following fragment:

-[((El→2) a - (El→4) a (M) b Ac D) x (L) e A (El→3) d B (El→4) e (Α € ((E l - 2) fl - (El→4) a (M) b

Ac D) y ] n -Ail

wherein:

L is El— >3 or Ac in position 3A ;

M is El - >6 or Ac in position 6 D ;

a and b is 0 or 1 , providing a + b = 0 or 1 ;

a' is:

- 0 or 1 when a is 1 ; - 0 when a is 0 ;

c, d and e are 0 or 1 ;

a, b, c, d and e being in particular such as a + b + c + d + e = l or 2 ;

z is at each occurrence independently 0 or 1 , z being in particular at all occurrences 0 or 1 ; E represents a residue a-D-Glcp- ;

n is an integer superior or equal to 1, in particular comprised between 1 and 10.

In particular, said saccharide comprises the following fragment:

-[«E l→2) a - (El→4) a (M) b Ac D) x (L) c (El→4) c . A (El→3) d (El→4) cl - B (E 1→4) e (Ac)/ C

((El ->2) a . (El→4) a (M) b Ac D) y ] n ^All

wherein:

L is E ! -- 3 or Ac in position 3 A;

M is El-»6 or Ac in position 6D;

a is 0 or 1 ,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1 , providing d + d' = 0 or 1 ;

when L is Ac, c is at each occurrence independently 0 or 1, c being in particular at all occurrences 0 or 1, providing c + c' = 0 or 1 ;

when L is El->3, c is 0 or 1, providing c + c' = 0 or 1 ;

when M is Ac, b is at each occurrence independently 0 or 1, b being in particular at all occurrences 0 or 1, providing a + b = 0 or 1 ;

when M is El— >6, b is 0 or 1, providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3.

In particular, said saccharide comprises the following fragment:

-[(<E1→2),. (El→4) a (M) b Ac D) x (L) c A (El→3) d B (El→4) e (Ac)z C ((El ->2) a - (El→4). (M) b

wherein:

L is El~>3 or Ac in position 3A ;

M is El— >6 or Ac in position 6D ;

a and b is 0 or 1 , providing a + b = 0 or 1 ; a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

c, d and e are 0 or 1 ;

a, b, c, d and e being in particular such as a + b + c + d + e = 1 or 2 ;

z is at each occurrence independently 0 or 1, z being in particular at all occurrences 0 or 1 ; E represents a residue α-D-Glcp- ;

n is an integer superior or equal to 1, in particular comprised between 1 and 10 ;

Pr is propyl.

In particular, said saccharide comprises the following fragment:

-[((El→2) a - (El→4) a (M) b Ac D) x (L) c (El→4) c . A (El→3) d (El→4) d . B (El→4) e (Ac)z C

((El→2) a . (E l→4) a (M) b Ac D) y ] n -Pr

wherein:

L is El->3 or Ac in position 3 A ;

M. is El→6 or Ac in position 6D;

a is 0 or 1,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1, providing d + d' = 0 or 1 ;

when L is Ac, c is at each occurrence independently 0 or 1, c being in particular at all occurrences 0 or 1, providing c + c' = 0 or 1 ;

when L is El— >3, c is 0 or 1, providing c + c' = 0 or 1 ;

when M is Ac, b is at each occurrence independently 0 or 1, b being in particular at all occurrences 0 or 1 , providing a + b = 0 or 1 ;

when M is El— >6, b is 0 or 1, providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3;

Pr is propyl.

In particular, said saccharide comprises the following fragment:

-[((El→2) 0 - (El→4) a (El→6) b Ac D) x (El→3) c A (El→3) d B (El→4) c (Ac)z C ((El→2) a - (El→4) a (El--»6) b Ac D) y ] n - wherein:

a and b is 0 or 1 , providing a + b = 0 or 1 ;

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0;

c, d and e are 0 or 1 ;

a, b, c, d and e being in particular such as a + b + c + d + e = 1 or 2 ;

z is at each occurrence independently 0 or 1 , z being in particular at all occurrences 0 or 1 ;

E represents a residue cx-D-Glc/?- ;

n is an integer superior or equal to 1 , in particular comprised between 1 and 10.

In particular, said saccharide comprises the following fragment:

-[((E 1→2) a - (E 1→4) a (El→6) b Ac D) x (E 1→3) c (El -→4) c - A (El- 3) d (E1→¾ B (El→4) e

(Ac )2 C ((E l→2) a - (El→4) a (El→6) b D) y ]„- wherein:

a is 0 or 1 ,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1, providing d + d' = 0 or 1 ;

c is 0 or 1 , providing c + c' = 0 or 1 ;

b is 0 or 1 , providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = l, 2 or 3.

In particular, said saccharide comprises the following fragment:

-[((E 1→2).v (E 1→4) a (Ac 1→6) b Ac D) x (Ac 1→3) c (El→4) c - A (El→3) d (El→4) d . B (El→4) e

(Ac) zC ((El→2) a . (El→4) a (El→6) b D)v] n - wherein:

a is 0 or 1 ,

a' is:

- 0 or I when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ; c' is 0 or 1 ;

(1 and d' are 0 or 1, providing d + d' = 0 or 1 ;

c is at each occurrence independently 0 or 1 , c being in particular at all occurrences 0 or 1, providing c + c' = 0 or 1 ;

b is at each occurrence independently 0 or 1 , b being in particular at all occurrences 0 or 1 , providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3.

In particular, said saccharide comprises the following fragment:

-[((El→2) a - (El→4), (El→6 Ac D)x (Ac 1→3) c (El→4) c . A (El→3) d (El ->4) d . B (El ->4) c {Ac)z C ((El→2) a . (El→4) a (El→6) b D) y ] n - whcrcin:

a is 0 or 1,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1 , providing d + d' = 0 or 1 ;

c is at each occurrence independently 0 or 1 , c being in particular at all occurrences 0 or 1, providing c + c' = 0 or 1 ;

b is 0 or 1, providing a + b = 0 or 1 ;

a, b, c, c', d, d' and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3.

In particular, said saccharide comprises the following fragment:

-f((E 1→2)a- (E 1→4) a (Ac 1→6) b Ac D) x (E 1→3) c (E 1→4) c - A (El→3) d (E 1→4) d - B (E 1→4) e , Ac)z C ((El→2) a . (El→4) a (Ε\→6 D) y ] n - wherein:

a is 0 or 1,

a' is:

- 0 or 1 when a is 1 ;

- 0 when a is 0 ;

e is 0 or 1 ;

c' is 0 or 1 ;

d and d' are 0 or 1 , providing d + d' = 0 or 1 ; , c is 0 or 1 , providing c + c' = 0 or 1 ;

b is at each occurrence independently 0 or 1 , b being in particular at all occurrences 0 or 1 , providing a + b = 0 or 1 ;

a, b, c, c', d, d * and e being in particular such as a + b + c + c' + d + d' + e = 1, 2 or 3..

By "z is at each occurrence independently 0 or 1" is meant that for each occurrence of the repeating unit repeated n times, z can be independently 0 or 1. In other terms, for n superior or equal to 2, position 2c can be noii-O-acetylated, rally O-acetylated or partially acetylated within said saccharide.

By "when L is Ac, c is at each occurrence independently 0 or 1" is meant that for each occurrence of the repeating unit repeated ri times, c can be independently 0 or 1 , when L is Ac, In other terms, for n superior or equal to 2 » position 3A can be non-O-acetylated, fully O- acetylated or partially acetylated within said saccharide.

By "when M is Ac, b is at each occurrence independently 0 or 1" is meant that for each occurrence of the repeating unit repeated n times, b can be independently 0 or 1, when M is Ac. In. other terms, for n superior or equal to 2, position 6D can be non-O-acetylated, fully O-acetylated or partially acetylated within said saccharide.

More particularly, said saccharide comprises the following fragment:

-[AB (At)2 C(E l→6) A cDj n - In a particular embodiment, the sucrose-active enzyme is selected from the group consisting of the BRS-B, BRS-B-D2, BRS-C, BRS-A, BRS-D, BRS-E, GBD-CD2, GBD- CD2 W2135V, GBD-CD2 W2135C-F2136I, GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135C, GBD-CD2 W2135L-F2136L, GBD-CD2 W2135F- F2136I, GBD-CD2 W2135C-F2136N, GBD-CD2 W2135G, GBD-CD2 W2135F, GBD-CD2 F2163G, GBD-CD2 L2166I, GBD-CD2 F2163G L2166I, GBD-CD2 A2162E F2163L, GBD- CD2 F2163L, GBD-CD2 F21631-D2164E-L2166I enzymes, and the BRS-B-D2 M6, BRS-B- D2 M21 , BRS-B-D2 M23, BRS-B-D2 M28, BRS-B-D2 M30, BRS-B-D2 M31 , BRS-B-D2 M34, BRS-B-D2 M35, BRS-B-D2 M40 and BRS-B-D2 M41 enzymes, the enzyme being more particularly BRS-B.

More particularly, said saccharide comprises the following fragment:

-[(El→3)AB ( Ac) Z C A cD]n-

In a particular embodiment, the sucrose-active enzyme is selected from the group consisting of the BRS-B, BRS-B-D2, BRS-C, BRS-A, BRS-D, BRS-E, GBD-CD2, GBD- CD2 W21 35V, GBD-CD2 W2135C-F2136I, GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135N-F2136Y, GBD-CD2 W2135N, GBD-CD2 W21351- F2136Y, GBD-CD2 W2135L, GBD-CD2 W2135C, GBD-CD2 W2135N-F2136H, GBD-CD2 W2135L-F2136L, GBD-CD2 W2135F-F2136I, GBD-CD2 W2135C-F2136N, GBD-CD2 W2135G, GBD-CD2 W2135F, GBD-CD2 F2163G, GBD-CD2 L2166I, GBD-CD2 F2163H, GBD-CD2 F2163G L2166I, GBD-CD2 A2162E F2163L, GBD-CD2 F2163L, GBD-CD2 F2163I-D2164E-L21661 enzymes, and the BRS-B-D2 M14, BRS-B-D2 M l 8, BRS-B-D2 M21 , BRS-B-D2 M23, BRS-B-D2 M28, BRS-B-D2 M30, BRS-B-D2 M34, BRS-B-D2 M35, BRS-B-D2 M40 and BRS-B-D2 M41 enzymes, the enzyme being more particularly BRS-B, GBD-CD2 W2135G or GBD-CD2 F2163I-D2164E-L2166I.

More particularly, said saccharide comprises the following fragment:

-[(El→4)AB ( Ac) Z C A cD]n-

In a particular embodiment, the sucrose-active enzyme is selected from the group consisting of the BRS-B, BRS-B-D2, BRS-C, BRS-A, BRS-D, BRS-E, GBD-CD2, GBD- CD2 W2135V, GBD-CD2 W2135C-F2136I, GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135N-F2136Y, GBD-CD2 W2135N, GBD-CD2 W2135I- F2136Y, GBD-CD2 W2135L, GBD-CD2 W2135C, GBD-CD2 W2135N-F2136H, GBD-CD2 W2135L-F2136L, GBD-CD2 W2135F-F21361, GBD-CD2 W2135C-F2136N, GBD-CD2 W2135G, GBD-CD2 W2135F, GBD-CD2 F2163G, GBD-CD2 L21661, GBD-CD2 F2163H, GBD-CD2 F2163G L2166I, GBD-CD2 A2162E F2163L, GBD-CD2 F2163L, GBD-CD2 F2163I-D2164E-L2166I enzymes, and the BRS-B-D2 Ml 4, BRS-B-D2 M l 8, BRS-B-D2 M21 , BRS-B-D2 M23, BRS-B-D2 M28, BRS-B-D2 M30, BRS-B-D2 M34, BRS-B-D2 M35, BRS-B-D2 M40 and BRS-B-D2 M41 enzymes, the enzyme being more particularly BRS-B, GBD-CD2 W2135G or GBD-CD2 F2163I-D2164E-L2166I.

More particularly, said saccharide comprises the following fragment:

-[A(El→4)B (A c) Z C A cD] n - In a particular embodiment, the sucrose-active enzyme is selected from the group consisting of the GBD-CD2 W2135S-F2136L, GBD-CD2 W2135I-F2136C, GBD-CD2 W2135L-F2136L enzymes, and the BRS-B-D2 M14, BRS-B-D2 M18, BRS-B-D2 M21, BRS-B-D2 M23, BRS-B-D2 M28, BRS-B-D2 M30, BRS-B-D2 M35, BRS-B-D2 M40 and BRS-B-D2 M41 enzymes.

I another aspect, the present invention relates to a compound of one of the following formulae:

((E 1→2) a .(E 1→4) a (E 1→6) bC i3AcD) x (E 1→3) C (E 1→4) C A(E 1→3) cf (E 1 ->4) d -B(E 1→4) e

CI AC C((E 1 ->2) a .(E 1→4) a (E 1→6 b ) C i3A C D) y » All; (t E I >2) A .(E 1→4) a (E 1→6)bci3AcD) x (E 1→3) C (E 1→4) C -A(E 1→3) d (E 1→4) D -B(E 1 - 4) C C((E 1→2) A - (E 1→4)„(E 1→6) B cD Ac D All;

((E 1 >2),-(E 1 >4) U (E 1→6) bC i3AcD) x (E 1→3) C (E 1→4) C A(E 1 ~>3) d (E 1→4) d -B(E 1→4) E

Ac C((El-→2) 3 .(El→4) a (El→6) b cBAcD) y -AIl;

((E 1→2) A .(E 1→4) a (E 1→6) bC .A )x(E 1→3) C (E 1→4 -A(E 1 ->3) d (E 1→4) D -B(E i >4) e

Ac C((El→2) a .(El→4MEl→6) b c ,3AcD) y ;

((E 1→2) a E 1→4) a (E 1→6) b ci3AcD) x (E 1→3) C (E 1→4) C A(E 1→3)„(E 1 >4) d -B(E 1→4) E

ciAcC((E 1 >2).·(Ε 1 >4) a (E 1→6) b C u Ac D) y ;

((E 1→2) a .(E 1→4) a (E 1→6)bCB A C D) X (E 1 ->3) (E 1→4) C . A(E 1→3) d (E 1→4) D -B(E 1→4),C((E 1→2) a -

(El→4) a (El→6) b C ,3AcD) y ;

((El→4).(E 1→6) bC 3A D) x (E 1→3) S (E i ~>4) C A(E I ->3) d (E 1→4) D .B(E 1→4) E

CIA C C((E 1→4) a (E 1→6 b ) c ,3AcD) r All;

((E 1→4) a (E 1→6) b ci3 A C D) X (E1→3) C (E 1→4) C -A(E 1→3) d (E 1→4) D -B(E 1→4) E C((E 1→4) A (E 1 ->6) b

CBAcD)y-All;

((E 1→4) a (E 1→6)κ;; ,Ι ),(Ε 1→3) C (E 1→4) c A(E 1→3) d (E 1→4) D -B(E 1 ~»4). Ac C((El→4) a (El→6) b

((El→4) a (E 1 >6) bC i AcD)x(E 1→3).(E 1 ->4) c A(E 1→3) d (E 1→4),·Β(Ε 1→4) E Ac C((E 1 - 4) a (E 1 · >6)

((E 1→4) a (E 1→6) bC i3 A C D) X (E 1→3) C (E 1→4) C A(E 1→3) d (E 1 - 4) D -B (E 1→4%

CIA C C((E 1→4) a (E 1→6)„ ci3AcD) y ;

((E 1→4) a (E 1→6)„ c,3AcD) x (E 1→3) C (E 1→4) C -A(E 1→3) D (E 1→4) D .B(E 1→4) e C((E 1→4) a (E 1→6) b

((E 1→4) a (E ! >6 , C!3A C D) x (E 1→3) c A(E 1→3) d B(E 1→4) c CIA.C((E 1→4) a (El→6) B C IM. D) y -All;

((E 1→4) a (E 1→6) b ci3 AcD) x (E 1→3) C A(E 1→3) D B(E I→4) e C((E 1→4) a (E 1→6) b C B Ac D) y - All;

((E 1→4) a (E 1→6) B ci3AcD) x (E l→3) c A(E 1→3) d B(E 1 ->4) e AC C((E 1→4) a (E 1 ->6) b a3 AcD) r All; ((E 1→4) a (E 1→6) b ci3AcD) x (E 1→3) c A(E 1→3) d B(E 1→4) e Ac C((E 1→4) a (E l→6) B a3 Ac D) y ;

((E 1→4) a (E 1→6) B cBAcD) x (E 1→3) c A(E 1→3) d B(E 1→4 C IA C C((E 1→4) a (El→6) b ci3A C D) y ;

((E 1→4) a (E 1 ~>6) B ci3AcD) x (E 1 -->3) c A(E 1→3) d B(E 1 >4) c C((E 1→4) a (E 1→6) B COA C D),;

ABA C C(E1 - 6) CI3A C D-A11;

AB Ac C(El→6) ci3AcD;

ABciAeC(El→6) a3Ac D; ABC(El→6) cBAcD

(El→3)ABci Ac C c ,3AcD-All;

(El→3)ABCci 3Ac D-All;

(E1→3)ABACC c , 3 A O D-A11;

(E1→3)AB A CQ:BACD;

(E1→3).ABCIACCCI3ACD;

(El→3)ABCCI3ACD;

(E1 -->4)ABCIACC C',3ACD-A11;

(El ->4)ABCC,3A C D-A11;

(E 1 ->4)ABACC I3ACD-A11;

(E 1 ->4)ABACCCI3ACD;

(E1→4)ABCIACCCI3ACD;

A(E1→4)B CI ACC c ,3ACD-A11;

A(E 1 ->4)BCCI3ACD-A11;

A(E1→4)BACC C I3ACD-A11;

A(E1->4)B AC C C1 3ACD;

A(E 1 - 4)BCIACCCI3ACD;

In another aspect, the invention concerns an enzyme selected from the group comprising the enzymes BRS-D-2 M6, BRS-D-2 M14, BRS-D-2 M18, BRS-D-2 M21 , BRS-

D-2 M23, BRS-D-2 M28, BRS-D-2 M30, BRS-D-2 M31 , BRS-D-2 M34, BRS-D-2 M35, BRS-D-2 M40, BRS-D-2 M41 as defined above, and their variants.

Synthesis

Compounds of formula I may be obtained thanks to a [3+1] strategy as shown below. For instance, an ABQA C C rhamnotriosyl donor encompassing protecting groups orthogonal to an allyl ether (All), a N-trichloroacetyl (Cl 3 Ac) and chloroacetyl moiety (CiAc), was reacted with a D acceptor. A two-step deprotection process gave the ABCIA C CCUA C D-AU acceptor. Protection and deprotection techniques are for instance described by P. G. M. Wuts and T. W. Greene (Greene's Protective Groups in Organic Synthesis, Fourth Edition; Wiley- Interscience, 2006; or Greene's Protective Groups in Organic Synthesis, fifth Edition; Wiley- Interscience, 2014),

Advantageously, rhamnoses A, B and C are in particular built from a single precursor. A suitable synthetic pathway may be the following:

Definitions

The following terms and expressions contained herein are defined as follows:

As used herein, a range of values in the form "x-y" or "x to y", or "x through y", include integers x, y, and the integers there between. For example, the phrases "1-6", or "1 to 6" or "1 through 6" are intended to include the integers 1 , 2, 3, 4, 5, and 6. Preferred embodiments include each individual integer in the range, as well as any subcombination of integers. For example, preferred integers for "1-6" can include 1 , 2, 3, 4, 5, 6, 1-2, 1-3, 1-4, 1- 5, 2-3, 2-4, 2-5, 2-6, etc.

As used herein, the term (E) means that the saccharide following the term (E) bears a residue a-D-Glc -. For example, A(E)B means that B bears a residue ct-D-Glc -. Furthermore, (E l— 6) AcD means that the residue A C D bears a residue ot-D-Glc/?- linked at its primary hydroxyl group (position 6D-OH).

As used herein, the term "donor" more particularly refers to a mono-, oligo- or polysaccharide bearing a leaving group at the anomeric position.

As used herein, the term "acceptor" more particularly refers to a mono-, oligo- or polysaccharide having at least a free hydroxyl group, in general other than the anomeric hydroxyl, preferably at least the free hydroxyl group corresponding to the elongation site of the growing chain.

A variant is derived from an enzyme of the GH13 or GII70 family, such as amylosucrases in the case of GH13 family, and branching sucrases and glucansucrases in the case of GH70 family, by the introduction of mutations (deletion(s), insertion(s) and/or substitutions(s)) at specific positions in the sequence of said enzyme, while retaining the ability of said enzyme to catalyze ct-D-glucosylation.

In particular, the amino acid sequence of said variant has at least 50% identity, or by order of increasing preference at least 40%, 42%, 45%, 47%, 50%, 52%, 55%, 57%, 60%, 62%, 65%, 70%, 72%, 75%, 77%, 80%, 82%, 85%, 87%, 90%, 92%, 95%, 96%, 97%, 98%, 99% or 100% identity, with the amino acid sequence of the corresponding enzyme of the Gill 3 or GH70 family.

The percent amino acid sequence identity is defined as the percent of amino acid residues in a Compared Sequence that are identical to the Reference Sequence after aligning the sequences and introducing gaps if necessary, to achieve the maximum sequence identity. The Percent identity is then determined according to the following formula: Percent identity = 100 x [1- (C/ ' R)], wherein C is the number of differences between the Reference Sequence and the Compared sequence over the entire length of the Reference sequence, wherein (i) each amino acid in the Reference Sequence that does not have a corresponding aligned amino acid in the Compared Sequence, (ii) each gap in the Reference Sequence, and (iii) each aligned amino acid in the Reference Sequence that is different from an amino acid in the Compared Sequence constitutes a difference; and R is the number amino acids in the Reference Sequence over the length of the alignment with the Compared Sequence with any gap created in the Reference Sequence also being counted as an amino acid.

Unless otherwise specified, the percent of identity between two protein sequences which are mentioned herein is calculated from the BLAST results performed either at the NCBI (http://blast.ncbi.nlm.nih.gov/Blast.cgi) or at the GRYC (http://gryc.inra.fr/) websites using the BlastP program with the default BLOSUM62 parameters as described in Altschul el al. (1997).

By "ABC-triosyl" is meant an ABC rhamnotriosyl, i.e. an ABC triose group.

FIGURES

Figure 1 A presents the HSQC spectrum corresponding to pentasaccharide 1 . Figure IB presents the HMBC spectrum corresponding to pentasaccharide 1 , EXAMPLES Example 1: synthesis of compound of formula (I)

Synthesis of a common precursor (1) to residues A, B ani C

AHyl 4-0-(2-naphtylmethyI)-a-L-rhamnopyranoside (1) Acetyl chloride (50 mL, 0.70 mol, 2.5 equiv.) was added dropwise to allyl alcohol (610 mL) at 0 °C, the solution was stirred for 25 min, then L-rhamnose monohydrate (50 g, 277 ramol) was added. The mixture was heated for 2.5 h at 70 °C then for 15 h at 40 °C. Follow up by TLC (DCM/MeOH 8:2) indicated the total conversion of the starting hemiacetal (Rf 0.2) into a less polar product (Rf 0.7). The bath temperature was cooled to 0 °C and the solution was neutralized by addition of NaHC0 3 (102.5 g). The suspension was filtered over a pad of Celite® and solvents were evaporated and co-evaporated three times with toluene.

The brown oily residue was dissolved in anhydrous acetone (300 mL), then 2,2- dimethoxypropane (100 mL, 0.81 mol, 3.0 equiv.) and PTSA (3.04 g, 16 mmol, 0.05 equiv.) were successively added. The mixture was stirred for 3 h at rt. Follow up by TLC (DCM/MeOH 9: 1) indicated the total conversion of the intermediate allyl glycoside (Rf 0.3) into a less polar product (Rf 0.6). The solution was neutralized by adding Et 3 N (4 mL), solvents were evaporated under reduced pressure. The residue was dissolved in DCM (600 mL) and washed with H?0 (3x300 mL) and brine (200 mL). The organic layer was dried by passing through phase separator filter and concentrated to dryness.

The residue was dissolved in DMF (800 mL) under Ar, the bath temperature was cooled to -5 °C, and NaH (60% oil dispersion, 29.1 g, 0.73 mol, 2.4 equiv.) was added portionwise to this suspension. The mixture was stirred for 2 h at rt, then 2-bromomethylnaphthalene (73.5 g, 0.33 mol, 1.2 equiv.) was added portionwise at -5 °C and the reaction mixture was stirred at rt for 2 h. Follow up by TLC (cyclohexane/EtOAc 7:3) indicated the total conversion of the intermediate alcohol (Rf 0.3) into a less polar product (Rf 0.67). The reaction was quenched at 0 °C by addition of MeOH (50 mL). Solvents were eliminated under reduced pressure and volatiles were co-evaporated with toluene. The residue was taken up in EtOAc (400 mL) and washed with H 2 0 (3 x 300 mL) and brine (150 mL). The organic phase was dried over anhydrous Na 2 S0 4 , filtered, and concentrated to dryness.

The residue was dissolved in 80% aq. AcOH (500 mL) and the solution was stirred for 6 h at 80 °C then over the weekend at it and heating was continued for 5 h at 80 °C. Follow up by TLC (cyclohexane/EtOAc 5:5) indicated the total conversion of the intermediate acetal (Rf 1.0) into a more polar product (Rf 0.2). Solvents were removed under vacuum and traces of AcOH were eliminated by co-evaporation with toluene (3 x 400 mL) to give a brown solid. Filtration over a pad of silica eluting with a 4:1 mixture of cilex/EtOAc then 1 : 1 mixture of cyclohexane/EtOAc then recrystallization in hot cyclohexane afforded the expected diol (65.5 g, 69%) as a pale brown solid. Mother liquors were further purified by flash column chromatography (cyclohexane/EtOAc 100:0 to 50:50) to give an additional amount of expected diol (1 1.6 g). The total yield of diol 1 is 81 % over 4 steps.

Ή NMR (400 MHz, CDC1 3 ) δ 7.87 - 7.78 (m, 411, H ArNap ), 7.52 - 7.44 (m, 3H, H Ar Nap)» 5.89 (dddd, I II, J - 17.2, 10.4, 6.0, 5.2 Hz, CH=C¾), 5.28 (dq app , I II, J = 17.2, 1.5 Hz,

CH=Ci¾), 5.19 (dq ap p, 1H, J - 10.4, 1.5 Hz, CH=G%), 4.95 - 4.86 (m, 2H, H ArNap ), 4.81 (d,

J= 1 .4 Hz, HI, H-l), 4.17 (ddt, I H, J= 12.9, 5.1, 1.5 Hz, 1 H, HAH), 4.02 - 3.93 (m, 3H, H A ii,

H-2, H-3), 3.79 (dq, 1H, J = 9.2, 6.3 Hz, H-5), 3.41 (t app , 1H, J = 9.2 Hz, H-4), 2.45 (brs, 2H,

OH), 1.38 (d, 3H, J= 6.3 Hz, H-6).

i 3 C NMR (100 MHz, CDC1 3 ) δ 133.8 (CH=C¾), 128.4 (CivAr), 128.0 (C 1VAr ), 127.7 (C IVAr ),

126.7-125.8 (7C, C Ar ), 1 17.4 (CH=€H 2 ) » 98.5 (C- l 'J C . H = 170.1 Hz), 75.1 (CH 2Nap ), 71.6,

71.3 (2C, C-2, C-3), 68.0 (C A „), 67.3 (C-5), 18.1 (C-6).

HRMS (ES ): mlz 362.1985 (calcd for C,6H 22 0 5 Na [M+NH 4 ]' mlz 362.1967); mlz 367.1576 (calcd for C 4 3H 5 ,C10 12 Na [M+Naf mlz 367.1521).

Synthesis of the Aamnopyranosyl donors (5 and 5a) used as precursor to residues A and B

[Ir] = [IrH 2 {THF) 2 (PPh 2 Me) 2 ]PF 6

The TES derivative

AIM 4-0-(2-naphthylmethyl)-3-i?-triethylsilyI-a-L-rharnnopyranos ide (2)

To a solution of allyl 4-O-(2-naphthylmethyl)-a-L-rharnnopyranoside (diol 1) (5.0 g, 14.5 mmol) in anhydrous acetonitrile (MeCN, 200 mL) stirred at 0°C were successively added dropwise chlorotriethylsilane (TESC1 99%, 2.9 mL, 17.4 mmol, 1.2 equiv) and NJV- diisopropylethylamine (DIPEA 99.5%, 3.8 mL, 21.8 mmol, 1.5 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (cyclohexane/EtOAc 8:2) showed complete consumption of the starting material (Rf = 0.13) and the presence of a main product (Rf = 0.70). Methanol (1.2 mL, 29.0 mmol, 0.5 equiv) was added, the mixture was stirred for 15 min, and volatiles were evaporated under reduced pressure. Toluene was added and a precipitate appeared. The suspension was filtered and the filtrate was concentrated to dryness under vacuum. The residue was purified by chromatography eluting from a column of Et N- treated silica gel (toluene/ethyl acetate 95:5) to give compound 2 as a yellow oil (4.9 g,

74%).

Allyl 2-0-levHlnyl-4-0-(2-naphtliylmetliyl)-3-£l-trietliylslyl-e- L-rhamnopyraiiosMe (3) Route 1: To a solution of allyl 4-O-(2-naphthylmethyl)-3-( -tricthylsilyl-a-L- rhamnopyranoside (alcohol 2) (5.0 g, 10.9 mmol) in anhydrous dichloromethane (DCM, 65 mL) stirred at room temperature were successively added 1 ,3-dicyclohexylcarbodiimide (DCC 99%, 7.65 g, 37.1 mmol, 3.4 equiv), 4-dimethylaminopyridine (DMAP, 1.07 g, 8.73 mmol, 0.8 equiv) and levulinic acid (LevOH 98%, 4.69 mL, 45.8 mmol, 4.2 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (cyclohexane EtOAc 7:3) showed complete consumption of the starting material (Rf = 0.60) and the presence of a more polar compound (Rf = 0.53). The reaction mixture was concentrated under reduced pressure. The crude material was taken in EtOAc (50 mL) and the resulting suspension was filtered on a pad of Celite®. Water (30 mL) was added to the filtrate and the organic layer was washed successively with 10% aqueous copper (II) sulfate (30 mL), water (30 mL), saturated aqueous sodium bicarbonate (30 mL) and brine (30 mL). The organic layer was dried by stirring over .anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by eluting from a column of Et 3 N-treated silica gel (cyclohexane/EtOAc 9: 1 to 5:5) to give the fully protected 3 as a yellow oil (4.5 g, 74%).

Route 2: To a solution of allyl 4-O-(2-naphthylmethyl)-a-i.-rhamnopyranoside (diol 1) (20.0 g, 58.1 mmol) in anhydrous MeCN (800 mL) stirred at 0°C were successively added dropwise TESC1 (11.70 mL, 69.7 mmol, 1.2 equiv) and DIPEA (15.2 mL, 87.1 mmol, 1.5 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (cyclohexane/EtOAc 8:2) showed complete consumption of the starting material (Rf = 0.13) and the presence of a main product (Rf = 0.70). Methanol (1.2 mL, 29.0 mmol, 0.5 equiv) was added, the mixture was stirred for 15 min, and volatiles were evaporated under reduced pressure. Toluene was added and a precipitate appeared. The suspension was filtered and the filtrate was concentrated to dryness under vacuum.

To a solution of the crude alcohol 2 in anhydrous DCM (200 mL) stirred at room temperature were successively added DCC (40.74 g, 197.4 mmol, 3.4 equiv), DMAP (5.7 g, 46.5 mmol, 0.8 equiv) and levulinic acid (23.8 mL, 232.3 mmol, 4.0 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (cyclohexane/EtOAc 7:3) showed complete consumption of the starting material (Rf = 0.60) and the presence of a more polar product (Rf = 0.53). The reaction mixture was concentrated under reduced pressure. The crude material was taken in EtOAc (50 mL) and the resulting suspension was filtered on a pad of Celite®. Water (30 mL) was added to the filtrate and the organic layer was washed successively with 10% aqueous copper (II) sulfate (30 mL), water (30 mL), saturated aqueous sodium bicarbonate (30 mL) and brine (30 mL). The organic layer was dried by stirring over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by eluting from a column of Et 3 N-treated silica gel (cyclohexane/ethyl acetate 9: 1 to 5:5) to give the fully protected 3 as a yellow oil with (30,7 g, 95%).

2-0-LevuIInyl-4-0-(2-naplitliyimethyi)-3-0-trIetIiylsIlyl-a p-L-rliamiiopyranose (4) To a solution of allyl 2-0-levulinyl-4-O-(2-naphthylmethyl)-3-0-triethylsilyl-a-L- rhamnopyranoside (allyl glycoside 3) (2.5 g, 4.49 mmol) in anhydrous THF (60 mL) stirred at room temperature was added hydrogen-activated 1 ,5-cyclooctadienebis- (methyldiphenylphosphine)iridium(I)hexafluorophosphate (76.0 mg, 90 μπιοΐ, 0.02 equiv). After stirring the reaction mixture for 2 hours at room temperature, TLC (cyclohexane/EtOAe 7:3) showed complete conversion of the starting material (Rf = 0.53) into a less polar product (Rf = 0.56). Λ-Iodosuccinimide (MS 95%, 1.21 g, 5.39 mmol, 1.2 equiv) in 1 :5 water/THF (30 mL) and then distilled water (40 mL) were added to the mixture stirred at 0°C. After stirring the reaction mixture for 2 h at this temperature, TLC (cyclohexane EtOAc 7:3) showed complete conversion of the intermediate (Rf = 0.56) into a more polar product (Rf = 0.26). 10% aqueous sodium metabisulfite (50 mL) was added, THF was evaporated under reduced pressure and DCM (50 mL) was added. The aqueous layer was extracted twice with DCM (30 mL) and the combined organic phases were washed with saturated aqueous sodium bicarbonate (30 mL) and then brine (30 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by chromatography eluting from a column of I¾N-treated silica gel (cyclohexane EtOAc 8:2 to 6:4) to give hemiacetal 4 as a yellow oil (2.1 g, 89%, α/β 8:2). 2-0-Levulinyl-4-0-(2-naphthylmethyl)-3-0-tri^

trichloroacetimidate (5)

Route 1: To a solution of 2-0-levulinyl-4-0-(2-naphthyImethyl)-3-0-triethyisilyl-L- rhamnopyranose (hemiacetal 4) (2.00 g, 3.87 mmol) in anhydrous 1 ,2-dichlorocthane (DCE, 30 mL) stirred at room temperature were successively added trichloroacetonitrile (C1 3 CCN 98%, 1.16 mL, 11.6 mmol, 3.0 equiv) and l,8-diazabicyclo[5A0]undec-7-ene (DBU 98%, 289 L, 1.94 mmol, 0.5 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (cyclohexane/ethyl acetate 7:3) showed complete consumption of the starting material (Rf = 0.26) into a less polar product (Rf = 0.53). Volatiies were evaporated under reduced pressure. The residue was purified by chromatography eluting from a column of triethylamine-treated silica gel (cyclohexane/ethyl acetate 9: 1 to 5:5) to give donor 5 as a whitish crystalline solid (2.3 g, 90%, α/β 95:5).

Route 2: To a solution of ally! 2-0-levulinyl-4-O-(2-naphthylmethyl)-3-O-triethylsilyl-a-L- rhamnopyranoside (allyl glycoside 3) (15.0 g, 26.9 mmol) in anhydrous THF (300 mL) stirred at room temperature was added hydrogen-activated 1 ,5-cyclooctadienebis- (mcthyldiphenylphosphine)iridiura(I)hexafluorophosphate ([lr], 456 mg, 540 μιτιοΐ, 0.02 EtOAc 7:3) showed complete conversion of the starting material (Rf = 0.53) into a less polar product (Rf = 0.56). NIS (7.27 g, 32.3 mmol, 1.2 equiv) in 1 :5 water/THF (150 mL) and then distilled water (250 mL) were added to the mixture stirred at 0 °C. After stirring the reaction mixture for 2 h at this temperature, TLC (cyclohexanc'EtOAc 7:3) showed complete conversion of the intermediate (Rf = 0.56) into a more polar product (Rf = 0.26). 10% aqueous sodium metabisulfite (100 mL) was added. THF was evaporated under reduced pressure and DCM (500 mL) was added. The aqueous layer was extracted twice with DCM (300 mL) and the combined organic phases were washed with saturated aqueous sodium bicarbonate (200 mL) and then brine (200 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum (m: 14.3 g).

To a solution of the crude material (13.9 g) in anhydrous DCE (210 mL) stirred at room temperature were successively added CI 3 CCN (8.1 mL, 80.8 mmol, 3.0 equiv) and DBU (2.01 mL, 13.5 mmol, 0.5 equiv). After stirring the reaction mixture for 2 h 50 at room temperature, TLC (cyclohexane/EtOAc 7:3) showed complete consumption of the starting material (Rf = 0.26) into a less polar product (Rf = 0.53). Volatiles were evaporated under reduced pressure. The residue was purified by chromatography eluting from a column of Et 3 N-treated silica gel (cyclohexane/EtOAc 9: 1 to 5:5) to give donor 5 as a whitish crystalline solid (15.04 g, 84%, α/β 95:5).

The BDA derivative

L-rhamno

monohydr

[irH 2 (THF) 2 (PPh 2 Me) 2 ]PF 6

Allyl 3,4-0-(2 ',3 '-dimethoxybutaii-2 s ,3 '-diyI)-a-L-rhamnopyranoside (2a)

Acetyl chloride (34 mL, 475 mmol, 2.5 equiv) was added dropwise to allyl alcohol (420 mL) at 0 °C. The solution was stirred for 25 min and L-rhamnose monohydrate (34.3 g, 190 mmol) was added. The mixture was heated for 2.5 h at 70 °C then for 15 h at 40 °C. Follow up by TLC (DCM/MeOH 8:2) indicated the total conversion of L-rhamnose (Rf = 0.2) into a less polar product (Rf 0.7). The bath temperature was cooled to 0 °C and the solution was neutralized by addition of solid NaHC0 3 (102.5 g). The suspension was filtered off a pad of Celite® and solvents were evaporated and co-evaporated three times with toluene.

To a solution of crude allyl rhamnoside (190 mmol) in anhydrous methanol (1.0 L) stirred at room temperature were successively added butan-2,3-dione (18.3 mL, 0.21 mol, 1.1 equiv), trimethyl orthoformate (83 mL, 0.76 mol, 4.0 equiv) and boron tri fluoride etherate (11.7 mL, 95 mrnol, 0.5 equiv). After stirring the reaction mixture for 1.5 h under reflux, a TLC follow up (DCM/MeOH 95:5) showed complete consumption of the starting material (Rf = 0.3) and the presence of a main product (Rf = 0.5). Et 3 was slowly added to the reaction mixture at 0 °C until neutralization and volatiles were evaporated under reduced pressure. The residue was purified by chromatography eluting from a column of silica gel (cyclohexane EtOAc 8:2 to 6:4) to give compound 2a as a brown oil (57.8 g, 96%, 2 isomers 9: 1).

Allyl 2-0-leviilinyl-3,4-0-(2',3'-dimethoxybutaii-2' » 3'-iiyl>a-L- rhamnopyranoside (3a)

To a solution alcohol 2a (27.0 g, 84.8 mrnol) in anhydrous dichlorornethane (650 mL) stirred at rt were successively added DCC (59.5 g, 0.29 mol, 3.4 equiv.), DMAP (8.29 g, 67.9 mrnol, 0.8 equiv.) and levulinic acid (36.5 mL, 0.36 mol, 4.2 equiv.). After stirring the reaction mixture at rt for 2 h, a TLC follow up (DCM/MeOH 98:2) showed complete consumption of the starting material (Rf = 0.25) and the presence of two less polar compounds (Rf = 0.6, 0.65). The reaction mixture was concentrated under reduced pressure. Saturated aqueous NaHC0 3 (300 mL) was added to the reaction mixture. The aqueous layer was extracted once with DCM (300 mL) and the combine organic phases were washed twice with brine (150 mL). The organic layer was dried by stirring over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by chromatography eluting from a column of silica gel (DCM MeOH 1 :0 to 9:1) to give the fully protected 3a as a brown oil (37.5 g, 95%, 2 isomers 9: 1).

2-0-Levulinyl-3,4-0-(2%3'-diraethoxybutan-2' '-diyl)-a-L-rhamnopyranosyl

trichloroacetimidate (5a)

To a solution of rhamnoside 3a (37.5 g, 90.0 mrnol) in anhyd. tetrahydrofuran (THF, 500 mL) stirred at rt was added hydrogen-activated 1,5- cyclooctadienebis(methyldiphenylphosphine)iridium(I) hexafluorophosphate (1.5 g, 1.80 mrnol, 0.02 equiv.). After stirring the reaction mixture at room temperature for 2 h, TLC (cyclohexane/EtOAc 5:5) showed complete conversion of the starting material (Rf = 0.8, 0.85) into two closely eluting products (Rf = 0.8, 0.9). N-iodosuccinimide (NTS, 24.3 g, 108 mmol, 1.2 equiv.) in water/THF (1 :5, 250 mL) and then additional water (370 mL) were added to the mixture stirred at 0 °C. After stirring the reaction mixture for 2 h at this temperature, TLC (cyclohexane/EtOAc 5:5) showed complete conversion of the intermediate into two more polar products (Rf = 0.45, 0.5). Saturated aqueous sodium metabisulfite (500 mL) and then ethyl acetate (500 mL) were added. The aq. layer was extracted twice with ethyl acetate (300 mL) and the combined organic phases were washed with saturated aqueous NaHCOs (300 mL) and then brine (300 mL). The organic layer was dried over anhyd. sodium sulfate, filtered and concentrated under vacuum.

To a solution of crude hemiacetal 4a (90.0 mmol) in anhydrous DCE (500 mL) stirred at rt were successively added trichloroacetonitrile (27.1 mL, 0.27 mol, 3.0 equiv.) and DCC (6.7 mL, 45.0 mmol, 0.5 equiv.). After stirring the reaction mixture for 2.5 h at room temperature, TLC (cyclohexane/EtOAc 7:3) showed complete consumption of the starting material (Rf = 0.45, 0.5) into two less polar product (Rf = 0.75, 0.5). Volatiles were evaporated under reduced pressure. The residue was purified by chromatography eluting from a column of Et N-treated silica gel (cyclohexane EtOAc 8:2 to 6:4) to give donor 5a as a yellow to brownish oil (38.6 g, 82%, a only, 2 isomers 9:1).

Synthesis of the glucosamine D acceptor

The known glucosamine D acceptor was obtained following the procedure below, through route A or the improved route B (Carbohydrate Chemistry: Proven Synthetic Methods, Eds P. Murphy and C. Vogel, 2017, vol. 4, chap. 39, in press).

Route A

Route B 88%

Synthesis of the glucosamine D donor.

The known glucosamine D donor 12 was obtained as published {Tetrahedron Lett, (2008) 49, 5339-42). Alternatively, the analogue of donor 12, equipped with a 4,6-O-benzylidene acetal instead of a 4,6-O-isopropylidene acetal could be obtained from the D acceptor 9 according to the procedure described for the conversion of alcohol 10 into donor 12.

It is noteworthy that alcohol 10 can also serve as a suitable acceptor in the synthesis of compounds of the formula (la).

11 84% 12

[ir] = [!r(COD) 2 (PPh 2 Me) 2 ]PF 6

A synthesis of donor 12a, which encompasses a 4,6- -benzylidene acetal and a TBS ether in place of the 4,6-O-isopropylidene acetal and the Lev group, respectively, is exemplified in the following.

ΤΒδθ-^- ^ OPTFA

TBSOTf Ph --Y-o-A 1 [Ir], THF NHC(0)CCI 3

2,6-lutidine 2. aq I 2 , THF, NaHC0 3 12a

Acceptor TBSO^-^-OAII „ + 2: 1

10 DCM. -78 NHC(0)CCI 3 3. PTFA-Ci, K 2 C0 3

92% 11a Acetone

[Ir] = [irH 2 {THF) 2 (PPh 2 Me) 2 ]PF 6 65%

Ailyl 4,6-0-benzylidene-3-0-/e^butyldimethylsilyl-2-deoxy-2-trichl oroacetamido-P-D- glucopyranoside (11a)

To a solution of allyl 4,6-0-benzylidene-2-deoxy-2-trichloroacetamido-P-D-glucopyra noside

(10) (2,0 g, 4.42 mmol) in anhydrous dichloromethane (40 mL), stirred at -78 °C, were successively added 2,6-lutidine (2.06 mL, 17.7 mmol, 4.0 equiv) and

trifluoromethanesulfonate (2.54 mL, 1 1.05 mmol, 2.5 equiv). After stirring the reaction mixture overnight at room temperature, a TLC follow up (toluene/ethyl acetate 8:2) showed complete consumption of the starting material (Rf = 0.25) and the presence of a less polar product (Rf = 0.7). Distilled water (50 mL) and ethyl acetate (200 mL) were added. The aqueous layer was extracted with ethyl acetate (100 mL) and the combined organic phases were washed with water (50 mL), 10% aq, citric acid (50 mL) and then water (50 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by chromatography eluting from a column of silica gel (toluene/ethyl acetate 95:5 to 9: 1) to give the fully protected 11a as a white powder (2.32 g, 92 %).

4,6-0-Benzylidene-3-0-fcrz butyldim

glucopyranosyl N-(phenyl)trifluoroacctimidate (12a)

To a solution of glucosaminide 11a (2.27 g, 4.0 mmol) in anhydrous THF (200 mL) stirred at room temperature was added hydrogen-activated 1 ,5- c clooctadienebis(methyldiphenylphosphine)iridium(I)hexafluoro phosphate (170 mg, 0.20 mmol, 0.05 equiv). After stirring the reaction mixture for 4 hours at room temperature, a TLC follow up (toluene/ethyl acetate 9: 1) showed complete conversion of the starting material (Rf = 0.6) into a less polar product (Rf = 0.6). N-Iodosuccinimide (1.35 g, 6.0 mmol, 1.5 equiv) in 1 :5 water THF (56 mL) was then added to the mixture stirred at room temperature. After stirring for 2 h at this temperature, a TLC follow up (toluene/EtOAc 9: 1) showed complete conversion of the intermediate into a more polar product (Rf = 0,2). Saturated aqueous sodium metabisulfite (200 mL) and then ethyl acetate (500 mL) were added. The aqueous layer was extracted twice with ethyl acetate (250 mL) and the combined organic phases were washed with saturated aqueous NaHCO? (200 mL) and then brine (200 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum.

To a solution of crude hemiacetal (4.42 mmol) in anhydrous acetone (36 mL), stirred at room temperature, were successively added A r -(phenyl)trifluoroacetimidyl chloride (1.80 mL, 6.63 mmol, 1.5 equiv) and potassium carbonate (1.53 g, 1 1.05 mmol, 2.5 equiv). After stirring the reaction mixture for 3 hours at room temperature, TLC (toluene EtOAc 8:2) showed complete consumption of the starting material (Rf = 0.2) and the presence of a mixture of less polar products (Rf ~ 0.75 and 0,8). The mixture was filtered off a pad of Celite® and the filtrate was concentrated under reduced pressure. The residue was purified by chromatography eluting from a column of silica gel (toluene/EtOAc 95:5 to 9: 1) to give a 2: 1 mixture of the N-(phenyl)trifluoroacetimidate donor 12a and oxazoline 12b as a brown oil (2.57 g, 65 % over two steps).

Synthesis of the ABoxeC-Z'

Selected examples

Allyl 2-i?-chIoroacetyl-4-0-(2-napht\lraethyl)-a-L-rhamnopyranosid e (13)

Diol 1 (2.0 g, 6.0 mmol) was solubilized in anhydrous MeCN (5.0 mL). To the solution was added trimethylchloroorthoacetate (2.35 mL, 3.0 equiv) and PTSA (90 mg, 0.08 equiv). The solution was stirred at room temperature for 1 hour (reaction followed by TLC Toluene/EtOAc 7:3). To the reaction medium cooled to 0 °C was added a 90% aqueous TFA (3.0 mL) and the reaction mixture was stirred at room temperature for 15 min. Water (20 mL) was added. The product was extracted with DCM (2 x 40 mL). The organic phase was washed with saturated aqueous NaHC0 3 (2 x 25 mL) and brine (25 mL). The aqueous phase was extracted with DCM (2 x 25 mL). The combined organic phases were dried over Na 2 S0 4 , filtered, evaporated and finally co-evaporated with toluene to yield the crude alcohol 13 as a 92:8 mixture of regioisomers. Allyl 2-0-levulinyl-4-0-(2-naphtylmethyl)-3-(7-triethylsilyl-a-L-r hamnopyranosyI- (l→3)-2-0^hloroacetyl-4-0-(2-naphtylmcthvI)-a-L-rharnnopyr anoside (14)

A solution of trichloroacetimidate 5 (5.84 g, 8.83 mmol) and crude acceptor 13 (4.09 g, 1.1 equiv) in toluene (88 mL) containing 4A-MS (1.25 g) was stirred at room temperature for 15 min, then at -60 °C for 15 min. ter/-Butyldimethylsilyl trifluoromethanesulfonate (TBSOTf, 100 μΐ-, 0.05 equiv) was added to reaction mixture stirred at -60 °C and the bath was left to reach -40°C. After stirring for 1 h, Et 3 N was added to the suspension at -35 °C, the mixture was filtered through a pad of Celite®, and the filtrate was concentrated to dryness. Rapid filtration of the residue over silica gel and crystallization in EtOAc/pentane 5:1 (400 mL) gave the fully protected 14 (3.83 g, 72%). Ally! 2-0-levulinyl-4-0-(2-naphtyImcthyl)-3-(9-tricthylsilyI-a-L-r hamnopyranosyI-

(l→2)-4-0-(2-naph<ylmetliyl)-3-0-triethyIsIlyI-a-L-r hamnopyranosyl-(l→3)-2-0- chloroacetyl-4-0-(2-naphtylmethyl)-a-i.-rhamnopyranoside (16)

A solution of trichloroacetimidate 5 (1.04 g, 1.24 equiv) and alcohol 15 (1.0 g, 1.22 mmol) in toluene (30 mL) containing 4A-MS (1.38 g) was stirred at room temperature for 15 min, then at -20 °C for 15 min. TMSOTf (11 μί, 0.05 equiv) was added to reaction mixture stirred at - 20 °C and the bath was left to reach -10 °C. After stirring for 1 h at this temperature, stirring is pursued for 1 h while the bath slowly reached room temperature. Et 3 N was added to the suspension, the mixture was filtered through a pad of Celite®, and the filtrate was concentrated to dryness. Column chromatography gave the fully protected ABQA C C trisaccharide 16 (1.36 g, 85%).

HRMS (ESI+): m/z 1336.6191 (calcd for C73H99CI4NO16S.2 [M+N¾f) found m/z 1336.6171.

2-0-LevuHnyM- -(2-naphtylmethyI)-3-0-triethylsilylHi-L-rhamnopyranosyl-(l 2)-4- 0-(2-naphtyImethyI)-3-0-triethylsily!-a^^

0-(2-naphtyimethyl)-a/p-L-rhamnopyranose (17)

To a solution of the fully protected ABQA C C (1.04 g, 7.6 mmol) in anhydrous THF (35 mL) stirred at room temperature was added hydrogen-activated [Ir] (13.0 mg, 0.02 equiv). After stirring the reaction mixture for 45 min at room temperature, TLC (toluene/EtOAc 9: 1) showed complete conversion of the starting material into a less polar product. The reaction mixture was cooled to 0 °C, NIS (205 mg, 1.2 equiv) in 1 :5 water.THF (17.5 mL) and then distilled water (25 mL) were added to the mixture stirred at 0 °C. After stirring the reaction mixture for 1.5 h at this temperature, TLC (toluene ethyl acetate 9: 1) showed complete conversion of the intermediate into a more polar product. 10% aqueous sodium metabisulfite (50 mL) was added. THF was evaporated under reduced pressure and DCM (100 mL) was added. The aqueous layer was extracted twice with DCM (50 mL) and the combined organic phases were washed with saturated aqueous sodium bicarbonate (50 mL) and then brine (50 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by chromatography eluting from a column of Et 3 N- treated silica gel (toluene/ethyl acetate 85: 15 to 0: 100) to give hemiacetal 17 (825 mg, 85%). HRMS (ESI+): m/z 1296.5878 (calcd for C^oH sC NO^Sia [M+NII 4 ] + ) found m/z 1296.5922. 2-0-Levulinyl-4-0-(2-naphtyImethyl)-3-0-^

0-(2-naphtylmethyl)-3-0-triethy!silyl-a-L-rhamnopyranosyl-(l -→3)-2-0-chloroacety (9-(2-naphtylmethyI)-a/ -L-rhamnopvTanosyl trichloroacetimidate (li)

To a solution of the AB C IA C C triose 17 (700 mg, 550 μπιοΐ) in anhydrous DCE (5.0 rriL) stirred at room temperature were successively added CI3CCN (165 pL, 3.0 equiv) and DBU (40 μΐ,, 0,5 equiv). After stirring the reaction mixture for 45 min at room temperature, the same amounts of CCI3CN and DBU were added and the reaction was stirred for 1 h more at room temperature. Volatiles were evaporated under reduced pressure. The residue was purified by chromatography eluting from a column of EtsN-treated silica gel (toluene/EtOAc 8:2 containing 3%o I¾N) to give donor IS (830 mg, 84%).

FIRMS (ESI+): m/z 1439.4974 (calcd for C72¾ 5 Cl 4 N 2 0 16 Si 2 [M+N¾ ) found m/z 1439.4912.

2-0-Levulinyl-4-0-(2-naphtylmethyI)-3-0-triethylsilyl-o-L -rhaninopyranosyl-(l *2)-4- 0-(2-naphtylmethyl) -( riethylsilyl-a-L-rt

0-(2-naphtyimetliyl)-a/p-L-rliaiiinopyraiiosyl Λ-phenyitrifluoroacetimidate (19)

To a solution of the ABCIA C C triose 17 (1.56 g, 1.22 mmol) in acetone (24.4 iiiL) stirred at room temperature were successively added K 2 C0 3 (337 ml, 2.0 equiv) and N- phenyltrifluoroacetimidyl chloride (PTFACl, 290 pL, 1.5 equiv). After stirring the reaction mixture overnight at room temperature, a TLC (cyclofaexane/EtOAc 2:8) indicated that that the starting 17 had been converted to a less polar product. The suspension was filtered over a pad of Celite® and volatiles were evaporated under reduced pressure. The residue was purified by column chromatography to give donor 19 (1.56 g, 88%). ABciAcCcBAcD- ll may also be obtained through the alternative route B as defined below, whereby the protecting group differs from that shown above and are: R 1 = Nap, R 3 , R 4 = R 6 , R 4 = BDA, R 2 = CIAc, R 5 = Lev, R 8 = All, R 9 = C13Ac).

[Ir] = [lrH 2 {THF) 2 (PPh 2 ye) 2 ]PF 5

Selected examples

Allyl 2-(?-levulinyl-3,4-0-(2' '-diniethoxybutan-2'^'-diyl)-a-L-rharanopyranosyl- (l-^3)-2-0-chloroacetyl-4-0-(2-naphtylmethyl)-a-L-rhamnopyra noside (14a)

To a solution of cmde allyl 2-0-chloroacetyl-4-0-(2-methylnaphthyl)- -L-rhamnopyranoside

(5, 6.91 mmol, 1.2 equiv.) in anhyd. diethyl ether (60 mL), stirred at it, were successively added donor 5a (3.0 g, 5.76 mmol) and activated 4A molecular sieves (3.0 g). After stirring for 15 min at rt, the reaction mixture was cooled down to -20 °C and stirred for an additional 15 min at this temperature. Trimethylsilyl trifluoromethanesulfonate (TMSOTf, 0.18 mL, 1.15 mmol, 0.2 equiv.) was then slowly added. After stirring the reaction mixture for 1 h at - 20 °C, TLC (toluene/ethyl acetate 7:3) showed complete consumption of the donor 5a (Rf = 0.4) and the presence of a less polar product (Rf = 0.6). Triethylamine was then added until neutralization. The reaction mixture was stirred for 15 min at -20 °C and then filtered off a pad of Celite®. The filtrate was concentrated under reduced pressure. The residue was purified by chromatography eluting from a column of silica gel (cyclohexane/ethyl acetate 97:3 to 7:3) to give disaccharide 14a as a yellow oil (3.1 g, 69 %).

Aliyl 3,4-0-(2V^'-dirnethoxybutan-2' '-diyl)-a-L-rhamnopyranosyl-(l'^3)-2-0- chIoroacetyI-4-0-(2-naphtyImethyl)-a-L-rhamnopyranoside (15a)

To a solution of disaccharide 14a (1.0 g, 1.28 mmol) in pyridine/acetic acid (3:2, 13 mL), stirred at room temperature, was added hydrazine monohydrate (125 μί, 2.57 mmol, 2.0 equiv.). After stirring the reaction mixture at room temperature for 1 h, a TLC follow up (toluene/ethyl acetate 7:3) showed complete consumption of the starting material (Rf = 0.6) and the presence of a closely eluting product (Rf = 0.6). Saturated aqueous NaHCOi (15 mL) was added and the aqueous layer was extracted twice with dicMoromethane (15 mL) and the combine organic phases were washed once with brine (15 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum. The residue was purified by chromatography eluting from a column of silica gel (toluene/ethyl acetate 9: 1 to 7:3) to give di saccharide 15a as a yellow oil (0.63 g, 72 %).

AIM 3,4-0-(2 ',3 '-dimethoxybutan-2 ',3 '-diyl)-2-0-levulinyl-a-L-rhamnopyranosyl- (1 -»2)-3,4-0-(2' '-dImethoxybutan-2 ',3 '-diyl)-a-L-rhamnopyranosyi-(l ->3)-2-0- chloroacetyl-4-0-(2-naphtylmethyl)-a-L-rhamnopyranoside (16a)

To a solution of alcohol 15a (300 mg, 0.44 mmol) in anhyd. diethyl ether (4.0 mL), stirred at rt, were successively added donor 5 (252 mg, 0.48 mmol, 1.1 equiv.) and activated 4A MS (0.3 g). After stirring the reaction mixture for 15 min at rt, the reaction mixture was cooled down to -20 °C and stirred for an additional 15 min at this temperature. Trimethylsilyl trifluoromethanesulfonate (6.8 μΙ. -5 44 μιιιοΐ, 0.1 equiv.) was then slowly added. After stirring the reaction mixture for 2 h at -20 °C, TLC (toluene/diethyl ether/ethyl acetate 5:4:1) showed complete consumption of disaccharide 7 (Rf = 0.65) and the presence of a less polar product (Rf = 0.7). Triethylamine was then added until neutralization. The reaction mixture was stirred for 15 minutes at -20 °C and then filtered off a pad of Celite®. The filtrate was concentrated under reduced pressure. The residue was purified by chromatography eluting from a column of silica gel (toluene/ethyl acetate 9: 1 to 7:3) to give trisaccharide 16a as a yellow to brown oil (370 mg, 81 %).

Synthesis of ABCIA C CCUA C D-AII

ABc'.AcCcDAeD-All (ABC'D', 21) was obtained following the generic route B (the first steps of route B up to the triosyl donor IS being in particular described above) as defined below. The activation glycosylation and a final two-step orthogonal deprotcction of route B are in particular performed with a rhamnotriosyl donor and a monosaccharide D acceptor 9 as shown in the more detailed scheme, which just follows. In the exemplified synthesis (R 1 = R 4 = Nap, R 3 = R 6 = TES, R 2 = C Ac, R 5 = Lev, R 8 = All, R 9 = C13Ac).

ABciAcCc AeD-All may also be obtained through the alternative route A as defined in the overview scheme below. ABCIA C CCI3A C D-A11 may also be obtained through the alternative route B as defined below, whereby the protecting group differ from that shown above and are: R 1 = PMB, R 3 = R 4 = Nap, R 6 = TES, R 2 = ClAc, R 5 = Lev, R 8 = All, R 9 = CBAc).

Selected examples

Ally! 2-0-levuiinyl-4-0-(2-naphtylinethyI)-3-i?-triethylsilyl-a-L- rhamnopyranosyI-

(l→2)-4-0-(2-naplitylitietliyl)-3-0-triethy!silyI-a-L-r hameopyranosyl-(l→3)-2-0- chloroacetyM-0-(2-naphtylmethyl)-a-L-rhamnopyranosyKl→3)- ,6- ?-benzylidene-2- deoxy-2-N-trichloroacetyl- ^D-glucopyranoside (20)

A solution of the triosyl trichloroacetimidate 18 (924 mg, 649 μηιοΐ) and the D acceptor 9 (440 mg, 1.5 equiv) in toluene DCM (3: 1 , 37 mL) containing 4A-MS (1.91 g) was stirred at room temperature for 15 min, then at -15 °C for 15 min. TBSOTf (22 μΐ., 0.15 equiv) was added to reaction mixture stirred at -15 °C and the bath was left to reach -0 °C in 1.3 h. A follow up by TLC (toluene/EtOAc 9: 1) indicated consumption of the donor. I¾N was added to the suspension, the mixture was filtered through a pad of Celite®, and the filtrate was concentrated to dryness. Column chromatography gave the fully protected ABCIA C CCBA C D tetrasaccharide (20, 932 mg, 84%).

HRMS (ESI+): m/z 1729.6128 (calcd for C 88 Hn3Cl 4 N 2 0 2 ,Si 2 [M+NH 4 ] ') found m/z 1729.6161. Allyl a-L-rhamnopyranosyI-(i→2)-a-L-rhainnopyranosyl-(l→3)-2-i ?-chloroacetyI-a-L- rhamnopyranosyl-(l→3)-2-deoxy-2-A , -trichloroacetyl- , --)-glucopyranoside (21)

To a solution of the alcohol (222 mg, 137 μηιοΐ) - issued from the delevulinylation of the fully protected 20 - in toluene stirred at 0 °C was added TFA to reach a TF A/toluene ratio of 9: 1. The reaction mixture was stirred overnight, at which point a follow up by TLC indicated total consumption of the starting tetrasaccharide and the presence of a major more polar product. Volatiles were coevaporated twice with toluene, then once with acetonitrile and finally with MeOH. The crude material was purified by reverse phase flash chromatography (H 2 0/MeCN ()→50%), then by RP-HPLC (MeCN/¾0 20.5%) to give the target AB C ) A cCc!3 A cD-All 21 following freeze-drying (70 mg, 36%). RP-HPLC (C I 8 ® RP fusion

(4.6x250 mm, 4.0 μηι, 80 A, CH 3 CN in H 2 0 (30% for 4 mill, then 30→4Q% over 7 min, at 1.0 mL.min 5 ), 40 °C, λ: 220 nm) = 7.6 min.

! H NMR (800 MHz, D 2 0), δ 5.84 (m, 1H, -CH=), 5.25 (m, IH, J trans = 17.3 Hz, =CH 2 ), 5.19 (m, 1 H, J cis = 10.5 Hz, =C¾), 5.13 (dd, 1 H, J 2>3 = 3.0 Hz, J 2 ,i = 1.9 Hz, H-2 C ), 5.1 1 (d, IH, J !>2 = 1.3 Hz, H-1 B ), 4.86 (d, IH, J 1>2 = 1.6 Hz, H-l c ), 4.86 (d, IH, J i)2 = 1.5 Hz, II- 1 A ), 4.66 (d, 1H, J 1>2 = 8.0 Hz, H-l D ), 4.28 (m, 1 H, HAH), 4.27 (d, 1H, J gem =15.4 Hz, H OA ), 4.23 (d, I H, J GEM =15.4 Hz, HCIA C ), 4.12 (m, I H, H A1I ), 4.05 (m, IH, H-5 C ), 3.99 (dd, IH, ½ = 33 Hz, J 2;1 = 1.7 Hz, H-2 A ), 3.93 (dd, I H, J 3 , 4 = 9.7 Hz, J 3 , 2 = 3.1 Hz, H-3 C ), 3.89 (dd, 1H, J 2>3 = 4.7 Hz, J 2, i = 1 .7 Hz, H-2 B ), 3.88 (d, 1H, ½ = 2.1 Hz, H-6b D ), 3,86 (m, 1H, H-2 D ), 3.72 (m, 2H, H-3 D) H-3 A ), 3.71 (d, 1H, J 6>5 = 3.3 Hz, H-6a D ), 3.69 (m, 1H, H-5 B ), 3.63 (dd, 1 H, J 3 , 4 = 9.9 Hz, J 3,2 = 3.5 Hz, H-3 B ), 3.62 (m, 1H, H-5 A ), 3.54 (dd, 1H, J 4,5 = 9.9 Hz, J 4>3 = 9.9 Hz, H-4 D ), 3.52 (dd, 1H, J 4>5 = 9.8 Hz, J 4 , 3 = 9.8 Hz, H-4 C ), 3.42 (m, 1H, H-5 D ), 3.38 (dd, 1H, J 4>5 = 9.8 Hz, J 4(3 = 9,8 Hz, H-4 B ), 3.37 (dd, 1H, J 4>5 = 9.8 Hz, ½ = 9.8 Hz, H-4 A ), 1.25 (d, 3H, J 6)5 = 6.2 Hz, H- 6B), 1.20 (d, 3H, ¾ 5 = 6.4 Hz, H-6 A ), 1.1 (d, 3H, J 6 , 5 = 6.4 Hz, H-6 C ).

13 C NMR (800 MHz, D 2 0), δ 168.6 (CO aAc ), 164.7 (CO NHC (0)cc: 3 ), 133.0 (CH= A „), 1 18.7 (=CH 2A11 ), 102,3 (C-1 A ), 100.5 (C- 1 H ), 99.1 (C-1 D ), 98.5 (C-l c ), 91.6 (CC1 3 ), 81.7 (C-3 D ), 78.0 (C-2 B ), 76.1 (C-5 D ), 74.5 (C-3 C ), 73.4 (C-2 C ), 72,2 (C-4 C ), 72.0 (2C, C-4 B , C-4 A ), 70.8 (- CH 2 - Aii ), 70.0 (2C, C-3 A , C-2 A ), 69.9 (C~3 B ), 69.5 (C-5 B ), 69.2 (C-5 C ), 69.1 (C-5 A ), 68.5 (C- 4 D ), 60.7 (C-6 D ), 57.1 (C-2 D ), 40.7 (-CH 2 - aA c), 16.7 (C-6 B ), 16,6 (C-6 A ), 16.2 (C-6 C ).

HRMS (ESI+): m/z 895.1840 (calcd for C 3 :H 47 Cl 4 Oi9NH4 [M+NH 4 ] + ) found m/z 895.1860.

Allyl a-L-rhamnopyranosyI-(l-→2)-a-L-rhamnopyranosyI-(l→3)-2-( -chloroacetyl-a-L- rhamnopyranosyI-(l→3)-2-deoxy-2-A-tr!chIoroacet l- -i>-glucopyranoside (21a)

To a solution of the lightly protected tetrasaccharide ABQA C CCBACD-AH (21, 200 mg, 0.23 mmol) in anhydrous methanol (31 rnL) was added MeONa (25% w/w in MeOH, 126 pL, 0.55 mmol, 2.4 equiv.). After stirring the reaction mixture for 3 h, a TLC follow up (cyclohexane/ethyl acetate 8:2) showed complete consumption of the starting material (Rf = 0.15). Dowex if was then added until neutralization and filtered off, volatiles were evaporated under reduced pressure. The residue was purified by reverse phase chromatography eluting from a C18 column (water/MeCN 1 :0 to 6:4) to give, after lyophilization, tetrasaccharide ABCci 3A cD-All 21a as a white powder (76 mg, 42%). RP- HPLC (CI 8 RP fusion (4.6x250 mm, 4.0 μιη, 80 A, CH 3 CN in H 2 0 (30% for 4 min, then

30→40% over 7 min, at 1.0 mL.min "1 ), 40 °C, λ: 220 nm) = 3.8 min.

HRMS (ESI+): m/z 819.2124 (calcd for C30H 4 6CI3 O18NH4 [M+NH 4 ] ') found m/z 819.2745; m/z 824.1618 (calcd for C 3 oH4 6 Cl 3 NQ ! 8 Na [M+Na] ') found m/z 824.2239.

Synthesis of an alternative B donor (25)

Toluene, 60°C, overnight overnight

22 23

AHyl 3,4-di-0-(2-naphtylmethyl)-a-L-rhamnopyranoside (22)

Dibutyltin oxide (4.0 g, 1.1 equiv) was added to a solution of diol 1 (5.0 g, 15.0 mmol) in anhydrous toluene (100 mL). The mixture was stirred for 2 h at reflux using a Dean-Stark apparatus. After cooling to it, dry CsF (2.2 g, 1.0 equiv), dry tetrabutylammoniun iodide (6.97 g, 1.3 equiv) and 2-naphtylmethyl bromide (3.54 g, 1. 1 equiv.) were successively added. After heated at 60 °C overnight, a TLC control (toluene/EtOAc 8:2) showed the total consumption of starting diol 1. After cooling to 0 °C, salts were removed by filtration over a pad of Celite® and solvents were evaporated under reduced pressure. The crude was purified by flash chromatography to give alcohol 22 (5.02 g, 69%) as a brown oil.

Ή NMR (CDCI3) «5 7.81 (m, 8H, HA T N. p ), 7.50 (m, 6H, H ArNap ), 5.93 (m, 1 11, J mils = 17.1 Hz,

J gem = 1.5 Hz, CH=CH 2 ), 5.31 (m, 1 H, J cb = 10.4 Hz, CH=C¾), 5.22 (m, 1 H, CH=Ci¾), 5.10 (d, 3H, J = 1 1 .2 Hz, CH 2Na p), 4.91 (m, 2H, CH 2Nap , H- l ), 4.21 (m, 1H, HAH), 4.16 (m, 1 H, H- 2), 4.02 (m, 2H, H A! i, H-3), 3.85 (m, 1H, H-5), 3.60 (pt, 1 H, J 3A = J4.5 = 9.3 Hz, H-4), 2.58 (bs, 1H, Jj.oH = 9.6 Hz, OH), 1.40 (d, 3H, J 5i6 = 6.3 Hz, H-6).

HEMS (ESf ): m/z 502.2608 (calcd for C31H36O5N [M+N¾] + m/z 502.2593)

Allyl 2-0-levulinyI-3,4-di-(9-(2-naphtylinethyl)-a-L-rhamnopyranos ide (23)

To a solution of alcohol 22 (5.02 g, 9.0 mmol) in anhydrous DCM (42 mL) stirred at room temperature were successively added DCC (3.16 g, 1 .7 equiv), DMAP (440 mg, 0.4 equiv) and levulinic acid (1,94 mL, 4.2 equiv). After stirring the reaction mixture overnight at room temperature, TLC (toluene/EtOAc 7:3) showed complete consumption of the starting material. The reaction mixture was filtered over a pad of Celite® , and volatiles were evaporated under reduced pressure. The crude material was taken in ethyl acetate (50 mL) and the organic layer was washed thrice with 10% aqueous copper (II) sulfate (30 mL), water (30 mL), saturated aqueous sodium bicarbonate (30 mL) and brine (30 mL). The organic layer was dried by stirring over anhydrous sodium sulfate, filtered and concentrated under vacuum. The crude material was used as such in the next step. 2-0-LevuIinyl-3,4-di-i?-(2-naphtylmeth l)-a-L-rhamnopyranose (24)

To a solution of the fully protected 23 (from 22, 9.0 mmol) in anhydrous THF (50 mL) stirred at room temperature was added hydrogen-activated [Ir] (152 trig, 0.02 equiv) in anhydrous THF (10 mL). After stirring the reaction mixture for 2 hours at room temperature, another 0.02 equiv of hydrogen-activated [Ir] vas added and the reaction mixture was stirred overnight at room temperature. TLC (cyclohexane EtOAc 6:4) showed that the starting allyl rhamnoside had been consumed. Iodine (2.74 g, 1.2 equiv) in water/THF (1 :5 , 72 mL). After stirring the reaction mixture for 1 h at this temperature, TLC (cyclohexane/ ' EtOAe 6:4) showed complete conversion of the intermediate into a more polar product. 10% aqueous sodium metabisulfite was added. THF was evaporated under reduced pressure and DCM (50 mL) was added. The aqueous layer was extracted twice with DCM (30 mL) and the combined organic phases were washed with saturated aqueous sodium bicarbonate and then brine. The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated under vacuum. The crude material was used as such in the next step. 2-0-Levulinyl-3,4-di-0-(2-naphty lrnethyl)-a-L-rhamnopyranosyl trichloroacetimidate

(25)

To a solution of hemiacetal 24 (from 22, 9.0 mmol) in anhydrous DCE (50 mL) stirred at room temperature were successively added CI3CC (2.7 mL, 3.0 equiv) and DBU (670 μί, 0.5 equiv). After stirring the reaction mixture for 2 h at room temperature, TLC (toluene/EtOAc7:3) showed complete consumption of the starting material. Volatiles were evaporated under reduced pressure, The residue was purified by chromatography eluting from a column of EtsN-treated silica gel (toluene/EtOAc 95:5 to 8:2 containing 1¾N 5%o) to give donor 25 as a brown oil (5.05 g, 88% over three steps). Ή NMR (CDCI3) δ 8.67 (s, 1H, H), 7.81 (m, 8H, H^ p ), 7.49 (m, 6H, H ArNap ), 6.22 (d, 1 H, H-l), 5.56 (dd, 1H, J u = 2.1 Hz, J 2 = 3,3 Hz, H-2), 5.14 (d, III, J = 1 1.1 Hz, CH 2Nap ), 4.92 (d, 1H, J= 1 1.4 Hz, CH 2Nap ), 4.87 (d, 1 H, CH 2Nap ), 4.77 (d, 1H, CH 2Nap ), 4.10 (dd, 1 H, H-3), 4.00 (m, 1H, H-5), 3.62 (pt, 1 H, J 4 = J 4 = 9.5 Hz, H-4), 2.78 (m, 4H, CH 2Le v), 2.18 (s, 3H, CHj v), 1.40 (d, 3H, J 5i6 = 6.2 Hz, H-6)

13 C NMR (CDCI3) δ 206.0 (CO Lev ), 171.9 (C0 2Lcv ), 160.1 (NHCO), 135.6433.2 (6C, Qv), 128.2-126.1 (14C, C ArNap ) 5 95.2 (C- l , ! J C . = 179.5 Hz), 79.4 (C-4), 77.1 (C-3), 75.7 (C Nap ), 72.0 (C Nap ), 70.8 (C-5), 67.9 (C-2), 38.0 (CH 2Lev ), 29.8 (CH 3Le v), 28.1 (CH 2L ev), 18.1 (C-6). HRMS (ES ): mlz 703.1744 (calcd for C 35 H 38 C1 3 N0 7 [M+ NH 4 ] + miz 703.1749)

Synthesis of an alternative C acceptor (28)

1 ) AIIOH, AcCl

70 °C, 2.5h then 1 ) PMBCI, NaH

40°C overnight DMF, RT, overnight

2) Me 2 C(OMe) 2 . PTSA 2) 80% aq. AcOH

acetone, RT, 3h 60 °C, 3 d

26 27

0 °C, 10 min 28 Ally! 4-0- 7flra-methoxybenzyI-a-L-rhamnopyranoside (27)

Grade allyl 2,3-O-isopropylidene-a-L-rhamnopyranoside (26, from L-rhamnose, 1 10 mmol) was dissolved in DMF (320 mL) under Argon, the bath temperature was cooled to -5 °C and NaH (60% oil dispersion, 10.6 g, 2.4 equiv.) was added portioewise to this suspension. The mixture was stirred for 2 h at rt, then /wra-methoxybenzyl chloride (17.9 mL, 1.2 equiv.) was added dropwise at -5 °C and the reaction mixture was stirred at rt overnight. Follow up by TLC (toluene/EtOAc 8:2) indicated the total conversion of the intermediate alcohol into a less polar product. The reaction was quenched at 0 °C by addition of MeOH (20 mL). Solvents were eliminated under reduced pressure and volatiles were co-evaporated with toluene. The residue was taken up in EtOAc (200 mL) and washed with H 2 0 (3 x 120 mL) and brine (120 mL). The organic phase was dried over anhydrous Na 2 S0 4 , filtered, and concentrated to dryness to give the fully protected intermediate. The crade intermediate was dissolved in 80% aq. AcOH (200 mL) and the solution was stirred for 3 d at 60 °C. Follow up by TLC (toluene/EtOAc 7:3) indicated the total conversion of the intermediate acetal into a more polar product. Solvents were removed under vacuum and traces of AcOH were eliminated by co-evaporation with cyclohexane (2 x 100 mL) to give a brown solid. Crystallization from cyclohexane and column chromatography of the mother liquor eluting with toluene/EtOAc 8:2 to 6:4) gave diol 28 (27.9 g, 78% over four steps), m.p. = 72 °C (cyclohexane).

Ή NMR (CDCI3) δ 730 (m, 2H, Η ΑΓΡΜ Β), 6,91 (m, 2H, Η ΑΓΡΜ Β), 5.89 (m, 1H, C7/=€H 2 ), 5.29 (m, 1 H, J TRANS =17,2 Hz, J gem = 1.6 Hz, CH=CI¾), 5.20 (m, 1H, J trans = 10.4 Hz, CH=Ci¾) » 4.81 (d, 1H, Ji,2 = L I Hz, H-l), 4.69 (m, 2H, CH 2P MB), 4.17 (m, 1H, H A!! ), 3.97 (m, 3H, H-All, H- 2, H-3), 3.82 (s, 3H, CH 3P MB), 3.75 (m, 111, H-5), 3.35 (pt, 1 H, J ? , 4 = J 4 .5 = 9.2 Hz, H-4), 2.45

(bs, 2H, OH), 136 (d, 3H, J 5 , 6 = 6.3 Hz, H-6).

Allyl 2-<?-chloroacetyI-4-0- >arfl-methoxybenzyl-a-L-rhamnopyranoside (28)

Diol 27 (1.0 g, 3.0 mmol) was solubilized in anhydrous acetonitrile (MeCN, 5 mL). To the solution was added trimethylchloroorthoacetate (1.39 mL, 3.0 equiv) and APTS (59 mg, 0.1 equiv). The solution was stirred at room temperature for 1 hour (reaction followed by TLC toluene/EtOAc 7:3). To the reaction medium cooled to 0 °C was added a 90% aqueous TFA (2.0 mL) and the reaction mixture was stirred at room temperature for 10 min. Water was added until the mixture became completely cloudy. The product was extracted with DCM (2 x 25 mL). The organic phase was washed with saturated aqueous NaHC(¾ (2 x 25 mL) and brine (25 mL). The aqueous phase was extracted with DCM (2 x 12.5 mL). The combined organic phases were dried over Na 2 S0 , filtered, evaporated and finally co-evaporated with toluene to yield alcohol 28 as a mixture of regioisomers. The crade material is used as such in the next step.

Synthesis of COA C DABCL AC C-AII

c DAc D Bci Ac C-All is obtained following route B' as defined below.

In particular, it can be obtained from donor 12 or its 4,6-0-benzylidene analog and the ABC trioside acceptor 29 following conventional delevulinylation at position 2 A of the fully protected precursor 16. A synthesis is highlighted in the scheme below whereby R 1 = R 4 = Nap, R 3 = R 6 = TES, R 2 = ClAc, R 5 = Lev, R 7 = TBS, R 8 « All, R 9 = C13Ac).

Allyl 4,6-0-benzyIidene-3-0-/ r^butyldimethylsilyl-2-deoxy-2-trichioroacetamido- -D- glucopyranosyl-(i 2)-4-0-(2-naphtylmethyl)-a-L-rIiamiiopyranosyI-(l 2)-4-0-(2- naphtylmethyl)-a-L-rliamHopyranosy!-(l 3)-2-0-cliioroacetyl-4-0-(2-napIiiylmetliyl)- a-L-rhamnopyranoside (32) and allyl 4,6-0-beHzyIidene-2-deoxy-2-trkli!oroacetaiiiido- P-D-glucopyranosyl-(l— *2)-4-0-(2-naph^lmethyl)-a-L-rhamnopyranosyi-(l— »2)-4-0-(2- naphtyIraethyl)-a-L-rhamnopyranosyl-(l 3)-2-0-chloroacetyW-0^2-naphtylmethyl)- a-L-rhamnopyranoside (31)

To a solution of crade tetrasacchande 30 (1.48 mmol) in THF/AcOH (4: 1, 74 mL) was slowly added 1 M TBAF in THF (14.8 mL, 14.8 mmol, 10.0 equiv.). After stirring the reaction mixture overnight at rt, were added TBAF (1M solution in THF, 14.8 mL, 14.8 mmol, 10.0 equiv.) and AcOH (14.8 mL). After stirring the reaction mixture for 2 days at rt, a TLC follow up (cyclohexane/ethyl acetate 7:3) showed the presence of a complex mixture of products (Rf = 0.05, 0.25, 0.35, 0.45 and 0.55). Distilled water (20 mL) and toluene (50 mL) were added. The organic layer was washed with satd aq. NaHC0 3 (20 mL) and brine (20 mL). The organic layer was dried over anhydrous sodium sulfate, filtered and concentrated and co-evaporated with toluene under vacuum. The residue was purified by chromatography eluting from a column of silica gel (toluene/ethyl acetate 95:5 to 4:6) to give by order of elutioii diol 32 (820 mg, 37%) and triol 31 (334 mg, 16%).

Ally! 2^eoxy-2-trichIoroacetamido- -D-gIucopyranosyl-(l— » 2)-a-L-rhamnopyranosyl-(l -→2)-a-L-rhanmopyranosyl-(l- 3)-2-0-chIoroacet l-a-L-rhamnopvTanoside (33)

A solution of tetrasaccharide 31 (334 mg, 0.24 mmol) in trifluoroacetic acid / 1 , 1 , 1 ,3,3,3- hexafluoro-2-propanol (9: 1, 3.0 mL) is stirred at rt for 2 h. Toluene (10 mL) was added and volatiles were evaporated and co-evaporated with toluene (five times) under reduced pressure. The residue was dissolved in water (10 mL) and dichloromethane (5 mL), The organic phase was washed with water (10 mL) and the aq. phases were freeze-dried and lyophilized. The residue was purified by reverse phase chromatography eluting from a CI 8 column (¾0/MeCN 0→ 40%) to give the lightly protected tetrasaccharide D 'ABC '-All (33) as a white powder (74 mg, 35%). RP-HPLC (CI 8 ® P fusion (4.6x250 mm, 4.0 μτη, 80 A, C¾CN in H 2 0 (30% for 4 min, then 30→40% over 7 min, at 1.0 mL.mm '), 40 °C, λ: 220 ran) = 10.2 min.

HUMS (ESI+): m/z 900.1394 (calcd for CsiituCLjNGtgNa [M+ af) found m/z 900.1246.

Route B

5 Example 2: Enzymatic a-D-glucosvlation of a compound of formula (I) with branching siicrases

Enzymes

Table 1 presents some enzymes that were used in the context of the present invention.

Osidlc

Enzyme linkage on

Source organism References

accronym natural

acceptor

Leuconostoc US2016136199 (Al ),

BRS-B citreum NRRL-B WO2016146764 (Al )

742 Vuillemin et al. (JBC, 2016)

US2016136199 (A1 ),

Leuconostoc fallax

BRS-C erf ,3 WO2016146764 (Al )

KCTC 3537

Vuillemin et al (JBC, 2016)

Leuconostoc

BRS-E mesenteroides

F I-MG

Leuconostoc US2017152489 (Al),

BRS-A citreum NRRL-B US2017101484 (Al ), ES2398227 (T3)

1299 Vuillemin et al. (JBC, 2016)

US2017152489 (Al ),

Lactobacillus

BRS-D US2017101484 (Al), ES2398227 (T3) kunkei EFB6

a-1,2 Vuillemin et al. (JBC, 2016)

From truncated

US2017152489 (Al),

DSM-E

US2017101484 (Al ), ES2398227 (T3)

GBD-CD2 Leuconostoc

Fabre et al. (J. Bact, 2004), Brison et al. citreum NRRL-B

(JBC, 2012)

1299

GBD-CD2 Yarmick Malbert. Flavonoid

GBD-CD2

W2135V* glucodiversifi cation with engineered

mutants

GBD-CD2 sucrose-active enzymes. Biotechnology. W2135C- INSA Toulouse

F2136I* (PhD, 2014)

GBD-CD2

W2135S-

F2136L*

GBD-CD2

W2135I-

F2136C*

GBD-CD2

W2135N-

F2136Y*

GBD-CD2

W2135N*

GBD-CD2

W2135I-

F2136Y*

GBD-CD2

W2135L*

GBD-CD2

W2135C*

GBD-CD2

W2135N-

F2136H*

GBD-CD2

W2135L-

F2136L*

GBD-CD2

W2135F-

F2136I*

GBD-CD2

W2135C-

F2136N* GBD-CD2

W2135G*

GBD-CD2

W2135F*

GBD-CD2

F2163G*

GBD-CD2

L2166I*

GBD-CD2

F2163H*

GBD-CD2

F2163G

L2166I*

GBD-CD2

A2162E

F2163L*

GBD-CD2

F2163L*

GBD-CD2

F2163I- D2164E-

L2166I*

The mutations are given relatively to the sequence of the GBD-CD2 wild type.

The sequence of said enzymes is as follows:

SEQ ID NO : 1 (BRS-E)

SEQ ID NO : 2 (BRS-A)

SEQ ID NO : 3 (BRS-B-D1)

SEQ ID NO : 4 (BRS-B-D2)

SEQ ID NO : 5 (BRS-C)

SEQ ID NO : 6 (BRS-D) SEQ ID NO : 7 (GBD-CD2).

BRS-E (L. MAHHHHHHVTSLY AGSAAAPFTMQQ NATLQVSPT NDNSVA NTTS Mesenteroides TANVA VDITTD TRDISSAN VN LITNQYKENSNGSWSYYD NGQIV G KFRI-MG) LQTINGNIQYFDSTTGEQVKGQTLTIDGVIYSFD DSGNGTKTEVASLPTTGS

YATKDGSNWQYEDQQQQ PIKGLYTDKG NLRYFNETDG TQVKGTVVSV DNNTYYFDKDSGNGQLVPSVTGGQYGTIQLNNQTVWVYRNANGEIVKGLQ NINGNIQYFDPNTGEQLKGKVATVNGVTYY FEASDGNLVG TVSDGLVTVN GQIQ YFDP ATGEQAKNKQ I VNNVTYYFDDNGYGQYLFTNAILSTTPDAY SAHTQAYNTDQSSFTNVVDGFLTADSWYRPKEV1ADATGS AWQTSSENDY RPIITVWWPNKNVEVNYLKLMQDNDLLSTQ TQFTIFSDQY TLNEAAQAAQ NEIEKRIYREKSTDWLKDLLFEA1 IGDTPSF VKQQFIWNKD SEYQGAGGGN LWSLQGGYLK YVNDSETSWS DSTSRKHDYY EYLLGNDIDN SNPQVIAENI NWLYYLMNFGSLTGNDDTANFDGVRMDAVIYMKGEASTKVYQFLHKSDE L TKNEKIANEH ISIVEDGTDh 1 1 NNSAL1V SKWNENIASS LAKIASGKDT SLEALVKTDE KTSVSRIMNS SVESDVNPNY SMIRSHDRGS QDEVINASKV ANNDQSIALD NINLNQLENG LKLYYEDQAS PTKNYNYYNI PASYALLLSN KDTVPRLYYSDMYQDYDQNPDTPQQYMSKP TIYYSAIDAL LKARIKYVAG GQS AVEKVGDNKDQEVLTSVRYGKNVMTATDTGTVESRTEGMGVIVSN NTKLKLSTTDQIVLHMGAAHANQAYQALMLT DEGIQLYNDNAPVVWTD HNGDLVFNGNDINGQKNTSIKGYFNPQVAGYLAVWVPVGATDTQDARTK ASTNATTDGKVFHSNAALDSNVIYEGFSNFQPIARNHNDFSNVKIAENVDLF KKWGi rSFELAPQYRSADVSDLVGSTFVDVVTKNGYGLSDRYDLGFVTPTK YGSDSDLRNAISSLHAQGIQAMADFVGNQIYALNDGQEVVTAQRSDMFNN TLNNAFGTELYVVNSIGGGKYQAKYGGNYLEEIASLYPDLFTNQDGTKIDIN TKIKQWSAKYMNGTNVLGRGMGYVLKDWNTATYFKLDGEHTVLPAALTL SG LK V ENG VT YYYKNNERQTGTQT VDD VT YFFDPKTG AMKKDYFDFT AD NKVYYYGKGGRADPAFLYKV VSAWSHPQFE K

BRS-A, 1878 MRQKETITRKKLYKSGKSWVAAATAFAVMGVSAVTTVSADTQTPVGTTQS amino acids QQDLTGQRGQDKPTTKEVIDKKEPVPQVSAQNAGDLSADAKTTKADDKQD (L, citreum TQPTNAQLPDQGNKQTNSNSDKGVKESTTAPVKTTDVPSKSVTPETNTSING NRRL B- GQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDPKTGYQTKGELKNIDDN

1299) AYYFDKNSGNGRTFTKISNGSYSEKDG WQYVDSHDKQPVKGLYDVEGN

LQYFDLSTGNQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQT

TKDKDGNETSYWAYLDNQGNAIKGLNDVNGEIQYFDEHTGEQLKGHTATL

DGTTYYFEGNKGNLVSVVNTAPTGQYKINGDNVYYLDNNNEAIKGLYGIN

GNLNYFDLATGIQLKGQAKNIDGIGYYFDKDTGNGSYQYTLMAPSNKNDY

TQHNVVNNLSESNFKNLVDGFLTAETWYRPAQILSHGTDWVASTDKDFRPL

ITVWWPNKDIQVNYLRLMQNEGVLNQSAVYDLNTDQLLLNEAAQQAQIGI

EKKISQTGNTDWLNNVLFTTHDGQPSFIKQQYLWNSDSEYHTGPFQGGY

LKYQNSDLTPNVNSKYRNADNSLDFLLANDVDNSNPIVQAEDLNWLYYLL

NFGSITTQGKENNSNFDSIRIDAVDFVSNDLIQRTYDYLRAAYGVDKNDKEA

NAHLSLVEAGLDAGTTTIHQDALIESDIREAMKKSLTNGPGSNISLSNLIQDK

EGDKLIADRANNSTENVAIPNYSIIHAHDKDIQDKVGAAITDATGADWTNFT

PEQLQKGLSLYYEDQRKIEKKYNQYNIPSAYALLLTNKDTVPRVYYGDMY

QDDGQYMQKQSLYFDTITALMEARKQFVAGGQTINVDDNGVLTSVRFGKG

AMTANDIGTNETRTQGIGVVIANDPSLKLSKDSKVTLHMGAAHRNQNYRA

LLLTTDNGIDSYSSSKNAPVIKTDDNGDLVFSNQDINDQLNTKVHGFLNSEV

SGYLSAWVPLDATEQQDARTLPSEKSVNDGKVLHSNAALDSNLIYEAFSNF

QPMPTNRNEYTNVVIADKADTFKSWGITSFEMAPQYRSSQDKTFLDSTIDN

GYAFTDRYDLGFEKPTKYGNDEDLRQAIKQLHSSGMQVMADWANQIYNL

PGKEVASTNRVDW GNNLS TPFGTQMYVVNTVGGGKYQNKYGGEFLDKL

KAAYPDIFRSKNYEYDVKNYGGNGTGSVYYTVDSKTRAELDTDTKIKEWS A YMNGT VLGLGMGYVLKDWQTGQYFNVSNQNMKFLLPSDLISNDITV

QLGVPVTD IIFDPASAY MYSNLPEDMQVMDYQDD KSTPSIKPLSSYN

NKQVQVTRQYTDS GVSWNL1TFAGGDLQGQ LWVDSRALTMTPFKTMN

QISFISYANRNDGLFLNAPYQVKGYQLAGMSNQYKGQQVTIAGVANVSGK

DWSLISFNGTQYWIDSQALNTNFTHDMNQKVFVNTTSNLDGLFLNAPYRQP

GY LAGLA NYNNQTVTVSQQYFDDQGTVWSQVVLGGQTVWVDNHALA

QMQVRDTNQQLYVNSNGRNDGLFLNAPYRGQGSQLIGMTADYNGQHVQV

T QGQDAYGAQWRLITL NQQVWVDSRALSTTIMQAMNDDMYVNSSQRT

DGLWLNAPYTMSGA WAGDTRSANGRYVHIS AYSNEVGNTYYLTNLNG

QSTWID RAFTATFDQVVALNATIVARQRPDGMF TAPYGEAGAQFVDYV

TNYNQQTVPVTKQHSDAQGNQWYLATVNGTQYWIDQRSFSPVVT VVDY

QA IVPRTTRDGVFSGAPYGEVNAKLV MATAYQNQVVHATGEYTNASGI

TWSQFALSGQED LWIDKRALQA

BRS-B MEM ETITRKKLYKSG SWVAAATAFAVMGVSAVTTVSADTQTPVGTTQS

QQDLTGQTGQD PTT EVIDK EPVPQVSAQNVGDLSADA TP ADD QD

TQPTNAQLPDQGNKQTNSNSD GVKESTTAPV TTDVPSKSVAPETNTSIN

GGQYVE DGQFVYIDQSG QVSGLQN1EGHTQYFDPKTGYQT GELKNIDD

NAYYFD NSGNGRTFT ISNGSYSEKDGMWQYVDSHDK.QPVKGLYDVEG

NLQYFDLSTGNQAKHQIRSVDGVTYYFDADSGNATAF AVTNGRYAEQTT

KD DGNETSYWAYLDNQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATV

DGTTYYFEGN GNLVSVVNTAPTGQYKiNGDNVYYLDNNNEAI GLYGIN

GNLNYFDLATGIQL GQA NIDGIGYYFDQNNGNGEYRYSLTGPVV DVY

SQHNAVNNLSANNFKNLVDGFLTAETWYRPAQILSHGTDWVASTD DFRP

LITVVVWPNKDIQVNYLKLMQQIGILDNSVVFDTN DQLVLNKGAESAQIGI

EKKVSETGNTDWLNELLFAPNGNQPSF1 QQYLWNVDSEYPGGWFQGGYL

AYQNSDLTPYANTNPDYRTH GLEFLLANDVDNSNPVVQAEQLNWLYYL

MNFGQITANDSNANFDSMRiDAISFVDPQIA KAYDLLDKMYGLTDNEAV

ANQHISrVEAPKGETPITVEKQSALVESNWRDRMKQSLS NATLD LDPDPA

INSLEKLVADDLVNRSQSSD DSSTIPNYSIVHAHDKDIQDTViHIMKIVNNN

PNISMSDFTMQQLQNGL AFYEDQHQSVK Y QYNIPSAYALLLT DTVP

RVFYGDMYQDYGDDLDGGQYMATKSIYYNAIEQMMKARLKYVAGGQIM

AVT I NDGIN DGTNKSGEVLTSVRf G DIMDAQGQGTAESRNQGIGV1V

SNSSGLELKNSDS1TLHMGIAHKNQAYRALMLTND GIVNYDQDNNAPIAW

TNDHGDL1FTNQMINGQSDTAVKGYLNPEVAGYLAVWVPVGANDNQDAR

TVTTNQK TDG VLHTNAALDSKLMYEGFSNFQ MPTRGNQYANVVIT N

IDLF SWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGFKTPT YGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHVNIN

GDTKLVVDPRYGTQMTVVNSVGGGDYQAKYGGEYLDTIS LYPGLLLDSN

GQ IDLSTKIKEWSAKYLNGS IPQVGMGYVLKDWNNGQYFHILD EGQY

SLPTQLVSNDPETQIGESVNYKYFIGNSDATYN YH LPNTVSLINSQEGQI

KTQQSGVTSDYEGQQVQVTRQYTDSKGVSWNLITFAGGDLQGQKLWVDS

RALTMTPF TMNQISFISYANRNDGLFLNAPYQVKGYQLAGMSNQYKGQQ

VT1AGVANVSGKDWSLISFNGTQYWIDSQALNTNFTHDMNQ VFVNTTSNL

DGLFLNAPYRQPGYKLAGLAKNYNNQTVTVSQQYFDDQGTVWSEVVLGG

QTVWVDNHALAQMQVSDTSQQLYV SNGRNDGLFLNAPYRGQGSQLIGM

TADYNGQHVQVT QGQDAYGAQWRLlTLNNQQVWVDSRALSniVQAMN

DDMYV SNQRTDGLWLN'APYTMSGAKWAGDTRSANGRYVHIS AYS EV

GNTYYLTNLNGQSTW1DKRAFTATFDQWALNATIVARQRPDGMFKTAPY

GEAGAQFVDYVTNY QQTVPVTKQHSDAQGNQWYLATVNGTQYWIDQRS

FSPVVTKVVDYQAKIVPRTTRDGVFSGAPYGEV A LVNMATAYQNQVV

HATGEYTNASGITWSQFALSGQED LWIDKRALQA

BRS-B-D1 (L. DTQTPVGTTQSQQDLTGQTGQDKPTTKEVIDK EPVPQVSAQNVGDLSADA citreum KTPKADD QDTQPTNAQLPDQGNKQTNSNSD GVKESTTAPV TTDVPSK NRRL B-742) SVAPETNTSINGGQYVE DGQFVYIDQSGKQVSGLQNIEGHTQYFDP TGY

QTKGELK IDDNAYYFDKNSGNGRTFT ISNGSYSE DGMWQYVDSHD Q PVKGLYDVEGNLQYFDLSTGNQAKHQIRSVDGVTYYFDADSGI* CAV TNGRYAEQTTKDKDGNETSYWAYLDNQGNAIKGLNDVNGEIQYFDEHT

GEQLKGHTATVDGTTYYFEGNKGNLVSWNTAPTGQY I GDNVYYLDN EAIKGLYGINGNLNYFDLATGIQLKGQAKNIDGIGYYFDQNNGNGEYRY

SLTGPVVKDVYSQHNAVN LSA NFKNLVDGFLTAETWYRPAQILSHGTD

WVASTDKDFRPL1TVWWPN DIQVNYL LMQQIGILDNSVVFDTNNDQLV

LNKGAESAQIGIE KVSETGNTDWLNELLFAPNGNQPSFIKQQYLWNVDSE

YPGGWFQGGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVV

QAEQLNWLYYLMNFGQITANDSNANFDSMRIDAISFVDPQIAKKAYDLLDK

MYGLTDNEAVANQHISIVEAP GETPITVEKQSALVESNWRDRMKQSLSKN

ATLDKLDPDPAINSLEKLVADDLV RSQSSDKDSSTIPNYSrVHAHDKDIQD

TVIHIMKIV N PNISMSDFTMQQLQNGL AFYEDQHQSVKKYNQYNIPSA

YALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYMATKSIYYNAIEQMMKA

RL YVAGGQIMAVTKIKNDGINKDGTNKSGEVLTSVRFGKDIMDAQ

GQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAHKNQAYRALMLTND GIVNYDQDN APIAWTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYL

AVWVPVGANDNQDARTVTTNQ NTDG VLHTNAALDS LMYEGFSNFQK

MPTRGNQYANVVITKNIDLFKSWGITDFELAPQYRSSDGKDITDRFLDSIVQ

NGYGLSDRYDLGFKTPT YGTDQDLRKAIERLHQAGMSVMADFVANQIYG

LHADKEVVSAQHVNINGDT LVVDPRYGTQMTVVNSVGGGDYQAKYGG

EYLDTIS LYPGLLLDSNGQKIDLSTK1KEWSAKYLNGSNIPQVGMGYVLKD

WNNGQYFH 1 LD EGQYS LPTQL

BRS-B-D2 (L. MASQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDP TGYQT GEL ID citreum DNAYYFDKNSGNGRTFTKISNGSYSEKDGMWQYVDSHDKQPV GLYDVE NRRL B-742) GNLQYFDLSTGNQA HQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQT

TKD DGNETSYWAYLDNQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATV

DGTTYYFEGN GNLVSVVNTAPTGQYKINGDNVYYLDNKNEAI GLYGIN

GNLNYFDLATGIQLKGQAKNIDGIGYYFDQNNGNGEYRYSLTGPVV DVY

SQHNAVNNLSANNFKNLVDGFLTAETWYRPAQILSHGTDWVASTDKDFRP

LITVWWPNKDIQVNYL LMQQIGILDNSVVFDTNNDQLVLNKGAESAQIG1

EKKVSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGWFQGGYL

AYQNSDLTPYANTNPDYRTH GLEFLLANDVDNSNPVVQAEQLNWLYYL

MNFGQITANDSNANFDSMRIDAISFVDPQIA AYDLLD MYGLTD EAVA

NQHISIVEAPKGETPITVEKQSALVESNWRDRM QSLS NATLD LDPDPA!

NSLE LVADDLVNRSQSSD DSSTIPNYSIVHAHD DIQDTVIHIM IVN NP

NISMSDFTMQQLQNGL AFYEDQHQSVKKYNQYNIPSAYALLLTN DTVP

RVFYGDMYQDYGDDLDGGQYMAT SIYYNAIEQMMKARL YVAGGQIM

AVT I NDGIN DGTN SGEVLTSVRFGKDIMDAQGQGTAESRNQGIGVIV

SNSSGLEL NSDSITLHMGIAH NQAYRALMLTNDKGIVNYDQDNNAPIAW

TNDHGDLIFTNQMINGQSDTAV GYLNPEVAGYLAVWVPVGANDNQDAR

TVTTNQKNTDG VLHTNAALDS LMYEGFSNFQ MPTRGNQYANVVITKN

IDLFKSWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGFKTPT

KYGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHVNIN

GDT LVVDPRYGTQMTVVNSVGGGDYQAKYGGEYLDTIS LYPGLLLDSN

GQKIDLSTKI EWSA YLNGSNIPQVGMGYVL DWNNGQYFHILD EGQY

SLPTQLKGGRADPAFLY VVHHHHHH

BRS-C M QQESITRK LYKAGKSWVVAATLFAATLFAAMGAAGATTVASADVQ

(GH Leucono DTVVVTADKNTTDKDKEPI TAGANVVD GVAQTTDTNTTD TIEVG S stoc fallax K VDMSATDKKVTETV SVDTSATDK TTEAVKPVDTNATDKKATEAVKPV CTC3537 1) DTNATDKKTTEAVKPVDTNTTD VTEAIKPVNTNADD TAEPV TISAT

DTVKTIANKQKGATEEQAVITEGHYEAQGDGFVYIT DG QLTGLQNINGN

TQYFDPATGQQL GDIKAVAGTVYYFDKNSGNARVYQKVADGTYSENNE

HWQYIS VDNKPVEGLYNVQGNLQYFDMSTGNQVKNDIRSVDGVTYYFD DSGNGSAFNALSAGEYVE KETDAQGNQNSYWTYSGLDGNPVKGLYDIN

GSLQYFDE NGAQL GGTATV GVTYYFEQD GNLISVV SVESGQY ID NDNVYYID QGNTL GLYAINGQLNYFDMSTGVQLKGASE.NANGVGYYF

DKDKGNGQYQYSLITSTLANAFSKHNAANDYTQSSFTHTVDGFLTADTWY

RPTEIL NGTTWVASTSQDLRPMITVWVVPN NVQLNYL LMQTEGLLDSG

QVYDLNSDQALL QAAQTVQVNIEKRITKAGNSDWLNDLLYNSHGirrPSF

V QQAIW ADSEYHGGWFQGGYLAYRNSDLTPYANSSYRHYTGMEFLLA DVD SNPlVQAEDLNWLYYLMNFGTETGNDPQA FDSrRIDAISFVD QV

A KAYELLHDMYGLSASDAVA HVSiVEASADQTPVITENHDALlESYW

RDTM NSLS DASiDSSAGSLSAMINDGNVDRANDSTTESSIFPNYTlVHAH

D DIQDAVSNVMKIVNNDPSISLDGFTMEQLE GLSAFYADQRSAV QYN

QYNIPSAYAVMLTNKDTVPRTFYGDMYQDDGQYMANKSLYYDAIDTMMK

ARL YVSGGQTMSVT INNANSQKSGEVLTSVRFGKGVMDATDAGSAESR

TQGIGVVVSNSSGLQLNDNDKIVLHMGAAHKNQEYRALMLTTNDGIKSFN

NDEAPINYTDDNGDLIFDGHNIDGQENTAIRGYLNPQVAGYLAVWVPTGA

DDQDARTQPSNEKSTDGKVLHTNAALDSELIYEGFSNFQPMPTT DEYTOV

M1AKNIDLF SWGITNFELAPQYRSSDGKN1 DRF1DSLVQNGYGLSDRY

DLGFETPTKYGTDQDLRTAI TLHQAGMTVMADYVANQIYGLNTSQEVVD

AQRVNSD NAVEVRYGQHLNVVNSIGGGEYQ LYGG YLEILN LYPDLL

VDENGN IDIDT IKQWSAKYLNGSNVTGLGMGYVLKDWSNGQYFNISNT

DGKVMLPEQLVKHMPAVEIGTQTNYTAYISSTIRRDGLYNNMPWGVTATG

QDGNEIKWERQGSTSDYNHQKVQVNRQYVD QGVVWNLIN

FDDKDLWVDSNALVTVNFTSQKPT HFVQFGMRQGKYDGFYLSAPYKQTE

S WVASTRTHQGQLLEVVGQYTTGSGSRKVTWYLVGLDG QVWVDSRAV

GTNFSHKTNiNLLINSATRNDGMYLNAPYGQKGYKRETSSRFYNE LVTVS

QQYYDNKGVIWNL1TLNG KLWVDSRAFATVID VNQSLYINSRNDGMY

LNAPYRAQGA RYASTKTYTGQRVQVTLQRKDTHGVTWYLT VDS QLW

VDSHAFAPTFTRNVSLNVKVNSSKRNDGIYLNAPYGN A RIASTKAYNG

KRVKAS EYKDA GVTWYLVNLNNKQVWIDKRAF

Brs-D (Lb MNINSNER VRF MY SGKQWIVAGLTTAVISIAVYGGSSIANGGIEAKAD kunkei EFB6) AQ AATSSIVNTNNSTNSSNANS1ASLPQNGTYSTNDNGQTW YVSQ DI

QGLY DNNDQLRYFNEYDGTQAKGDIVNVNNDNYYFDKDSGQGHKIDSY

TGGSYSESKVN QDGWIYKSSDNNDVKGVATVDGNIQYFDQNTGLQLKGG

SAQIGGVDYYFDPNKGNLVGKVDQVVNSNDYSDNKLLDSN NVVKGLVV

NNGQLQFFDTSNGNQAKNKQVIANGITYYFDTNGNGQYLFTNTGKSAVDD

rrQRNAANSVNPSDY NVVDGFFTADTWYRP QILDNGlTWRNSNSNELR

PMITAWWPN DVQVNYL LMQNNGLLDKSNSYS1QSDQQTLNQAAQKAQ

VN1E KISQTGNTDWLNDLLFKG GDNPSFV QQYIWSSDSESPWQGDAWF

QGGYLKYGNSVMTPNmSNYRDSNNLFDFLLA DVDNSNPAVQAEDLNW

LYYLTNFGTITANDS ANFDSIRIDAVDFISNDIIQRSYDYLRQ FNL QSDA

NADSHISLVEGGVDAGTTSYSNDGLVEAPFRLDAYPLLHKQDGDVFKNLID

EEDSGIDISNHNGETNTNNTIGGITLSGG PNYSlVHAHDKDVQEKVGQAIiD

TTGI DWTDFTPSQLAQGLETFYNDQRQTVK YNDYNVPSAYAIMLTNKG

TVPRIYYGDMYQDDGQFMQ SLYYDDiANLMTAR YVSGGQSMVDNN

GILTS VRFG GANTVSDSGTEDTRNQGIGLIVGSAPKKVLNDGDTVVLHMG

AAHKNQKYRALMLTTENGIQNYNSDDNAPVAETDDNGDLVFSN DINGQA

NTAI QVANPEVNGYLAAWVPVGASDDQDSRTAPSTSQNNDG VLHENDA

LDSNLIFEGFSNFQPTPTOHDEYANVVIAKNASLFKDWGVTSFEMAPQYRSS

QDHTFVDSTIDNGYAFSDRYDLGFGTPTKYGTDEDLRNAI SLHDNGMQV

MADVVY QLYNLPGQEVVSATRAGVTGNTNALPFGTQLYVVNTIGGGDY

QK YGGAFLNELQEQYPSLF SQKY YYYKNYA GAGPGYLTVNDAER

SDIPYNQPITEWSA YM GTNILGRGMGYVL DW TGDYFKLSGSDSTLPS

SLTY SGWVENPDSTWSYYEKNNIDKLTGSQVINEERVFFD NGIQVKGGW

V NSNGTYSYYD SGNILTGDQLIDGEHFFFDNNGVQV G WI NSDGS

KSYYDSHLG LI TDK VSSNARKKKSKEELLYENAL VLRKDK RLD N TKANIR YNKSLKKYRKA LLAIT RVANAR AI IAK VLSl RKNI

NNE RYY AL EYYVAE SYLKITGNYN KYYYEFD LTP VKWKNIYS YKSRHFT RV I GTLVRVKSIVRSG VARINIGNGHFITSS DFI MF K

GBD-CD2 AQAGHYITKNGNDWQYDTNGELAKGLRQDSNG LRYFDLTTGIQA GQFV (L. citreum TIGQETYYFSKDHGDAQLLPMVTEGHYGTITLKQGQDTKTAWVYRDQNNT NRRL B-12 9 IL GLQNINGTLQFFDPYTGEQLKGGVAKYDDKLFYFESGKGNLVSTVAGD (tronque YQDGHYISQDGQTRYADKQNQLVKGLVTVNGALQYFDNATGNQIKNQQVI depuis DSR- VDGKTYYFDDKGNGEYLFTNTLDMSTNAFSTKNVAFNHDSSSFDHTVDGF E) LTADTWYRP SILANGTTWRDSTD DMRPLITVWWPNKNVQV YLNFM

ANGLLTTAAQYTLHSDQYDLNQAAQDVQVAIERRIASEHGTDWLQKLLFES

QNN PSFV OQFIWN DSEYHGGGDAWFQGGYL YGNNPLTPTTNSDYR

QPGNAFDFLLANDVDNSNPVVQAENLNWLHYLMNFGTITAGQDDANFDSI

R1DAVDFIH DTIQRTYDYLRDAYQVQQSEA ANQHISLVEAGLDAGTSTIH

NDALIESNLREAATLSLTNEPG N PLTNMLQDVDGGTLITDHTQNSTENQ

ATPNYSIIHAHDKGVQE VGAAITDATGADWTNFTDEQLKAGLELFYKD

QRATNKKYNSYNIPSIYALMLTNKDTVPRMYYGDMYQDDGQYMANKSIY

YDALVSLMTARKSYVSGGQTMSVDNHGLL SVRFG DAMTANDLGTSAT

RTEGLGVIIGNDPKLQLNDSDKVTLDMGAAHKNQ YRAVILTTRDGLATFN

SDQAPTAWTNDQGTLTFSNQEINGQDNTQIRGVANPQVSGYLAVWVPVG

ASD QDARTAA 111 b HDG VLMS AALDS LIYEGFS FQP ATTHDELT

NVVIAKNADVFN GITSFEMAPQYRSSGDHTFLDSTIDNGYAFTDRYDLG

FNTPTKYGTDGDLRATIQALHHANMQVMADVVDNQVYNLPGKEVVSATR

AGVYGNDDATGFGTQLYVTNSVGGGQYQEKYAGQYLEALKAKYPDLFEG

KAYDYWY NYANDGSNPYYTLSHGDRESIPADVAIKQWSAKYMNGTNVL

GNGMGYVL DWHNGQYF LDGD STLPQI GEL LEG PIPNPLLGLDSTR

TGHHHHHH

Table 2 presents some mutants of BRS-B-D2 that were used in the context of the present invention.

Mutant Sequence

MASQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDPKTGYQTKGEL NIDDNAYY FDKNSGNGRTFT ISNGSYSEKDGMWQYVDSHDKQPV GLYDVEGNLQYFDLSTG

NQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKDKDGNETSYWAYLD

NQGNAIKGLNDV GEIQYFDEHTGEQL GHTATVDGTTYYFEGNKGNLVSVVNTAP

TGQY INGDNVYYLDN NEAIKGLYGINGNL YFDLATGIQL GQA NIDGIGYYFD

QNNGNGEYRYSLTGPVV DVYSQH AVN LSANNF NLVDGFLTAETWYRPAQII.

SHGTDWVASTD DFRPLITVWWPN DIQVNYLKLMQQIGILDNSVVFDTN DQLVL

N GAESAQIGIE VSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGAFQ

GGYLAYQNSDLTPYANTNPDYRTHNGI VLLANDVDNSNPVVQAEQLNVVLYYEMN

FGQITANDSNANFDSMRIDGLAFVDPQIAKKAYDLLDKMYGLTDNEAVANQHISrVE

AP GETPITVEKQSALVESNWIDRMLQSLS NATLDKLDPDPAINSLE LVADDLVN

RSQSSDKDSSTIPNYSIVHAHDLLLVDTVIHIM IVN NPNISMSDFTMQQLQNGLKA

FYEDQHQSV YNQYNIPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYMA

T SlYYNAIEQMM ARL YVAGGQIMAVT I NDGINKD iTN SGEVLTSVRFG DI

MDAQGQGTAESRNQGIGVIVSNSSGLEL NSDS1TLHMGIAHKNQAYRALMLTNDK

GIVNYDQD NAPIAWTNDHGDI.IFTNQMINGQSDTAVKGYLNPEVAGYEAVWVPV

GANDNQDARTVTTNQK TDGKVLHTNAALDSKLMYEGFSNFQKMPTRGNQYANV

VIT NIDLF SWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGF TPT

YGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHVNINGDT LVV

DPRYGTQMTW SVGGGDYQAKYGGEYLDTISKLYPGLLLDSNGQ IDLSTKIKEW

SA YLNGSNIPQVGMGYVL DWN GQYFHILDKEGQYSLPTQLKGGRADPAFLY

M6 VVHHHHHH MASQYVE DGQFVYIDQSG QVSGLQNIEGHTQYFDP TGYQTKGEL .NIDD AYY FD NSGNGRTFT ISNGSYSE DGMWQYVDSHDKQPVKGLYDVEGNLQYFDLSTG QAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTT D DGNETSYWAYLD NQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATVDGITYYFEGNKGNLVSVVNTAP TGQY INGD VYYLDN NEAIKGLYGINGNLNYFDLATGIQL GQAK IDGIGYYFD Q NGNGEYRYSLTGPVV DVYSQHNAVNNLSANNF NLVDGFLTAETWYRPAQIL SHGTDWVASTD DFRPLITVWWPNKDIQVNYLKLMQQIGILDNSVVFDTNNDQLVL N GAESAQIGIEKKVSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGMM QGGYLAYQNSDLTPYANTNPDYRTFINGV FMLANDVDNSNPVVQAEQLNWLYYL MNFGQ1TANDSNANFDSMRIDGMWLVDPQIA KAYDLLD MYGLTDNEAVANQHI SIVEAP GETP1TVE QSALVES WVDRMLQSLSK.NATLD LDPDPAINSLE LVAD DLV RSQSSD DSSTIP YSIVHAHDLNLVDTVIHIM IVNNNPNISMSDFTivlQQEQ GL AFYEDQHQSVKKYNQY IPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQ Y AT SIYYNAIEQMMKARLKYVAGGQI AVT I NDGI KDGTNKSGEVLTSVRF GKDIMDAQGQGTAESRNQGIGVIVSNSSGLEL NSDSITLHMGIAH NQAYRALMLT NDKGIVNYDQDN APIAWTNDHGDLIFTNQMINGQSDTAV GYLNPEVAGYLAVW VPVGANDNQDARTVTTNQKNTDGKVLHTNAALDS LMYEGFSNFQ MPTRGNQY ANVVITKNIDLFKSWGITDFELAPQYRSSDGKDITDRFLDSIVQNGYGLSDRYDLGF TPTKYGTDQDLR A1E.RLHQAGMSVMADFVANQIYGLHAD EVVSAQHVNINGDT KLVVDPRYGTQMTVV SVGGGDYQA YGGEYLDTIS LYPGLLLDSNGQKIDLSTK IKEWSA YLNGSNIPQVGMGYVL DWNNGQYFHILD EGQYSLPTQLKGGRADPAF

M14 .VVHHHHHH

MASQYVE DGQFVYIDQSGKQVSGLQNIEGHTQYFDP TGYQTKGEL NIDDNAYY

FDKNSGNGRTFT ISNGSYSE DGMWQYVDSHDKQPVKGLYDVEGNLQYFDLSTG

NQA HQIRSVDGVTYYFDADSGNATAF AVTNGRYAEQTTKD DGNETSYWAYLD

NQGNAIKGLNDV GEIQYFDEHTGEQLKGHTATVDGTTYYFEGNKGNLVSVV TAP

TGQY INGDNVYYLDNNNEAIKGLYGINGNLNYFDLATGIQL GQA NIDGIGYYFD

Q NGNGEYRYSLTGPVV DVYSQHNAVNNLSANNFKNLVDGFLTAETWYRPAQIL

SHGTDWVASTDKDFRPLITVWWPN DIQVNYLKLMQQIGILDNSVVFDTNNDQLVL

N GAESAQIGIE VSETGNTDWLNELLFAPNGNQPSFIKQQYLWNVDSEYPGGLFQ

GGYLAYQNSDLTPYANTNPDYRTHNGIRYLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAPDWVDPQIAKKAYDLLDKMYGLTDNEAVANQHISIV

EAP GETPITVE QSALVESNWIDRMLQSLSKNATLDKLDPDPAINSLEKLVADDLV

NRSQSSD. .DSSTIPNYSIVHAHDINVLDTV1HIMK1VNNNPNISMSDFTMQQLQNGL

AFYEDQHQSVK YNQYNIPSAYALLLTN DTVPRVFYGDMYQDYGDDLDGGQYM

AT SIYYNAIEQMMKARL YVAGGQIMAVTKI NDGINKDGTN SGEVLTSVRFG

DIMDAQGQGTAESR QGIGVIVSNSSGLELKNSDSITLHMGIAFi QAYRAL LTND

KGIVNYDQDNNAPL WTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYLAVWVP

VGANDNQDARTVTTNQKNTDG VLHT AALDS LMYEGFSNFQKMPTRGNQYAN

VVITKNIDLFKSWGITDFELAPQYRSSDG DITDRFLDS1VQ GYGLSDRYDLGFKTPT

KYGTDQDLR AIERLHQAGMSVMADFVANQ1YGLHADKEVVSAQHVNINGDTKLV

VDPRYGTQMTW SVGGGDYQAKYGGEYLDTISKLYPGLLLDSNGQ IDLST I E

WSAKYLNGSNIPQVGMGYVLKDWN GQYFHILDKEGQYSLPTQLKGGRADPAFLY

M18 KWHHHHHH

MASQYVEKDGQF iSG QVSGLQNIEGHTQYFDPKTGYQT GEL IDDIs

FDKNSGNGRTFT ISNGSYSE DGMWQYVDSHD QPV GLYDVEGNLQYFDLSTG

NQAKHQIRSVDGVTYYFDADSGNATAF AVTNGRYAEQTTKDKDGNETSYWAYLD

NQGNAI GLNDWGEIQYFDEHTGEQL GHTATVDGTTYYFEGNKGNLVSVVNTAP

TGQY INGD VYYLDNNNEAI GLYGINGNLNYFDLATGIQL GQA NIDGIGYYFD

Q NGNGEYRYSLTGPVV E ) VYSQFINAVNNLSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTD DFRPLFrVWWP KDIQV YLKLMQQIGILDNSVVFDTNNDQLVL

N GAESAQIG1E KVSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGLFQ

GGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAIDFVDPQIA AYDLLDKMYGLTDNEAVANQHISIVE

M21 ADKGETPITVE QSALVESNWWLRMKQSLSKNATLDKLDPDPAINSLEKLVADDLV NRSQSSD DSSTIP YSIVHAHDLDIQETVVHIM IVN NPNISWTDFTMQQLQNGLK

AFYEDQHQSV YNQYNIPSAYALLLTN DTVPRVFYGDMYQDYGDDLDGGQYM

AT SIYYNAIEQMMKARL YVAGGQ1MAVTKIKNDGINKDGTN SGEVLTSVRFG

D1MDAQGQGTAESRNQGIGVIVSNSSGLELKNSDS1TLHMGLAH NQAYRALMLTND

KGIVNYDQDNNAPIAWTNDHGDLIFTNQMI GQSDTAV GYL PEVAGYLAVWVP

VGANDNQDARTVTTNQKNTDG VLHTNAALDSKLMYEGFSNFQKMPTRGNQYAN

VVITKNIDLFKSWGITDFELAPQYRSSDGKDITDRFLDSIVQNGYGLSDRYDLGF TPT YGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHV INGDTKLV

VDPRYGTQMTVV SVGGGDYQAKYGGEYLDTISKLYPGLLLDSNGQKIDLSTKI E

WSAKYLNGSNIPQVGMGYVL DWN GQYFHILDKEGQYSLPTQLKGGRADPAFLY

KVVHHHHIIH

MASQYVEKDGQFVYIDQSG QVSGLQNIEGHTQYFDPKTGYQTKGELKNIDDNAYY

FDKNSGNGRTFTKISNGSYSE DGMWQYVDSHD QPVKGLYDVEGNLQYFDLSTG

NQA HQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKDKDGNETSYWAYLD

NQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATVDGTTYYFEGNKGNLVSVVNTAP

TGQY INGDNVYYLDNNNEAIKGLYGINGNLNYFDLATGIQL GQA NIDGIGYYFD

Q NGNGEYRYSLTGPVVKDVYSQHNAV LSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTDKDFRPLITVWWPN DIQVNYLKLMQQIGILDNSVVFDTNNDQLVL

N GAESAQIGIEK VSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGPFQ

GGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAIMFVDPQIA AYDLLDKMYGLTDNEAVANQHiSIVE

AD GETPITVE QSALVESNWLLRM QSLSKNATLDKLDPDPAINSLEKLVADDLVN

RSQSSDKDSSTIPNYSIVHAHD DILETVTHIM IVNN PNISDTDFTMQQLQNGL A

FYEDQHQSV YNQYNIPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYMA

T SIYYNAIEQMM ARLKYVAGGQIMAVT IKNDGINKDGTN SGEVLTSVRFGKDI

MDAQGQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAH NQAYRALMLTNDK

GIVNYDQDNNAPIAWTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYLAVWVPV

GANDNQDARTVTTNQ NTDG VLHTNAALDS LMYEGFSNFQKMPTRGNQYANV

V1T NIDLF SWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGFKTPT

YGTDQDLRKAIERLHQAGMSV ADFVANQIYGLIIADKEVVSAQHV INGDTKLVV

DPRYGTQMTVVNSVGGGDYQAKYGGEYLDTIS LYPGLLLDSNGQ IDLST I EW

SA YLNGSN1PQVGMGYVL DWNNGQYFHILD EGQYSLPTQL GGRADPAFLYK

M23 VVHHHHHH

MASQYVEKDGQFVYIDQSG QVSGLQN1EGHTQYFDPKTGYQT GELKNIDDNAYY FDKNSGNGRTFTKISNGSYSE DGMWQYVDSHD QPV GLYDVEGNLQYFDLSTG NQAKHQIRSVDGVTYYFDADSGNATAF AVTNGRYAEQTTKDKDGNETSYWAYLD NQGNAIKGLNDVNGEIQYFDEHTGEQLKGHTATVDGTTYYFEGN GNLVSVVNTAP

TGQY INGDNVYYLDNNNEAIKGLYGI GNLNYFDLATGIQLKGQA NIDGIGYYFD

QN GNGEYRYSLTGPVV DVYSQHNAV NLSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTDKDFRPLITVWWPN DIQV YLKLMQQIGILDNSVVFDTNNDQLVL

N GAESAQIGIEK VSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGMF

QGGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLM

NFGQITANDSNANFDSMRIDAIMFVDPQIA AYDLLDKMYGLTDNEAVANQHISIV

EDSKGETPITVE QSALVESNWWLRMKQSLSKNATLDKLDPDPAINSLE LVADDLV

NRSQSSDKDSSTIPNYSIVHAHDLDILETVVHIM IVNN PNISWTDFTMQQLQNGLK

AFYEDQHQSV YNQYNIPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYM

AT SIYY AIEQMMKARL YVAGGQIMAVT IK DGIN DGTN SGEVLTSVRFG

DIMDAQGQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAH NQAYRALMLTND

KGIVNYDQDN APIAWTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYLAVWVP

VGANDNQDARTVTTNQKNTDG VLHTNAALDS LMYEGFSNFQKMPTRGNQYAN

WIT NIDLFKSWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGFKTPT YGTDQDLRKAIERLHQAGMSVMADFVANQiYGLHAD EVVSAQHVNI GDT LV

VDPRYGTQMTVV SVGGGDYQA YGGEYLD ' HSKLYPGLLLDSNGQ IDLSTKI E

WSAKYLNGSNIPQVGMGYVLKDW GQYFHILDKEGQYSLPTQLKGGRADPAFLY

M28 VVHHHHHH MASQYVE DGQFVYIDQSG QVSGLQNIEGHTQYFDPKTGYQT GELKNIDDNAYY

FD NSGNG TFTKISNGSYSE DGMWQYVDSHD QPVKGLYDVEGNLQYFDLSTG

NQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKD DGNETSYWAYLD QGNAI GLNDVNGEIQYFDEHTGEQLKGHTATVDGITYYFEGNKGNLVSVV TAP

TGQY rNGDNVYYLDNN EAIKGLYGI G L YFDLATGIQL GQAKNIDGIGYYFD

QNNGNGEYRYSLTGPVV DVYSQH AVNNLSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTD DFRPLITVWWPN DIQVNYL LMQQIGILDNSVVFDTNNDQLVL

N GAESAQIGlEKKVSETGNTDWLNELLFAPNGNQPSFl QQYLWiWDSEYPGGLFQ

GGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAlSWVDPQIAK AYDLLDKMYGLTDNEAVANQHISr E

ALKGETPiTVE QSALVESNWMQRM QSLS NATLD LDPDPAINSLE LVADDLV

NRSQSSDKDSSTIPNYSIVHAHDVDrVETVTHMKIVNNNPNISMTDFTMQQLQNGL

AFYEDQHQSV KY QYNIPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYM

AT SrrYNAlEQMMKARLKYVAGGQIMAVTKIKNDGIN DGTN SGEVLTSVRFGK

DIMDAQGQGTAESRNQGIGVIVSNSSGLEL NSDSITLHMGIAH NQAYRALMLTND

KGIVNYDQDNNAPIAWTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYLAVWVP

VGANDNQDARTVTTNQKNTDG VLHTNAALDSKLMYEGFSNFQKMPTRGNQYAN

VV1T NIDLFKSWGITDFELAPQYRSSDG DITDRFLDSWQNGYGLSDRYDLGFKTPT YGTDQDLR AIERLHQAGMSVMADFVANQIYGLHAD EVVSAQHVNINGDTKLV

VDPRYGTQMTVV SVGGGDYQAKYGGEYLDTIS LYPGLLLDS GQ IDLST IKE

WSA YLNGSNIPQVGMGYVL DWWJGQYFHILD EGQYSLPTQL GGRADPAFLY

M30 KVVHHHHHH

MASQYYEKDGQFVYIDQSG QVSGLQN1EGHTQYFDP TGYQTKGELKN1DDNAYY

FD NSGNGRTFT IS GSYSEKDG WQYVDSHDKQPVKGLYDVEGNLQYFDLSTG

NQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTT DKDGNETSYWAYLD

NQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATVDGTTYYFEGNKGNLVSVV TAP

TGQYKINGDNVYYLDNNNEAI GLYGINGNLNYFDLATGIQLKGQA NIDGIGYYFD

Q G GEYRYSLTGPVVKDVYSQHNAV LSA F NLVDGFLTAET RPAQ1L

SHGTDWVASTDKDFRPLITVWWPN DIQVNYL LMQQIGILDNSVVFDTNNDQLVL

NKGAESAQIGIEKKVSETGNTDWLNELLFAPNGNQPSFI QQYLWNVDSEYPGGLFQ

GGYLAYQNSDLTPYANTNPDYRTFINGLEFLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAISWVDPQIAK AYDLLDKMYGLTDNEAVANQHISIVE

AL GETPITVE QSALVESNWMERM QSLS NATLD LDPDPAI SLE LVADDLV

NRSQSSDKDSSTIPNYSIVHAHDIDIILTVWH K1VNNNPNISVTDFTMQQLQNGLKA

FYEDQHQSVK YNQY IPSAYALLLT DTVPRVFYGDMYQDYGDDLDGGQYMA

T SIYYNAIEQMMKARL YVAGGQIMAVT I DGIN DGTNKSGEVLTSVRFG D1

MDAQGQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAH NQAYRALMLTND

GIVNYDQDNNAPIAWTNDHGDLIFTNQMI GQSDTAV GYLNPEVAGYLAVWVPV

GANDNQDARTVTTNQ NTDGKVLHTNAALDSKLMYEGFSNFQKMPTRGNQYANV

V1T NIDLF SWG1TDFELAPQYRSSDG D1TDRFLDSIVQNGYGLSDRYDLGF TPTK

YGTDQDLR AIERLHQAGMSVMADFVA QIYGLHAD EVVSAQHV INGDTKLVV

DPRYGTQMTVVNSVGGGDYQAKYGGEYLDTISKLYPGLLLDSNGQ IDLSTKIKEW

SA YLNGSNIPQVGMGYVLKDWNNGQYFHILDKEGQYSLPTQL GGRADPAFLY

M31 VVHHHHHH

MASQYVEKDGQFVYIDQSG QVSGLQNIEGHTQYFDPKTGYQT GEL NIDDNAYY

FDK SGNGRTFI ' ISNGSYSE DGMWQYVDSHDKQPVKGLYDVEG LQYFDLSTG

NQA HQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKD DGNETSYWAYLD

NQGNAIKGLNDVNGEIQYFDEHTGEQLKGHTATVDGTTYYFEGN GNLVSVVNTAP

TGQYKINGDNVYYLDNNNEAI GLYGINGNLNYFDLATGIQL GQA NIDGIGYYFD

Q NGNGEYRYSLTGPVVKDVYSQHNAVN LSANNFKNLVDGFLTAETWYRPAQIL

SHGTDWVASTD DFRPLITVWWPN DIQVNYLKLMQQIGE.DNSVVFDTNNDQLVL

NKGAESAQIGIE VSETGNTDWL ELLFAPNGNQPSFIKQQYLWNVDSEYPGGLFQ

GGYLAYQNSDLTPYANTNPDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLMN

FGQITANDSNANFDSMRIDAISWVDPQlA AYDLLDKMYGLTDNEAVANQHISrVE

M34 ALKGETPITVEKQSALVESNWMARMKQSLSKNATLDKLDPDPAI SLEKLVADDLV NRSQSSDKDSSTIPNYSIVHAHDVDILETVVHIM IV NNPNISPTDFTMQQLQNGLK

AFYEDQHQSV Y QYNIPSAYALLLTNKDTVPRVFYGDMYQDYGDDLDGGQYM

ATKSIYYNAIEQMMKARLKYVAGGQIMAVTKIKNDGINKDGTNKSGEVLTSVRFGK

DIMDAQGQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAHKNQAYRALMLTND

KGIVNYDQDNNAPIAWTNDHGDLIFTNQMINGQSDTAV GYLNPEVAGYLAVWVP

VGANDNQDARTVTTNQKNTDGKVLHTNAALDSKLMYEGFSNFQKMPTRGNQYAN

WITKNIDLFKSWGITDFELAPQYRSSDG DITDRFLDSIVQNGYGLSDRYDLGFKTPT

KYGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHVNINGDTKLV

VDPRYGTQMTVVNSVGGGDYQA YGGEYLDTISKLYPGLLLDSNGQ IDLSTKIKE

WSAKYLNGSNIPQVGMGYVLKDW NGQYFHILD EGQYSLPTQL GGRADPAFLY VVHHHHHH

MASQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDPKTGYQTKGEL IDDNAYY FDKNSGNGRTFT ISNGSYSEKDGMWQYVDSHDKQPV GLYDVEGNLQYFDLSTG NQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTT DKDGNETSYWAYLD NQGNAIKGLNDVNGEIQYFDEHTGEQL GHTATVDGTTYYFEGNKGNLVSVVNTAP

TGQY INGDNVYYLDNNNEAIKGLYGINGNLNYFDLATGIQLKGQAKNIDGIGYYFD

QN GNGEYRYSLTGPVV DVYSQHNAVNNLSA NFKNLVDGFLTAETWYRPAQIL

SHGTDWVASTD DFRPLITVWWPNKDIQVNYLKLMQQIGILDNSVVFDTNNDQLVL

NKGAESAQIGIEK VSETGNTDWLNELLFAPNGNQPSFIKQQYLWNVDSEYPGGAFQ

GGYLAYQNSDLTPYANTNPDYRTH GLEFMLANDVDNSNPVVQAEQL WLYYLM

NFGQITANDSNANFDSMRIDAISFVDPQIAK AYDLLDKMYGLTDNEAVANQHISIV

EAPKGETPITVEKQSALVESNWRDRM QSLSKNATLDKLDPDPAINSLEKLVADDLV

NRSQSSDKDSSTIPNYSIVHAHDADVQVTVIGTM IVNN PNISHTDFTMQQLQNGL

KAFYEDQHQSV YNQYNIPSAYALLLTN DTVPRVFYGDMYQDYGDDLDGGQY

MAT SIYYNAIEQMM ARLKYVAGGQIMAVTKI NDGI DGTNKSGEVLTSVRFG

KDIMDAQGQGTAESRNQG1GVIVSNSSGLEL NSDSITLHMG1AHKNQAYRALMLTN

DKGIVNYDQDNNAP1AWTNDHGDL1FTNQM1NGQSDTAVKGYLNPEVAGYLAVWV

PVGANDNQDARTVTTNQ NTDG VLHTNAALDS LMYEGFSNFQKMPTRGNQYA

NVVIT NIDLF SWGITDFELAPQYRSSDGKDITDRFLDSIVQNGYGLSDRYDLGFKT

PTKYGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHAD EVVSAQHVNINGDT

LVVDPRYGTQMTVVNSVGGGDYQA YGGEYLDTISKLYPGLLLDSNGQKIDLSTKI

KEWSA YLNGSNIPQVGMGYVLKDWNNGQYFHILD EGQYSLPTQL GGRADPAF

M35 LYKVVHHHHHH

MASQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDP TGYQT GELKNIDDNAYY

FDKNSGNGRTFTKISNGSYSEKDGMWQYVDSHDKQPV GLYDVEGNLQYFDLSTG

NQA HQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKD DGNETSYWAYLD

NQGNAI GLNDVNGEIQYFDEHTGEQLKGHTATVDGTTYYFEGN GNLVSVV TAP

TGQYKINGDNVYYLDNN EAIKGLYGINGNL YFDLATGIQLKGQA NirXiIGYYFD

QNNGNGEYRYSLTGPVVKDVYSQH AVN LSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTDKDFRPLITVWWPN DIQVNYLKLMQQIGILDNSVVFDTNNDQLVL

NKGAESAQIGIEK VSETGNTDWLNELLFAPNGNQPSFIKQQYLW VDSEYPGGDFQ

GGYLAYQNSDLTPYANTNPDYRTHNGLEFLMANDVDNSNPVVQAEQLNWLYYLM

NFGQITANDSNANFDSMRIDAISFVDPQIAKKAYDLLDKMYGLTDNEAVANQHISIV

EAPKGETPITVE QSALVESNWRDRMKQSLSKNATLD LDPDPAI SLEKLVADDLV

NRSQSSD DSSTIPNYSIVHAHDVDITITVVSLM IV NNPNISSTDFTMQQLQNGLKA

FYEDQHQSVK YNQYNIPSAYALLLTN DTVPRVFYGDMYQDYGDDLDGGQYMA

T SIYYNAIEQMMKARL YVAGGQIMAVTKIKNDGIN DGTN SGEVLTSVRFGKDI

MDAQGQGTAESRNQGIGVTVSNSSGLELKNSDSITLHMGIAH NQAYRALMLTNDK

GIVNYDQDNNAPIAWTNDHGDLIFTNQM1 GQSDTAV GYLNPEVAGYLAVWVPV

GANDNQDARTVTTNQ NTDG VLHTNAALDS LMYEGFSNFQKMPTRGNQYA V

VITKNIDLFKSWGITDFELAPQYRSSDGKDITDRFLDSIVQNGYGLSDRYDLGF TPT

YGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEWSAQHVNINGDTKLW

DPRYGTQ TVV SVGGGDYQAKYGGEYLDTIS LYPGLLLDSNGQKIDLSTKi EW

SA YLNGSNIPQVGMGYVL DW NGQYFHILDKEGQYSLPTQLKGGRADPAFLY

M40 WHHHHHH MASQYVEKDGQFVYIDQSGKQVSGLQNIEGHTQYFDPKTGYQTKGELKNIDDNAYY

FDKNSGNGRTFTKISNGSYSE DGMWQYVDSHD QPV GLYDVEGNLQYFDLSTG

NQAKHQIRSVDGVTYYFDADSGNATAFKAVTNGRYAEQTTKDKDGNETSYWAYLD

NQGNAI GLNDVNGEIQYFDEHTGEQL GHTATVDGTTYYFEGN GNLVSVVNTAP

TGQYKINGDNVYYLDNNNEAIKGLYGINGNLNYFDLATGIQLKGQA NIDG1GYYFD

QNNGNGEYRYSLTGPVV DVYSQHNAVNNLSANNF NLVDGFLTAETWYRPAQIL

SHGTDWVASTD DFRPLITVWWPN DIQVNYLKLMQQIGILDNSVVFDTONDQLVL

N GAESAQIGIE KVSETGNTDWLNELLFAPNGNQPSFIKQQYLWNVDSEYPGGLM

QGGYLAYQNSDLTPYANT PDYRTHNGLEFLLANDVDNSNPVVQAEQLNWLYYLM

NFGQITANDSNANFDSMRJDAISFVDPQ1AKKAYDLLD MYGLTDNEAVANQHISIV

EAP GETPITVEKQSALVESNWRDRM QSLS NATLD LDPDPAINSLE LVADDLV

NRSQS SD D S STIPNYS I VHAHDLDLEDT VVSLMKIVNNNPNISMTDFTMQQLQNGL

KAFYEDQHQSVKKYNQY IPSAYALLLT KDTVPRVFYGDMYQDYGDDLDGGQY

MAT SIYYNAIEQMM ARL YVAGGQIMAVTKJKNDGIN DGTN SGEVLTSVRFG

KDIMDAQGQGTAESRNQGIGVIVSNSSGLELKNSDSITLHMGIAHK.NQAYRALMLT N

DKGIVNYDQDNNAP1AWTNDHGDLIFTNQMINGQSDTAVKGYLNPEVAGYLAVWV

PVGANDNQDARTVTTNQKNTDG VLHTNAALDS LMYEGFSNFQKMPTRGNQYA

NVVIT NIDLFKSWGITDFELAPQYRSSDGKDITDRFLDSIVQNGYGLSDRYDLGFKT

PTKYGTDQDLRKAIERLHQAGMSVMADFVANQIYGLHADKEVVSAQHVNINGDTK

LVVDPRYGTQMTVVNSVGGGDYQAKYGGEYLDTISKLYPGLLLDSNGQ IDLSTKI

KEWSA YLNGSNIPQVGMGYVLKDWNNGQYFHILDKEGQYSLPTQLKGGRADPAF

M41 LYKVVHHHHHH

Isolation of brsE gene

The brsE gene was identified in Leiiconostoc mesenteroides KFM-MG genome (NCBI Reference Sequence : CP000574) by performing a nucleotide BLAST against a GH70 a- transglucosyiase encoding gene database. The protein sequence of BRS-E is deposited under the GenBank accession number AHF 19404.1.

Recombinant expression of BRS-E in E. coli

A synthetic brsE gene was designed in order to optimize its expression in E. coli (Biomatik, Cambridge, ON, Canada), and cloned in pET28b vector.

The gene was then amplified by PCR from pET28b/BrsE plasmid DNA template using the forward primer 5 '-atgggctacaaggccgg-3 ' and the reverse primer 5'- accataataatacaccttattatcggc-3'. The PCR product was then inserted into the pENT /D-TOPO vector (Life Technologies). From a positive entry clone, LR recombination (Gateway LR Clonase II enzyme mix, Life technologies) was performed with pET-55-DEST destination vector (Merck Millipore). Expression clones were selected on LB agar plates supplemented with 100 μg ml-1 of ampicillin. Plasmids were then extracted using the GenElute HP Plasmid Miniprep kit (Sigma-Aldrich), verified by restriction analyses and sequenced (GATC Biotech). E, coli TOP 10 competent cells (Life Technologies) were used for all cloning experiments. For enzyme production, E. coli BL21 * DE3 cells were freshly transformed by 55 brsE. Twenty milliliters of LB medium, supplemented with ampicillin (100 μg niL- 1 ), were inoculated with 100 μΐ, of transformation mix and incubated overnight at 37 °C under agitation (200 rpm). Then, Erlenmeyer flasks culture containing a modified ZYM5052 medium with 100 μg mL ampicillin, 0.1% lactose, 0% glucose and 1 % glycerol were inoculated with the starter culture at an OD600nm of 0.05. Cultures were incubated at 21 °C under agitation (150 rpm). After 26-hour incubation, cells were harvested by centrifugation, dispersed in 50 mM sodium acetate buffer (pH 5.75) at a final OD600nm of 80 and disrupted by sonication. The recombinant enzymes were recovered in the soluble fraction after centrifugation (1 1 ,000 g, 30 min, 8°C) of the crude cell extract.

Enzyme production

Cloning of branching sucrase genes in inducible vectors (pET53, pET55 or pBAD49, Life technologies) for heterologous expression in E. coli cells was previously described (Vuillemin et al„ J Biol Chem. 2016; 291 (14):7687-702). E. coli BL21 * DE3 and E. coli BL21 AI cells were freshly transformed by pET53-55/brsB Δ2, brsC, brsD, brsE and pBAD49/brsA, respectively. Twenty milliliters of LB medium, supplemented with ampicillin ( 100 μg mL "1 ), were inoculated with 100 μL· of transformation mix and incubated overnight at 37 °C under agitation (200 rpm).

Enzyme production were performed in Erlenmeyer flasks with modified ZYM5052 medium that contains i) 0% lactose, 0% glucose, 0.5% glycerol and 0.01 % L-arabinose for BRS-A production, ii) 0. 1 % lactose, 0% glucose and 1% glycerol for BRS-B-A2, BRS-C, BRS-D, BRS-E production or iii) 0.75% lactose, 0.05% glucose and 1.5% glycerol for GBD-CD2 (wild type and mutants) production. All culture media were supplemented with ampicillin ( 100 g mL "1 ) and inoculated with the corresponding starter culture at an ODeoonm of 0.05. Cultures were incubated at 21 °C or 23°C under agitation (150 rpm). After 26-hour incubation, cells were harvested by centrifugation, dispersed in 50 mM sodium acetate buffer (pH 5.75) at a final OD 600nm of 80 for BRS-A, BRS-B-A2, BRS-C, BRS-D, BRS-E, and an OD 600nm of 30 for GBD-CD2 and mutants. Cells were disrupted by sonication. The recombinant enzymes were recovered in the soluble fraction after centrifugation (1 1 ,000 g, 30 min, 8 °C) of the crude cell extract.

Enzyme purification by affinity chromatography Recombinant enzymes are produced in fusion with a 6xHis tag allowing purification by affinity chromatography. For that purpose, cells were centrifuged and resuspended in binding buffer (20 mM phosphate sodium buffer, pH 7.4, 500 mM NaCl, 20 mM imidazole, 2.5% (v/v) glycerol) at a final OD 600nni of 200 for BRS-B productions, and 30 for GBD-CD2 and mutants productions. After disruption by sonication, centrifugation (18,000 g, 30 min, 4 °C) and filtration through a 0.22 μηι cartridge, lysates were applied at 10 °C onto a 1 ml HisTrap HP ® column that had been equilibrated with the binding buffer, using an A TAXpress system (GE Healthcare). The proteins were eluted by imidazole gradient from 10 to 500 mM, over 25 minutes. Eluate fractions of 3 mL were desalted onto 10-DG column (Biorad, Hercules, CA, USA), with 50 mM sodium acetate buffer at pH 5.75 with 100 mM NaCl, or purified for a second round by gel-filtration on a Superosel2 resin.

Enzymatic activity assay

One unit of branching sucrase (wild-type and mutants) is defined as the amount of enzyme which catalyzes the production of one micromole of fructose per min, at 30°C, in 50 mM sodium acetate buffer at pH 5.1 or pH 5.75 depending on the enzyme, and from 292 mM sucrose. The enzyme activities were determined by measuring the amount of reducing sugars using the dinitrosalycilic acid (DNS) method (G. L. Miller, Anal Chem. 1959, 31, 426-428). Glucosylation of tetrasaccharide using branching sucrases

Transglucosylation assays were performed at a temperature between 20 to 37°C in 50 mM sodium acetate buffer, pH 5.0 to 6.0, supplemented with 0.05 to 5 U.mL "1 of enzyme, 50 mM to 1 M sucrose, and 10 mM to 100 mM tetrasaccharide of formula (la), in particular ABC; AC CC ! 3 AC .D-A11 (also referred as ABC'D'). Reactions were incubated in glass tubes for 8 to 24 h.

Specifically, the pentasaccharides were produced in the following conditions:

Pentasaccharide 1 (PI) was produced in particular by BRS-B-.A2 in a 500 uL scale reaction using 200 pL of ABC'D' acceptor preparation at 110 g.L "1 , 206 μΐ, of sucrose at 830 g.L "! , 50 μΐ, of sodium acetate buffer 500 mM at pH 5.1, 9.62 μΐ, of purified enzyme BRS-B-A2 at 52 U / mL and H 2 0 to 500 μ

Pentasaccharides 2 (P2) and 2' (2') were produced in particular by GBD-CD2 F2163G in a 2 mL scale reaction using 800 pL of ABC'D' acceptor preparation at 110 g.L "1 , 823 pL of sucrose at 830 g.L "1 , 200 μL· of sodium acetate buffer 500 mM at pH 5.1, 130.7 μΐ. of purified enzyme GBD-CD2 F2163G at 15.3 U / mL and H 2 0 to 2 mL.

Pentasaccharide 3 (P3) was produced in particular by GBD-CD2 W2135S-F2136L or GBD- CD2 W2135I-F2136C in a 2 mL scale reaction using 800 μΐ. of ABC'D' acceptor preparation at 110 g.L "1 , 823 μL· of sucrose at 830 g.L/ 1 200 μL· of sodium acetate buffer 500 mM at pH 5.1, 130.7 L of purified enzyme at 15.3 U / mL and H 2 0 to 2 mL.

Methods for separation, detection and purification of the compounds of Interest:

The presence of residual acceptor ABC'D' and glucosylated products (pentasaccharides PI and P2, P2' and P3) was determined by HPLC-MS (High performance Liquid Chromatography coupled with Mass Spectrometry) using a C18RP Fusion (4 μηι, 80 A, 250 x 4.6 mm) analytical column placed in an oven at 40 °C and eluting with a 20-minute iLO/acetonitrile gradient from 70:30 to 60:40 at a flow of 1 mL.min "1 . Reaction media were diluted 10 times in H 2 0/acetonitrile (70:30, v/v) + 0.08% trifiuoroacetic acid (TFA) before injection of 20 μΐ. samples. UV Detection was carried out at 220 nm wavelength. The mass of the different compounds was determined by mass spectrometry with a 0.5 s full scan (m/z 200 - 1 50) both in positive and negative modes. Needle was set on 3.5 kV, cone on 60 V, and the probe temperature was maintained at 450°C.

Pentasaccharides PI , P2, P2' and P3 were purified by automatic fractionation on an Agilent 1260 Infinity HPLC, during 20 min of separation (same method as above). Several rounds of purification were performed if necessary. Products detected by UV-RI peaks were collected, and reanalyzed by HPLC. Fractions containing single peak products were pooled, concentrated to dryness using a SpeedVac before exchange in D 2 0 for NMR analyses. NMR analyses for pentasaccharides structure elucidation:

The samples were dissolved in DCl-containing D 2 0 at pH 4.9 For NMR studies, the samples were lyophilized three times and dissolved in 180 μΐ. of 99.9% DCl-containing D 2 0.

All NMR spectra were recorded on a Bruker Avance spectrometer operating at a proton frequency of 950 MHz and at a carbon frequency of 238 MHz with a 5 -mm gradient indirect cryoprobe. All spectra were processed and analyzed with Topspin software (Bruker).

! H and °C I D NMR spectra were accumulated at 30 °C, 65536 data points were acquired with 32 and 2048 scans respectively for proton and carbon experiments.

' H- 13 C HSQC (Heteronuclear Single Quantum Coherence spectroscopy), HMBC (I leteronuclear single quantum coherence spectroscopy) and Double Quantum Filtered Correlation SpectroscopY (QDF COSY) experiments were performed at 30 °C. Homo and heteronuclear spectra were recorded under the following experimental conditions: 512 increments of 2048 complex points are acquired with an accumulation of 16 scans. Spectral widths were 16025 Hz for proton dimension and 44267 Hz for carbon dimension.

Results

The structure of Pentasaccharide PI was determined by MR spectroscopy (950MHz), revealing an a- 1 ,6 glucosylation of D', characteristic of 5. flexneri serotype 4a.

The structure of Pentasaccharide P2 was determined by NMR spectroscopy (950MHz), revealing an a- 1 ,3 glucosylation of A, characteristic of S. flexneri serotype 3a.

The structure of Pentasaccharides P2 " and P3 were also determined by NMR spectroscopy (950MHz), revealing respectively an a- 1 ,4 glucosylation o residue A and an a- 1 ,4 glucosylation of residue B.

D