Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
HARNESSING GUT MICROBES FOR GLYCAN DETECTION AND QUANTIFICATION
Document Type and Number:
WIPO Patent Application WO/2023/196992
Kind Code:
A1
Abstract:
The present disclosure relates compositions and methods of engineering microbial strains to detect and quantify glycan molecules.

Inventors:
TOWNSEND II GUY EDMUND (US)
MODESTO JENNIFER L (US)
Application Number:
PCT/US2023/065548
Publication Date:
October 12, 2023
Filing Date:
April 07, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PENN STATE RES FOUND (US)
International Classes:
C12Q1/02; C12Q1/34; C12N5/10; C12N15/52
Domestic Patent References:
WO2020252370A12020-12-17
WO2021133854A12021-07-01
Foreign References:
US20110281762A12011-11-17
US20200339964A12020-10-29
Attorney, Agent or Firm:
LANIER, J. Gibson et al. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. A glycan-sensing system comprising an engineered bacteria harboring a reporter plasmid, wherein the reporter plasmid comprises a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter, wherein the glycan-sensing system detects and quantifies a glycan molecule.

2. The system of claim 1 , wherein the glycan molecule comprises chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, or derivatives thereof.

3. The system of claim 1 or 2, wherein the glycan molecule is located in the gastrointestinal tract of a mammal.

4. The system of any one of claims 1-3, wherein the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

5. The system of any one of claims 1-4, wherein the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan- specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O- glycan(OG)-specific PUL, or variants thereof.

6. The system of any one of claims 1-5, wherein the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactanspecific promoter, a dextran-specific promoter, a HA-specific promoter, a HS -specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

7. The system of any one of claims 1-6, further comprising a second PUL.

8. The system of any one of claims 7, wherein the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan- specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O- glycan(OG)-specific PUL, or variants thereof.

9. The system of any one of claims 7 or 8, wherein the first PUL is interchangeable with the second PUL.

10. The system of any one of claims 1-9, further comprising a second glycan-specific promoter.

11. The system of claim 10, wherein the first glycan-specific promoter is interchangeable with the second glycan-specific promoter.

12. The system of any one of claims 10 or 11, wherein the second glycan-specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

13. The system of any one of claims 1-12, wherein the first or second glycan-specific promoter comprises SEQ TD NO: 74, SEQ TD NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ TD NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

14. The system of any one of claims 1-13, wherein the reporter plasmid comprises a Bacteroides- optimized lux (Bolux) plasmid.

15. The system of any one of claims 1-14, wherein the engineered bacteria originates from a gastrointestinal bacterium.

16. The system of any one of claims 1-15, wherein the engineered bacteria originates from a Bacteroides bacterium.

17. The system of any one of claims 1-16, wherein the engineered bacteria originates from a Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be) or a variant strain thereof.

18. The system of any one of claims 1-17, wherein the engineered bacteria are grown in an anaerobic environment.

19. The system of any one of claims 1-18, wherein the engineered bacteria are grown in a 96- well, a 384-well, or a 1536-well microplate.

20. The system of any one of claims 1-19, wherein the engineered bacteria overexpresses the chondroitin sulfate (CS)-specific PUL when exposed to a CS glycan, or derivatives thereof.

21. The system of any one of claims 1-20, wherein the engineered bacteria overexpresses the levan-specific PUL when exposed to a levan glycan, or derivatives thereof.

22. The system of any one of claims 1-21, wherein the engineered bacteria overexpresses the inulin- specific PUL when exposed to an inulin glycan, or derivatives thereof.

23. The system of any one of claims 1-22, wherein the engineered bacteria overexpresses the arabinogalactan-specific PUL when exposed to an arabinogalactan, or derivatives thereof.

24. The system of any one of claims 1-23, wherein the engineered bacteria overexpresses the dextran-specific PUL when exposed to a dextran glycan, or derivatives thereof.

25. The system of any one of claims 1-24, wherein the engineered bacteria overexpresses the hyaluronan(HA)-specific PUL when exposed to a HA glycan, or derivatives thereof.

26. The system of any one of claims 1-25, wherein the engineered bacteria overexpresses the heparan sulfate (HS)-specific PUL when exposed to a HS glycan, or derivatives thereof.

27. The system of any one of claims 1-26, wherein the engineered bacteria overexpresses the O- glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof.

28. The system of any one of claims 1-27, wherein the engineered bacteria emits a light signal when overexpressing any one PUL of claims 20-27.

29. The system of any one of claims 1-28, wherein the light signal is decreased when the glycan molecule is broken down into a monosaccharide molecule.

30. A method of detecting a glycan, the method comprising: isolating and preserving a tissue sample from a subject, preparing and culturing the glycan-sensing system of any one of claims 1-29 in a microplate in an anaerobic environment, adding the tissue sample into the microplate, and detecting and quantifying a light signal emitted from the system.

31. The method of claim 30, wherein the tissue sample comprises at least one glycan.

32. The method of claim 30 or 31, wherein the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal.

33. The method of any one of claims 30-32, wherein the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample.

34. The method of any one of claims 30-33, wherein the method prevents, treats, or reduces a gastrointestinal disease or disorder in the subject.

35. A method of treating or preventing a gastrointestinal disease or disorder in a subject in need thereof, the method comprising: isolating and preserving a tissue sample from a subject, preparing and culturing the glycan-sensing system of any one of claims 1-29 in a microplate in an anaerobic environment, adding the tissue sample into the microplate, detecting and quantifying a light signal emitted from the system, and performing or administering a therapeutic procedure to the subject comprising pathological amounts of a glycan relative to a control tissue comprising normal amounts of the glycan.

36. The method of claim 35, wherein the tissue sample comprises at least one glycan.

37. The method of claim 35 or 36, wherein the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal.

38. The method of any one of claims 35-37, wherein the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample.

39. The method of any one of claims 35-38, wherein the therapeutic procedure comprises a surgical procedure, a therapeutic agent, a lifestyle change, or a combination thereof.

40. The method of any one of claims 35-39, wherein the surgical procedure comprises bariatric surgery, esophageal surgery, foregut surgery, hernia surgery, gastroesophageal reflux disease surgery, surgical nutrition, colorectal surgery, gallbladder surgery, liver surgery, bile duct surgery, pancreatic surgery, tumor excision, or combinations thereof.

41. The method of any one of claims 35-40, wherein the therapeutic agent comprises a laxative, an antacid, a proton pump inhibitor, a histamine blocker, an anti-inflammatory agent, a steroid, a vitamin, an intravenous fluid, an antibiotic, a probiotic, or combinations thereof.

42. The method of any one of claims 35-41, wherein the lifestyle change comprises a dietary alteration, an exercise program, a sedentary lifestyle, or combinations thereof.

43. The method of any one of claims 35-42, wherein the gastrointestinal disease or disorder comprises heartburn, irritable bowel syndrome, lactose intolerance, gallstones, cholecystitis, cholangitis, anal fissure, hemorrhoids, proctitis, colon polyps, infective colitis, ulcerative colitis, ischemic colitis, Crohn’s disease, radiation colitis, celiac disease, diarrhea (chronic or acute), constipation (chronic or acute), diverticulosis, diverticulitis, acid reflux (gastroesophageal reflux (GER) or gastroesophageal reflux disease (GERD)), Hirschsprung disease, abdominal adhesions, achalasia, acute hepatic porphyria (AHP), anal fistulas, bowel incontinence, centrally mediated abdominal pain syndrome (CAPS), clostridioides difficile infection, cyclic vomiting syndrome (CVS), dyspepsia, eosinophilic gastroenteritis, globus, inflammatory bowel disease, malabsorption, scleroderma, or volvulus.

44. The method of any one of claims 35-43, wherein the subject is a human.

45. A kit for detecting or purifying a glycan molecule, the kit comprising: an apparatus comprising a binding medium and a surface glycan binding protein (SGBP), wherein the binding medium comprises a matrix of biomolecules, the SGBP comprises a peptide label, and the SGBP is attached to the matrix, a container comprising an equilibration buffer, a container comprising a releasing buffer, and a container comprising a washing buffer.

46. The kit of claim 45, wherein the glycan molecule comprises chondroitin sulfate, arabinan, heparan, hyaluronan sulfate, fructan, levan, mannan, or derivatives thereof.

47. The kit of claim 45 or 46, wherein the SGBP comprises a chondroitin sulfate (CS)-specific SGBP, a levan- specific SGBP, an inulin-specific SGBP, an arabinogalactan-specific SGBP, a dextran-specific SGBP, a hyaluronan(HA)-specific SGBP, a heparan sulfate (HS)-specific SGBP, an O-glycan(OG)-specific SGBP, or variants thereof.

48. The kit of any one of claims 45-47, wherein the matrix comprises a nickel-conjugated agarose polymer.

49. The kit of any one of claims 45-48, wherein the peptide label comprises a hexa-histidine label.

50. The kit of any one of claims 45-49, wherein the releasing buffer comprises a histidine solution.

51. The kit of any one of claims 45-50, wherein the washing buffer comprises a buffered solution comprising water, phosphate buffered saline solution, a tris-buffered saline solution, or variants thereof.

52. The kit of any one of claims 45-51, wherein the kit is combined with the glycan-sensing system of any one of claims 1-29 to detect the glycan molecule.

53. A reporter plasmid comprising a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter.

54. The reporter plasmid of claim 53, further comprising a Bacteroides-optimi^ed lux (Bolux) plasmid.

55. The reporter plasmid of claim 53 or 54, wherein the luciferase reporter cassette comprises one or more genes selected from lux C, lux D, lux E, lux A, and lux B.

56. The reporter plasmid of any of claims 53-55, wherein the luciferase reporter cassette originated from a Photorhabdus luminescens (PI) bacteria.

57. The reporter plasmid of any one of claims 53-56, wherein the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

58. The reporter plasmid of any one of claims 53-57, wherein the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

59. The reporter plasmid of any one of claims 53-58, wherein the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

Ill

60. The reporter plasmid of any one of claims 53-59, further comprising a second PUL.

61. The reporter plasmid of any one of claims 53-60, wherein the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

62. The reporter plasmid of any one of claims 53-61, wherein the first PUL is interchangeable with the second PUL.

63. The reporter plasmid of any one of claims 53-62, further comprising a second glycan-specific promoter.

64. The reporter plasmid of any one of claims 53-63, wherein the second glycan-specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

65. The reporter plasmid of any one of claims 53-64, wherein the first glycan-specific promoter is interchangeable with the second glycan-specific promoter.

66. The reporter plasmid of any one of claims 53-65, wherein the first or second glycan-specific promoter comprises SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ ID NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

67. An engineered bacteria comprising the reporter plasmid of any one of claims 53-66.

68. An engineered bacteria comprising a reporter plasmid comprising a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter.

69. The engineered bacteria of claim 68, further comprising a Bacteroides-optimized lux (Bolux) plasmid.

70. The engineered bacteria of claim 68 or 69, wherein the luciferase reporter cassette comprises one or more genes selected from lux C, lux D, lux E, lux A, and lux B.

71. The engineered bacteria of any of claims 68-70, wherein the luciferase reporter cassette originated from a Photorhabdus luminescens (PI) bacteria.

72. The engineered bacteria of any of claims 68-71, wherein the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

73. The engineered bacteria of any of claims 68-72, wherein the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

74. The engineered bacteria of any of claims 68-73, wherein the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

75. The engineered bacteria of any of claims 68-74, further comprising a second PUL.

76. The engineered bacteria of any of claims 68-75, wherein the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

77. The engineered bacteria of any of claims 68-76, wherein the first PUL is interchangeable with the second PUL.

78. The engineered bacteria of any of claims 68-77, further comprising a second glycan-specific promoter.

79. The engineered bacteria of any of claims 68-78, wherein the second glycan-specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

80. The engineered bacteria of any of claims 68-79, wherein the first glycan-specific promoter is interchangeable with the second glycan-specific promoter.

81. The engineered bacteria of any of claims 68-80, wherein the first or second glycan-specific promoter comprises SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ ID NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

82. The engineered bacteria of any of claims 67-81, wherein the bacteria originates from a gastrointestinal Bacteroides bacterial strain comprising Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be), or a variant strain thereof.

83. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses a chondroitin sulfate (CS)-specific PUL when exposed to a CS glycan, or derivatives thereof.

84. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses a levanspecific PUL when exposed to a levan glycan, or derivatives thereof.

85. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses an inulinspecific PUL when exposed to an inulin glycan, or derivatives thereof.

86. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses an arabinogalactan-specific PUL when exposed to an arabinogalactan, or derivatives thereof.

87. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses a dextranspecific PUL when exposed to a dextran glycan, or derivatives thereof.

88. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses a hyaluronan(HA)-specific PUL when exposed to a HA glycan, or derivatives thereof.

89. The engineered bacteria of any of claims 67-82, wherein the bacteria overexpresses a heparan sulfate (HS)-specific PUL when exposed to a HS glycan, or derivatives thereof.

90. The engineered bacteria of any of claims 67-89, wherein the bacteria overexpresses an O- glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof.

91. The engineered bacteria of any one of claims 67-90, wherein the engineered bacteria emits a light signal when overexpressing any one PUL of claims 83-90. 92. A glycan-sensing system comprising the reporter plasmid of any of claims 53-66 and/or the engineered bacteria of any claims 67-91.

Description:
HARNESSING GUT MICROBES FOR GLYCAN DETECTION AND QUANTIFICATION

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH

This invention was made with government support under Grant No. AI149319 awarded by the National Institutes of Health/NIAID. The Government has certain rights in the invention.

RELATED APPLICATION

This PCT application claims priority to, and the benefit of, U.S. Provisional Patent Application No.63/328,555, filed April 07, 2022, entitled “HARNESSING GUT MICROBES FOR GLYCAN DETECTION AND QUANTIFICATION,” which is incorporated by reference herein in its entirety.

REFERENCE TO SEQUENCE LISTING

The sequence listing submitted on April 7, 2023, as an .XML file entitled “11196- 101WOl_Sequence_Listing.xml” created on April 7, 2023, and having a file size of 360,902 bytes is hereby incorporated by reference pursuant to 37 C.F.R. § 1.52(e)(5).

FIELD

The present disclosure relates compositions and methods of engineering microbial strains to detect and quantify glycan molecules.

BACKGROUND

Characterization of glycan molecules present in heterogenous mixtures has generally been performed using lectins, nuclear magnetic resonance, or liquid or gas chromatography (LC or GC, respectively) followed by mass spectrometry methods. Using these methods are proven to be expensive and requires tremendous investments in expertise, instrumentation, and consumable products. Furthermore, these approaches require glycan separation and derivatization prior to implementation, which can result in non-uniform detection and often necessitate tandem parallel approaches to achieve high-confidence results. Detection of glycan molecules in heterogenous mixtures allows for profiling microbes present in a patient, specifically in the gastrointestinal tract of the patient. Further, glycan detection can allow for early detection, treatment, and/or prevention of pathogenic microbes within a patient. Given the limitations described above, there is a need to effectively and efficiently detect and quantify glycans in heterogenous mixtures using improved and optimized methods.

The compositions and methods disclosed herein address these needs.

SUMMARY

The present disclosure provides a system for sensing and detecting glycan molecules using a bacterial reporter plasmid.

In one aspect, one aspect, disclosed herein are reporter plasmids comprising a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter.

Also disclosed herein are reporter plasmids of any preceding aspect, wherein the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

In one aspect disclosed herein are reporter plasmids of any preceding aspect, wherein the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

Also disclosed herein are reporter plasmids of any preceding aspect, wherein the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof including, but not limited to glycan specific promoters comprising the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

In one aspect disclosed herein are reporter plasmids of any preceding aspect, wherein the reporter plasmid further comprises a second PUL. In some embodiments, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some embodiments, the first PUL is interchangeable with the second PUL.

Also disclosed herein are reporter plasmids of any preceding aspect, wherein the reporter plasmid further comprises a second glycan-specific promoter. In some embodiments, the first glycan- specific promoter is interchangeable with the second glycan-specific promoter. In some embodiments, the second glycan-specific promoter comprises a CS-specific promoter, a levanspecific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

In one aspect, disclosed herein are reporter plasmids of any preceding aspect, wherein the second glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

Also disclosed herein are reporter plasmids of any preceding aspect, wherein the reporter plasmid comprises a Bmerozrtes-optimized lux (Bolux) plasmid.

Also disclosed herein are engineered bacteria comprising the reporter plasmid of any preceding aspect. For example, disclosed herein are engineered bacteria harboring a reporter plasmid wherein the reporter plasmid comprises a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter. In some aspects, the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor. In some embodiments, the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some aspects, the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

In one aspect, disclosed herein are engineered bacteria of any preceding aspect, wherein the first glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74- 76, or SEQ ID NOs: 262-354.

Also disclosed herein are engineered bacteria of any preceding aspect, further comprising a second PUL. In some aspects, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan-specific PUL, a dextran-specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)- specific PUL, or variants thereof. In some aspects, the first PUL is interchangeable with the second PUL.

In one aspect, disclosed herein are engineered bacteria of any preceding aspect, further comprising a second glycan-specific promoter. In some embodiments, the first glycan-specific promoter is interchangeable with the second glycan-specific promoter. In some aspects, the second glycan-specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin- specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

Also disclosed herein are engineered bacteria of any preceding aspect, wherein the second glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

In some aspects, disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria originates from a gastrointestinal bacterium. In some aspects, the engineered bacteria originates from a Bacteroides bacterium including, but not limited to Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be) or a variant strain thereof.

Also disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the chondroitin sulfate (CS)-specific PUL when exposed to a CS glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the levan- specific PUL when exposed to a levan glycan, or derivatives thereof.

Also disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the inulin-specific PUL when exposed to an inulin glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the arabinogalactan- specific PUL when exposed to an arabinogalactan, or derivatives thereof.

Also disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the dextran-specific PUL when exposed to a dextran glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the hyaluronan(HA)-specific PUL when the engineered bacteria is exposed to a HA glycan, or derivatives thereof.

Also disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the heparan sulfate (HS)-specific PUL when exposed to a HS glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria overexpresses the O-glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof. Also disclosed herein are engineered bacteria of any preceding aspect, wherein the engineered bacteria emits a light signal when overexpressing any PUL of any preceding aspect. In some aspects, the light signal is decreased when the glycan molecule is broken down into a monosaccharide molecule.

In one aspect, disclosed herein are glycan-sensing systems comprising the engineered bacteria or reporter plasmid of any preceding aspect. For example, disclosed herein is a glycan-sensing system comprising an engineered bacteria harboring a reporter plasmid, wherein the reporter plasmid comprises a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter, wherein the glycan-sensing system detects and quantifies a glycan molecule.

In some embodiments, the glycan molecule comprises chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, or derivatives thereof. In some embodiments, the glycan molecule is located in the gastrointestinal tract of a mammal.

In some embodiments, the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

In some embodiments, the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan-specific PUL, a dextran-specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)- specific PUL, or variants thereof.

In some embodiments, the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin- specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)- specific promoter, or variants thereof.

In some embodiments, the system further comprises a second PUL. In some embodiments, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulinspecific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some embodiments, the first PUL is interchangeable with the second PUL.

In some embodiments, the system further comprises a second glycan-specific promoter. In some embodiments, the first glycan-specific promoter is interchangeable with the second glycan- specific promoter. In some embodiments, the second glycan-specific promoter comprises a CS- specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS-specific promoter, an O- glycan(OG)-specific promoter, or variants thereof.

In some embodiments, the first or second glycan- specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

In some embodiments, the reporter plasmid comprises a Bacteroides-optimized lux (Bolux) plasmid.

In some embodiments, the engineered bacteria originates from a gastrointestinal bacterium. In some embodiments, the engineered bacteria originates from a Bacteroides bacterium. In some embodiments, the engineered bacteria originates from a Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be) or a variant strain thereof.

In some embodiments, the engineered bacteria are grown in an anaerobic environment. In some embodiments, the engineered bacteria are grown in a 96- well, a 384-well, or a 1536-well microplate.

In some embodiments, the engineered bacteria overexpresses the chondroitin sulfate (CS)- specific PUL when exposed to a CS glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the levan-specific PUL when exposed to a levan glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the inulin- specific PUL when exposed to an inulin glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the arabinogalactan-specific PUL when exposed to an arabinogalactan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the dextran-specific PUL when exposed to a dextran glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the hyaluronan(HA)-specific PUL when the engineered bacteria is exposed to a HA glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the heparan sulfate (HS)- specific PUL when exposed to a HS glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the O-glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof.

In some embodiments, the engineered bacteria emits a light signal when overexpressing any PUL of any preceding aspect.

In some embodiments, the light signal is decreased when the glycan molecule is broken down into a monosaccharide molecule. In one aspect, disclosed herein is a method of detecting a glycan, the method comprising isolating and preserving a tissue sample from a subject, preparing, and culturing the glycan-sensing system of any preceding aspect in a microplate in an anaerobic environment, adding the tissue sample into the microplate, and detecting and quantifying a light signal emitted from the system of any preceding aspect.

In some embodiments, the tissue sample comprises at least one glycan. In some embodiments, the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal. In some embodiments, the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample.

In some embodiments, the method prevents, treats, or reduces a gastrointestinal disease or disorder in the subject.

In one aspect, disclosed herein is a method of treating or preventing a gastrointestinal disease or disorder in a subject in need thereof, the method comprising isolating and preserving a tissue sample from a subject, preparing and culturing the glycan-sensing system of any preceding aspect in a microplate in an anaerobic environment, adding the tissue sample into the microplate, detecting and quantifying a light signal emitted from the system, and performing or administering a therapeutic procedure to the subject comprising pathological amounts of a glycan relative to a control tissue comprising normal amounts of the glycan.

In some embodiments, the tissue sample comprises at least one glycan. In some embodiments, the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal. In some embodiments, the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample.

In some embodiments, the therapeutic procedure comprises a surgical procedure, a therapeutic agent, a lifestyle change, or a combination thereof.

In some embodiments, the surgical procedure comprises bariatric surgery, esophageal surgery, foregut surgery, hernia surgery, gastroesophageal reflux disease surgery, surgical nutrition, colorectal surgery, gallbladder surgery, liver surgery, bile duct surgery, pancreatic surgery, tumor excision, or combinations thereof. In some embodiments, the therapeutic agent comprises a laxative, an antacid, a proton pump inhibitor, a histamine blocker, an anti-inflammatory agent, a steroid, a vitamin, an intravenous fluid, an antibiotic, a probiotic, or combinations thereof. In some embodiments, the lifestyle change comprises a dietary alteration, an exercise program, a sedentary lifestyle, or combinations thereof.

In some embodiments, the gastrointestinal disease or disorder comprises heartbum, irritable bowel syndrome, lactose intolerance, gallstones, cholecystitis, cholangitis, anal fissure, hemorrhoids, proctitis, colon polyps, infective colitis, ulcerative colitis, ischemic colitis, Crohn’s disease, radiation colitis, celiac disease, diarrhea (chronic or acute), constipation (chronic or acute), diverticulosis, diverticulitis, acid reflux (gastroesophageal reflux (GER) or gastroesophageal reflux disease (GERD)), Hirschsprung disease, abdominal adhesions, achalasia, acute hepatic porphyria (AHP), anal fistulas, bowel incontinence, centrally mediated abdominal pain syndrome (CAPS), clostridioides difficile infection, cyclic vomiting syndrome (CVS), dyspepsia, eosinophilic gastroenteritis, globus, inflammatory bowel disease, malabsorption, scleroderma, or volvulus.

In some embodiments, the subject is a human.

In one aspect, disclosed herein is a kit for detecting or purifying a glycan molecule, the kit comprising an apparatus comprising a binding medium and a surface glycan binding protein (SGBP), wherein the binding medium comprises a matrix of biomolecules, the SGBP comprises a peptide label, and the SGBP is attached to the matrix; a container comprising an equilibration buffer, a container comprising a releasing buffer, and a container comprising a washing buffer.

In some embodiments, the glycan molecule comprises chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, or derivatives thereof.

In some embodiments, the SGBP comprises a chondroitin sulfate (CS)-specific SGBP, a levan-specific SGBP, an inulin- specific SGBP, an arabinogalactan-specific SGBP, a dextran-specific SGBP, a hyaluronan(HA)-specific SGBP, a heparan sulfate (HS)-specific SGBP, an O-glycan(OG)- specific SGBP, or variants thereof.

In some embodiments, the matrix comprises a nickel-conjugated agarose polymer. In some embodiments, the peptide label comprises a hexa-histidine label. In some embodiments, the releasing buffer comprises a histidine solution. In some embodiments, the washing buffer comprises a buffered solution comprising water, phosphate buffered saline solution, a tris-buffered saline solution, or variants thereof.

In some embodiments, the kit is combined with the glycan-sensing system of any preceding aspect to detect the glycan molecule.

BRIEF DESCRIPTION OF FIGURES

The accompanying figures, which are incorporated in and constitute a part of this specification, illustrate several aspects described below.

Figures 1A,1B, 1C, ID, IE, IF, 1G, 1H, and II show the (FIG. 1A) PUL-encoded surface glycan binding proteins (SGBPs) sequester target glycans prior to their translocation across the outer membrane by the corresponding SusC channel. Once in the periplasm, the glycan is depolymerized into mono, di-, or oligo-saccharide signatures that bind their cognate sensor, which subsequently activates PUL transcription in the cytoplasm. The sensor ligand is subsequently further degraded into monosaccharide components and transported across the inner membrane, where they enter central metabolism, thereby deactivating the sensor. FIG. IB shows a cartoon depicting the strategy employed in this proposal to harness the PUL-encoded glycan detection machinery to report the presence of distinct glycans. FIG. 1C shows the growth of wild-type (wt, black) or mutant Bt strains deficient for the PUL-encoded susC transporter (ΔBT3332, blue), lyases (ΔBT 3324 ΔBT3350 T2BT4410, purple), sensor (ΔBT3334, red) or glycosyl hydrolase (gh, ΔBT3348, orange) supplied CS as the sole carbon source. n=8, error bars are SEM. FIG. ID shows the fold increase in susC transcript levels were measured by qPCR 120 minutes following the introduction of 0.2% CS to cultures of wild-type Bt or a strain deficient for the corresponding PUL sensor, BT3334. FIG. IE shows the fold change in susC transcript levels were measured by qPCR 120 minutes following the introduction of 0.2, 0.02, or 0.002% CS prepared with galactose to a final carbohydrate content of 0.5%. FIG. IF shows the fold increase in susC transcript levels were measured by qPCR 120 minutes following the introduction of 0.2% levan to cultures of wild-type Bt or a strain deficient for the corresponding PUL sensor, BT1754. FIG. 1G shows the growth of wild-type (wt, black) or mutant Bl strains deficient for the PUL-encoded susC transporter (ΔBT 1763, blue), levanases (ΔBT1760-59 ΔBT3082 ΩBT1765, purple), sensor (ΔBT1754, orange) or fructose importer (ΩBT1758, purple) supplied levan as the sole carbon source. n=8, error bars are SEM. FIGS. 1H and II show the fold change in susC transcript levels were measured by qPCR after (FIG. 1H) 120 or (FIG. II) 60 minutes following the introduction of 0.2, 0.02, or 0.002% levan prepared with galactose to a final carbohydrate content of 0.5%. For FIGS. 1D-1G, n=6, error bars are SEM, p-values were calculated by paired two-tailed Student’s t- test, and R 2 values were computed by Microsoft Excel power trendline functions, ns indicates values < 0.05, * < 0.05, « < 0.01, and *** < 0.001.

Figures 2 A, 2B, and 2C show the construction of a Bacteroides-optimized bioluminescent reporter. FIG. 2A shows the schematic depicting the construction of a bioluminescent reporter that encodes the entire Pl lux cassette under control of the Bt rpoD promoter and rpiL* RBS (top); a Bacteroides optimized lux cassette with rearranged ZMXA-E (indicated by the shaded regions) and Bt intergenic regions from a constitutively expressed Bt operon (BT1160-1155; middle); or pBolux which has BamHI and Spel sites positioned upstream of the Bacteroides optimized lux cassette (bottom) in the multi-copy plasmid pLYLOl. FIG. 2B shows the relative luminescence (solid lines) or growth (dashed lines) from Bt strains harboring an empty vector (black) or plasmids containing either the lux operon from Pl (blue) or the Bacteroides-optimized lux cassette (red) expressed from the Bt rpoD promoter and rpiL* RBS were measured during growth in minimal media containing 0.5% galactose as the sole carbon source. FIG. 2C shows the relative luminescence (solid lines) or growth (dashed lines) of Bt strains harboring empty pBolux (black) or a plasmid with the Bt rpoD promoter cloned into the BamHI and Spel sites (red) during growth in galactose as the sole carbon source. All values in FIGS. 2B and 2C are the mean of 8 biological replicates and error bars are SEM in color-matched shading. Source data are provided as a Source Data file.

Figures 3A, 3B, 3C, 3D, 3E, and 3F show the construction of a glycan-responsive reporter in Bt. FIG. 3A shows the relative luminescence from wild-type Bt strains harboring pBolux (black) or a plasmid including the promoter region preceding the CS-inducible susC gene (P-BT3332, pink lines) following the introduction of CS as the sole carbon source normalized by the relative luminescence of identical cultures supplied galactose. Values are the mean of 12 biological replicates and error is SEM in color-matched shading. FIG. 3B shows the relative luminescence from wild-type Bt (black) or strains lacking a CS-inducible susC (ΔBT3332. blue), 3 CS-specific lyases (ΔBT3324 ΔBT3350 £2BT4410, purple), CS-sensor (ΔBT3334, red), or a glucuronyl hydrolase (ΔBT3348, orange) harboring P-BT3332 following the introduction of an equal mixture of CS and galactose normalized to measurements from identical strains supplied galactose alone. FIG. 3C shows the relative luminescence from wild-type Bl (black) or strains lacking 3 CS-specific lyases (ΔBT3324 /1BT3350 ΩBT441G, purple) or the CS-sensor (ΔBT3334, red) harboring P-BT3332 following the introduction of a mixture of unsulfated CS disaccharide (diOS) and galactose normalized to measurements from identical strains supplied galactose alone. Values are the mean of 6 biological replicates and error is SEM in color-matched shading. FIGS. 3D-3F shows the relative luminescence from wild-type Bt (solid lines) or strains lacking the CS-sensor (ΔBT3334, dashed lines) harboring P-BT3332 following the introduction of a mixture of galactose and either hyaluronic acid (HA, green) or heparin (blue) and galactose normalized to measurements from identical strains supplied galactose alone. For FIGS. 3B and 3D, values are the mean of 8 biological replicates and error is SEM in color-matched shading. Source data are provided as a Source Data file.

Figures 4A, 4B, 4C, 4D, 4E, 4F, 4G, and 4H show the levan-responsive reporter reveals multiple levanases coordinate fructan utilization in Bt. FIG. 4A shows the relative luminescence from wild-type Bt harboring pBolux (black) or a plasmid including the promoter region preceding the levan inducible susC gene (P-BT1763, pink) following the introduction of levan as the sole carbon source and normalized by the relative luminescence of identical cultures supplied galactose. Values are the mean of 12 biological replicates and error is SEM in color-matched shading. FIGS. 4B and 4C show the relative luminescence from wild-type Bt (black) or strains lacking the levan-inducible susC (ΔBT 1763, blue), 4 levan-specific hydrolases (ΔBT1760-1759 ΔBT3082 ΩBT1765, purple), fructan sensor (ΔBT1754, red), or a putative inner membrane fructose transporter (T2BT1758, orange) harboring P-BT1763 were measured following the introduction of an equal mixture of galactose and (FIG. 4B) levan or (FIG. 4C) fructose and normalized by the relative luminescence of identical cultures supplied galactose alone. FIG. 4D shows the growth of wild-type Bl (black) or strains lacking the levan- inducible susC ΔBT 1763, blue), 4 levan-specific hydrolases (ΔBT1760-1759 ΔBT3082 ΩBT1765. purple), fructan sensor (ΔBT 1754. red), or a putative inner membrane fructose transporter (ΩBT1758, orange) were measured during anaerobic culture in minimal media containing fructose as a sole carbon source. FIG. 4E shows the relative luminescence of wild-type Bt or strains lacking all other levanases except BT1760 (ΔBT 1759 ΔBT3082 ΩBT1765. pink), BT1759 (ΔBT 1760 ΔBT 3082 ΩBT1765, teal), BT3082 (ΔBT 1760-59 ΩBT1765, lavender), or BT1765 (ΔBT 1760-59 ΔBT3082, purple) harboring the levan-responsive reporter following the introduction of a mixture of levan and galactose normalized with measurements from identical cultures supplied galactose alone. FIG. 4F- 4H show the relative luminescence of wild-type Bt or strains lacking BT1760 (pink), BT1759 (teal), BT3082 (lavender), or BT1765 (purple) harboring P-BT1763 following the introduction of a mixture of levan and galactose normalized with measurements from identical cultures supplied galactose alone. For FIGS. 4B-4F, values are the mean of 8 biological replicates and error is SEM in color- matched shading. Source data are provided as a Source Data file.

Figures 5A, 5B, 5C, 5D, 5E, and 5F show the species-specific responses that enable PUL reporters to distinguish between compositionally identical yet structurally distinct glycans. FIG. 5A shows the growth of wild-type Bt (gray) and Bo (black) or a strain lacking the Bo inulin sensor (ΔBACO VA_04496, red) were measured during anaerobic culture in minimal media containing inulin as a sole carbon source. FIG. 5B shows the relative luminescence from wild-type Bo harboring y>Bolux (black) or a plasmid including the promoter region preceding the inulin-inducible susC gene (P- BACOVA_04505 , pink) were measured following the introduction of inulin as the sole carbon source and normalized by the relative luminescence from identical cultures supplied galactose. Values are the mean of 12 biological replicates and error is SEM in color-matched shading. FIGS. 4C-4F show the relative luminescence from wild-type Bo harboring P-BACOVA_04505 (black) or an isogenic strain lacking the Bo inulin sensor (ΔBACO VA_04496, red) were measured following the introduction of an equal mixture of galactose and (FIG. 4C) inulin or (FIG. 4D) fructose and normalized by the relative luminescence of identical cultures supplied galactose alone. For FIGS. 4A, 4C, and 4D, values are the mean of 8 biological replicates and error is SEM in color-matched shading. Source data are provided as a Source Data file.

Figures 6A, 6B, 6C, 6D, 6E, and 6F show the PUL-reporters reflect dose-dependent transcription. FIG. 6A shows the relative luminescence from a wild-type Bt strain harboring P-BT3332 following the introduction of 2-fold serial dilutions of 0.4% CS containing galactose to a total carbohydrate content of 0.5% and normalized to identical cultures supplied galactose alone. FIG. 6B shows the relative luminescence from a wild-type Bt strain harboring P-BT1763 following the introduction of 2-fold serial dilutions of 0.4% levan containing galactose to a total carbohydrate content of 0.5% and normalized to identical cultures supplied galactose alone. For FIGS. 6A-6B, values are the mean of 12 biological replicates and error is SEM in color- matched shading. FIG. 6C shows the AUC of response curves measured from wild-type Bt (black, see panel a) or a mutant lacking the glucuronyl hydrolase ( gh, orange, see panel f) harboring P-BT3332 supplied 2-fold dilutions of CS to each strain and normalized by identical cultures supplied galactose alone. FIG. 6D shows the AUC of response curves measured from wild-type Bt (black, see panel b) or a mutant lacking a putative inner membrane transporter harboring -BT1763 supplied 2-fold dilutions of levan to each strain and normalized by identical cultures supplied galactose alone. For FIGS. 6C-6D, values are the mean of 12 biological replicates, error bars are standard deviation, and P-values were computed by 2-way ANOVA with Dunnett correction and *** indicates values < 0.001, ** < 0.01, * < 0.05, and ns > 0.05. FIG. 6E The AUC of response curves measured from wild-type Bt strains harboring either P-BT3332 (open blue squares) supplied mixtures containing 2-fold serial dilutions of levan or -BT1763 (open red circles) supplied 2-fold serial dilutions of CS with galactose to a total carbohydrate content of 0.5% and normalized by identical cultures supplied galactose alone. Values are the mean of 6 biological replicates, error bars represent standard deviation, and P-values were computed using 2- way ANOVA with Tukey’s honest significance test and ns indicates values > 0.05. FIG. 6F shows the relative luminescence from a gA-deficient Bt strain harboring P-BT3332 following the introduction of 2-fold serial dilutions of 0.4% CS containing galactose to a final carbohydrate content of 0.5% and normalized to identical cultures supplied galactose alone. Values are the mean of 12 biological replicates and error is SEM in color-matched shading. Source data are provided as a Source Data file.

Figures 7A, 7B, 7C, 7D, 7E, 7F, 7G, 7H, 71, 7J, 7K, and 7L show the development of a glycan affinity isolation strategy using PUL-reporters. FIG. 7A shows a cartoon depicting an affinity isolation strategy whereby recombinant, PUL-encoded SGBP is immobilized using metal affinity chromatography and binds its target when incubated with heterogenous glycan mixtures, which subsequently co-elutes following the addition of excess histidine. FIGS. 7B-7D show the migration of purified recombinant his-tagged BT3330 and BT1761 proteins were examined by affinity-PAGE using acrylamide gels containing (FIG. 7B) CS, (FIG. 7C) no glycan, or (FIG. 7D) levan. FIGS. 7E- 7F show the fold increases in reporter activity (AUC) from Bt strains harboring either a levan-specific reporter (P-BT1763, black bars) or a CS-specific reporter (P-BT3332, black bars) supplied 8 step- wise eluates from (FIG. 7E) BT3330 or (FIG. 7F) BT1761 pre-incubated with an equal mixture of CS and levan normalized by reporter strains supplied eluates from identical reactions containing E. coli lysates harboring an empty vector. Inset: SDS-PAGE analysis of E1-E8 demonstrating the amount of (FIG. 7E) BT3330 or (FIG. 7F) BT1761 present in each fraction. FIGS. 7G-7I show the pooled, concentrated eluates from (FIG. 7G & FIG. 71) BT333O or (FIG. 7H & FIG. 7J) BT1761 pre- incubated with a CS and levan mixture were supplied Bt strains harboring (FIG. 7G) P-BT3332 or (FIG. 7H) P-BT1763 and compared to standard curves of identical strains supplied 2-fold diluted CS or levan, respectively. Alternatively, the material was analyzed for total glycosaminoglycans (FIG. 71) or fructans (FIG. 7J) using commercially available colorimetric assays according to the manufacturer’s instructions. n=4, error bars are standard deviation; p- values were computed using a paired, two-tailed, Student’s t-test; R 2 values were computed using simple linear regression analysis in Prism and the 95% confidence interval is displayed as a dashed line. FIGS. 7K-7E show the pooled, concentrated BT3330 eluates pre-incubated with either (FIG. 7K) a CS and levan mixture or (FIG. 7E) a 0.1% PMOG solution were supplied to reporter strains differentially activated by CS or PMOGs (see FIGS. 8F&G, respectively). For all panels n=4, error bars are standard deviation, p- values were computed using a two-tailed student’s t-test and ns indicates p- values > 0.05, * < 0.05, **, 0.01, *** < 0.001.

Figures 8A, 8B, 8C, 8D, and 8E show the PUEs are required for glycan-specific growth conditions and transcriptional responses. FIG. 8 A shows growth of wild-type Bt (GT23; black) or strains lacking a CS-inducible susC (ΔBT3332; GT2926; blue), 3 CS-specific lyases (ΔBT3324ΔBT3350 ΩBY4410; GT3086; purple), CS-sensor (ΔB T3334-, GT150; red), or a glucuronyl hydrolyase (ΔBT3348-, VR69; orange_ were measured during anaerobic culture in minimal media containing galactose as a sole carbon source. FIG. 8B shows the growth of wild-type Bt (GT231 black) or strains lacking a levan-inducible susC C8BT1763; GT3196; blue), 4 levanases (ΔBT1760-1759ΔBT3082 Ω2BT1765; GT3348; purple), fructan-sensor (ΔBT1754; GT165; red), or a putative inner membrane fructose importer (91BT1758'. GT3379; orange) were measured during anaerobic culture in minimal media containing galactose as a sole carbon source. For FIGS. 8 A and 8B, values are the mean of 8 biological replicates and error bars are SEM in color-matched shading. FIGS. 8C-8E show the fold increase of (FIGS 8C and 8D) BT3332 or (FIG. 8E) BT1763 mRNA levels in wildtype Bt following the introduction of mixtures containing either 0.2%, 0.02%, or 0.02% CS or levan supplemented with galactose to 0.5% total carbohydrate. The fold increase was calculated as the change in transcript levels between cultures before and after 2 hours or 1 hour following induction of glycan mixtures. Values are average of 6 independent measurements, error bars represent SEM, and P-values were calculated by 2-way ANOVA with Tukey’s honest significance test and *** represents values <0.001, *<0.05, and ns indicates values >0.05. Figures 9A, 9B, 9C, and 9D show the bioluminescence during anaerobic growth across Bacleroides species and growth conditions. FIG. 9A shows the relative luminescence (solid lines) or growth (dashed lines) from Bt strains harboring an empty vector (GT1866; black) or plasmids containing either the lux operon from P. luminescens (GT3137; blue) or the Bacteroides- -optimized lux cassette (GT1541; red) expressed from the Bt rpoD promoter and rpiL* RBS was measured during growth in minimal media containing 0.5% glucose. FIG. 9B shows the relative luminescence of Bt strains harboring empty pBolux (GT1867; dashed lines) or a plasmid with the corresponding rpoD promoter cloned into the BamHI and Spel sites (GT1868; solid lines) during growth in glucose (black), fructose (red), arabinose (purple), or xylose (green) as the sole carbon source. FIG. 9C shows the relative luminescence (solid lines) or growth (dashed lines) of Bo strains harboring empty pBolux (GT3489; black) or a plasmid with the Bo rpoD promoter cloned into the BamHI and Spel sites (GT3490; red) during growth in galactose as the sole carbon source. FIG. 9D shows the relative luminescence of Bo strains harboring empty pBolux (GT3489; dashed lines) or a plasmid with the corresponding rpoD promoter cloned into BamHI and Spel sites (GT3490; solid lines) during growth in glucose (black), fructose (red), arabinose (purple), or xylose (green) as the sole carbon source. For all figures, values are the mean of 8 biological replicates and error is SEM in color-matched shading.

Figures 10A, 10B, 10C, 10D, 10E, and 10F show that the BT3332 promoter confers CS and HA- inducible bioluminescence in pBolux. FIGS. 10A and 10B show the relative luminescence from wild-type Bt (GT1934, black lines) or strains lacking the CS-inducible susC (ΔBT3334; GT2939; blue), 3 CS-specific lyases Δ BT3324 ΔBT3350 Q.BT4410', GT3117; purple), CS-sensor ΔB T3334\ BT2618; red), or a glucuronyl hydrolyase (ΔBT3348-. GT3102; orange) harboring pBolux including the promoter region preceding BT3332 (P-BT3332) following the introduction of an equal mixture of CS and galactose or galactose alone. FIG. 10C shows the relative luminescence from wild-type Bt (GT1934) harboring P-BT3332 following the introduction of galactose alone (black lines) or an equal mixture of galactose and HA (green lines) or HS (blue lines). FIGS. 10D AND 10E show the growth of wild-type Bt (GT23; black) or a strain lacking the CS-sensor ( ΔBT3334', GT150; red) were measured during anaerobic culture in minimal media containing HS or HA as a sole carbon source. FIG. 10F the relative luminescence from a CS-sensor deficient strain (J3BT3334-, GT2618) harboring P-BT3332 following the introduction of galactose alone (black lines) or an equal mixture of galactose and HA (green lines) or HS (blue lines). For all figures, values are the mean of 8 biological replicates, error bars are SEM in color-matched shading.

Figures 11 A, 11B, 11C, 11D, HE, 11F, 11G, and 11H show a fructan-responsive reporter reveal new insights into Bt levan utilization. FIG. HA shows the relative luminescence from wildtype Bt harboring empty pBolux (black) or a plasmid including the region upstream of the levan- inducible susC (P-BT1763, pink) following the introduction of 0.5% levan (solid lines) or 0.5% galactose (dashed lines). Values are the mean of 12 biological replicates, error is SEM in color- matched shading. FIGS. 11B and 11C show the growth of wild-type Bt harboring empty pBolux (black) or P-BT1763 (pink) following the introduction of 0.5% galactose or 0.5% levan. For FIGS. 11 A, 11B, and 11C, values are the mean of 12 biological replicates, error is SEM in color-matched shading. FIGS. 11D and 11F show the growth of wild-type Bt or strains lacking all other levanases except BT1760 (ΔBT1759 ΔBT3082 ΩBT1765: GT3347; pink), BT1759 (ΔBT1760 A3082 (21765: GT3346; teal), BT3082 (ΔBT1760-59 ΩBT1765: gt3401; lavender), or BT1765 (NBT1760-59 A3082: GT33O8; purple) in 0.1% fructose or galactose as a sole carbon source. FIGS. HE and 11G show the growth wild-type Bt (GT23, black) or strains lacking either BT1760 (GT3181; pink), BT1759 (GT3226; teal), BT3082 (GT3303; lavender), or BT1765 (GT; purple) 0.1% fructose or galactose as a sole carbon source. FIG. 11H shows the growth of wild-type Bt (GT2111. Black) or BT1760- deficient strains (GT3215; pink) harboring empty pNBU2 or a plasmid encoding BT1760 (GT3216, green) in 0.1% levan as the sole carbon source.

Figures 12A, 12B, 12C, and 12D show a fructan-responsive PUL in Bo exhibits inulin- inducible activity. FIGS. 12A, 12B, and 12C show the growth of wild-type Bo (ATCC 8483; black) or a strain lacking the Bo inulin sensor (ΔBAC OVA _04496: GT3183; red) were measured during anerobic culture in minimal media containing levan, fructose, or galactose as the sole carbon source. Values are the mean of 8 biological replicates and errors bars are SEM in color-matched shading. FIG. 12D shows the relative luminescence from wild-type Bo harboring empty pBolux (black) or a plasmid that includes the region preceding an inulin-inducible susC (P-BACOVA_04505; pink) were measured following the introduction of 0.5% inulin (solid lines) or 0.5% galactose (dashed lines). Values are the mean of 12 biological replicates and error is SEM in color-matched shading.

Figures 13A, 13B, 13C, 13D, 13E, and 13F show that PUL reporters display concentration dependent responses to target glycans. FIG. 13A shows the relative luminescence from a wild-type Bt strain harboring a P-BT1763 (GT1893) following the introduction of mixtures containing 2-fold serial dilutions of 0.4% levan balanced with galactose to a final carbohydrate content of 0.5% and normalized to identical cultures supplied galactose alone. Values are the mean of 12 biological replicates and error bars are SEM in color-matched shading. FIGS. 13B and 13C show the log 10 AUC responses within the linear range from wild-type Bt (black; FIG. 13B: GT1934; FIG. 13C: GT1893) or mutants defective for PUL-sensor deactivation (orange; FIG. 13B: GT3102; FIG. 13C: GT3393) harboring P-BT3332 or P-BT1763 and supplied 2x serial dilutions of 0.4% CS or levan balanced with galactose to 0.5% total carbohydrate content and normalized to response from identical cultures supplied galactose alone. Values are the mean of 12 biological replicates and error is standard deviation. Solid lines represent the simple linear regression models corresponding to responses from each strain and color-matched dashed lines represents the 95% confidence intervals computed in Prism. FIGS. 13D and 13E show the fold difference between the AUC of response from wild-type Bt strains harboring either P-BT3332 (blue squares) or P-BT1763 (red circles) supplied glycan mixtures containing 2-fold serial dilutions of 0.2% CS or levan, respectively, in the presence or absence of constant 0.2% levan or CS, respectively, and balanced with galactose to 0.5% total carbohydrate normalized by the AUC of responses from identical cultures supplied alone. FIG. 13E shows the AUC of responses from wild-type Bt strains harboring either Y-BT3332 (GT1934; blue squares) or P- BT1763 (GT1893; red circles) supplied glycan mixtures containing 2-fold serial dilutions of 0.2% CS or levan, respectively, in the presence of constant 0.2% levan or CS, respectively, and balanced with galactose to 0.5% total carbohydrate normalized by the AUC of responses from identical cultures supplied galactose alone. Values represent the average of 6 biological replicates and error bars are standard deviation. P-values were calculated with 2-way ANOVA with Tukey’s honest significance test and *** represents values <0.001, **<0.01, *<0.05, and ns >0.05. FIG. 13F shows the relative luminescence from a BT175<S-deficient Bl strain harboring P-BT1763 (GT3393) following the introduction of 2-fold serial dilutions of 0.4% levan containing galactose to a final carbohydrate content of 0.5% and normalized to identical cultures supplied galactose alone. Values are the mean of 12 biological replicates and error bars are SEM in color-matched shading.

Figures 14A, 14B, 14C, 14D, 14E, 14F, 14G, 14H, and 141 show the glycan-responsive reporter strains can indicate target glycan abundance. FIGS. 14A and 14B show the AUC of responses from wild-type Bt strains harboring either a P-BT3332 (GT1934; open bars) or P-BT1763 (GT1893, filled bars) supplied elution fractions from nickel-NTA agarose incubated with E. Coli whole cell lysates from strains containing empty pT7-7 vector, or plasmids engineered to overexpress BT1761 or BT3330 and preOincubated with a mixture of 0.1% of both levan and CS. All elution fractions were supplemented with 0.4% galactose. Values represent an average of 4 total replicates from two independent experiments and error bars are SEM. FIG 14C shows the Coomassie stained SDS-PAGE gels showing the corresponding protein levels for BT3330 (top 2 gels) or BT1761 (bottom 2 gels) in each elution fraction. FIGS. 14D and 14E show the AUC responses from wild-type Bt strains harboring either P-BT3332 (GT1934; open bars) or P-BT1763 (GT1893, filled bars) supplied galactose alone or concentrated material co-purifying with BT3330 or BT1761 supplemented with 0.4% galactose. Values are the average of 4 measurements from 2 independent experiments, error is standard deviation and P-values were computed using 1-way ANOVA with Tukey’s honest significance test and *** represents values <0.001 and ns indicates values >0.05. FIG. 14F shows the AUC of responses from a BT334S-deficient Bt strain harboring P-BT3332 supplied galactose alone (none) or concentrated material co-purifying BT1761 supplemented with 0.4% galactose. FIG. 14G shows identical cultures to those described in FIG. 14F were supplied 2-fold serial dilutions of CS containing galactose to 0.5% total carbohydrate content and all measurements were normalized to responses from cultures supplied galactose alone. FIG. 14H shows the AUG of responses from a BT1758-'-deficient Bt strain harboring B-BT1763 supplied galactose alone or concentrated material co-purifying with BT3330 and supplemented with 0.4% galactose. FIG. 141 shows identical cultures to those described in FIG. 14H were supplied 2-fold dilutions of levan containing galactose to 0.5% total carbohydrate content and all measurements were normalized to responses from cultures supplied galactose alone. For FIGS. 14F-14I, values represent the average of 2 measurements from a single experiment and error bars are standard deviation. For FIGS. 14G-14H, solid lines represent simple linear regression models and color- matched dashed lines are the 95% confidence intervals computed in Prism.

Figures 15A, 15B, 15C, and 15D show a Bacteroides transcriptional reporter functions across many species. Relative luminescence (left axis) and growth (right axis) detected from 4 different wildtype Bacteroides species, (FIG. 15A) Bt, (FIG. 15B) Bo, (FIG. 15C) Bf, or (FIG. 15D) Be harboring a promoter-less Bacteroides-optimized lux cassette control plasmid (pBolux, black lines) or one encoding the constitutively active rpoD promoter (red lines) cultured in minimal media containing 0.5% galactose as the sole carbon source. n=4, error bars are SEM.

Figures 16A, 16B, 16C, and 16D show how PUL-reporters respond to target glycans. FIG. 16A shows the reporter activity over 18 hours in wild-type Bt harboring a promoter- less reporter plasmid (pBohtx, gray) or one with the promoter preceding the CS-specific sttsC gene (P-BT3332, black) in 0.5% CS normalized to identical strains supplied 0.5% galactose. n=6. FIG. 16B shows the normalized reporter activity over time harboring the CS-responsive PUL reporter in wild-type Bt (wt, black) or mutants lacking either the CS-sensor (Δsen, red), CS-specific susC gene (ΔsusC, blue), 3 CS-specific lyases (Δlyase, purple), or a CS-disaccharide specific glucuronyl hydrolase (Δgh, orange) supplied a mixture of 0.2% CS and 0.2% galactose and normalized to identical strains supplied 0.4% galactose. FIG. 16C shows the AUC for all strains in (B) were computed for 18 hours following the introduction of CS. n=8. FIG. 16D shows the AUC computed from normalized reporter activity from wild-type Bt (wt, black) or strains lacking 3 CS-specific lyases (Δlyase, purple) or the CS-specific sensor {Asen, red) harboring the B-BT3332 reporter plasmid and supplied CS-disaccharide. n=6. For all panels, error bars are SEM. For (C-D) p- values were calculated using paired, two-tailed student’s t-tests. ns indicates values > 0.05, * < 0.05, ** < 0.01, and *** < 0.001.

Figures 17A, 17B, 17C, 17D, 17E, and 17F show the species-specific PUL-reporters differentially detect compositionally identical but structurally distinct glycans. FIGS. 17A-17C show the growth of wild-type Bt (black) or Bo (gray) strains or mutants lacking the corresponding fructan PUL-sensor genes, BT1754 (BlAsen, red) or BACOVA_04496 (BoAsen, yellow), respectively, cultured in (FIG. 17A) levan, (FIG. 17B) inulin, or (FIG. 17C) fructose as the sole carbon source. FIGS. 17D-17F show the fructan-responsive reporter activity of the strains described for (FIGS. 17A- 17C) following the introduction of galactose supplemented with equal amounts and (FIG. 17D) fructose, (FIG. 17E) levan, or (FIG. 17F) inulin and normalized to the non-inducing condition (galactose alone). The area under each curve was computed for 18 hours following the introduction of each glycan mixture. For all panels: n=8, error bars are SEM, P-values were calculated by paired, two-tailed Student’s t-test. ns indicates values < 0.05, * < 0.05, ** < 0.01, and *** < 0.001.

Figures 18A, 18B, 18C, 18D, and 18E show that PUL-reporters elicit specific dose-dependent responses to target glycans. FIG. 18A shows the reporter activity from wild-type Bt harboring the levan-reporters supplied 2x dilutions of the corresponding glycan in galactose to a final carbohydrate content of 0.5% and normalized to identical strains supplied galactose alone. FIG. 18B shows the quantified CS-reporter activity normalized to a non-inducing condition (0.5% galactose) following the introduction of 2x serial dilutions of CS (filled circles) or levan (open circles) in galactose to a final glycan concentration of 0.5%. FIG. 18C shows the quantified levan-dependent reporter activity normalized to a non-inducing condition (0.5% galactose) following the introduction of 2x serial dilutions of levan (open circles) or CS (filled circles) in galactose to a final glycan concentration of 0.5%. FIG. 18D shows the quantified CS-reporter (P-BT3332; filled circles) or levan-reporter (P- BT1763; open circles) activity from strains supplied mixtures containing 2x serial dilutions of CS in constant 0.2% levan and galactose to final carbohydrate content of 0.5%. FIG. 18E shows the quantified CS-reporter (P-BT3332; filled circles) or levan-reporter (P-BT1763; open circles) activity from strains supplied mixtures containing 2x serial dilutions of CS in constant 0.2% levan and galactose to final carbohydrate content of 0.5%. For all panels: n=4, error bars are SEM, p-values were calculated using a paired, two-tailed Student’ s t-test. * indicates p-values below 0.05 compared to strains supplied galactose. R 2 values were computed by MS Excel power trendline function.

Figures 19A, 19B, 19C, 19D, and 19E show the mutations disrupting PUL-encoding activities can narrow target specificity or increase reporter sensitivity. FIG. 19A shows the growth of wild-type Bt (black) or a strain lacking 3 CS-specific lyases (purple) in 0.1% HA as a sole carbon source. n=4, error is standard deviation. FIGS. 19B-19C shows the AUC from wild-type Bt (black) or a lyase- deficient mutant (purple) harboring the CS/HA-reporter (P-BT3332 ) following the introduction of 2- fold titrations of (FIG. 19B) CS or (FIG. 19C) HA and normalized to a non-inducing condition (0.5% galactose). n=4, error bars are standard deviation. FIG. 19D shows the reporter activity (AUC) was measured from wild-type Bt (black line) or the g/z-deficient strain (orange) harboring the CS-reporter (P-BT3332) following the introduction of 2x serial dilutions of CS in galactose to a final glycan concentration of 0.5% and normalized to an identical strain supplied galactose alone. FIG. 19E shows the reporter activity (AUC) was measured from wild-type Bt (black line) or a fructose importer- deficient strain (orange) harboring the levan-reporter ( P-BT1763) following the introduction of 2x serial dilutions of levan in galactose to a final glycan concentration of 0.5% and normalized to an identical strain supplied galactose alone. n=6 and error bars are standard deviation.

Figures 20A, 20B, 20C, 20D, 20E, 20F, 20G, 20H, and 201 show the glycan-independent validation of reporter plasmids. FIG. 20 A shows a schematic depicting our approach to validate PUL reporters in the absence of known glycan ligands and identify optimal promoters for sensor/regulator- dependent glycan-induced transcription activation. FIGS. 20B-20I show the fold increase (AUC) in reporter activity in strains over-expressing a constitutively activated form of the PUL regulators: (FIG. 20B) BT2391, (FIG. 20C) BT3269, (FIG. 20D) BT3097, (FIG. 20E) BT3172, (FIG. 20F) BT1278, (FIG. 20G) BT1617, (FIG. 20H) BT3517, or (FIG. 201) BT4137 and harboring promoter- less pBolux or a plasmid containing putative promoter regions from the corresponding PUL. The fold increase in reporter activity was calculated as the AUC from strains harboring each reporter plasmids and over-expressing the indicated regulator normalized by a strain harboring identical reporter plasmids and an empty over-expression vector. n=4, error bars are standard deviation, p- values were calculated using paired, two-tailed student’s t-tests, ns indicates p-values > 0.05, *, < 0.05, and ** < 0.01.

Figures 21A, 21B, 21C, 21D, 21E, 21F, and 21G show a high-throughput Bt PUL-reporter array specifically detects target glycans individually and in biologically derived mixtures. FIG. 21A- 21G show the fold increase in reporter activity (AUC) of strains harboring 95-ditinct PUL reporter plasmids (labeled on the x-axis by the corresponding PUL-encoded susC gene) or the promoter-less pBolux plasmid (1 st bar) following the introduction of a mixture of 0.1% galactose and 0.4% (FIG. 21A) arabinogalactan, (FIG. 21B) levan, (FIG. 21C) dextran, (FIG. 21D) hyaluronan, (FIG. 21E) heparan sulfate, (FIG. 21F) chondroitin sulfate, or (FIG. 21G) PMOGs. FIGS. 21A-21E are from a single experiment, FIG. 21F and FIG. 21G are the average of 3 or 5 independent replicates, respectively. Error bars are standard deviation, p- values were calculated using a paired, two-tailed student’s t-test and * indicates p-values < 0.05, ** < 0.01, and *** < 0.001.

Figures 22A, 22B, and 22C show that PUL-sensors specifically activate target PULs. FIGS. 22A-22B show the bioluminescence from wild-type or BT333 4-def'icient Bt strains harboring P- BT1632, P-BT2818, P-BT3332, or P-BT4662 were measured in response to a mixture of galactose and (FIG. 22A) CS or (FIG. 22B) CS -disaccharide (di0S) and normalized by identical strains supplied galactose. FIG. 22C shows the bioluminescence from Bt strains harboring the reporters described in panels A&B and over-expressing a constitutively active BT3334 protein (BT3334*). Measurements were normalized by values from a strain harboring identical reporters and an empty over-expression vector. n=4, error bars are standard deviation, and p- values were calculated by two-tailed student’s t- test. ns indicates p-values < 0.05, * < 0.05, ** < 0.01, and *** < 0.001.

DETAILED DESCRIPTION

The following description of the disclosure is provided as an enabling teaching of the disclosure in its best, currently known embodiment(s). To this end, those skilled in the relevant art will recognize and appreciate that many changes can be made to the various embodiments of the invention described herein, while still obtaining the beneficial results of the present disclosure. It will also be apparent that some of the desired benefits of the present disclosure can be obtained by selecting some of the features of the present disclosure without utilizing other features. Accordingly, those who work in the art will recognize that many modifications and adaptations to the present disclosure are possible and can even be desirable in certain circumstances and are a part of the present disclosure. Thus, the following description is provided as illustrative of the principles of the present disclosure and not in limitation thereof.

Reference will now be made in detail to the embodiments of the invention, examples of which are illustrated in the drawings and the examples. This invention may, however, be embodied in many different forms and should not be construed as limited to the embodiments set forth herein.

Terminology

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood to one of ordinary skill in the art to which this disclosure belongs. The term “comprising” and variations thereof as used herein is used synonymously with the term “including” and variations thereof and are open, non-limiting terms. Although the terms “comprising” and “including” have been used herein to describe various embodiments, the terms “consisting essentially of’ and “consisting of’ can be used in place of “comprising” and “including” to provide for more specific embodiments and are also disclosed. As used in this disclosure and in the appended claims, the singular forms “a”, “an”, “the”, include plural referents unless the context clearly dictates otherwise.

The following definitions are provided for the full understanding of terms used in this specification.

The terms "about" and "approximately" are defined as being “close to” as understood by one of ordinary skill in the art. In one non-limiting embodiment the terms are defined to be within 10%. In another non-limiting embodiment, the terms are defined to be within 5%. In still another nonlimiting embodiment, the terms are defined to be within 1%.

As used herein, the terms "may," "optionally," and "may optionally" are used interchangeably and are meant to include cases in which the condition occurs as well as cases in which the condition does not occur. Thus, for example, the statement that a formulation "may include an excipient" is meant to include cases in which the formulation includes an excipient as well as cases in which the formulation does not include an excipient.

“Composition” refers to any agent that has a beneficial biological effect. Beneficial biological effects include both therapeutic effects, e.g., treatment of a disorder or other undesirable physiological condition, and prophylactic effects, e.g., prevention of a disorder or other undesirable physiological condition. The terms also encompass pharmaceutically acceptable, pharmacologically active derivatives of beneficial agents specifically mentioned herein, including, but not limited to, a vector, polynucleotide, cells, salts, esters, amides, proagents, active metabolites, isomers, fragments, analogs, and the like. When the term “composition” is used, then, or when a particular composition is specifically identified, it is to be understood that the term includes the composition per se as well as pharmaceutically acceptable, pharmacologically active vector, polynucleotide, salts, esters, amides, proagents, conjugates, active metabolites, isomers, fragments, analogs, etc.

"Comprising" is intended to mean that the compositions, methods, etc. include the recited elements, but do not exclude others. "Consisting essentially of' when used to define compositions and methods, shall mean including the recited elements, but excluding other elements of any essential significance to the combination. Thus, a composition consisting essentially of the elements as defined herein would not exclude trace contaminants from the isolation and purification method and pharmaceutically acceptable carriers, such as phosphate buffered saline, preservatives, and the like. "Consisting of' shall mean excluding more than trace elements of other ingredients and substantial method steps for administering the compositions provided and/or claimed in this disclosure. Embodiments defined by each of these transition terms are within the scope of this disclosure.

By “reduce” or other forms of the word, such as “reducing” or “reduction,” means lowering of an event or characteristic (e.g., tumor growth). It is understood that this is typically in relation to some standard or expected value, in other words it is relative, but that it is not always necessary for the standard or relative value to be referred to. For example, “reduces tumor growth” means reducing the rate of growth of a tumor relative to a standard or a control.

By “prevent” or other forms of the word, such as “preventing” or “prevention,” is meant to stop a particular event or characteristic, to stabilize or delay the development or progression of a particular event or characteristic, or to minimize the chances that a particular event or characteristic will occur. Prevent does not require comparison to a control as it is typically more absolute than, for example, reduce. As used herein, something could be reduced but not prevented, but something that is reduced could also be prevented. Likewise, something could be prevented but not reduced, but something that is prevented could also be reduced. It is understood that where reduce or prevent are used, unless specifically indicated otherwise, the use of the other word is also expressly disclosed.

The term “subject” refers to any individual who is the target of administration or treatment. The subject can be a vertebrate, for example, a mammal. In one aspect, the subject can be human, non-human primate, bovine, equine, porcine, canine, or feline. The subject can also be a guinea pig, rat, hamster, rabbit, mouse, or mole. Thus, the subject can be a human or veterinary patient. The term “patient” refers to a subject under the treatment of a clinician, e.g., physician.

A “control” is an alternative subject or sample used in an experiment for comparison purposes. A control can be "positive" or "negative."

The term “treatment” refers to the medical management of a patient with the intent to cure, ameliorate, stabilize, or prevent a disease, pathological condition, or disorder. This term includes active treatment, that is, treatment directed specifically toward the improvement of a disease, pathological condition, or disorder, and also includes causal treatment, that is, treatment directed toward removal of the cause of the associated disease, pathological condition, or disorder. In addition, this term includes palliative treatment, that is, treatment designed for the relief of symptoms rather than the curing of the disease, pathological condition, or disorder; preventative treatment, that is, treatment directed to minimizing or partially or completely inhibiting the development of the associated disease, pathological condition, or disorder; and supportive treatment, that is, treatment employed to supplement another specific therapy directed toward the improvement of the associated disease, pathological condition, or disorder.

A "promoter," as used herein, refers to a sequence in DNA that mediates the initiation of transcription by an RNApolymerase. Transcriptional promoters may comprise one ormore of a number of different sequence elements as follows: 1) sequence elements present at the site of transcription initiation; 2) sequence elements present upstream of the transcription initiation site and; 3) sequence elements down- stream of the transcription initiation site. The individual sequence elements function as sites on the DNA, where RNA polymerases and transcription factors facilitate positioning of RNA polymerases on the DNA bind.

As used herein, the term “buffer” refers to a solution consisting of a mixture of acid and its conjugate base, or vice versa. The solution is used as a means of keeping the pH at a nearly constant range to be used in a wide variety of chemical and biological applications. “Culture” or “cell culture” is the process by which cells, including bacterial cells, are grown under controlled conditions, generally outside their natural environment. These conditions vary for each cell type, but generally consist of a suitable vessel with a substrate or medium that supplies the essential nutrients (amino acids, carbohydrates, vitamins, minerals), growth factors, hormones, and gases (CO2, 02), and regulates the physio-chemical environment (pH buffer, osmotic pressure, temperature). Most cells require a surface or an artificial substrate to form an adherent culture as a monolayer (one single-cell thick), whereas others can be grown free floating in a medium as a suspension culture.

The term “administer,” “administering”, or derivatives thereof refer to delivering a composition, substance, inhibitor, or medication to a subject or object by one or more the following routes: oral, topical, intravenous, subcutaneous, transcutaneous, transdermal, intramuscular, intrajoint, parenteral, intra-arteriole, intradermal, intraventricular, intracranial, intraperitoneal, intralesional, intranasal, rectal, vaginal, by inhalation or via an implanted reservoir. The term “parenteral” includes subcutaneous, intravenous, intramuscular, intra-articular, intra- synovial, intrasternal, intrathecal, intrahepatic, intralesional, and intracranial injections or infusion techniques.

The term “detect” or “detecting” refers to an output signal released for the purpose of sensing of physical phenomenon. An event or change in environment is sensed and signal output released in the form of light.

A "primer" is a short polynucleotide, generally with a free 3'-OH group that binds to a target or "template" potentially present in a sample of interest by hybridizing with the target, and thereafter promoting polymerization of a polynucleotide complementary to the target. A "polymerase chain reaction" ("PCR") is a reaction in which replicate copies are made of a target polynucleotide using a "pair of primers" or a "set of primers" consisting of an "upstream" and a "downstream" primer, and a catalyst of polymerization, such as a DNA polymerase, and typically a thermally-stable polymerase enzyme. Methods for PCR are well known in the art, and taught, for example in "PCR: A PRACTICAL APPROACH" (M. MacPherson et al., IRL Press at Oxford University Press (1991)). All processes of producing replicate copies of a polynucleotide, such as PCR or gene cloning, are collectively referred to herein as "replication." A primer can also be used as a probe in hybridization reactions, such as Southern or Northern blot analyses. Sambrook et al., supra.

"Transformation" of a cellular organism with DNA means introducing DNA into an organism so that at least a portion of the DNA is replicable, either as an extrachromosomal element or by chromosomal integration. The cell is termed "host cell" and it may be either prokaryotic or eukaryotic. Typical prokaryotic host cells include various strains of E. coli or Bacteroides. The introduced DNA sequence may be from the same species as the host cell of a different species from the host cell, or it may be a hybrid DNA sequence, containing some foreign and some homologous DNA.

The terms “treat,” “treating,” and grammatical variations thereof as used herein, include partially or completely delaying, alleviating, mitigating, or reducing the intensity of one or more attendant symptoms of a disorder or condition and/or alleviating, mitigating, or impeding one or more causes of a disorder or condition. Treatments according to the disclosure may be applied preventively, prophylactically, palliatively, or remedially. Treatments are administered to a subject prior to onset (e.g., before obvious signs of disease]), during early onset (e.g., upon initial signs and symptoms of disease), or after an established development of disease.

The term “antibiotics” refers to a type of antimicrobial substance active against bacteria. These are the most important type of antimicrobial agent for fighting bacterial infections, and antibiotics medications are widely used in the treatment and prevention of such infections. They may either kill or inhibit the growth of bacteria.

Used herein, the term “probiotics” refers to live microorganisms promoted with claims that they provide health benefits when consumed, generally by improving or restoring the gut flora.

As used herein, “normal” refers to a standard of health where no disease, disorder, abnormal condition, mutation, or dysfunction exists within a particular subject or patient. “Normal” can also refer to an average or typical state or condition.

As used herein, “wild-type” refers to the genetic and physical characteristics of the typical form of a species as it occurs in nature. A wild-type or wild type characteristic is conceptualized as a product of the standard “normal” allele at a gene locus, in contrast to that produced by a non-standard “mutant” allele.

As used herein, “preserve,” “preserved,” “preservation,” “preserving” and any grammatical variations thereof as used herein, refers to the act of keeping any object, composition, or compound intact, alive, or free from decomposition/decay.

Reference also is made herein to peptides, polypeptides, proteins, and compositions comprising peptides, polypeptides, and proteins. As used herein, a polypeptide and/or protein is defined as a polymer of amino acids, typically of length>100 amino acids (Garrett & Grisham, Biochemistry, 2nd edition, 1999, Brooks/Cole, 110). A peptide is defined as a short polymer of amino acids, of a length typically of 20 or less amino acids, and more typically of a length of 12 or less amino acids (Garrett & Grisham, Biochemistry, 2nd edition, 1999, Brooks/Cole, 110).

As used herein, “harbor”, “harboring”, and any grammatical variations thereof refers to a subject, including, but not limited to a cell, vector, bacteria, virus, fungi, tissue, or any other organism containing or being the host to another organism, molecule, compound, nucleic acid, protein, or any other composition.

A “cassette” or “gene cassette” refers to a mobile genetic element that generally contains a gene and a recombination site. Gene cassettes can move around within an organism’s genome or can be transferred to another organism in the environment.

“Quantify”, “quantifying”, “quantification”, and any other grammatical variations thereof refer to the process of acquiring numerical values to determine, express, or measure an amount of a substance or signal.

“Anaerobic” refers to living, active, occurring, or existing in the absence of oxygen.

As used herein, the term “apparatus” refers to a technical equipment or machinery needed for a particular activity or purpose. Herein, the apparatus is used to separate biomolecules from one another for further purification and collection. The apparatus can comprise one or multiple components to form a complex structure needed to perform a particular function.

As used herein, a “container” refers to a receptacle or enclosure for holding a product, substance, or composition used in storage, packaging, and transportation, including shipping. The product, substance, or composition are kept inside of the container to protect and prevent contamination from the outside environment. A container is also meant to partially or completely enclose the product, substance, or composition.

Reporter plasmids

As used herein, a “reporter plasmid” refers to a type of expression vector containing a reporter gene, also called “reporter vector”, that can be used to identify and characterize promoter and/or enhancer element functions relative to gene expression. Expression of the reporter gene corresponds to the transcriptional activity of the reporter gene. In general, reporter genes are nucleic acid sequences whose products, or proteins, can be readily assayed, measured, or quantified following transformation or transfection into a host cell. Reporter genes include, but are not limited to luciferase reporter genes (lux or luc), beta-galactosidase reporter gene (lacZ), chloramphenicol acetyltransferase (cat), green fluorescent protein reporter gene (gfp), red fluorescent protein reporter gene (rfp), and derivatives thereof. It should be noted that luciferase reporter gene are the preferred reporter gene because luciferase reporter genes are readily expressed in an anaerobic environment.

The reporter plasmid can comprise any combination of gene elements to allow for transcription including, but not limited to promoter sequences, activator and/or enhanced elements (including, but not limited to transcription factor binding sites and polymerase binding sites), and operator and/or silenced elements (including, but not limited to repressor protein binding sites). Thus in one aspect, disclosed herein are reporter plasmids comprising a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter.

Also disclosed herein are reporter plasmids, wherein the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor.

In one aspect disclosed herein are reporter plasmids, wherein the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof.

Also disclosed herein are reporter plasmids, wherein the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS- specific promoter, an O-glycan(OG)-specific promoter, or variants thereof including, but not limited to glycan specific promoters comprising the nucleic acids sequence as set forth in SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ ID NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

In one aspect disclosed herein are reporter plasmids, wherein the reporter plasmid further comprises a second PUL. In some embodiments, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan- specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O- glycan(OG)-specific PUL, or variants thereof. In some embodiments, the first PUL is interchangeable with the second PUL.

Also disclosed herein are reporter plasmids, wherein the reporter plasmid further comprises a second glycan-specific promoter. In some embodiments, the first glycan-specific promoter is interchangeable with the second glycan-specific promoter. In some embodiments, the second glycan- specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

In one aspect, disclosed herein are reporter plasmids, wherein the second glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

Also disclosed herein are reporter plasmids, wherein the reporter plasmid comprises a Bacleroides-optm' uzed lux (Bolux) plasmid.

Engineered bacteria

An engineered bacteria, or a genetically engineered bacteria, are bacteria harboring additional genetic material, usually in the form of a plasmid vector, that allows said bacteria the ability to efficiently express native or foreign proteins for a specific purpose. It should be noted that more than one gene and other genetic material can be inserted into a plasmid, which can then be further integrated into bacteria.

Generating engineered bacteria in the gastrointestinal (GI) microbiome serves as an option for treating multiple diseases by genetically altering the bacteria residing in the GI tract. Genetically altering the GI microbiome allows for molecular profiling of the GI tract, altering gut bacteria to destroy pathogenic bacteria, or using genetically engineered bacteria to modify deficient enzymes or proteins. Specifically, molecular profiling of the GI tract includes detecting and quantifying the presence of glycans, oligosaccharides, monosaccharides, and other biomolecules in the GI tract. Such an approach also indicates the presence of pathogenic microorganisms or deficient enzymes and/or proteins.

Also disclosed herein are engineered bacteria comprising any of the reporter plasmid disclosed herein. For example, disclosed herein are engineered bacteria harboring a reporter plasmid wherein the reporter plasmid comprises a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter. In some aspects, the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor. In some embodiments, the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan- specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some aspects, the first glycan specific promoter comprises a CS -specific promoter, a levan-specific promoter, an inulinspecific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

In one aspect, disclosed herein are engineered bacteria, wherein the first glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ ID NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

Also disclosed herein are engineered bacteria, further comprising a second PUL. In some aspects, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan- specific PUL, an inulin-specific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)- specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some aspects, the first PUL is interchangeable with the second PUL. In one aspect, disclosed herein are engineered bacteria, further comprising a second glycan- specific promoter. In some embodiments, the first glycan-specific promoter is interchangeable with the second glycan-specific promoter. In some aspects, the second glycan-specific promoter comprises a CS -specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactanspecific promoter, a dextran-specific promoter, a HA-specific promoter, a HS -specific promoter, an O-glycan(OG)-specific promoter, or variants thereof.

Also disclosed herein are engineered bacteria, wherein the second glycan-specific promoter comprises the nucleic acids sequence as set forth in SEQ ID NOs 74-76, or SEQ ID NOs: 262-354.

In some aspects, disclosed herein are engineered bacteria, wherein the engineered bacteria originates from a gastrointestinal bacterium. In some aspects, the engineered bacteria originates from a Bacteroides bacterium including, but not limited to Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be) or a variant strain thereof.

Also disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the chondroitin sulfate (CS)-specific PUL when exposed to a CS glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the levan-specific PUL when exposed to a levan glycan, or derivatives thereof.

Also disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the inulin-specific PUL when exposed to an inulin glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the arabinogalactan- specific PUL when exposed to an arabinogalactan, or derivatives thereof.

Also disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the dextran- specific PUL when exposed to a dextran glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the hyaluronan(HA)- specific PUL when the engineered bacteria is exposed to a HA glycan, or derivatives thereof.

Also disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the heparan sulfate (HS)-specific PUL when exposed to a HS glycan, or derivatives thereof.

In some aspects, disclosed herein are engineered bacteria, wherein the engineered bacteria overexpresses the O-glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof.

Also disclosed herein are engineered bacteria, wherein the engineered bacteria emits a light signal when overexpressing any PUL. In some aspects, the light signal is decreased when the glycan molecule is broken down into a monosaccharide molecule. Glycan-sensing systems and compositions

Glycans are polymer compounds comprising several smaller monosaccharide molecules usually linked together by glycosidic bonds. Glycan can also exist as homopolymers or heteropolymer, wherein the monosaccharide units can also be linked in a linear or branched formation. Common types of glycans include, but are not limited to O-linked glycans, N-linked glycans, and glycosaminoglycans. Glycans are substantive components of the gastrointestinal (GI) tract system of eukaryotic organisms. Microorganisms that reside in the GI tract are adept at consuming glycans and other polysaccharide molecules establishing a symbiotic relationship between gut microorganisms and the host. Fluctuations in the abundance of glycans create a diverse and heterogeneous environment in which the gut microorganisms can strive and proliferate. Thus, glycans shape and influence the composition and abundance of gut microbiome. Further, abnormalities in the gut microbiome glycan composition can also implicate presence of diseases and/or disorder. Given the influence of glycans in the GI tract, there is a need to develop compositions, systems, and methods of detecting homogenous or heterogenous mixtures of glycans for profiling, treating, preventing, reducing, and/or decreasing GI diseases and/or disorders in a subject.

In one aspect, disclosed herein is a glycan-sensing system comprising any of the reporter plasmids disclosed herein and/or any of the engineered bacteria disclosed herein.

In one aspect, disclosed herein is a glycan-sensing system comprising an engineered bacteria harboring a reporter plasmid, wherein the reporter plasmid comprises a luciferase reporter cassette, a first polysaccharide utilization locus (PUL), and a first glycan-specific promoter, wherein the glycan- sensing system detects and quantifies a glycan molecule.

As used herein, a “PUL” refers to a co-regulated group of genes that sense the presence of surrounding nutrients and enable digestion of glycans and other polysaccharides molecules. The GI microbiome comprises numerous PULs that enable bacterial species to recognize and consume glycan molecules. It should be noted that a PUL can be glycan specific.

In some embodiments, the glycan molecule comprises chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, or derivatives thereof. In some embodiments, the glycan is an N-linked glycan. In some embodiments, the glycan is an O-linked glycan. In some embodiments, the glycan is a branched glycan. In some embodiments, the glycan is a linear glycan. In some embodiments, the glycan molecule comprises pectic galactan, starch, glucan, galactomannan, glucomannan, homogalacturonan, xyloglucan, dermatan, xylan, rhamnogalacturonan, or derivative thereof. In some embodiments, the glycan molecule is located in the gastrointestinal tract of a mammal. The genes within a PUL can encode a glycan utilization system, including, but not limited to a starch utilization system (Sus system). The gene products of the glycan utilization system generally localize to the outer membrane or periplasm of a bacterium, and function to sequentially bind to the glycan on the bacterial surface, degrade it into oligosaccharides, and transport the oligosaccharides into the periplasmic space for further degradation into monosaccharides. In some embodiments, the first PUL encodes any combination of proteins comprising a surface glycan binding protein (SGBP), an outer membrane channel, a digestive enzyme, a glycan importer, or a glycan sensor. In some embodiments, the SGBP comprises a SusD, SusG, SusE, or Sus F protein. In some embodiments, the outer membrane channel comprises a Sus C channel protein. In some embodiments, the digestive enzyme comprises a lyase, an amylase, or an enzyme capable or degrade glycans, carbohydrates, or polysaccharides. In some embodiments, the digestive enzyme comprises a Sus A or SusB enzyme. In some embodiments, glycan importer is located on the inner membrane of a bacterial cell. In some embodiments, glycan sensor comprises a SusR protein.

In some embodiments, the first PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulin- specific PUL, an arabinogalactan-specific PUL, a dextran-specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)- specific PUL, or variants thereof.

In some embodiments, the first glycan specific promoter comprises a CS-specific promoter, a levan-specific promoter, an inulin- specific promoter, an arabinogalactan-specific promoter, a dextran-specific promoter, a HA-specific promoter, a HS-specific promoter, an O-glycan(OG)- specific promoter, or variants thereof.

In some embodiments, the system further comprises a second PUL. In some embodiments, the second PUL comprises a chondroitin sulfate (CS)-specific PUL, a levan-specific PUL, an inulinspecific PUL, an arabinogalactan-specific PUL, a dextran- specific PUL, a hyaluronan(HA)-specific PUL, a heparan sulfate (HS)-specific PUL, an O-glycan(OG)-specific PUL, or variants thereof. In some embodiments, the first PUL is interchangeable with the second PUL.

In some embodiments, the system further comprises a second glycan-specific promoter. In some embodiments, the first glycan-specific promoter is interchangeable with the second glycan- specific promoter. In some embodiments, the second glycan-specific promoter comprises a CS- specific promoter, a levan-specific promoter, an inulin-specific promoter, an arabinogalactan-specific promoter, a dextran- specific promoter, a HA-specific promoter, a HS-specific promoter, an O- glycan(OG)-specific promoter, or variants thereof.

In some embodiments, the first or second glycan-specific promoter comprises SEQ ID NO: 74, SEQ ID NO: 75, SEQ ID NO: 76, SEQ ID NO: 262, SEQ ID NO: 263, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 266, SEQ ID NO: 267, SEQ ID NO: 268, SEQ ID NO: 269, SEQ ID NO: 270, SEQ ID NO: 271, SEQ ID NO: 272, SEQ ID NO: 273, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 277, SEQ ID NO: 278, SEQ ID NO: 279, SEQ ID NO: 280, SEQ ID NO: 281, SEQ ID NO: 282, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 285, SEQ ID NO: 286, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 290, SEQ ID NO: 291, SEQ ID NO: 292, SEQ ID NO: 293, SEQ ID NO: 294, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 299, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 318, SEQ ID NO: 319, SEQ ID NO: 320, SEQ ID NO: 321, SEQ ID NO: 322, SEQ ID NO: 323, SEQ ID NO: 324, SEQ ID NO: 325, SEQ ID NO: 326, SEQ ID NO: 327, SEQ ID NO: 328, SEQ ID NO: 329, SEQ ID NO: 330, SEQ ID NO: 331, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 334, SEQ ID NO: 335, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 338, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 342, SEQ ID NO: 343, SEQ ID NO: 344, SEQ ID NO: 345, SEQ ID NO: 346, SEQ ID NO: 347, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 350, SEQ ID NO: 351, SEQ ID NO: 352, SEQ ID NO: 353, or SEQ ID NO: 354.

In some embodiments, the luciferase reporter cassette comprises one or more genes selected from lux C, lux D, lux E, lux A, and lux B. In some embodiments, the luciferase reporter cassette originates from a Photorhabdus bacteria. In some embodiments, the luciferase reporter cassette originates from a Photorhabdus luminescens (PI) bacteria. In some embodiments, the gene products of the luciferase reporter cassette emit a light signal.

In some embodiments, the reporter plasmid comprises a Bacteroides-optimized lux (Bolux) plasmid.

In some embodiments, the engineered bacteria originates from a gastrointestinal bacterium. In some embodiments, the engineered bacteria originates from a Bacteroides bacterium. In some embodiments, the engineered bacteria originates from a Bacteroides thetaiotaomicron (Bt), Bacteroides ovatus (Bo), Bacteroides fragilis (Bf), Bacteroides caccae (Be) or a variant strain thereof. In some embodiments, the engineered bacteria originates from a Bacteroides acidifaciens (Ba), Bacteroides barnesiaes (Bb), Bacteroides caecicola, Bacteroides caecigallinarum, Bacteroides cellulosilyticus, Bacteroides cellulosolvens, Bacteroides ciarus, Bacteroides coagulans, Bacteroides coprocola, Bacteroides coprophilus, Bacteroides coprosuis, Bacteroides dorei, Bacteroides eggerthii, Bacteroidesgracilis, Bacteroides faecichinchillae, Bacteroides faecis, Bacteroides finegoldii, Bacteroides fluxus, Bacteroides galacturonicus, Bacteroides gallinaceum, Bacteroides gallinarum, Bacteroides goldsteinii, Bacteroides graminisolvens, Bacteroides helcogene, Bacteroides inleslinalis, Bacteroides lull, Bacteroides massiliensis, Bacteroides nordii, Bacteroides oris, Bacteroides paurosaccharolyticus, Bacteroides plebeius, Bacteroides polypragmatus, Bacteroides proprionicifaciens, Bacteroides putredinis, Bacteroides pyogenes, Bacteroides reticulotermitis, Bacteroides rodentium, Bacteroides salanitronis, Bacteroides salyersiae, Bacteroides sartorii, Bacteroides sedmenti, Bacteroides stercoris, Bacteroides suis, Bacteroides tectus, Bacteroides uniformis, Bacteroides vulgatus, or Bacteroides xylanisolvens.

In some embodiments, the engineered bacteria are grown in an anaerobic environment. In some embodiments, the engineered bacteria are grown in a 96- well, a 384-well, or a 1536-well microplate.

In some embodiments, the engineered bacteria overexpresses the chondroitin sulfate (CS)- specific PUL when exposed to a CS glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the levan-specific PUL when exposed to a levan glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the inulin- specific PUL when exposed to an inulin glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the arabinogalactan-specific PUL when exposed to an arabinogalactan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the dextran-specific PUL when exposed to a dextran glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the hyaluronan(HA)-specific PUL when the engineered bacteria is exposed to a HA glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the heparan sulfate (HS)- specific PUL when exposed to a HS glycan, or derivatives thereof.

In some embodiments, the engineered bacteria overexpresses the O-glycan(OG)-specific PUL when exposed to an OG glycan mixture, or derivatives thereof.

In some embodiments, the engineered bacteria emits a light signal when overexpressing any PUL disclosed herein.

In some embodiments, the light signal is decreased when the glycan molecule is broken down into a monosaccharide molecule.

Methods of detecting glycan molecules

In one aspect, disclosed herein is a method of detecting a glycan, the method comprising isolating and preserving a tissue sample from a subject, preparing, and culturing the glycan-sensing system disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, and detecting and quantifying a light signal emitted from the system.

In one aspect, disclosed herein is a method of detecting a glycan, the method comprising isolating and preserving a tissue sample from a subject, preparing, and culturing any of the engineered bacteria disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, and detecting and quantifying a light signal emitted from the engineered bacteria.

In one aspect, disclosed herein is a method of detecting a glycan, the method comprising isolating and preserving a tissue sample from a subject, preparing any of the reporter plasmids disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, and detecting and quantifying a light signal emitted from the reporter plasmid.

In some embodiments, the tissue sample comprises at least one glycan. In some embodiments, the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal. In some embodiments, the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample. In some embodiments, the tissue sample comprises a tissue biopsy from the GI tract, including, but not limited to a tongue scraping, buccal swab, an esophageal tissue, a stomach tissue, an intestinal tissue (including the small intestines and/or the large intestines), a liver tissue, a gallbladder tissue, a pancreatic tissue, a colorectal tissue,

In some embodiments, the method prevents, treats, or reduces a gastrointestinal disease or disorder in the subject. In some embodiments, the gastrointestinal disease or disorder comprises heartbum, irritable bowel syndrome, lactose intolerance, gallstones, cholecystitis, cholangitis, anal fissure, hemorrhoids, proctitis, colon polyps, infective colitis, ulcerative colitis, ischemic colitis, Crohn’s disease, radiation colitis, celiac disease, diarrhea (chronic or acute), constipation (chronic or acute), diverticulosis, diverticulitis, acid reflux (gastroesophageal reflux (GER) or gastroesophageal reflux disease (GERD)), Hirschsprung disease, abdominal adhesions, achalasia, acute hepatic porphyria (AHP), anal fistulas, bowel incontinence, centrally mediated abdominal pain syndrome (CAPS), clostridioides difficile infection, cyclic vomiting syndrome (CVS), dyspepsia, eosinophilic gastroenteritis, globus, inflammatory bowel disease, malabsorption, scleroderma, or volvulus.

Methods of treating or preventing gastrointestinal diseases or disorders

In one aspect, disclosed herein is a method of treating or preventing a gastrointestinal disease or disorder in a subject in need thereof, the method comprising isolating and preserving a tissue sample from a subject, preparing and culturing any of the glycan-sensing systems disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, detecting and quantifying a light signal emitted from the system, and performing or administering a therapeutic procedure to the subject comprising pathological amounts of a glycan relative to a control tissue comprising normal amounts of the glycan.

In one aspect, disclosed herein is a method of treating or preventing a gastrointestinal disease or disorder in a subject in need thereof, the method comprising isolating and preserving a tissue sample from a subject, preparing and culturing any of the engineered bacteria disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, detecting and quantifying a light signal emitted from the engineered bacteria, and performing or administering a therapeutic procedure to the subject comprising pathological amounts of a glycan relative to a control tissue comprising normal amounts of the glycan.

In one aspect, disclosed herein is a method of treating or preventing a gastrointestinal disease or disorder in a subject in need thereof, the method comprising isolating and preserving a tissue sample from a subject, preparing any of the reporter plasmids disclosed herein in a microplate in an anaerobic environment, adding the tissue sample into the microplate, detecting and quantifying a light signal emitted from the reporter plasmid, and performing or administering a therapeutic procedure to the subject comprising pathological amounts of a glycan relative to a control tissue comprising normal amounts of the glycan.

In some embodiments, the tissue sample comprises at least one glycan. In some embodiments, the tissue sample comprises 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22,

23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49,

50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76,

77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102,

103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122,

123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142,

143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162,

163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182,

183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202,

203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222,

223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242,

243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262,

263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282,

283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302,

303, 304, 305, 306, 307, 308, 309, 310, 311, 312, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322,

323, 324, 325, 326, 327, 328, 329, 330, 331, 332, 333, 334, 335, 336, 337, 338, 339, 340, 341, 342,

343, 344, 345, 346, 347, 348, 349, 350, 351, 352, 353, 354, 355, 356, 357, 358, 359, 360, 361, 362, 363, 364, 365, 366, 367, 368, 369, 370, 371, 372, 373, 374, 375, 376, 377, 378, 379, 380, 381, 382, 383, 384, 385, 386, 387, 388, 389, 390, 391, 392, 393, 394, 395, 396, 397, 398, 399, 400, 401, 402, 403, 404, 405, 406, 407, 408, 409, 410, 411, 412, 413, 414, 415, 416, 417, 418, 419, 420, 421, 422, 423, 424, 425, 426, 427, 428, 429, 430, 431, 432, 433, 434, 435, 436, 437, 438, 439, 440, 441, 442, 443, 444, 445, 446, 447, 448, 449, 450, 451, 452, 453, 454, 455, 456, 457, 458, 459, 460, 461, 462, 463, 464, 465, 466, 467, 468, 469, 470, 471, 472, 473, 474, 475, 476, 477, 478, 479, 480, 481, 482, 483, 484, 485, 486, 487, 488, 489, 490, 491, 492, 493, 494, 495, 496, 497, 498, 499, 500, 501, 502, 503, 504, 505, 506, 507, 508, 509, 510, 511, 512, 513, 514, 515, 516, 517, 518, 519, 520, 521, 522, 523, 524, 525, 526, 527, 528, 529, 530, 531, 532, 533, 534, 535, 536, 537, 538, 539, 540, 541, 542, 543, 544, 545, 546, 547, 548, 549, 550, 551, 552, 553, 554, 555, 556, 557, 558, 559, 560, 561, 562, 563, 564, 565, 566, 567, 568, 569, 570, 571, 572, 573, 574, 575, 576, 577, 578, 579, 580, 581, 582, 583, 584, 585, 586, 587, 588, 589, 590, 591, 592, 593, 594, 595, 596, 597, 598, 599, 600, 601, 602, 603, 604, 605, 606, 607, 608, 609, 610, 611, 612, 613, 614, 615, 616, 617, 618, 619, 620, 621, 622, 623, 624, 625, 626, 627, 628, 629, 630, 631, 632, 633, 634, 635, 636, 637, 638, 639, 640, 641, 642, 643, 644, 645, 646, 647, 648, 649, 650, 651, 652, 653, 654, 655, 656, 657, 658, 659, 660, 661, 662, 663, 664, 665, 666, 667, 668, 669, 670, 671, 672, 673, 674, 675, 676, 677, 678, 679, 680, 681, 682, 683, 684, 685, 686, 687, 688, 689, 690, 691, 692, 693, 694, 695, 696, 697, 698, 699, 700, 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 714, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 731, 732, 733, 734, 735, 736, 737, 738, 739, 740, 741, 742, 743, 744, 745, 746, 747, 748, 749, 750, 751, 752, 753, 754, 755, 756, 757, 758, 759, 760, 761, 762, 763, 764, 765, 766, 767, 768, 769, 770, 771, 772, 773, 774, 775, 776, 777, 778, 779, 780, 781, 782, 783, 784, 785, 786, 787, 788, 789, 790, 791, 792, 793, 794, 795, 796, 797, 798, 799, 800, 801, 802, 803, 804, 805, 806, 807, 808, 809, 810, 811, 812, 813, 814, 815, 816, 817, 818, 819, 820, 821, 822, 823, 824, 825, 826, 827, 828, 829, 830, 831, 832, 833, 834, 835, 836, 837, 838, 839, 840, 841, 842, 843, 844, 845, 846, 847, 848, 849, 850, 851, 852, 853, 854, 855, 856, 857, 858, 859, 860, 861, 862, 863, 864, 865, 866, 867, 868, 869, 870, 871, 872, 873, 874, 875, 876, 877, 878, 879, 880, 881, 882, 883, 884, 885, 886, 887, 888, 889, 890, 891, 892, 893, 894, 895, 896, 897, 898, 899, 900, 901, 902, 903, 904, 905, 906, 907, 908, 909, 910, 911, 912, 913, 914, 915, 916, 917, 918, 919, 920, 921, 922, 923, 924, 925, 926, 927, 928, 929, 930, 931, 932, 933, 934, 935, 936, 937, 938, 939, 940, 941, 942, 943, 944, 945, 946, 947, 948, 949, 950, 951, 952, 953, 954, 955, 956, 957, 958, 959, 960, 961, 962, 963, 964, 965, 966, 967, 968, 969, 970, 971, 972, 973, 974, 975, 976, 977, 978, 979, 980, 981, 982, 983, 984, 985, 986, 987, 988, 989, 990, 991, 992, 993, 994, 995, 996, 997, 998, 999, 1000, or more glycans. In some embodiments, the tissue sample comprises N-linked glycans, O-linked glycans, or combinations thereof. In some embodiments, the t issue sample comprises branched glycans or linear glycans, or combinations thereof. In some embodiments, the tissue sample comprises one or any combination of glycans comprising chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, pectic galactan, starch, glucan, galactomannan, glucomannan, homogalacturonan, xyloglucan, dermatan, xylan, rhamnogalacturonan, or derivatives thereof.

In some embodiments, the light signal is detected and quantified using a microplate reader, a spectrophotometer, or an instrument capable of measuring the light signal. In some embodiments, the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample. In some embodiments, the tissue sample comprises a tissue biopsy, a blood sample, or a saliva sample. In some embodiments, the tissue sample comprises a tissue biopsy from the GI tract, including, but not limited to a tongue scraping, buccal swab, an esophageal tissue, a stomach tissue, an intestinal tissue (including the small intestines and/or the large intestines), a liver tissue, a gallbladder tissue, a pancreatic tissue, a colorectal tissue,

In some embodiments, the therapeutic procedure comprises a surgical procedure, a therapeutic agent, a lifestyle change, or a combination thereof.

In some embodiments, the surgical procedure comprises bariatric surgery, esophageal surgery, foregut surgery, hernia surgery, gastroesophageal reflux disease surgery, surgical nutrition, colorectal surgery, gallbladder surgery, liver surgery, bile duct surgery, pancreatic surgery, tumor excision, or combinations thereof. In some embodiments, the therapeutic agent comprises a laxative, an antacid, a proton pump inhibitor, a histamine blocker, an anti-inflammatory agent, a steroid, a vitamin, an intravenous fluid, an antibiotic, a probiotic, or combinations thereof. In some embodiments, the lifestyle change comprises a dietary alteration, an exercise program, a sedentary lifestyle, or combinations thereof.

In some embodiments, the gastrointestinal disease or disorder comprises heartbum, irritable bowel syndrome, lactose intolerance, gallstones, cholecystitis, cholangitis, anal fissure, hemorrhoids, proctitis, colon polyps, infective colitis, ulcerative colitis, ischemic colitis, Crohn’s disease, radiation colitis, celiac disease, diarrhea (chronic or acute), constipation (chronic or acute), diverticulosis, diverticulitis, acid reflux (gastroesophageal reflux (GER) or gastroesophageal reflux disease (GERD)), Hirschsprung disease, abdominal adhesions, achalasia, acute hepatic porphyria (AHP), anal fistulas, bowel incontinence, centrally mediated abdominal pain syndrome (CAPS), clostridioides difficile infection, cyclic vomiting syndrome (CVS), dyspepsia, eosinophilic gastroenteritis, globus, inflammatory bowel disease, malabsorption, scleroderma, or volvulus.

In some embodiments, the subject is a human.

Kits of detecting a glycan molecule In one aspect, disclosed herein is a kit for detecting or purifying a glycan molecule, the kit comprising an apparatus comprising a binding medium and a surface glycan binding protein (SGBP), wherein the binding medium comprises a matrix of biomolecules, the SGBP comprises a peptide label, and the SGBP is attached to the matrix; a container comprising an equilibration buffer, a container comprising a releasing buffer, and a container comprising a washing buffer.

In some embodiments, the glycan molecule comprises chondroitin sulfate, arabinan, heparan sulfate, hyaluronan, fructan, levan, mannan, or derivatives thereof.

In some embodiments, the SGBP comprises a chondroitin sulfate (CS)-specific SGBP, a levan-specific SGBP, an inulin- specific SGBP, an arabinogalactan-specific SGBP, a dextran-specific SGBP, a hyaluronan(HA)-specific SGBP, a heparan sulfate (HS)-specific SGBP, an O-glycan(OG)- specific SGBP, or variants thereof.

In some embodiments, the matrix comprises a nickel-conjugated agarose polymer. In some embodiments, the matrix comprises a stationary or porous composition of biomolecules, including, but not limited to a metal ion (including, but not limited to cobalt, nickel, and copper), an antibody, a protein, a nucleic acid, a hormone, and metabolite capable of reversibly binding a target composition, compound, or molecule.

In some embodiments, the SGBP is directly conjugated to a peptide label or the SGBP is conjugated to the peptide label by a linker. It is contemplated that the location and placement of the peptide label on the SGBP is not limited. In some embodiments, the peptide label comprises a hexahistidine label. In some embodiments, the releasing buffer comprises a histidine solution. In some embodiments, the releasing buffer comprises a buffer with a high salt concentration. In some embodiments, the releasing buffer comprises a buffer with a low salt concentration. In some embodiments the releasing buffer comprises a buffer with a low pH. In some embodiments, the releasing buffer comprises a buffer with a high pH. In some embodiments, the releasing buffer is a buffer comprising an ionic charge. In some embodiments, the releasing buffer comprises a buffer with a neutral ionic charge. In some embodiments, the releasing buffer comprises a buffer with a neutral pH. In some embodiments, the washing buffer comprises a buffered solution comprising water, phosphate buffered saline solution, a tris-buffered saline solution, or variants thereof.

In some embodiments, the kit is combined with any of the glycan-sensing systems disclosed herein to detect the glycan molecule.

In some embodiments, the kit is combined with any of the engineered bacteria disclosed herein the glycan molecule.

In some embodiments, the kit is combined with any of the reporter plasmids disclosed herein to detect the glycan molecule. EXAMPLES

The following examples are set forth below to illustrate the compositions, devices, methods, and results according to the disclosed subject matter. These examples are not intended to be inclusive of all aspects of the subject matter disclosed herein, but rather to illustrate representative methods and results. These examples are not intended to exclude equivalents and variations of the present invention which are apparent to one skilled in the art.

Example 1: HARNESSING GUT MICROBES FOR GLYCAN DETECTION AND QUANTIFICATION

The mammalian gut microbiota is a critical human health factor and a premier target for therapeutic modulation. The gut microbiota composition is influenced by the host diet, intestinal mucosa, and co-resident microbes, which all contribute to the availability of complex oligo- and polysaccharides, collectively referred to herein as glycans. Glycans are highly abundant macromolecules found in every domain of life, mediate critical cellular processes, and exhibit extraordinary structural diversity conferred by unique non-linear arrangements, glycosidic linkages, and monosaccharide components. Altering glycan availability in the intestine modulates the gut microbiota by favoring the expansion of microbial populations that can consume individual glycan structures, which typically requires differentially available enzymatic activities. The mammalian intestine can exhibit disease- associated alterations in mucosal glycosylation influencing the abundance of gut microbial populations. Thus, evaluating glycans associated with intestinal diseases reveal disease biomarkers, therapeutic targets, or prebiotic compounds. However, unlike nucleic acids and proteins, glycan structural complexity limits high-throughput examinations of heterogenous glycan mixtures using individual methodologies frequently requiring parallel tandem approaches to comprehensively examine complex heterogenous glycan mixtures. Therefore, new tools and approaches are necessary to efficiently survey mixtures for unique glycan structures that could modulate microbial abundance and activity in the gut.

Microbes have evolved systems dedicated to glycan recognition and consumption from a myriad of biological sources. Members of the gut microbiota encode vast repertoires of glycan utilization machinery conferring unparalleled access to the diverse substrates present in the intestinal lumen and facilitate gut colonization. For example, a dominant and abundant bacterial group present in the mammalian gut, called the Bacteroides, encode discreet, tightly linked, coordinately regulated gene clusters called polysaccharide utilization loci (PULs) that mediate the consumption of structurally distinct glycans. Many Bacteroides species encode vast PUL libraries endowing these organisms with the ability to consume structurally diverse complex polysaccharides derived from the host diet, co-resident microbes, and the intestinal mucosa to successfully thrive in the mammalian gut. Most PULs are transcriptionally regulated such that individual genes are expressed at low constitutive levels until encountering their target glycans (FIG. 1A), resulting in rapid and dramatic increases in corresponding PUL transcripts (FIG. IB). These responses are directed by sensor proteins that recognize mono-, di-, or oligo-saccharide signatures of their corresponding target glycan(s) and subsequently increase PUL transcript levels in a rapid and dramatic fashion (FIGS. 1 A and IB). The ability of PUL sensors to direct PUL transcription changes in response to glycan availability demonstrates their use as biosensors to be employed to surveille heterogenous mixtures.

Additionally, PULs encode proteins possessing glycan binding, transport, and depolymerization activities that exhibit substrate specificity and confer species-specific access to discreet glycan subsets (FIG. 1 A). For example, B. thetaiotaomicron (Bt) and a closely related species, B. ovatus (Bo) both encode similar polyfructan-specific PULs that confer discreet target glycan specificities. The Bt fructan PUL facilitates the consumption of the P-2,6-linked poly-fructan, levan, which is inaccessible to Bo, while the Bo fructan PUL facilitates utilization of the P-2,1 -linked polyfructan, inulin, which is inaccessible to Bl. Species specific poly-fructan utilization is mediated independently of the fructan PUL-sensor, which activates PUL transcription in both Bt and Bo following detection of monomeric fructose. Thus, species-specific differences in target glycan utilization can rely on PUL-encoded factors that distinguish between compositionally similar yet structurally distinct glycans. Moreover, the specificity exhibited by SGBPs (Fig. 1A) demonstrates their use as affinity purification reagents to isolate PUL-target glycans from complex heterogenous mixtures aiding in their structural characterization However, the paucity of specific, sensitive, and cost-effective glycan detection methods has hindered the development of such approaches.

Bacteroides PUL sensors are described to be harnessed as tools to detect their corresponding target glycans by examining changes in PUL transcription following the introduction of various biological samples. However, examining PUL transcription in response to glycan mixtures has relied on molecular approaches such a qPCR, microarrays, and RNAseq to detect transcriptional changes in Bacteroides species. Furthermore, the anaerobic growth requirements of Bacteroides species have previously limited the implementation of traditional transcriptional reporters such as GFP and luciferase to ex vivo measurements at discreet times. Herein, a Bacteroides-optiraized luciferase reporter is generated to accurately reflect transcription during growth in anaerobic culture and is readily quantifiable. Reporter plasmids containing PUL promoters can sensitively and specifically indicate the presence of target glycans and these responses require PUL-encoded transport, degradation, and detection machinery in multiple Bacteroides species. Moreover, it was determined that target glycans elicit dose-dependent responses from their corresponding PUL reporters that can be used to quantify glycan abundance as accurately as commercially available kits. Finally, an affinity purification strategy was developed using PUL-encoded SGBPs to isolate individual target glycans from heterogenous mixtures by employing PUL-reporters as glycan detection and quantification tools. This microbial glycomics toolkit is readily scalable and will enable efficient characterization of unknown PUL ligands, identify previously concealed genetic determinants governing microbial glycan utilization, and ultimately reveal the glycomic interface between mammals and their gut microbiotas.

Results

Bacteroides PUL sensors promote dose-dependent transcriptional responses to target glycans. Bacteroides species frequently encode many PULs (Bt and Bo encode 88 and 112, respectively, each putatively dedicated to the utilization of distinct glycan substrates. PULs include a .susCD-encoded complex necessary for the translocation of a target glycans across the outer membrane (FIG. 1 A) and the levels of .sus CD-containing transcripts increase rapidly and dramatically following exposure to their cognate target glycan (FIG. IB). For example, transcripts corresponding to the susC gene, BT3332, located within a Bt PUL required for chondroitin sulfate (CS) utilization (FIG. 1C) increase 92-fold after 120 minutes following the addition of CS to the media (FIG. ID). Similarly, transcripts corresponding to another susC gene, BT1763, located within the Bt PUL necessary for levan utilization (FIG. IE), increase 530-fold following the addition of levan (FIG. IF). These increases in susC transcript levels require sensor proteins that often bind target glycan-derived ligands in the periplasm and subsequently direct transcriptional responses in the cytoplasm (FIG. 1A). Accordingly, Bt mutants lacking either the CS or levan PUL sensors, are unable to grow on their corresponding target glycans as sole carbon sources (FIGS. 1C and IE, respectively) or increase PUL transcription (FIGS. ID and IF, respectively), but exhibit no growth defects on galactose (FIGS. 8 A and 8B, respectively).

Interestingly, BT1763 and BT3332 transcript levels exhibited corresponding decreases when Bt was supplied 10-fold dilutions of levan or CS (FIGS. 1G and 8C, respectively) indicating that Bacteroides PUL transcriptional responses are concentration-dependent. However, the relationship between .susC-transcripl and glycan concentrations detected at 120 minutes was not apparent after 60 minutes following glycan addition (FIGS. 8D and 8E), demonstrating the dynamic nature of PUL transcription. These results show that PUL sensors promote dose-dependent transcriptional responses over time, which are harnessed to report unknown target glycan abundances by measuring changes in corresponding susC-transcript levels. Construction of a Bacteroides-optimized bioluminescent transcriptional reporter. Quantifying susC transcription by qPCR or transcriptomics is expensive and inefficient, requiring kinetic sampling to accommodate variations in PUL transcriptional responses. Therefore, a transcriptional reporter was engineered that accurately reflects transcript levels in Bacteroides species over time during anaerobic growth without additional oxygenation. Previously constructed transcriptional reporters in Bt require terminal measurements following cell lysis limiting their applications for kinetic measurements and fluorophores such as GFP require exposure to oxygen for proper maturation or exhibit high background fluorescence. In contrast, the production of bacterial LuxCDABE proteins and biosynthesis of the luciferase long-chain aldehyde substrate can be achieved anaerobically although luciferase activity is oxygen dependent. It was contemplated that /m-mediatcd bioluminescence could function in Bacteroides species during growth under anaerobic conditions when expressed from Bacteroides promoters and ribosomal binding sites, which differ from those typically found in Firmicutes and Proteobacteria. Therefore, the Photorhabdus luminescens (Pl) luxCDΔBE cassette was engineered in the multi-copy plasmid, pLYLOl, preceded by the Bt rpoD promoter (BT1311) and an optimized Bacteroides ribosome binding site, rpiL*, that facilitates high levels of gene expression (FIG. 2A). A wild-type Bt strain harboring the Pl lux cassette exhibited detectable luminescence during anaerobic growth in minimal media containing galactose (FIG. 2B) or glucose (FIG. 9A) and grew similarly to a strain harboring the empty vector under identical conditions (FIGS. 2B and 9A, respectively).

To further optimize luciferase activity, the lux operon was re-organized to luxCDEΔB and exchanged the Pl intergenic regions for the Bt intergenic regions from BT1160-1155 (FIG. 2A), which increased luminescence output approximately 10-fold over the Pl cassette without altering growth kinetics (FIGS. 2B and 9A). A Bt strain harboring the Bacteroides-opdvaizod lux cassette exhibited consistently higher relative luminescence than a strain expressing the Pl cassette during logarithmic growth (FIGS. 2B and 9 A) and remained readily detectable even in late stationary phase over 48 hours (FIG. 2B). A plasmid containing tandem BamHI and Spel restriction sites upstream of rpiL*- luxCDEΔB (pBolux, FIG. 2A) were constructed for efficient cloning of promoters. A Bt strain harboring pBolux exhibited similar sustained luminescence over 18 hours in galactose (FIG. 2C), glucose, fructose, arabinose, or xylose as sole carbon sources (FIG. 9B). Introduction of the Bt rpoD promoter into the BamHI and Spel sites increased activity 10-fold independently of carbon source (FIGS. 2C and 9B). Introduction of pBolux into Bo produced similar results across all conditions and a plasmid preceded by the region upstream of the Bo rpoD gene (BACOVAJJ0615) increased luminescence approximately 10-fold (FIGS. 9C and 9D). These data indicate that pBolux can reflect promoter-dependent transcription in multiple Bacteroides species during anaerobic culture with minimal cost to bacterial growth.

Engineering a glycan responsive PUL reporter. To determine if pBolux can accurately indicate changes in PUL transcription in response to target glycans, the 300 bp region was introduced immediately upstream of the susC gene from the CS PUL (P-BT3332)' . Luminescence (FIG. 3A) and growth (FIG. 3B) was measured in a wild-type Bt strain harboring P-BT3332 or promoter- less pBolux for 18 hours following the introduction of CS or galactose as the sole carbon source. After only 2 hours, the relative luminescence from a strain harboring P-BT3332 was 46-fold greater and reached a peak activity of 65 -fold by 7 hours following the introduction of CS compared to cultures supplied galactose alone (FIGS. 3A and 3C). Conversely, a strain harboring pBolux exhibited identical luminescence in CS relative to galactose until 6 hours where it increased 7-fold over 18 hours (FIG. 3C). P-BT3332 -driven reporter increases imposed no detectable growth alterations indicating that increased luminescence carries minimal fitness cost (FIG. 3B).

To examine whether genes encoded within the CS PUL (BT3328-BT3334 & BT4410-BT4411 ) are required for the observed P-BT5552-specific luminescence increases in CS, reporter activity in mutant Bt strains deficient for CS transport, depolymerization, and detection activities necessary for growth on CS (FIG. 1C) were compared following the introduction of a mixture of 0.2% CS and 0.2% galactose (FIG. 10A) (galactose was added as a supplemental carbon source to support growth and luciferase activity) or galactose alone (FIG. 10B). A mutant lacking the CS-sensor, BT3334, exhibited no increased luminescence over 18 hours following the introduction of CS (FIG. 3D). The reduced bioluminescence exhibited by this strain was not the consequence of disabling growth on CS because a mutant lacking a glucuronyl hydrolase, BT3348 (gh), is unable to utilize CS (FIG. 1C), but exhibits sensor-dependent PUL transcription and increased reporter activity similar to wild-type Bt (FIG. 3D). These data show that disabling genes necessary to generate the PUL-sensor ligand, the 6- or unsulfated CS-derived disaccharide (diOS), in the periplasm prevent sensor-dependent reporter activity increases. Consistent with this notion, a mutant lacking the CS PUL-encoded susC transporter, BT3332, exhibited a dramatic reduction in reporter activity with a peak activity of 3.1 -fold increase at 10 hours putatively by reducing CS importation from the extracellular environment (FIG. 3D). Furthermore, disabling 3 CS-specific polysaccharide lyases necessary for the conversion of CS- polymers into diOS (BT3324, BT3350, and BT4410) also dramatically reduced reporter activity in response to CS relative to wild-type Bt, peaking at 3.3-fold after 1 hour following the introduction of CS (FIG. 3D). However, supplying diOS increased luminescence 39.6-fold by 16 hours in the lyase- deficient but not the sensor-deficient mutant (FIG. 3E). Collectively, these data indicate that a strain harboring P-BT3332 can indicate the presence of CS in a manner dependent on PUL-encoded glycan importation, depolymerization, and detection activities.

To determine whether increased activity from a strain containing P-BT3332 were limited to CS-PUL substrates, luminescence was examined following the addition of a structurally similar but compositionally distinct glycosaminoglycan, heparin sulfate (HS, FIG. IOC), which is consumed by Bt independently of the CS-PUL (FIG. 10D). A strain harboring the CS-reporter exhibited a dramatically lower activity following the addition of HS compared to the addition of CS (FIG. 3F), peaking at 2.6-fold after an hour. Conversely, the addition of hyaluronic acid (HA), which also activates sensor-dependent CS-PUL gene expression required for HA utilization (FIG. 10E), increased reporter activity similarly to CS in wild-type Bt but not the sensor-deficient mutant (FIGS. 3F and 10F). Thus, the Bacteroides-optimized lux reporter can specifically indicate the presence of target glycans by reflecting PUL sensor-dependent responses to glycan-derived ligands.

A fructan-responsive reporter reveals new insights into Bt levan utilization. Bt consumption of the polyfructan, levan, requires a distinct PUL encoded by BT1754-BT1765 and includes an unlinked but co-regulated exo-levanase, BT3082. A Bl strain harboring pBolux containing the region preceding the corresponding susC gene (P-BT1763) exhibits 35-fold increased activity 5.5 hours after the introduction of levan compared to galactose (FIGS. 4A and 11 A). Levan-dependent reporter activity requires the BT1763 promoter region because a strain harboring the promoter-less control plasmid, pBolux, exhibits no change after 18 hours following the introduction of levan (FIGS. 4 A and 11 A). Differences in reporter-activity cannot be attributed to plasmid-dependent growth effects because both strains exhibit identical growth kinetics in either galactose (FIG. 11B) or levan (FIG. 11C). Reporter activation requires levan-derived fructose detection by the PUL sensor protein, BT1754, because a mutant lacking BT1754 displays only 1.4-or 1.7-fold increased reporter activity following the introduction of either levan (FIG. 4B) or fructose (FIG. 4C), respectively. Moreover, a strain lacking the corresponding susC gene exhibited a 3 -hour delay before reporter activity increased following the addition of levan (FIG. 4B) and 60-hour lag before detectable growth on levan as the sole carbon source (FIG. IE). Conversely, identical strains exhibit reporter activity (FIG. 3C) and growth kinetics (FIG. 3D) resembling wild-type Bt when supplied fructose, confirming that the fructan-specific SusC is dispensable for transport of monomeric fructose and internalizes levanderived fructo-oligosaccharides. Consistent with this notion, it was determined that inactivation of 4 levanases (BT1760, BT1759, BT1765 and BT3083) reduced levan-dependent reporter activity to levels similar to the /?T/754-dcil'icicnt strain (FIG. 4B) and abolished utilization of levan as the sole carbon source (FIG. IE). Strains harboring only a single levanase, generated by disabling various combinations of all other 3 levanases, exhibited reporter activity increases in levan after a 4-hour delay (FIG. 4E) and grow on levan as the sole carbon source (FIG. 4F) albeit at dramatically reduced rates and maximum cell densities compared to wild-type Bl. Unexpectedly, all strains lacking the exolev anase, BT1765, exhibited reduced maximum cell densities during growth in fructose as the sole carbon source (FIGS. 11D and HE), and this was specific because /?77765-del'icient strains grew similarly to wild-type Bt \n galactose (FIGS. HF and 11G). Finally, reduced levan-dependent reporter activity in any mutant is not due to diminished growth because a mutant deficient for the inner membrane fructose importer, BT1758, cannot grow on levan (FIG. IE) or fructose (FIG. 4D) as sole carbon sources but exhibit reporter activity resembling wild-type Bt in response to either carbohydrate (FIG. 4B and 4C, respectively). Collectively, these data demonstrate that P-BT1763 containing strains indicate the presence of levan-derived fructose in a manner requiring PUL-encoded transport, depolymerization and detection proteins independently of its utilization.

These results reveal that Bt employs multiple levanases to consume levan and demonstrate that three distinct exo-levanases can function independently of the endo-levanase, BT1760, contrasting previous conflicting reports asserting this gene was either essential or dispensable for growth on levan. To examine the role of BT1760 in levan utilization, a BT1760-deficient strain was constructed, which exhibited reduced growth on levan as the sole carbon source (FIG. 4G) and delayed reporter activation in response to levan compared to wild-type Bt (FIG. 4H). This mutant exhibited growth and reporter activity resembling wild-type Bt when complemented in trans (FIG. 11H) indicating that this BT1760 mutation did not disrupt expression of downstream genes required for levan utilization (FIG. IE). Furthermore, a mutant encoding only BT1760 and lacking all known exo-levanases (ΔBT 1759, ΔBT3082 ΩBT1765). exhibited the greatest delay in reporter activity increases (FIG. 4E) and achieved the lowest maximum growth on levan as the sole carbon source (FIG. 4F) compared to any other combination of levanase mutations. Collectively, these data demonstrate that exo-levanases function independently of BT1760 to liberate fructose from levan highlighting that Bt levan utilization is not completely understood. Finally, these results illustrate how the pBolux reporter plasmid can be implemented to genetically dissect the contributions of individual PUL-encoded activities to target glycan consumption.

Species-specific reporter responses distinguish between compositionally identical glycans. Bacteroides species can differentially consume structurally distinct glycans comprised of identical monosaccharide components. This is exemplified by the fructan PUL, which is conserved across several species and confers Bt the ability to consume the P(2,6)-linked polyfructan, levan (FIG. IE), but not the P(2,l)-linked polyfructan, inulin (FIG. 5 A). Conversely, a similar PUL in Bo (BACOVA_04496-BACOVA_04507) confers inulin (FIG. 5A) but not levan utilization (FIG. 12A). Bo inulin utilization requires the PUL-sensor protein, BACOVA_04496, because a mutant lacking the corresponding gene is unable to grow on inulin as a sole carbon source (FIG. 5A) and exhibits reduced growth rates on fructose (FIG. 12B) but not galactose (FIG. 12C), showing that the Bo fructan PUL sensor detects monomeric fructose similarly to the Bt fructan PUL sensor. To examine how similar PULs encoded by closely related species can confer access to compositionally identical yet structurally distinct glycans, a Bo strain was generated harboring a reporter plasmid containing the region upstream of the corresponding susC gene (P-BACOVA_04505). This strain exhibits dramatically increased activity following the addition of inulin compared to galactose, in contrast to a promoter-less control plasmid (FIGS. 5B and 12D). Inulin-dependent reporter activity increases in Bo requiring the fructan PUL sensor because no change in reporter activity was observed when the sensor mutant was supplied inulin (FIG. 5C) or fructose (FIG. 5D). Collectively, these results indicate that the Bo fructan PUL sensor responds to monomeric fructose in a manner similar to the Bt sensor although each PUL confers access to structurally distinct glycans. Thus, identical PUL sensor specificities are employed to mediate utilization of distinct fructans showing that other PUL-encoded products facilitate species-specific growth on inulin and levan.

Interestingly, the addition of levan elicits detectable reporter activity increases in wild-type Bo harboring P-BACOVA_04505 compared to a strain lacking the corresponding PUL-sensor (FIG. 5E) although Bo is unable to grow on levan as a sole carbon source (FIG. 12A). Similarly, wild-type Bt harboring P-BT1763 exhibited detectable reporter activity increases when supplied inulin compared to a sensor-deficient strain (FIG. 5F), showing that Bo and Bt can derive fructose from levan and inulin, respectively, in quantities insufficient to support growth. It is contemplated that these responses reflect slow, non-specific, and extracellular liberation of fructose from non-target fructans because a Bt strain lacking the fructan- PUL encoded susC gene, BT1763, exhibits identical reporter activity relative to the wild-type strain following the addition of fructose (FIG. 4C) or inulin (FIG. 5F). Although inulin-dependent reporter activity was substantially reduced in a strain lacking 4 levanases, luminescence levels were greater than a senior-deficient mutant suggesting Bt possesses unknown inulin-degrading activities that can likely liberate fructose monomers slowly and non- specifically compared to Bo (FIG. 5F). Collectively, these results demonstrate how PUL reporters exhibit highly sensitive responses in distinct gut Bacteroides species and can detect the presence of compositionally identical but structurally distinct glycans independently of their utilization.

PUL reporters exhibit linear dose-dependent responses. To determine if dose-dependent PUL transcriptional responses elicited by target glycans (FIGS. 1G and 8C) are reflected by the corresponding PUL-reporter activity, 2-fold serial dilutions of either CS or levan were supplied to Bt strains harboring either the CS- or fructan-responsive reporter plasmids described above. A wild-type Bt strain containing P-BT3332 exhibited increased reporter activity when supplied between 0.0001% (1 pg/mL) and 0.4% (4 mg/mL) CS relative to identical cultures supplied only galactose (FIG. 6A). Similarly, a wild-type Bl strain containing -BT1763 displayed concentration-dependent reporter activity increases when supplied between 0.0016% (16 pg/mL) and 0.4% levan (FIG. 13A). Quantifying the area under each curve (AUC) revealed that the CS- and levan-reporter responses increase linearly within these ranges (FIGS. 6B and 6C) and the logarithm of corresponding AUC values follows a linear regression model with an R 2 of 0.95 and 0.97, respectively, when supplied their target ligands (FIGS. 13B and 13C). Reporter strains respond specifically to their corresponding target glycan because neither the CS- or levan-responsive strains exhibited significantly increased luminescence when supplied with any concentration of levan or CS, respectively (FIG. 6D). These data show that the linear, concentration-dependent reporter activity increases exhibited by engineered Bt strains in response to their target glycans could be used to estimate target glycan abundance in an unknown sample concentration. However, bacterial species including Bt exhibit hierarchical glycan utilization, facilitating the prioritized expression of genes necessary for the consumption of moreover less-preferred substrates, hindering target-glycan detection in PUL-reporter strains. However, the CS-reporter strain produced similar dose-dependent responses in the presence of levan (FIG. 13D) and exhibited an identical linear range of detection relative to those collected in the absence of levan (FIG. 13E). Interestingly, the levan-responsive strain exhibited greater reporter activity responses to levan in the presence of 0.2% CS (FIG. 13D) expanding the lower range of responsiveness down to 0.0004% (4 pg/mL) levan and suggesting that the presence of CS may enhance luciferase activity. Consistent with this notion, the addition of 0.2% CS significantly increased the levan-responsive reporter activity when supplied with 0.2% levan (FIGS. 6E and 13D) but did not significantly change fructan-reporter activity in the absence of levan (FIG. 6D). Taken together, these data show that PUL reporters can indicate the presence of target substrates in glycan mixtures.

Sensor-dependent PUL transcription by target glycans is typically resolved by negative feedback mechanisms whereby PUL-encoded activities remove the glycan-derived, PUL-sensor ligand from the periplasm, thereby reducing PUL sensor activation (FIG. 1A). For example, the CS- responsive PUL relies on a glucuronyl hydrolase (g/z) to hydrolyze diOS into its monosaccharide components N-acetyl galactosamine and 5-keto 4-deoxyuronate, thereby removing the sensor ligand and reducing transcription activation. Therefore, a mutant lacking BT3348 exhibits increased PUL- transcription relative to wild-type Bt as the CS-sensor is deactivated by this activity. Accordingly, a "//-deficient strain harboring P-BT3332 exhibits detectable responses between 0.000006% (6 ng/mL) and 0.4% CS (FIG. 6F) but the linearity of these responses reached a maximum at 0.0125% (125 pg/mL) (FIGS. 6B and 13B). Similarly, a strain lacking the inner-membrane fructose importer, BT1758, harboring P-BT1763 exhibited increased luminescence at levan concentrations as low as 0.0004% and this response achieved its maximum at 0.2% levan (FIG. 6C) limiting the upper range of its linear responsiveness (FIG. 13C). Thus, PUL-reporter activity can be genetically tuned to extend target glycan sensitivity by preventing PUL-sensor deactivation, which horizontally shifts the linear ranges of glycan detection. Collectively, this data show PUL-reporters offer new specific and highly sensitive glycan detection reagents that estimate glycan abundances across wide linear ranges spanning more than 3 orders of magnitude.

PUL-reporters facilitate target glycan isolation using SGBPs. PULs often encode SGBPs that facilitate glycan sequestration along the outer membrane (FIG. 1A). SGBPs specifically bind their cognate glycan ligands and can discriminate between compositionally similar, structurally distinct glycans. For example, BT1761, an SGBP encoded in the Bt levan utilization PUL, binds to levan but not inulin and BT333O, an SGBP encoded in the Bt CS utilization PUL differentially binds CS polymers greater than 20 disaccharide units in size. Consistent with these findings, recombinant BT3330 protein exhibited a reduced migration following affinity-PAGE in the presence of CS (FIG. 7 A) but not in its absence (FIG. 7B). Conversely, BT1761 protein exhibited a decreased relative migration following affinity-PAGE in the presence of levan (FIG. 7C) but not in its absence (FIG. 7B). Furthermore, the altered migration of both proteins was specific to the presence of target glycans because BT333O migration was similar in the presence of levan (FIG. 7C) relative to gels lacking glycan (FIG. 7B) and BT1761 migration in CS-containing gels (FIG. 7A) was similar in the absence of glycan (FIG. 7B). These results indicate that each PUL-encoded SGBP can distinguish between target and non-target glycans in vitro and putatively facilitate target glycan isolation from a heterogenous mixture. PUL-reporter strains were thought to exhibit sensitive (FIGS. 6B and 6C) and specific (FIG. 6D) detection of target glycans even in the presence of non-target molecules (FIGS. 13D and 13E), they could serve as glycan detection reagents to identify fractions enriched for glycans of interest using immobilized, recombinant SGBPs. Thus, cell lysates were combined and prepared from E. coli strains engineered to over-express either N-terminally his-tagged BT1761 or BT3330 with an equal mixture of levan and CS. After incubation, SGBPs were captured using Ni-agarose, washed extensively, and subsequently eluted with free histidine so that each fraction could be supplied directly to Bt cultures harboring either P-BT3332 or P-BT1763. Elutions 2 through 8 from columns containing BT3330 elicited significantly increased CS-responsive reporter activity compared to control reactions incubated with lysates prepared from E. coli strains harboring an empty vector (FIGS. 7D and 14A). Conversely, identical fractions supplied to the fructan-responsive reporter strain resulted in activity that was indistinguishable from those eluted from vector control containing lysates (FIGS. 7D and 14B) indicating that BT3330-containing fractions (FIG. 14C) were enriched for CS over levan. Conversely, fractions co-eluting with BT1761 (FIG. 14C) elicited increased activity from the levan- (FIGS. 7E and 14B) but not the CS-responsive reporter strain relative to elutions from a vector control (FIGS. 7E and 14A), indicating BT1761 enriched levan over CS.

To measure SGBP-mediated target glycan enrichment, pooled fractions co-eluting with either BT3330 or BT1761 that elicited significantly increased reporter activity compared to control fractions were concentrated (FIGS. 7D and 7E). The concentrated material was supplied to Bt strains harboring CS- or levan- reporter plasmids and compared to a standard curve of 2-fold dilutions of either CS- or levan (FIGS. 7F and 7G, respectively). Glycans co-eluting with BT3330 elicited increased activity from a wild-type Bt strain harboring P-BT3332 (FIGS. 7F and 14D) but not P-BT1763 (FIG. 14E). Conversely, glycans co-eluting with BT1761 stimulated levan- (FIGS. 7G and 14E) but not CS- responsive reporter activity (FIG. 14D). TheBT3330 eluate contained 205.6 pg/mL CS and the BT1761 eluate contained 244.4 pg/mL levan using a simple linear regression of loglO reduced AUC values from corresponding standard curves (FIG. 7F and 7G, respectively). To verify these results with reporter-independent methods, total fructan or glycosaminoglycan content was measured from identical samples using colorimetric assays (Biovision and Megazyme, respectively), which indicated that concentrated eluates co-purifying with BT3330 containing 168.2 pg/mL CS (FIG. 7H) but no detectable levan and material co-purifying with BT1761 contained 229.7 pg/mL levan (FIG. 71) but no detectable CS. To estimate the concentration of non-target glycans co-purifying with each SGBP, the concentrated eluate was supplied to hyper-sensitive strains defective for sensor deactivation, which exhibit extended target glycan detection ranges (FIGS. 13B and 13C). The concentrated eluates from BT1761 increased CS-responsive reporter activity in the gA-deficient strain harboring P-BT3332 (FIG. 14F) within the linear range of detection (FIG. 14G) indicating that 0.0000086% (86.7 ng/mL) CS co-purified with BT1761, representing a 939.7-fold enrichment for levan over CS. However, an estimate of the SGBP-mediated enrichment of CS over levan was not obtained because the concentrated BT3330 eluate stimulated 1.2-fold increased levan-responsive reporter activity in the fructose importer-deficient strain harboring P-BT1763 (FIG. 14H) but this was outside of the linear range of quantification (FIG. 141). Collectively, these data demonstrate that PUL reporters are powerful detection tools that can facilitate target glycan isolation from heterogenous mixtures and measurement beyond the limits of commercially available solutions.

Discussion

This disclosure provides a new reporter system that kinetically monitors transcription during anaerobic growth in prominent human gut commensal bacteria. Here, introduction of pBolux plasmids containing PUL promoters generates glycan-sensitive bacterial strains that require PUL-encoded transport, depolymerization, and detection activities (FIGS. 3-5). It is demonstrated that Bacteroides PUL sensors elicit concentration-dependent transcriptional responses (FIGS. 1G and 8C) that are recapitulated by corresponding reporter strains (FIGS. 6 and 13) and can indicate the presence of corresponding target molecules at concentrations below the limits of detection of commercially available kits (FIGS. 7F-7I). Moreover, it is demonstrated that reporter sensitivity can be genetically tuned to respond by glycan degree of polymerization by disabling glycan depolymerization machinery (FIGS. 3-4) or achieve lower limits of detection disabling PUL-encoded genes necessary for PUL deactivation (FIGS. 6F, 13B, and 13C-13F). It is also demonstrated that reporter responses are readily quantifiable and can estimate unknown target glycan concentrations (FIGS. 7F and 7G) similar to reporter-independent methods (FIG. 7H and 71). Thus, this example establishes a new glycomics toolset that employs Bacteroides PULs to facilitate simple, rapid, and high-throughput detection and quantification of gut microbial substrates in heterogeneous mixtures.

Examining bacterial glycan utilization has previously relied on detectable growth on commercially purified glycans or steady-state transcriptional changes following introduction of undefined glycan mixtures. It is demonstrated that PUL reporters can indicate bacterial transcriptional responses to target glycans in dramatically smaller volumes (as low as 30 pL) (FIGS. 6 and 7) and require substantially less material than similar measurements using ex vivo approaches such as qPCR, transcriptomics, or NanoLuc. Furthermore, pBoZn -derived reporter strains overcome the need for kinetic sampling of bacterial cultures because changes in gene expression can be measured over time into late stationary phase (FIG. 1L1J) and can indicate target glycans independently of their utilization (FIGS. 1C, IE, 3D, and 4B). Finally, because these strains can be cultured ad infinitum and do not require harvesting mRNA or cDNA synthesis, they offer a high-throughput, cost-effective tool to measure changes in gene transcription during growth. These advantages render Bolux a powerful, efficient, and relatively inexpensive tool to examine Bacteroides gene transcription in various conditions including in response to glycans.

To demonstrate the advantages of pBoZn -derived plasmids, two PUL-reporters were leveraged to devise an affinity isolation approach whereby recombinant PUL-encoded SGBPs isolate target glycans from a mixture (FIG. 7). Because the corresponding reporter strains can specifically and sensitively detect target glycans even in the presence of non-targets (FIGS. 6D, 6E, 13D, and 13E), fractions containing two distinct SGBPs copurifying with miniscule amounts of their corresponding glycan ligands were identified (FIGS. 7A and 7B). Furthermore, using a strain with extended limits of target glycan detection, contaminating non-target glycan abundance demonstrated that this approach achieved 940-fold enrichment of target from non-target molecules (FIGS. 14F and 14G). It is contemplated that similar approaches could be implemented to isolate and characterize unknown PUL-targets such as those derived from the mammalian mucosa, which can exert prebiotic effects when administered in aggregate, for downstream structural characterization. In total, the dedicated glycan utilization machinery possessed by gut microbes has been harnessed to develop tools that enable simple, efficient, and inexpensive glycan detection, isolation and quantification tools that are readily scalable and distributable.

It is demonstrated how glycan-responsive reporters can harness species-specific glycan preferences to detect structurally distinct yet compositionally identical glycans (FIG. 4-5). With some Bacteroides species encoding over one hundred different PULs, many with unknown glycan targets, the accessible glycome of intestinal microbes is vast and largely uncharacterized. Moreover, some annotated PULs putatively facilitate utilization of unknown glycan substrates that confer speciesspecific gut colonization advantages, which represent attractive therapeutic targets and be identified by implementing analogous approaches to those described here. Because PUL reporters can indicate the presence of target glycans at extremely low abundances (Figs. 6B&C,S6B&C FIGS. 6B, 6C, 13B, and 13C), without serving as growth substrates (Figs. 3D,4B&C,5E&F FIGS. 3D, 4B, 4C, 5E, and 5F), and even in the presence of other glycans (Figs. 6D&E,S6D&E FIGS. 6D, 6E, 13D, and 13E), PUL reporters used to efficiently screen various mixtures for the presence of target glycans. Thus, constructing pBolux plasmids containing promoters from PULs with unknown targets could be used to examine differential glycan abundances across low-yield biological samples in high-throughput applications enabling new glycan surveillance approaches.

Methods

Bacterial culture. All E. coli strains were cultured on LB agar (BD) aerobically at 37°C and inoculated from single colonies into LB media (BD) with agitation at 275 rpm. All Bacteroides strains were cultured on brain-heart infusion agar (BHI, Sigma) containing 5% horse blood (Hardy) anaerobically and inoculated from single colonies into TYG incubated under identical conditions. Bacteroides strains were sub-cultured at the indicated dilutions from stationary phase growth in TYG into minimal media containing the indicated carbon sources described in the corresponding figure legends. All strains were cultured in the presence of antibiotic selection where appropriate at the following concentrations: 100 pg/mL ampicillin, 2 g/mL tetracycline, 25 pg/mL erythromycin.

Growth Measurements. Bacteroides strains were cultured anaerobically to stationary phase in TYG and diluted 1:200 into pre-reduced minimal media containing the indicated carbon sources [glucose, galactose, fructose arabinose, xylose, and chondroitin sulfate (Sigma); levan and inulin (Megazyme)] in pre-reduced 96-well or 384-well clear microplates (Corning). To remove free fructose from inulin, a 1% inulin solution was passed over a ZebaSpin de-salting column (ThermoFisher) prior to the addition of cultures. Kinetic absorbance measurements were taken at 600 nm every 15 minutes for 96 hours using a Tecan Infinite M-plex maintained at 37°C in a Coy anaerobic chamber with a 2.5% hydrogen atmosphere.

Strain construction. A plasmid encoding the Psuedorhabdus luminescens lux cassette under control of the B. thetaiotaomicon rpoD promoter (BT1311) was generated by amplifying products from pmini-Tn5 luxCDΔBE Tc using primers W3115 and W3124 and PBTi3ii-rpiL* from strain NS340 with primers W2952 and W2905. The resulting amplicons were assembled with pNBU2-tetQ digested with BamHI-HF and Sall-HF (NEB) using the NEBuilder Assembly Master Mix (NEB). The resulting plasmid (pNBU2-lux-Pl) was verified by Sanger sequencing before its introduction into B. thetaiotaomicron by di-parental mating to generate strain GT962, and the insert was sub-cloned into pLYLOl using BamHI-HF and Sall-HF to generate plux-Pl.

To generate a B. thetaiotaomicron optimized lux cassette, luxC (primers W2952 and W3265), luxD (primers W3266 and W3267), luxE (primers W3268 and W3269), luxA (primers W3270 and W3271) and luxB (primers W3272 and W3273) were amplified from GT962 genomic DNA and assembled with pNBU2-tetQb digested with BamHI-HF and Sall-HF (NEB) using the NEBuilder Assembly Master Mix (NEB). The resulting plasmid (pNBU2-Z«x-Bz) was verified by Sanger sequencing before being sub-cloned into pLYLOl using BamHI-HF and Sall-HF to generate plux-Bt. A promoter-less Bacteroides optimized lux cassette was generated by assembling the lux cassette from plux-Bt, amplified with primers 1080 and 1011, with pLYL0l digested with BamHI (NEB) and Sall-HF (NEB) using the NEBuilder Assembly Master Mix (NEB). The resulting plasmid, pBolux, contains tandem BamHI and Spel restriction sites for cloning and was used for all experiments as a negative control. To construct chondroitin sulfate, levan, and inulin-responsive reporters, the 300 bp region upstream of BT3332 (primers 1232 and 1373), BT1763 (primers 1150 and 1304), or (primers 1943 and 1944) were amplified and combined with pBolux digested with BamHI-HF and Spel-HF (NEB) using the NEBuilder Assembly Master Mix (NEB).

Generating loss-of-function mutations in Bt were performed using pEXCHANGE-tdk or pKNOCK-ermGb. A Bo strain lacking BACOVA_04496 was constructed using a similar allelic exchange method with pSIEl. All mutations were verified by Sanger sequencing across the chromosomal region of interest. All strains, plasmids and primers are listed in Tables 1-4, 5a, and 5b, respectively.

Transcript quantification. mRNA was prepared from 1.0 mL of pelleted Bt cell culture treated with RNA protect (Qiagen) using the RNeasy kit (Qiagen) according to the manufacturer’ s directions. cDNA was subsequently synthesized from 1.0 pg of isolated RNA using Superscript VILO IV master mix (ThermoFisher) according to the manufacturer’s directions. Transcript levels were measured by qPCR using Fast SYBR Green PCR Master Mix (Applied Biosystems) and primers 1060 and 1061 (BT3332) or 1056 and 1057 (BT1763) and monitored using a QuantStudio 12K Flex instrument (Applied Biosystems). All mRNA transcripts were normalized to 16s rRNA measured from 1,000-fold diluted cDNA using primers 1044 and 1045.

Measuring bioluminescence during growth in various monosaccharides. Bt and Bo strains were cultured in TYG containing 2 pg/mL tetracycline overnight before being diluted 200-fold into minimal media containing 0.5% carbon source in a sterile 96-well white, clear-bottomed microplate (Coming 3610). Absorbance at 600 nm and luminescence were measured every 15 minutes for 48 hours using a Tecan Infinite M-plex. RLU were calculated as luminescence values divided by absorbance at 600 nm.

Measuring bioluminescence in response to glycans. Bt and Bo strains were cultured in TYG containing 2 pg/mL tetracycline overnight before being diluted 50-fold into minimal media containing 0.5% galactose anaerobically grown to mid-exponential phase at 37°C (approximately 4 hours). Cultured cells were pelleted by centrifugation before the supernatant was removed, and cells were resuspended in 2X minimal media lacking a carbon source. Equal volumes of each cell suspension were transferred to a pre-reduced, white, clear bottom 384- well microplate (Coming 3765) containing equal volume of the indicated carbon source. Absorbance at 600 nm and luminescence were measured every 15 minutes for 18 hours using a Tecan Infinite M-Plex instrument anaerobically at 37 °C temperature. RLU were calculated as luminescence values divided by absorbance at 600 nm and normalized to identical measurements from each strain supplied only galactose.

Protein overexpression and purification. E. coli BL21 (DE3) cells were transformed with pT7-7 plasmids encoding SGBPs. Fresh transformants were cultured overnight in LB supplemented with 100 pg/mL ampicillin at 37°C with agitation. The following day, cultures were diluted 50-fold in LB supplemented with 100 pg/mL ampicillin and incubated for approximately 2 hours at 37°C with agitation to mid-exponential phase (ODeoo ~ 0.45-0.6). Cultures were cooled on ice before isopropyl P-thiogalactopyranoside (IPTG) was added to a final concentration of 50 pM and incubated at 30°C with agitation for 4 hours. Cells were pelleted by centrifugation at 7197 x g for 10 min at 4°C before the supernatant was decanted and cell pellets were frozen on dry ice and stored at -80°C. Pellets were thawed on ice, resuspended in 20 mM Tris/HCl buffer, pH 8.0, containing 100 mM NaCl and lysed in 2.0 mL tubes containing 250 pL volumes of 0.1 mm silica beads placed in a pre-chilled aluminum block using a Bead beater-96 (BioSpec) for 5 cycles of 40 second beating at 2400 rpm and 5 minutes resting at 4°C. Lysates were centrifuged at 20,000 x g for 10 minutes at 4°C and the supernatant was combined with 0.2 mL pre-equilibrated Ni-NTA sepharose and 10 mL of 20 M Tris/HCl buffer, pH 8.0, containing 100 mM NaCl for 1 hour at room temperature with rocking. The slurry was packed into a 2 mL purification column (Pierce) and the liquid phase removed by gravity flow before washing with 3 mL of wash buffer (20 mM Tris/HCl buffer, pH 8.0, containing 100 mM NaCl). SGBPs were eluted from the column with 3 mL of elution buffer (20 mM Tris/HCl buffer, pH 8.0, containing 100 mM NaCl and 25 mM histidine), concentrated and buffer exchanged using centrifugal concentrators (Millipore) and 6 successive additions of storage buffer (10 mM Tris/HCl buffer containing 10% glycerol). Protein concentrations were estimated by BCA assay (Pierce) and stored at -80°C.

Affinity PAGE. 100 pmol of protein was combined with native PAGE loading buffer and electrophoresed in a 10% polyacrylamide matrix containing 10% (w/v) acrylamide in Tris, glycine buffer, pH 8.8. at 20 mA for 3.5 hours at room temperature. Gels contained 0.1% CS or levan and BSA was used as a negative control. Proteins were visualized by Coomassie Blue staining.

SGBP target glycan isolation. 500 pL of clarified protein lysate was pre-incubated with 1.0 mL of a mixture containing 0.1% levan and 0.1% CS overnight at 4°C with rocking before combination with 200 pL pre-equilibrated Ni-NTA agarose (Sigma) in 9 mL of 20 mM Tris/HCl buffer, pH 8.0, containing 100 mM NaCl, for 1 hour at RT with rocking. The slurry was packed into a 2 mL purification column (Pierce) and the liquid phase removed by gravity flow before washing 4 times with 1 mL of 20 mM Tris/HCl buffer, pH 8.0, containing 100 mM NaCl. SGBPs were eluted from the column in a total of 4 mL elution buffer (20 mM Tris/HCl buffer, pH 8.0, 100 mM NaCl and 25 mM histidine). Glycan content was measured as described above whereby mid-exponential cells were pelleted by centrifugation and re-suspended into 2X minimal media containing 0.4% galactose. Each cell suspension was transferred to a pre-reduced, white, clear bottom 384-well microplate (Coming 3765) containing equal volumes of column fractions. Absorbance and luminescence were measured kinetically under anaerobic conditions as described above. Target glycan containing elution fractions were combined and treated with 0.1 mg/mL Proteinase K in lOmM Tris, pH 7.5 containing 20mM calcium chloride for 2 hours at 65°C. Total glycans were combined with 3 volumes of 200 proof ethanol, incubated overnight with rocking at 4°C, pelleted by centrifugation, and resuspended in one tenth the original volume of water.

Reporter-independent glycan measurements. Total levan content was assayed using the Fructan Assay Kit (K-Fruc, Megazyme), modified to include the hydrolysis and absorbance measurement at 410 nm against a standard curve of 0.2-1 mg levan. CS abundance was estimated using the Total Glycosaminoglycan Assay Kit (K2085, BioVision), according to the manufacturer’s directions against a standard curve of 0.2-10 ug CS.

Statistical analysis. Repeated measurements were analyzed by paired, two-tailed Student’s t-test, 1-way, or 2-way ANOVA using Graphpad Prism where appropriate as indicated in each figure legend.

Example 2: APPLICATIONS FOR HARNESSING GUT MICROBES

Glycans facilitate critical biological functions and control the mammalian gut microbiota composition by supplying differentially accessible nutrients to distinct microbial subsets. Therefore, identifying unique glycan substrates that support defined microbial populations will inform new therapeutic avenues to treat diseases via modulation of the gut microbiota composition and metabolism. However, examining heterogenous glycan mixtures for individual microbial substrates is hindered by glycan structural complexity and diversity, which presents substantial challenges to glycomics approaches. Fortuitously, gut microbes encode specialized sensor proteins that recognize unique glycan structures and in-turn activate predictable, specific, and dynamic transcriptional responses. Herein, the microbial machinery is harnessed to indicate the presence and abundance of compositionally similar, yet structurally distinct glycans, using a developed transcriptional reporter. It demonstrated how these tools can be implemented to examine glycan mixtures, isolate target molecules for downstream characterization, and quantify the recovered products. Thus, a toolkit will dramatically enhance the understanding of the mammalian intestinal environment and identify host- microbial interactions critical for human health. Example 3: DEVELOPMENT OF METHODS FOR DETECTING AND QUANTIFYING GLYCANS

Modern glycomics approaches face technical and biophysical limitations that hinder comprehensive characterization of heterogeneous glycan mixtures typically derived from biological samples. For example, glycomic analyses by mass spectrometry requires sample derivatization and ionization that bias detection due to the non-uniform labeling of individual monosaccharides, covalent decorations, and branched structures exhibited by many glycans. Conversely, nuclear magnetic resonance (NMR) precisely determines composition, sequence, and structural arrangements but requires isolation of individual glycan species to achieve optimal resolution. Isolation of individual target glycans from a complex mixture requires subsequent rounds of fractionation using sequential high-performance liquid chromatographic (HPLC) approaches to sort compositionally similar but structurally distinct molecules. Glycan binding aptamers and proteins, such as lectins, can directly indicate the presence of distinct targets but their narrow specificity paired with high-cost limits their accessibility as high-throughput glycan surveillance tools. Consequently, glycomics strategies often require the application of multiple, tandem methodologies to comprehensively examine the complex mixtures typically extracted from eukaryotic, prokaryotic, and plant materials. Therefore, new, accessible, and inexpensive technologies are imperative to facilitate simple, rapid surveillance of crude glycan preparations that indicate the presence and abundance of distinct structures.

Collectively, the challenges facing their characterization have limited the understanding of the glycans present in various ecological niches, referred to as the glycome, and has lagged behind the understanding of the abundance and dynamics of the organisms occupying those niches. This is exemplified by the mammalian intestinal environment, where the gut microbiome and its transcriptome have been extensively examined by NGS methodologies that take advantage of the linear, uniform compositions of nucleic acids. Thus, the microbial populations and their transcriptional fluctuations have been well-defined, revealing incredible inter-individual compositional variation and uncovering their contributions to human health and disease. This work has revealed how the intestinal glycome influences gut microbial fitness and metabolism, thereby driving the microbiome composition and the synthesis of clinically important metabolites. Thus, the intestinal glycome is an untapped reservoir of molecules that could be employed to modulate gut microbial abundance and metabolism for clinically beneficial outcomes. New glycomic approaches have been developed to interface with existing technologies to detect glycans that serve as gut microbial signals and growth substrates, and subsequently isolate relevant molecules from biologically-derived mixtures for downstream structural characterization using conventional approaches. Bacterial taxa that dominate in the intestinal environment have evolved specialized glycan detection and utilization machinery to identify available substrates that can serve as nutrient sources and respond by producing enzymes that facilitate substrate sequestration, importation, and consumption. Many of these bacteria employ a panoply of different glycan sensor proteins that recognize structurally distinct glycans and facilitate transcription activation of the appropriate enzymatic repertoire to efficiently utilize target glycans. The glycan detection machinery possessed by gut bacteria has been harnessed to indicate the presence and abundance of distinct glycans in vitro using a new transcriptional reporter system. Furthermore, we implemented this reporter in a high- throughput capacity to define the glycome of a biologically derived heterogenous glycan mixture through the prism of gut microbes. Moreover, this reporter-mediated detection is utilized to facilitate isolation of target glycans of interest using affinity purification strategies for downstream structural and functional characterization. Finally, it is demonstrated that these microbial responses can be readily quantified to estimate the relative abundance of recovered glycans similarly to commercially available products. Therefore, a new suite of glycomics tools have been established that can overcome many of the current challenges facing glycomics and have developed an inexpensive, readily distributable glycan surveillance platform to examine crude glycan preparations in a high-throughput manner and affinity purification tools to isolate target molecules for downstream characterization.

Moreover, because the glycans found in the intestinal environment are derived from a myriad of environmental sources including plant and animal tissue, the tools that result from this example extend far beyond the gut microbiota to enhance existing glycomics workflows. Finally, genetic tuning can modulate the responsiveness of these systems to narrow glycan structural specificity and extend the limits of detection indicating their modularity.

Results

Bacteroides PUL sensors elicit dose-dependent transcriptional responses to cognate glycan ligands. Species of the bacterial genus, Bacteroides, are predominantly found in the mammalian intestinal tract and encode genetically linked, coordinately regulated gene clusters called polysaccharide utilization loci (PULs). Most Bacteroides species encode many different PULs, each conferring the ability to consume one or a small subset of structurally distinct glycans. PULs encode genes that facilitate the sequestration, importation, depolymerization, and detection of a target glycan (FIG. 1A). PUL transcripts are typically expressed constitutively at low levels until the target glycan is encountered in the environment, at which time PUL transcripts are rapidly and dramatically increased (FIG. IB). These increases in PUL transcription are mediated by unique sensor proteins that bind mono-, di-, or oligo-saccharide signatures derived from the target glycan in the periplasm directly facilitating transcription of PUL genes in the cytoplasm, an activity necessary for efficient utilization of the target glycan (FIG. 1C). For example, introduction of the glycosaminoglycan, chondroitin sulfate (CS), results in a 92-fold increase in the transcript level of the Bacteroides thelaiolaomicron (Bl) CS-specific PUL, which requires the CS sensor gene, BT3334, because a mutant deficiency for the CS-sensor is unable to increase CS-PUL expression (FIG. ID) and is unable to grow on CS as a sole carbon source (FIG. 1C). Interestingly, CS-dependent changes in PUL gene transcription are dose-dependent, such that adding increasing amounts of CS to the media results in corresponding increases in PUL-encoded susC (BT3332) transcript levels (FIG. IE). Similarly, introduction of the common dietary fiber, levan, increases transcription of the corresponding susC gene, BT1763, 530-fold, which requires the levan sensor, BT1754 (FIG. IF) and is necessary for levan utilization (Fig. 1G) (35). Increasing amounts of levan elicit concentration-dependent increases in the corresponding susC transcript levels (FIG. 1H), However, these responses are kinetic, and require sampling at discreet times following glycan addition (FIGS. 1H and II).

Bacteroides PUL reporters indicate the presence of target glycans. The data show that the rapid, dramatic, and predictable increases in PUL transcript levels can indicate the relative abundance of PUL-specific glycans and therefore be harnessed as glycan biosensors. However, measuring PUL transcription by qPCR or high-throughput sequencing is cumbersome, expensive, and requires kinetic sampling (FIG. IE, 1H, and II). Thus, the first transcriptional reporter plasmid (named \)Bolux. for Bacteroides-optimized lux) was developed that can quantify gene transcription during anaerobic growth across Bacteroides species. Because this reporter encodes a complete bacterial luciferase cassette, measuring activity is achieved kinetically, without the addition of exogenous reagents, and exhibits extremely low background as Bacteroides species do not natively produce bioluminescence. Reporter plasmids harboring the region upstream of the constitutively active rpoD gene, exhibited dramatically increased reporter activity compared to the promoter-less pBolux plasmid in Bt (FIG. 15A), Bacteroides ovatus (Bo; FIG. 15B), Bacteroides fragilis (Bf; FIG. 15C), or Bacteroides caccae (Be, FIG. 15D) without an obvious fitness cost as strains harboring these constructs grew identically to those harboring a promoter-less control plasmid (FIG. 15A-15D, respectively).

Bt PUL-promoters were cloned preceding the plasmid-encoded lux cassette and measured reporter activity in the presence or absence of known target glycans. For example, a strain harboring a reporter plasmid that included the CS-PUL promoter (i.e., region upstream of the corresponding susC gene, BT3332) resulted in a dramatically increased reporter activity following the introduction of CS compared to a control strain harboring the promoter-less plasmid (FIG. 16A The PUL-encoded activities (depicted in FIG. 1A) were responsible for these CS -induced reporter activity increases by culturing the mutants unable to grow in CS (FIG. 1C) in galactose prior to the introduction of a mixture of CS and galactose. The reporter activity was subsequently measured over 18 hours (FIG. 16B) and the area under each curve (AUC) was calculated for simple statistical comparisons (FIG. 16C). For example, CS-induced reporter activity requires the PUL-encoded outer membrane transporter SusC, because a mutant lacking the corresponding gene, BT3332, was unable to transport CS into the bacterial periplasm and exhibits dramatically reduced activity compared to wild-type Bt in the presence of CS (FIGS. 16B and 16C). Reporter activation requires the depolymerization of CS into disaccharides that bind the sensor in the bacterial periplasm (FIG. 1 A) because a mutant deficient in 3 PUL-encoded CS-specific lyases (Alyase) is unable to increase reporter activity in response to CS (FIG. 16B and 16C) but this can be overcome by the addition of pre-digested chondroitin disaccharides (FIG. 16D). In all cases, increased reporter activity requires the corresponding PUL- sensor as a mutant lacking the CS-sensor gene, BT3334, exhibited no increase in the presence of either chondroitin sulfate (FIGS. 16B and 16C) or chondroitin disaccharide (FIG. 16D). The reduced reporter activity observed in all of these mutants was due to the inability to activate the CS-sensor rather than a consequence of impairing CS utilization because a mutant lacking the CS-specific glucuronyl hydrolase, BT3348, which is necessary to hydrolyze the CS-sensor ligand and grow in CS (FIG. 1C), elicits reporter activation similar to wild-type Bt (FIG. 16B and 16C). Thus, these engineered strains can specifically indicate the presence of target glycans in the environment by defined mechanisms possessed by PUL-encoded gene products.

Species-specific responses distinguish between compositionally identical, structurally distinct glycans. Differentiating between compositionally identical glycans requires specialized LC/MS protocols due to the generation of molecules with identical masses following derivatization and ionization. However, different Bacteroides species encode similar PULs that confer distinct substrate specificities for compositionally identical yet structurally distinct glycans. For example, the Btfructan PUL facilitates consumption of the p2,6-linked polyfructan, levan (FIG. 17A), but not the P2,l-linked polyfructan, inulin (FIG. 17B). Conversely, Bo encodes a highly similar PUL to the one in Bt but facilitates utilization of inulin (FIG. 17B) and is unable to mediate the consumption of levan (FIG. I7A). Bt and Bo exhibit disparate fructan specificities despite both PULs encoding nearly identical sensor proteins that bind the monosaccharide fructose, which is the sole degradation product of both inulin and levan, in-tum activating PUL transcription in the cytoplasm and facilitating growth (FIG. 17C). PUL-reporter plasmids were generated in Bt and Bo containing regions upstream of their respective fructan PUL susC genes and determined that both Bt and Bo reporter strains exhibit similar sensor-dependent reporter activity increases when supplied fructose (FIG. 17D). As expected, the Bt fructan PUL-reporter exhibited 12-fold increased activity in levan (FIG. 17E) but only 2-fold increased activity in response to inulin (FIG. 17F). Conversely, the Bo PUL-reporter exhibited a 9.3- fold increase in reporter activity following the introduction of inulin (FIG. 17F) but no increase in levan (FIG. 17E). These results demonstrate that PULs exhibit substrate specificity even when the PUL-sensor responds to identical ligands, indicating that growth substrate specificity can be mediated by additional-PUL encoded activities. Finally, these results highlight that these reporters can detect very low levels of substrates because statistically significant differences between wild-type and sensor-deficient Bt strains were detected (FIG. 17F), indicating that Bt can liberate monomeric fructose from inulin at comparatively low rates insufficient for growth (FIG. 17B). Thus, by uncoupling PUL signaling from growth, microbial responses to glycans can be examined independently of substrate utilization, facilitating detection of structurally distinct but compositionally identical glycans.

Bacteroides PUL reporters elicit linear dose-dependent responses to target glycans. PUL- sensors elicit dose-dependent increases in PUL transcript levels (FIGS. IE and 1G) indicating that these responses could be used to quantify glycan abundance relative to a standard curve. Accordingly, PUL-reporters also exhibit dose-dependent responses to target glycans over a linear range greater than three orders of magnitude (FIGS. 18A and 18B). Quantifying the AUC following 18 hours after the introduction of two-fold serial dilutions of either CS or levan revealed a linear response from a wild-type Bl strain harboring the CS-reporter supplied between 0.4 to 0.0004% (4.0 to 0.004 mg/mL) CS but no response from any amount of levan (FIG. 18C). Conversely, strains harboring the levan- reporter plasmid exhibited similar linear dose-dependent responses between two-fold serial dilutions of levan but not CS (FIG. 18D). Furthermore, both the CS and levan-reporters retained linear responsiveness even when 0.2% of the other glycan was present (FIGS. 18E and 18F). These data indicate that the dose-dependent responses of PUL-reporters could be used as standard curves to quantify target glycan abundances present in complex heterogenous mixtures.

Mutations can narrow glycan specificity and enhance reporter sensitivity. PUL-sensor mediated transcription activation requires enzymatic conversion of target glycans into mono-, di-, or oligosaccharide signatures mediated by PUL-encoded polysaccharide lyases or glycosyl hydrolases that cleave the glycosidic bonds tethering distinct repeating disaccharides. The CS-PUL encodes 3- discreet CS-specific lyases that are collectively required for growth on CS (FIG. 1C) and increased PUL-reporter activity in response to CS (FIGS. 16B and 16C). These lyases also depolymerize a similar glycosaminoglycan, hyaluronan (HA), into disaccharides detected by the same PUL-sensor. However, HA depolymerization is also mediated by an additional unknown enzymatic activity that is unable to act on CS and therefore, a lyase-deficient mutant can grow on HA (FIG. 19A) but not CS (FIG. 1C). Consequently, a lyase-deficient mutant is no longer able to elicit dose-dependent responses in CS compared to wild-type Bt (FIG. 19B) but retains this ability in HA (FIG. 19C). These data demonstrate how directed mutations can further enhance glycan detection specificity by differentiating between two structurally similar, yet compositionally distinct glycans, which cannot be achieved using commercially available kits.

Following activation by binding glycan-derived ligands, PUL-sensors are deactivated and returned to steady state by the processive removal of their cognate glycan-derived ligands (FIG. 1 A). Deactivation of the CS -reporter requires hydrolysis of glycan-derived disaccharides into their monosaccharide components by a glucuronyl hydrolase (gh), encoded by BT3348 in Bt. Thus, a gh- deficient mutant accumulates CS-derived disaccharides, prevents PUL-sensor deactivation, which promotes increased PUL transcription relative to wild-type Bt. Consequently, it was determined that the g/j-del'icient strain exhibited significant CS-reporter activity increases when supplied CS amounts as low as 0.000006% (60 ng/mL), which were undetectable by wild-type Bt harboring the same CS- reporter construct (FIG. 19D). Therefore, by successfully uncoupling PUL transcription from bacterial growth, mutations that prevent PUL-sensor deactivation can be employed, extending the lower limits of detection and quantification well below those of commercially available solutions.

Conclusions

PUL transcription activation is directly facilitated by PUL sensor proteins that detect mono-, di-, or oligosaccharide signatures derived from their corresponding target glycan. These glycan signatures are generated by progressive and compartmentalized enzymatic activities that facilitate glycan sequestration outside of the cell, followed by target glycan importation and subsequent depolymerization (FIG. 1A). PUL reporters can capture these events and elicit dose-dependent luminescence increases following the introduction of target glycans into cultured Bacteroides reporter strains. These properties are harnessed to reveal the presence and abundance of previously undetectable glycans and dramatically enhance understanding of the interactions between gut microbes and their mammalian hosts.

A high-throughput glycan detection platform.

A bioluminescent reporter is demonstrated to be coupled to PUL promoters from different Bacteroides species (FIG. 15) to distinguish between compositionally identical, but structurally distinct glycans (FIG. 17). It has been contemplated to develop libraries of reporter strains representing every predicted PUL across 4 different species. It is further contemplated to develop a high-throughput detection platform that harnesses arrayed reporter strains to simply, accurately, and inexpensively detect the abundance of all glycans that are recognized by these PULs. Thus, leading to the building of a comprehensive resource that can aid in identifying the presence of distinct glycans from a myriad of biological sources including plant, animal, and microbial products. Reporter plasmids are generated representing every annotated PUL present in B. lhelaiolaomicron (strain VPI-5482; 96 predicted PULs), B. ovalus (ATCC 8483; 107 predicted PULs), B. fragilis (NCTC 9343, 55 predicted PULs), and B. caccae (ATCC 43185; 60 predicted PULs). These species were chosen because they encode PUL subsets with distinct glycan specificities and biological niches. PULs are identified using computational methods that examine the genomic sequence for susCD gene pairs, which are the minimum PUL components, and various arrangements of genes whose products encode glycan binding, hydrolysis, and lyase activities. PULs are often polycistronic transcripts initiated from one or a small set of promoters upstream of the corresponding susC gene. Publicly available RNAseq datasets from each organism are used to identify putative transcription start sites to guide putative PUL promoter selection prior to cloning. Plasmids are introduced into their corresponding Bacteroides species, and the resulting strains validated by glycan- dependent and -independent approaches described below. Validated reporter strains are cultured in rich media, combined with glycerol to a final concentration of 10%, and all strains will be dispensed into 50x 96-well plates for storage at -80°C using a Tecan Evo Freedom 200 instrument located at the Penn State Cancer Institute.

Glycan-dependent validation is achieved when significant increases in bioluminescence are detected following the addition of glycan mixtures to strains harboring putative reporter plasmids that are greater than identical cultures supplied galactose alone. Additionally, glycan-responsive reporters are introduced into mutant strains lacking corresponding PUL-sensors when available and examined for reduced PUL-reporter activation in response to identical glycan mixtures (as demonstrated for CS in FIG. 16). Glycan-independent validation are achieved using strains engineered to express constitutively active forms of the putative PUL regulator that can increase corresponding PUL transcription in the absence of target glycan, an approach we previously developed. Candidate reporter plasmids containing putative PUL promoters are introduced into strains engineered to express the corresponding constitutively active PUL regulator (cartooned in FIG. 20A). To demonstrate the feasibility of this approach, a draft array was generated including reporter constructs representing 95 predicted and experimentally established Bt PULs and validated all possible reporters using glycan- dependent or -independent approaches. Over half of the initially constructed reporters were experimentally validated (FIGS. 20B-20E). When PUL-reporters exhibited no glycan-responsive increases in engineered strains, alternative plasmids were constructed using promoter regions from upstream genes, which generated PUL-regulator-dependent bioluminescence increases (FIGS. 20E- 201). Furthermore, to determine whether constitutively active PUL-regulators elicited specific activation of target PULs, RNAseq analysis was performed comparing genome-wide transcription in strains harboring an empty vector or over-expressing BT3172*, which increased luminescence from P-BT3174 (FIG. 20E) and is a constitutively-active form of a PUL regulator important for Bt intestinal colonization and putatively senses an unknown host-derived glycan. A strain expressing BT3172* exhibited between 8 and 120-fold increases in the corresponding PUL genes, BT3171-BT3180, but no other PUL transcripts were differentially expressed, indicating specific regulation of target genes by these engineered regulatory proteins. Therefore, this strategy can be used to specifically validate each reporter in lieu of known sources containing target glycans and additional PUL reporter constructs in Bo, Bf, and Be can be validated in this manner or by measuring reporter activity increases in conditions known to activate the corresponding PUL or using glycan-dependent approaches described above.

To examine the ability of individual reporters to elicit target-glycan dependent increases in reporter activity, the activity of every Bt reporter is measured following the introduction of commercially purified glycans derived from plants: arabinogalactan and levan; microbes: dextran and HA, or animal tissue: HS and CS. Because genome wide Bt transcriptional responses to these glycans have been previously characterized, bioluminescence from the corresponding reporter strain is observed to be increased. Accordingly, the addition of each glycan to the array elicited the PUL- reporter activity increases. For example, arabinogalactan increased activity from P-BT0268 315-fold (FIG. 21 A), levan increased activity from P-BT1763 39-fold FIG. 21B), dextran increased P-BT3090 activity 130-fold (FIG. 21C), HA increased P-BT3332 activity 50-fold (FIG. 21D), HS increased P- BT4662 activity 50-fold (FIG. 21E) and CS increased P-BT3332 activity 42-fold (FIG. 21F). While the addition of arabinogalactan, levan, dextran, HA, and HS elicited a dominant response from the corresponding PUL reporters, CS elicited 6 additional significant responses including 3 that increased greater than 10-fold compared to the galactose treated control, P-BT1632, P-BT2818, and P-BT4662 (FIG. 21F). It was determined that the activation of these reporters was due to the presence of contaminating glycans present in the CS mixture (which is estimated at -70% purity by Sigma) rather than the results of promiscuous, non-specific activation by CS or the CS-sensor in the following ways: 1.) A mutant lacking the CS-sensor reduced the activation of P-BT3332 45-fold but only reduced P- BT1632 reporter 1.3-fold (FIG. 22A), 2.) The addition of purified CS disaccharide, which serves as the ligand for the CS-sensor, only activates P-BT3332 (FIG. 22B), and 3.) The overexpression of a constitutively active form of BT3334 (BT3334*) increased luminescence from P-BT3332 greater than 100-fold while the other examined reporters were less that 2-fold different (FIG. 22C). Thus, the activation of additional reporters by CS is due to the presence of additional glycans targeted by distinct Bt PULs indicating that the array can effectively detect distinct molecules present in biologically derived glycan mixtures. Consistent with this notion, 15 PUL-reporters were significantly increased following addition of a biologically-derived glycan mixture, porcine mucosal O-glycans, (PMOGs), consistent with the characterized Bt transcriptional responses. Thus, this approach can detect multiple PUL-specific ligands in biological mixtures.

Validated, arrayed reporter libraries are generated representing a total of 318 predicted PULs across 4 distinct Bacteroides species. Including all PUL-reporters in a single array requires increased array capacity such that all strains can be cultured in 384- well plates prior to the introduction of samples, which is done in 1536-well plates to accommodate 3 distinct samples per reporter strain and a negative control condition (galactose). Increasing capacity using 1536-well plates enables glycan screening to be achieved with reporters from all species simultaneously, thereby increasing the likelihood of detecting molecules of interest with smaller volumes of material than those required for screening in 384-well plates. Arrays containing all strains are manufactured by a Tecan Freedom Evo 200 that can repeatedly dispense into 384-well plates prior to long-term cryogenic storage. Cultured strains are anaerobically transferred using a semi- automated, repeating pipetting instrument. Reporter activity is measured in a Tecan Spark instrument housed in a Coy flexible anaerobic chamber over 18 hours.

An arrayed library of PUL reporters are generated that can detect small quantities of distinct glycan ligands present in crude biological extracts. Based on results with microbial and mammalian glycans, each strain exhibits accurate and sensitive reporter activity increases in response to specific glycan ligands. This offers users a rapid and inexpensive survey of known and unknown glycans as observed though the lens of the gut microbiota. Finally, tailored arrays, containing PUL-reporter subsets of interest in wild-type or mutant backgrounds that exhibit altered target glycan specificities or sensitivities (FIG. 19) are generated in 96- or 384-well microplates.

A glycan isolation platform.

Determining which PULs are activated via increased reporter activity indicates the presence of a distinct target glycan in samples with unknown glycan content. However, the identity of these molecule(s) may be unknown as only a small fraction of PULs have experimentally determined glycan targets. This is in part due to the challenges facing modern glycomics, which limit structural characterizations of heterogenous glycans mixtures. To overcome these challenges, PUL-reporters are employed as detection agents following glycan fractionation to isolate individual structures from heterogenous mixtures for downstream structural characterization. Moreover, affinity purification approaches are developed using PUL encoded surface glycan binding proteins (SGBPs) to facilitate simple, low-cost isolation of target molecules. Thus, glycans of interest can be identified in a biological sample using the arrays described previously and subsequently isolate individual glycans from these mixtures for downstream compositional and structural analysis. This stepwise approach helps circumvent challenges facing current glycomics strategies which are hindered by a paucity of simple and effective glycan separation strategies.

Bacter aides PULs frequently encode SGBPs that localize the bacterial outer membrane to aid in substrate sequestration prior to importation via the corresponding SusC transporter (FIG. 1A). SGBPs display specificity for target glycan structures and are capable of distinguishing between compositionally identical but structurally distinct glycans. Herein, SGBPs are encoded by PULs with unknown target specificities to isolate the corresponding glycan of interest from heterogenous mixtures (FIG. 21F and 21G) that can activate distinct reporters (FIG. 7A). SGBPs fused to hexahistidine tags are expressed in E. coli strain BL21 (DE3) for purification with nickel-conjugated agarose reagents as previously described. Immobilized recombinant protein are incubated with mixtures containing the putative glycan ligands, extensively washed to remove unbound material, and finally eluted with any bound glycans using free histidine. Histidine is used rather than imidazole because it is non-toxic to all Bacteroides cultures tested to date, allowing elution fractions to be added directly to cultured PUL-reporter strains. Corresponding PUL-reporter activity is measured following the addition of the resulting wash and elution fractions against all reporter strains that were activated by the initial glycan mixture. If elutions elicit increased reporter activity corresponding to target glycans but not non-target glycans, then these fractions are considered enriched for target glycans and analyzed by NMR.

To develop this approach, PULs with experimentally defined target specificities are chosen that encode conserved SGBPs. For example, the CS-PUL encodes BT3330 and the levan-PUL encodes BT1761, each exhibiting previously determined in vitro binding affinities for CS and levan, respectively. Accordingly, purified, recombinant over-expressed BT3330 and BT1761 specifically bind to CS and levan, respectively, using affinity-PAGE (FIGS. 7B-7D). Cell lysates prepared from E. coli strains over-expressing either BT333O or BT1761 were individual combined with an equal mixture of CS and levan, incubated, packed into a column, washed extensively, and eluted with histidine. Eluates from both reactions contained the over-expressed SGBP (FIGS. 7E and 7F) and were supplied to either the CS- or levan reporter strains. Eluates containing BT3330 elicited significant luminescence increases in P-BT3332 but not P-BT1763 containing strains (FIG. 7E), and eluates containing BT1761 increased reporter activity from strains harboring P-BT1763 but not P- BT3332 (FIG. 7F). The elutions were pooled, concentrated, and measured using standard curves from each reporter, which indicated that material recovered from BT333O contained 205.6 pg/mL CS (FIG. 7G) but no detectable levan. Conversely, BT1761 eluates contained 244.4 pg/mL of levan (FIG. 7H) with no detectable CS. These measurements were validated using a total glycosaminoglycans assay kit (Biovision) or fructan assay kit (Megazyme), which indicated we recovered 228.5 pg/mL CS but no levan from BT333O and 271.5 pg/mL levan but no CS from BT1761 (FIGS. 71 and 7J, respectively). Finally, reporters that were differentially activated in response to CS (FIG. 2 IF) were no longer significantly increased following the addition of material co-purifying with BT333O (FIG. 7K). Collectively, the viability of this approach is demonstrated built upon an incredibly sensitive and specific glycan-detection platform with the ability to rapidly expand the identity of PUL-target glycans.

This method is optimized by identifying stepwise changes in pH, salt concentrations, incubations times, and flow rates that improve target glycan retention. Finally, common parameters that can facilitate target glycan enrichment from SGBPs from the CS, levan, or heparan PULs are applied to PULs with unknown target glycans present in PMOGs, a more complex, commercially available biologically-derived glycan mixture (FIG. 21G) that can be used as a source of PUL-target glycans as demonstrated with BT333O (FIG. 7L). First, a glycan isolation assay is implemented with the BT4735-PUL reporter using the validated BT4756-promoted PUL-reporter plasmid (FIG. 201). This PUL contains the putative SGBP encoded by BT4133, which will be over-expressed, purified and incubated with a 0.1% PMOG solution. The unbound material (flow-through), wash, and elution fractions are examined using all PUL-reporters significantly activated by PMOGs (FIG. 21 G). Elution fractions that elicit BT4135 PUL-reporter activation, but not non-target reporters are considered enriched for the V-BT4136 activating glycan and are structurally characterized using compositional and NMR analyses of recovered material.

Concomitant with the development of affinity approaches, sequential UHPLC protocols are implemented to fractionate PMOGs by anion exchange chromatography using DEAE over a step- wise sodium chloride gradient as previously described and subsequently fractionated using reverse phase C- 18 as previously described. The resulting material is dialyzed and dried by rotary evaporator, resuspended in water, and supplied to cultured strains harboring selected PUL-reporters that exhibited increased activation upon screening with a complete array.

Affinity purification protocols are developed to simply isolate glycans that activate distinct PUL reporters using SGBPs cloned, over-expressed, and purified from the corresponding PUL. Successful implementation of this approach is demonstrated using SGBPs with known binding affinities (FIG. 7) and have contemplated implementing similar approaches with unknown glycans present in PMOGs. Therefore, a universal strategy is developed to isolate target glycans of interest from a heterogenous mixture that increases activity from a corresponding PUL-reporter.

A tunable glycan measurement system using PUL reporters. Measuring the abundance of individual glycans requires expensive, non-renewable reagents that can fail to distinguish between compositionally similar molecules. It has been demonstrated that PUL-sensors elicit dose-dependent increases in target gene transcription in response to their cognate glycan ligands (FIGS. IE and 1G), and this is recapitulated with the corresponding PUL-reporters (FIGS. 18A-18D) even in the presence of additional non-target glycans (FIG. 18E) across wide linear ranges. Therefore, glycan measurement methods are developed using standard curves derived from the AUC of PUL reporter dose-dependent responses. Additionally, mutant strains are identified and characterized that enhance PUL-reporter sensitivity and narrow target glycan specificity by taking advantage of these system’s abilities to distinguish between structurally distinct glycans (FIG. 19). This example generates a simple and inexpensive approach to measuring unknown target glycan quantities present in heterogenous mixtures.

Dose-dependent reporter responses to known glycan ligands are examined to define the limits of detection and quantification for each system. A standard set of measurement conditions are optimized that generate consistent results across Bt PUL-reporters specific for 1.) CS/HA, 2.) levan, 3.) arabinan, 4.) a-mannan and 5.) heparan sulfate by measuring the corresponding reporter activity against a standard curve of purified ligand. The supporting monosaccharide identity and abundance, supplement bacterial cultures with additional macronutrients, and determine time for computing AUC is varied to identify conditions that generate the optimal linear range across all 5 reporters. Subsequently, PUL-reporter measurements are compared against lab-generated or biologically derived glycan mixtures using each of the 5 reporters and validate these measurements using commercially available kits designed to measure the corresponding glycans (CS/HA and heparan sulfate: Biovision; levan and arabinan: Megazyme; mannan: Bio-rad). In many cases, measurements may be lower than the limits of detection or quantification of commercial kits.

It has been demonstrated that a mutation preventing CS-specific PUL-sensor deactivation generates detectable PUL-reporter responses at CS levels below the limits of detection in wild-type Bt (FIG. 19D). This indicates that blocking the consumption of the PUL-ligand leads to PUL-sensor hyperactivation and increasing the sensitivity of PUL-reporters to their target glycan. The linear range for the CS PUL-reporter had dramatically shifted to linear responsiveness to CS in a Bt mutant defective for PUL-deactivation, reaching as low as 0.000006% CS compared to 0.0001% observed in wild-type Bt (FIG. 19A). Additional mutants are generated that block PUL-reporters with known glycan specificities and determine 1.) whether the corresponding PUL-reporters exhibit expanded limits of detection, 2.) the increased sensitivity elicits dose-dependent responses, and 3.) these responses can accurately measure unknown target glycan concentrations present in biological mixtures. This is determined using PULs with known target glycans that have commercially available kits to validate measurements: CS using the ST334<S-deficient mutant, levan using a mutant defective for the inner membrane fructose transporter encoded by BT1758, and arabinan using a mutant lacking the inner membrane oligo-arabinose transporter, araP.

PUL-reporters produce glycan measurements resembling those generated using commercially available kits or MS/MS. Furthermore, genetically disrupting PUL-sensor deactivation increases the sensitivity of additional PUL-reporters, reducing their limits of detection and quantification without reducing specificity. Finally, disabling specific enzymatic activities “blinds” Bacteroides PUL- sensors to glycan subsets enabling narrower detection specificities relative to wild-type strains while retaining wide linear ranges of quantification.

Outlook

The fruits of this example are the development and production of a high-throughput tool, that can rapidly surveille samples for a diverse collection of glycans derived from plant, animal, or microbial sources. The detection of an activated PUL-reporter indicates the presence of a known or unknown glycan ligand corresponding to a microbial signal or growth substrate that corresponds to a locus containing glycan binding protein(s). Therefore, activation of PUL-reporters that respond to unknown glycan structures reveals gene products that can aid in glycan isolation using the methods described herein, for downstream functional and structural analyses using existing methodologies. Finally, these PUL-reporters can perform as robust glycan measurement tools that circumvent limitations associated with measuring some glycans using mass spec, NMR, or lectins and that by harnessing defined PUL mutations, further enhancing target sensitivity and refining specificity. The products herein represent an end-to-end pipeline from crudely extracted cellular material to isolating molecules of interest that differentially activated microbial gene expression.

Table 4. GT1893, GT1934, and GT3173 Primers and Promoters

Table 5A. Bt Reporter Array _Primers

Table 5B. Bt Reporter Array_Promoters

It will be apparent to those skilled in the art that various modifications and variations can be made in the present disclosure without departing from the scope or spirit of the invention. Other embodiments of the disclosure will be apparent to those skilled in the art from consideration of the specification and practice of the methods disclosed herein. It is intended that the specification and examples be considered as exemplary only, with a true scope and spirit of the invention being indicated by the following claims.