PREVOTELLA COPRI FORMULATIONS AND METHODS OF USE - WASHINGTON UNIVERSITY ST LOUIS

Title:

PREVOTELLA COPRI FORMULATIONS AND METHODS OF USE

Document Type and Number:

WIPO Patent Application WO/2023/201092

Kind Code:

Abstract:

Provided are probiotic compositions and methods of using such compositions for treatment of a spectrum of diseases like malnutrition. The probiotic compositions provided herein have Prevotella copri or engineered strains with genes from Prevotella copri.

Inventors:

GORDON JEFFREY (US)
WANG YI (US)
CHANG HAO-WEI (US)
BARRATT MICHAEL (US)
WEBBER DANIEL (US)
HIBBERD MATTHEW (US)
AHMED TAHMEED (BD)

Application Number:

PCT/US2023/018738

Publication Date:

October 19, 2023

Filing Date:

April 14, 2023

Export Citation:

Click for automatic bibliography generation Help

Assignee:

WASHINGTON UNIVERSITY ST LOUIS (US)
INTERNATIONAL CENTRE FOR DIARRHOEAL DISEASE RES BANGLADESH (BD)

International Classes:

A61K35/741; A23L33/135; A61K31/175; A61P1/00

Domestic Patent References:

WO2021203081A1	2021-10-07
WO2020252086A1	2020-12-17

Foreign References:

US20190151376A1	2019-05-23
US20170216375A1	2017-08-03

Other References:

LINARES-PASTÉN JAVIER A, HERO JOHAN SEBASTIAN, PISA JOSÉ HORACIO, TEIXEIRA CRISTINA, NYMAN MARGARETA, ADLERCREUTZ PATRICK, MARTINE: "Novel xylan degrading enzymes from polysaccharide utilizing loci of Prevotella copri DSM18205", GLYCOBIOLOGY, 15 June 2021 (2021-06-15), XP093102586, DOI: 10.1093/glycob/cwab056

Attorney, Agent or Firm:

RILEY-VARGAS, Rebecca C. et al. (US)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS

What is claimed is:

1 . A composition comprising a probiotic strain and at least a carrier, wherein the probiotic bacterial strain is operable to enhance utilization of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof, when administered to a subject in need thereof compared to a subject lacking the probiotic strain.

2. The composition of claim 1 , wherein the probiotic bacterial strain comprises a genome sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

3. A composition comprising a probiotic strain and a carrier, wherein the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

4. The composition of any one of claims 1 -3, wherein the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

5. The composition of any one of claims 1-4, wherein the probiotic bacterial strain is P.copri.

6. The composition of any one of claims 1-4 wherein the probiotic bacterial strain has a genome at least about 90% identical to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

7. The composition of any one of claims 1-6, wherein the probiotic bacterial strain is any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

8. The composition of claim 1-7, wherein the composition further comprises a microbiome- directed therapeutic food (MDF).

9. The composition of claim 8, wherein the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months.

10. The composition of claim 9, wherein the MDF contains no milk, powdered milk or milk product.

11. The composition of any one of claims 9 or 10, wherein the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%.

12. The composition of claim 1-7, wherein the MDF is selected from MDCF-1, MDCF-2, MDCF- 3, MDCF-2SS, MDSF, or MD-RUTF.

13. The composition of any one of claims 1-12, further comprising an additional probiotic bacterial strain.

14. The composition of claim 13, wherein the additional probiotic bacterial strain is a strain of Bifidobacterium longum subspecies infantis.

15. The composition of claim 14, wherein the additional probiotic bacterial strain is Bifidobacterium longum subspecies infantis Bg_2D9.

16. The composition of claim 15, wherein the additional probiotic bacterial strain is Bifidobacterium longum subsp. infantis with NRRL deposit # NRRL B-68253.

17. The composition of claim 1 , wherein the subject is an undernourished child 0-5 years of age.

18. The composition of claim 17, wherein the child is on a limited breast milk diet.

19. The composition of claim 17, wherein the child is on a no breast milk diet.

20. The composition of claim 1 , wherein the subject is a prospective mother.

21. The composition of claim 1 , wherein the composition is administered before, during or after pregnancy and combinations thereof.

22. The composition of claim 1 and 17-20, wherein the subject is additionally administered a second composition comprising an MDF, at least one additional probiotic bacterial strain or both.

23. The composition of claim 22, wherein the second composition is administered before, simultaneously or after the administration of the composition.

24. The composition of any one of claims 1- or 8-22, wherein the probiotic bacterial strain is an engineered probiotic bacterial strain.

25. A composition of claim 24, wherein the engineered probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 ,

ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

26. The composition of claim 24, wherein the engineered probiotic bacterial strain comprises a polynucleotide sequence at least about 60% identical to a polynucleotide sequence in any one of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz, within its genome or as an extrachromosomal element.

27. The composition of any one of claims 1-26, wherein the probiotic bacterial strain is present in an amount of more than 10² cfu per gram of the composition.

28. The composition claim 27, wherein the composition comprises at least a viable cell of the probiotic bacterial strain.

29. The composition of any one of claims 1-28, wherein the composition is formulated for oral administration.

30. The composition of any one of claims 1-28, wherein the composition is formulated for orogastric or nasogastric administration.

31. The composition of any one of claims 1-30, wherein the composition is in the form of a powder, a capsule, a tablet, a sachet, a liquid, an emulsion, or a suspension.

32. The composition of any one of claims 1-31, wherein the composition comprises an ingestible carrier.

33. The composition of claim 32, wherein the ingestible carrier comprises a milk component.

34. The composition of claim 33, wherein the ingestible carrier comprises baby formula or baby food.

35. The composition of claim 34, wherein the ingestible carrier comprises F-75 or F-100 formulas.

36. The composition of claim 32, wherein the ingestible carrier comprises a beverage.

37. The composition of any one of claims 1-34, further comprising one or more prebiotic, adjuvant, stabilizer, biological compound, dietary supplement, drug or combination thereof.

38. The composition of any one of claims 1-35, wherein administering the composition modifies the gut microbiota of a subject in need thereof.

39. An isolated bacterial strain comprising a genome sequence at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% identical to the genome sequence of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

40. The isolated bacterial strain of claim 39 comprising a genome sequence more that 99% identical to the genome sequence of any one of the P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

41. A method of treatment, the method comprising administering to a subject in need thereof, a therapeutically effective quantity of a composition of any one of claims 1-16 or 24-40.

42. The method of treatment of claim 41 , wherein the subject is a child 0-5 years of age.

43. The method of treatment of claim 42, wherein the subject is exhibiting symptoms of or diagnosed with undernutrition, Moderate Acute Malnutrition(MAM), Severe Acute Malnutrition (SAM), or stunting.

44. The method of treatment of claim 42, wherein the subject is an infant with a limited to no breastmilk diet.

45. The method of treatment of any one of claims 41-44, wherein the subject is exhibiting symptoms of or diagnosed with necrotizing enterocolitis, nosocomial infections, or enteric inflammation.

46. The method of treatment of any one of claims 41-45, wherein the child is on a limited breast milk diet.

47. The method of treatment of any one of claims 41-45, wherein the child is on a no breast milk diet.

48. The method of treatment of any one claim 42-47, wherein the subject is administered a second composition comprising an MDF, at least one additional probiotic bacterial strain or both.

49. The method of treatment of claim 48, wherein the second composition is administered before, simultaneously or after the administration of the composition.

50. The method of treatment of claim 48, wherein the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months.

51. The method of treatment of any one of claims 48-50, wherein the MDF contains no milk, powdered milk or milk product.

52. The method of treatment of any one of claims 48-51 , wherein the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%.

53. The method of treatment of any one of claims 48-51 , wherein the MDF is selected from MDCF-1, MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF.

54. The method of treatment of claim 46, wherein the additional probiotic bacterial strain is a strain of Bifidobacterium longum subspecies infantis.

55. The method of treatment of any one of claims 48 or 54, wherein the additional probiotic bacterial strain is Bifidobacterium longum subspecies infantis Bg_2D9.

56. Use of the compositions of any one of claims 1-16 or 24-40 for modifying the gut microbiota of a subject in need thereof.

57. Use of the compositions of any one of claims 1-16 or 24-40 for enhancing the utilization of one or more of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof.

58. A synbiotic formulation comprising at least a probiotic bacterial strain comprising a polynucleotide sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2 and an MDF.

59. The synbiotic formulation of claim 58, wherein the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

60. The synbiotic formulation of any of claims 58 or 59, wherein the probiotic bacterial strain is P. copri.

61. The synbiotic formulation of any of claims 58-60, wherein the probiotic bacterial strain has a genome at least about 90% identical to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

62. The synbiotic formulation of any of claims 58-60, wherein the probiotic bacterial strain is any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

63. The synbiotic formulation of any of claims 58-62, wherein the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months.

64. The synbiotic formulation of any of claims 58-63, wherein the MDF contains no milk, powdered milk or milk product.

65. The synbiotic formulation of any of claims 58-64, wherein the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%.

66. The synbiotic formulation of any of claims 58-65, wherein the MDF is selected from MDCF- 1 , MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF.

67. The symbiotic formulation of any of claims 58-66, for use in a subject diagnosed with undernutrition, Moderate Acute Malnutrition(MAM), Severe Acute Malnutrition (SAM), or stunting.

68. A food formulation selected from a group consisting of MDCF-1 , MDCF-2, MDCF-3, MDCF- 2SS, MDSF, or MD-RUTF, for treatment of MAM, SAM or stunting.

69. The food formulation of claim 68, wherein the food formulation is administered to augment the benefits of P. copri.

70. The food formulation of claim 69, wherein the P. copri is administered as a composition of any one of any one of claims 1-7 or 25-40.

71. The food formulation of claim 69, wherein the P. copri is not externally administered but exists in the subjects gut microbiome.

Description:

PREVOTELLA COPRI FORMULATIONS AND METHODS OF USE

ACKNOWLEDGEMENT OF GOVERNMENT SUPPORT

[0001] This invention was made with government support under Grant No. DK30292 awarded by the National Institutes of Health (NIH). The government has certain rights in the invention.

CROSS REFERENCE TO RELATED APPLICATIONS

[0002] This application claims the benefit of U.S. Provisional Application No. 63/330,837 filed

April 14, 2022 the disclosure of which are incorporated herein by reference in its entirety for all purposes.

BACKGROUND

[0003] 1. Field

[0004] The current invention relates to the field of treatment of malnutrition using the compositions and methods provided herein.

[0005] 2. Background

[0006] The gut microbiome is a complex ecosystem with diverse microorganisms including bacteria, archaea, viruses, and fungi. More than a 100 trillion microorganisms live within a human body at any given point in time. The gut metagenome carries approximately 150 times more genes than are found in the human genome. The microbiome has a huge impact on health and wellbeing. Mechanisms by which these gut microorganisms impact health are manifold and include enhanced nutrient uptake, appetite signaling, competitive protection against harmful microorganisms, production of antimicrobials, role in development of the intestinal mucosa and immune system of the host, to a list a few. Imbalances in the microbiome are linked to development and progression of major human diseases including gastrointestinal diseases, infectious diseases, liver diseases, gastrointestinal cancers, metabolic diseases, respiratory diseases, mental or psychological diseases, and autoimmune diseases.

[0007] Childhood undernutrition is a vexing, pressing, and in many respects overwhelming global health issue. Undernutrition contributes to more than 40% of deaths worldwide among children under 5 years old. Acute undernutrition affects more than 50 million children and is defined by a low weight-for-height Z (WHZ) score [the number of standard deviations from the median value for a reference, multinational World Health Organization (WHO) cohort of children with healthy growth phenotypes]. Preschool children with severe wasting (WHZ < -3) have a 10- fold higher mortality rate than that of their well-nourished counterparts. In 2014, chronic undernutrition, which manifests as stunting [low height-for-age Z score (HAZ)], affected 159 million children, with almost all living in low-income countries. Despite these categorical distinctions, deficits in ponderal and linear growth frequently coexist and increase the risk that children will experience persistent stunting, defective immune responses, and impaired neurocognitive function into adulthood. Current approaches to treatment have only modest effects in correcting these long-term sequelae, suggesting that certain features of host biology are not being adequately repaired. This has led to the hypothesis that healthy growth is dependent, in part, on normal postnatal development of the gut microbiota and that perturbations in its development are causally related to undernutrition.

[0008] Addressing microbiome imbalances using probiotic formulations is becoming an important part of treatment plans for relevant disease for childhood undernutrition. The microbiome is however not static but evolves with dietary intake, and environmental factors. The microbiota also varies greatly between individuals from different geographical and socioeconomical backgrounds. Therefore, therapies are not a one-size-fits all approach. The effectiveness of any intervention to address microbiome imbalances is contingent on the various factors that impact the microbiome.

[0009] There is therefore a need to understand and tailor probiotic formulations to specific populations and diet contexts.

SUMMARY OF THE INVENTION

[0010] In some aspects, the current disclosure encompasses a composition comprising a probiotic strain and at least a carrier, wherein the probiotic bacterial strain is operable to enhance utilization of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof, when administered to a subject in need thereof compared to a subject lacking the probiotic strain. In some aspects, the probiotic bacterial strain comprises a genome sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

[0011] In some aspects, the current disclosure also encompasses a composition comprising a probiotic strain and a carrier, wherein the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

[0012] In some aspects, the probiotic bacterial strain as provided herein comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects, the probiotic bacterial strain is P.copri.

[0013] In some aspects, the probiotic bacterial strain as provided herein has a genome at least about 90% identical to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects, the probiotic bacterial strain is any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

[0014] In some aspects the compositions as disclosed herein may further comprise a microbiome-directed therapeutic food (MDF). In some aspects, the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months. In some aspects, the MDF contains no milk, powdered milk or milk product. In some aspects, the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%. Non-limiting examples of MDF include MDCF-1 , MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF.

[0015] In some aspects, the compositions may further comprise an additional probiotic bacterial strain. In some aspects, the additional probiotic bacterial strain is a strain of Bifidobacterium longum subspecies infantis. In some aspects, the additional probiotic bacterial strain is Bifidobacterium longum subspecies infantis Bg_2D9. In some aspects, the additional probiotic bacterial strain is Bifidobacterium longum subsp. infantis with NRRL deposit # NRRL B-68253. [0016] In some aspects the compositions as disclosed herein may be administered to a subject, wherein the subject is an undernourished child 0-5 years of age. In some aspects, the subject is a child is on a limited breast milk diet. In some aspects, the child is on a no breast milk diet. In some aspects, the subject may be a prospective mother. In some aspects, the composition may be administered before, during or after pregnancy and combinations thereof. In some aspects, the subject may be additionally administered a second composition comprising an MDF, at least one additional probiotic bacterial strain or both. In some aspects, the second composition is administered before, simultaneously or after the administration of the composition. In some aspects, the probiotic bacterial strain is an engineered probiotic bacterial strain.

[0017] In some aspects, the engineered probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the engineered probiotic bacterial strain comprises a polynucleotide sequence at least about 60% identical to a polynucleotide sequence in any one of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz, within its genome or as an extrachromosomal element.

[0018] In some aspects, the probiotic bacterial strain is present in an amount of more than 10 ² cfu per gram of the composition. In some aspects, the compositions as disclosed herein comprise at least a viable cell of the probiotic bacterial strain. In some aspects, the composition is formulated for oral administration. In some aspects, the composition is formulated for orogastric or nasogastric administration. In some aspects, the composition is in the form of a powder, a capsule, a tablet, a sachet, a liquid, an emulsion, or a suspension. In some aspects, the composition comprises an ingestible carrier. In some aspects, the ingestible carrier comprises a milk component. In some aspects, the ingestible carrier comprises baby formula or baby food. In some aspects, the ingestible carrier comprises F-75 or F-100 formulas. In some aspects, the ingestible carrier comprises a beverage.

[0019] In some aspects, the compositions further comprise one or more prebiotic, adjuvant, stabilizer, biological compound, dietary supplement, drug or combination thereof. In some aspects, the compositions as disclosed herein modify the gut microbiota of a subject in need thereof.

[0020] In some aspects, the current disclosure also encompasses an isolated bacterial strain comprising a genome sequence at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% identical to the genome sequence of P. copri strain NRRL deposit no. xxxxx or yyyyy or In some aspects, the isolated strain comprises a genome sequence more that 99% identical to the genome sequence of any one of the P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

[0021] In some aspects the current disclosure also encompasses a method of treatment, the method comprising administering to a subject in need thereof, a therapeutically effective quantity of any of the compositions disclosed herein. In some aspects, the subject is a child 0-5 years of age. In some aspects, the subject exhibits symptoms of or is diagnosed with undemutrition, Moderate Acute Malnutrition (MAM), Severe Acute Malnutrition (SAM) or stunting. In some aspects, the subject is an infant with a limited to no breastmilk diet. In some aspects, the subject is exhibiting symptoms of or diagnosed with necrotizing enterocolitis, nosocomial infections, or enteric inflammation. In some aspects, the child is on a limited breast milk diet. In some aspects, the child is on a no breast milk diet. In some aspects, the subject is administered a second composition comprising an MDF, at least one additional probiotic bacterial strain or both. In some aspects, the second composition is administered before, simultaneously or after the administration of the composition. In some aspects, the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months. In some aspects, the MDF contains no milk, powdered milk or milk product. In some aspects, the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%. In some aspects, the MDF is selected from MDCF-1, MDCF-2, MDCF-3, MDCF- 2SS, MDSF, or MD-RUTF. In some aspects, the method comprises administration of additional probiotic bacterial strain, wherein the strain is a strain of Bifidobacterium longum subspecies infantis. In some aspects, the additional probiotic bacterial strain is Bifidobacterium longum subspecies infantis Bg_2D9.

[0022] In some aspects, the current disclosure also encompasses use of the compositions as disclosed herein for modifying the gut microbiota of a subject in need thereof. In some aspects, the current disclosure also encompasses use of the compositions as disclosed herein for enhancing the utilization of one or more of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof.

[0023] In some aspects, the current disclosure also encompasses a synbiotic formulation comprising at least a probiotic bacterial strain comprising a polynucleotide sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131, ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2 and an MDF. In some aspects, the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to P. copri Bg131 , ERZ17359674 corresponding to P. copri BgF5_2 and ERZ17359677 corresponding to P. copri BgD5_2. In some aspects, the probiotic bacterial strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects the probiotic bacterial strain is P.copri. In some aspects, the probiotic bacterial strain has a genome at least about 90% identical to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects, probiotic bacterial strain is any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects of the symbiotic formulation, the MDF comprises chickpea flour, peanut flour, soy flour, green banana, sugar, at least one oil, optionally an amino acid mix, a micronutrient premix, wherein the micronutrient premix provides at least 60% of the recommended daily allowance of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc for a child aged 6-24 months. In some aspects, the MDF contains no milk, powdered milk or milk product. In some aspects, the MDF has about 400 to about 600 kcal per 100 g of the composition, about 20 g to about 36 g of fat per 100 g of the composition, about 11 g to about 16 g of protein per 100 g of the composition, a protein energy ratio (PER) of about 8% to about 12%, and a fat energy ratio (FER) of about 45% to about 60%. In some aspects, the MDF is selected from MDCF-1 , MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF.

[0024] In some aspects, the current disclosure also encompasses a food formulation for example MDCF-1, MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF or variants thereof, for treatment of MAM, SAM or stunting. In some aspects, the food formulation may be administered to augment the benefits of P. copri in the gut microbiome. In some aspects, the P. copri is administered as a composition as disclosed herein. In some aspects, the P. copri is not externally administered but exists in the subject’s gut microbiome.

BRIEF DESCRIPTION OF THE DRAWINGS

[0025] Embodiments of the present inventive concept are illustrated by way of example in which like reference numerals indicate similar elements and in which:

[0026] FIG. 1 A shows photographs of the various food formulations developed for the trial.

[0027] FIG. 1B is a schematic of the timeline and phases of the study.

[0028] FIG. 2A shows a schematic of the study design.

[0029] FIG. 2B shows the Bioinformatic workflow for MAG assembly, refinement and quantitation. Pipeline for MAG assembly from short-read only or short-read plus long-read shotgun sequencing data. Steps are indicated on the left while the bioinformatic tools employed to accomplish each step are described within each box.

[0030] FIG. 2C shows comparison of MAG assembly summary statistics derived from CheckM (completeness, contamination) or Quast (contigs, length, N50) for 82 high-quality MAGs obtained from short- plus long-read hybrid assemblies versus 918 high-quality MAGs from short-read only assembly methods. Boxplots show the median, first and third quartiles; whiskers extend to the largest value no further than 1.5 x the interquartile range. ***, P < 0.001 (Wilcoxon test).

[0031] FIG. 2D shows volcano plot indicating the results of linear mixed-effects modeling of the relationship between MAG abundance and WLZ scores for all trial participants, irrespective of treatment. Bacterial genera that are abundant in the list of MAGs significantly associated with WLZ are colored by their taxonomic classification. [0032] FIG. 2E shows the distribution of WLZ-associated MAGs across taxonomic groups. Left subpanel, density plot showing WLZ-associated MAGs tabulated based on their genus-level classification. β ₁ refers to the coefficient in the mixed linear effects model presented at the bottom of the figure. Genera containing >3 significantly WLZ-associated MAGs are shown. Right subpanel, number of significant WLZ-associated MAGs assigned to each genus depicted in the left subpanel.

[0033] FIG. 2F shows results of gene set enrichment analysis (GSEA) of WLZ-associated MAGs ranked by the magnitude of their difference in abundance in response to MDCF-2 versus RUSF treatment. Plotted values indicate the mean Iog2-fold difference (±SEM) in each model coefficient between the two treatment groups. The statistical significance of enrichment (q-value, GSEA) of MAGs that are positively or negatively associated with WLZ is shown.

[0034] FIG. 2G shows results of gene set enrichment analysis (GSEA) of WLZ-associated MAGs ranked by the magnitude of their change in ‘abundance over time’ in response to MDCF-2 versus RUSF treatment. Plotted values indicate the mean Iog2-fold difference (±SEM) in each model coefficient between the two treatment groups. The statistical significance of enrichment (q- value, GSEA) of MAGs that are positively or negatively associated with WLZ is shown.

[0035] FIG. 2H shows enrichment of metabolic pathways in WLZ- and treatment-associated MAGs. MAGs were ranked by their WLZ association (negative to positive) or treatment association (RUSF-associated to MDCF-2 associated) and GSEA was employed to determine overrepresentation of pathways in MAGs at the extremes of each ranked list. The results (Normalized Enrichment Score, NES) only include pathways that display a statistically significant enrichment (q<0.05, GSEA) in both the WLZ-associated MAG and treatment-associated MAG analyses. For carbohydrate utilization pathways, disaccharides and oligosaccharides are indicated with an asterisk.

[0036] FIG. 3A provides LC-MS analysis of glycans for monosaccharides present in MDCF-2 and RUSF, and in the food ingredients used to formulate them. MeantSD are plotted. *, P<0.05, **, P<0.01 (t-test).

[0037] FIG. 3B provides LC-MS analysis of glycans for glycosidic linkages present in MDCF-2 and RUSF, and in the food ingredients used to formulate them. Mean+SD are plotted. *, P<0.05, **, P<0.01 (t-test).

[0038] FIG. 3C shows polysaccharide structures of glycans enriched in components of MDCF- 2 or RUSF. [0039] FIG. 3D depicts the principal polysaccharides in MDCF-2, RUSF and their component ingredients. Mean values + SD are plotted. *, P<0.05; ***, P<0.001 (t-test).

[0040] FIG. 3E shows the structure of the galactans.

[0041] FIG. 3F shows the structure of the mannans.

[0042] FIG. 4A shows the principal taxonomic features and expressed functions of MDCF-2 and RUSF-treated fecal microbiomes. Significant enrichment of taxa (q<0.1 ; GSEA) along the first principal component (PC1) of MAG abundance or transcript abundance is shown.

[0043] FIG. 4B shows percent variance explained by top 10 principal components of a PGA analysis including abundance of MAGs.

[0044] FIG. 4G shows percent variance explained by top 10 principal components of a PGA analysis including transcripts across all available timepoints and study participants.

[0045] FIG. 4D shows significant enrichment of taxa (q<0.05, GSEA) along the first three principal components (PC1-PC3) of the fecal microbiome or meta-transcriptome.

[0046] FIG. 4E shows carbohydrate utilization pathways significantly enriched (q<0.1 ; GSEA) by treatment group (β ₁ , circles) or the interaction of treatment group and study week (β ₃, squares). Right subpanel: Each point represents a MAG transcript assigned to each of the indicated functional pathways (rows), ranked by the direction and statistical significance of their differential expression in MDCF-2 versus RUSF treated participants (defined as the direction of the foldchange × -log10(P-value)). Transcripts are colored by their MAGs of origin. Larger, black outlined circles indicate leading edge transcripts assigned to the pathway described at the left of the panel.

[0047] FIG. 4F shows carbohydrate utilization pathways significantly enriched (q<0.1; GSEA) in upper- vs lower-WLZ quartile responders (β ₁, diamonds) or the interaction of WLZ-response quartile and study week (β ₃, triangles) (see linear mixed effects model). Right subpanel: Transcripts assigned to each functional pathway. Coloring and outlined circles have identical meaning as in panel b. The enrichment of glucuronate and galacturonate pathways was driven by the same transcripts, hence these pathways were considered as a single unit.

[0048] FIG. 5A shows unrooted, marker gene-based phylogenetic tree of 51 Prevotella MAGs from this study, plus 1 ,049 P. copri genomes and MAGs previously assigned to four clades. Pink stars denote the two WLZ-associated P. copri MAGs. The nine remaining P. copri MAGs from this study are highlighted by the green pentagons. The 40 Prevotella MAGs not classified as P. copri based on their having an average branch length >0.5 from all 1 ,049 reference P. copri isolates are grouped together and depicted as a yellow triangle

[0049] FIG. 5B shows mcSEED carbohydrate utilization pathways in 51 Prevotella MAGs from the current study. MAGs are hierarchically clustered based on the predicted presence (red) or absence (white) of these pathways.

[0050] FIG. 6 shows phylogenetic tree and inferred carbohydrate utilization phenotypes of Bifidobacterium MAGs. The phylogenetic tree indicates the relatedness of 34 Bifidobacterium MAGs and 14 reference genomes, as determined by sequence similarity among 142 core genes. The size of the pink circles in the dendrogram correspond to bootstrap support for the nodes (out of 100 bootstraps). Type stains used for taxonomic assignments and phenotypic comparisons are bolded. The matrix describes the presence (orange) or absence (white) of 25 predicted carbohydrate utilization phenotypes encompassing host- and plant-derived glycans. LNT, lacto- N-tetraose; LNnT, lacto-N-neotetraose; FL, 2'- and 3'-fucosyllactose; SL, 3'- and 6'-sialyllactose; Nglyc, N-glycans; Nglyc_core, N-glycan core (Fuca1-6GlcNAcβ1-Asn); GNB, galacto-N-biose; GIcNAc6S, N-acetylglucosamine-6-sulfate; Muc, mucin O-glycans; IMO, isomaltooligosaccharides and panose; Mlz, melezitose; AXOS, arabinoxylooligosaccharides; XGIOS, xyloglucan oligosaccharides; ST, starch and glycogen; RST, resistant starch; GALAJ, type I galactan and arabinogalactan; AGII, type II galactan and arabinogalactan; GA, gum arabic; AR, arabinan; XL, xylan; AX, arabinoxylan; bMAN, β-mannan; XGL, xyloglucan; Gin, ginsenosides; Rgl, rhamnoglycosides.

[0051] FIG. 7A is a representation of seven highly conserved PULs, present in Bg0018 and Bg0019, among the nine other P. copri MAGs identified in study participants and six P. copri isolates obtained from Bangladeshi children. The phylogenetic tree (left) indicates the relatedness of P. copri MAGs and isolates as determined by a marker gene-based phylogenetic analysis. Tree tips are colored by their P. copri clade designation. The β ₁(WLZ) coefficient for each P. copri MAG is indicated on the right of the figure; significant associations (q<0.05) are bolded. The color-coded matrix in the center indicates the extent of conservation of PULs in Bg0019 and Bg0018 versus the other P. copri MAGs identified in the fecal microbiomes of study participants. The known or predicted polysaccharide targets of these PULs are noted. The number of differentially expressed PUL transcripts in MAG Bg0018 and Bg0019 are shown in the colored cells; they were identified based on analysis of MDCF-2 versus RUSF treated participants and/or from upper versus lower WLZ-response quartile participants who all received MDCF-2.

[0052] FIG. 7B shows the relationship between PUL conservation in the 11 P. copri MAGs identified in study participants and the strength of each MAG's association with WLZ. [0053] FIG. 7C shows the CAZyme components of select P. copri PULs.

[0054] FIG. 7D shows the locus structure of PUL7 in MAG Bg0019. Abbreviations: GH, CAZy glycoside hydrolase family assignment; CE, carbohydrate esterase.

[0055] FIG. 8A shows significant changes in fecal glycosidic linkage levels (q<0.05) over time in upper- compared to lower-WLZ quartile responders. Likely polysaccharide sources for each of the 14 glycosidic linkages are noted in the middle column. PULs present in P. copri MAGs Bg0018 and Bg0019 with known or predicted cleavage activity for the listed polysaccharide sources are noted on the right subpanel.

[0056] FIG. 8B is a boxplot of changes in the levels of fecal glycosidic linkages relative to initiation of treatment among upper- and lower-WLZ quartile responders. Levels of these 14 linkages increased to a significantly greater extent over time in the comparison of upper- vs lower WLZ-quartile (Model: linkage abundance ~ WLZ-response quartile x study week + (1 |PID)). Note that boxplots indicate the median, first and third quartiles; whiskers extend to the largest value no further than 1 .5 x the interquartile range.

[0057] FIG. 8C shows the β ₃ coefficient for the interaction of WLZ-response quartile and study week is shown for CAZymes in PULs in Bg0018 and Bg0019. Predicted PUL substrates and potential glycosidic linkages in each of these substrates are shown at right. Glycosidic linkages with significant differences in fecal levels in upper versus lower WLZ-quartile responders are highlighted in bold font

[0058] FIG. 8D shows the polysaccharide structures, cleavage sites, and predicted products of CAZyme activity. Glycosidic linkages highlighted with arrows are those predicted as sites of cleavage by CAZymes expressed by the set of PULs, that are present in P. copri MAG Bg0019 and/or Bg0018. Consensus PUL numbers are listed except in the case of Bg0019 PUL3, which is not represented in Bg0018. The size of the arrows (large versus small) denotes the relative likelihood (high versus low, respectively) of cleavage of glycosidic linkages by P. copri CAZymes when considering steric hinderance at branch points.

[0059] FIG. 8E shows MDCF-2 polysaccharide substrates (left subpanels) and glycosidic linkage cleavage products predicted to be liberated by conserved P. copri MAGs Bg0019 and Bg0018 PULs. Linkages highlighted with arrows are putative sites of cleavage by the P. copri CAZymes based on their known or predicted enzyme activities; enzymes are labeled by their CAZyme module or modules predicted to perform the cleavage. The size of these arrows (large versus small) denotes the relative likelihood (high versus low, respectively) of glycosidic linkage cleavage by these CAZymes, considering steric hindrance at glycan branch points.

[0060] FIG. 8F shows the expression of PUL genes in MDCF-2 treated, upper- vs lower-WLZ quartile responders (only PUL genes with mcSEED or CAZy annotations are shown).

[0061] FIG. 8G shows predicted activity of PUL17b CAZymes, including cleavage of α-1,2- and α-1 ,3-linked arabinofuranose (Araf) side chains by GH51 (blue) and the α-1 ,5-Araf-linked backbone of branched arabinan by GH43 (brown, includes GH43_4 and GH43_5 subfamilies), respectively. Preferential cleavage of linear, unbranched regions of this glycan would be expected to yield oligosaccharide fragments containing t-Araf, 2-Araf, 5-Araf, and 2, 3-Araf linkages, which are enriched in MDCF-2 treated, upper quartile WLZ-responders.

[0062] FIG. 8H shows predicted activities of PUL7 GH26, GH5_4, or GH26-GH5_4 family CAZymes (magenta) on β-1 ,4 linked mannose residues of galactomannan, yielding products containing 4,6-manose, the most significantly differentially abundant linkage in the upper quartile WLZ-responders (see panel a).

[0063] FIG. 9A depicts the experimental design for studying the relationship between P. copri colonization efficiency and pre-colonization with B. longum subsp. Infantis. Mice were weaned at P28 and P25 for experiments 1 and 2, respectively.

[0064] FIG. 9B shows the phylogenetic tree of P. copri isolates and MAGs. The phylogenetic distance between each pair of comparisons is shown in the matrix.

[0065] FIG. 9C provides the total absolute abundance of P. copri strains in fecal samples collected from pups at P42. Mean values + SD are shown. Each dot represents a separate mouse. P-values (Mann-Whitney U test) are noted.

[0066] FIG. 10A Energy contribution from different modules of the ‘weaning diet supplemented with MDCF-2’.

[0067] FIG. 10B shows the study design outlining the timing of bacterial colonization of dams and diet switches.

[0068] FIG. 10C shows study shows the gavages administered to members of each treatment arm.

[0069] FIG. 10D provides the absolute abundance of B. infantis Bg2D9 (Arm 1) and B. infantis Bg463 (Arm 2) in fecal samples obtained from pups.

[0070] FIG. 10E provides absolute abundance of P. copri in fecal samples collected from pups in the indicated treatment arms at the indicated postnatal time points. Inset: the absolute abundance of P. copri in fecal samples collected from pups at P21 (Mann-Whitney U test)

[0071] FIG. 10F provides body weights of the offspring of dams, normalized to postnatal day 23. [linear mixed effects model (see Methods)]. Mean values ± SD are shown. Each dot in panels d-f represent an individual animal. P values were calculated using a Mann-Whitney U test (panel e insert) or a linear mixed effect model.

[0072] FIG. 11A shows ultra-high performance liquid chromatography-triple quadrupole mass spectrometric (UHPLC-QqQ-MS) quantitation of levels of arabinose-containing glycosidic linkages in cecal glycans.

[0073] FIG. 11 B shows ultra-high performance liquid chromatography-triple quadrupole mass spectrometric (UHPLC-QqQ-MS) quantitation of levels of total arabinose in cecal glycans.

[0074] FIG. 11C provides GC-MS quantitation of cecal acetate levels. Mean values ± SD are shown. P-values were calculated using a Mann-Whitney U test.

[0075] FIG. 11D is an illustration of the singular value decomposition and its application to microbial RNA-seq analysis. Matrix M stores the TPM value for each bacterium in each sample. Reads mapped to P. copri, P. stercorea, and the two strains of B. longum subsp. infantis were removed and transcripts with low expression were filtered out using edgeR before generating matrix M.

[0076] FIG. 11 E shows projection of samples onto a space determined by PC1 and PC2. Centroids are denoted by a white “X”. Shaded ellipses represent the 95% confidence interval of the sample distribution.

[0077] FIG. 11 F shows projection of the transcriptional responses of reconstructed metabolic pathways for each bacterium listed in M on the same PC space as depicted. Bacteria that can utilize arabinose, based on mcSEED metabolic reconstruction, are highlighted using bold font.

[0078] FIG. 11G shows differential expression analysis of genes involved in carbohydrate utilization, amino acid biosynthesis, and fermentation in arabinose-utilizing bacteria. Violin plots show the distribution of Iog2 fold-differences for all expressed genes in the indicated strain. Abbreviations: Glu, glutamate; Gin, glutamine; Leu, leucine; lie, isoleucine; Vai, valine.

[0079] Fig. 12A provides the number of Recon2 reactions with statistically significant differences in their predicted flux between the w/ P copri and w/o P. copri groups.

[0080] FIG. 12B provides the number of Recon2 reactions in each Recon2 subsystem that are predicted to have statistically significant differences in their activities between the two treatment groups. Colors denote values normalized to the sum of all statistically significantly different Recon2 reactions found in all selected cell clusters for a given Recon2 subsystem in each treatment group.

[0081] FIG. 12C is a proportional representation of cell clusters identified by snRNA-Seq.

Asterisks denote ‘statistically credible differences’ as defined by scCODA.

[0082] FIG. 12D shows selected Recon2 reactions in enterocyte clusters distributed along the villus involved in the urea cycle and glutamine metabolism.

[0083] FIG. 12E provides targeted mass spectrometric quantifications of citrulline levels along the length of the gut and in plasma. Mean values ± SD and P-values from the Mann-Whitney U test are shown.

[0084] FIG. 12F shows the effect of colonization with bacterial consortia containing or lacking P. copri on extracellular transporters for monosaccharides, amino acids and dipeptides. Sar: sarcosine. These transporters were selected and the spatial information of their expressed region along the length of the villus was assigned based on published experimental evidence. Arrows in panels b and e indicate the “forward” direction of each Recon2 reaction. The Wilcoxon Rank Sum test was used to evaluate the statistical significance of the net reaction scores (FIG. 12A, FIG. 12B, FIG. 12D and FIG. 12E) between the two treatment groups. P-values were calculated from Wilcoxon Rank Sum tests and adjusted for multiple comparisons (Benjamini-Hochberg method); a q-value < 0.05 was used as the cut-off for statistical significance.

[0085] FIG. 13A is a dot plot of marker gene expression across epithelial cell types. The average expression level and percentage of nuclei that express a given gene within a cell type are indicated by dot color and size, respectively.

[0086] FIG. 13B provides an integrated UMAR plot for all jejunal nuclei isolated from 8 animals representing the two treatment arms (n=4 mice/arm) in the parameter screen experiment.

[0087] FIG. 13C provides the number and directionality of statistically significant differentially expressed genes in each cell cluster.

[0088] Fig. 14 illustrates NicheNet-based analysis of the effects of P. copri colonization on cellcell signaling activities. Each row represents different sender cell clusters. Each column represents ligands expressed by these sender cells. Cells are colored based on the Iog2-fold difference in expression of ligands in the sender cell clusters between w/ P. copri and w/o P. copri mice. Ligands (columns) are grouped based on receiver cell clusters and the indicated functions of downstream signaling pathways in these receiver cells.

[0089] FIG. 15A provides the study design for validating the effects of P. copri colonization in gnotobiotic mother-pup dyads.

[0090] FIG. 15B provides body weights of the offspring of dams, normalized to postnatal day 23 linear mixed effects model.

[0091] FIG. 15C provides a targeted mass spectrometric analysis of jejunal citrulline. . Each dot represents a single animal. Mean values ± SD are shown. P-values were calculated from the linear mixed effect model (panel b) or Mann-Whitney U test. N.S., P-value > 0.05.

[0092] FIG. 15D provides a targeted mass spectrometric analysis of acylcarnitine levels. . Each dot represents a single animal. Mean values + SD are shown. P-values were calculated from the linear mixed effect model (panel b) or Mann-Whitney U test. N.S., P-value > 0.05.

[0093] FIG. 15E provides a targeted mass spectrometric analysis of colonic acylcamitine levels.

[0094] Fig. 15F provides plasma levels of non-esterified fatty acids. Each dot represents a single animal. Mean values + SD are shown. P-values were calculated from the linear mixed effect model (panel b) or Mann-Whitney U test. N.S., P-value > 0.05. . Each dot represents a single animal. Mean values + SD are shown. P-values were calculated from the linear mixed effect model

(panel b) or Mann-Whitney U test. N.S., P-value > 0.05.

[0095] Fig. 16 shows normalized number of Recon2 reactions in Recon2 subsystems predicted to have statistically significant differences in their activities between the w/ P. copri and w/o P. copri treatment groups.

[0096] Fig. 17A shows the study design for testing the effects of pre-weaning colonization with two P. copri strains closely related to MAGs Bg0018 and Bg0019 on host weight gain, and MDCF- 2 glycan degradation.

[0097] Fig. 17B provides absolute abundance of P. copri strains and total bacterial load in cecal contents collected at P53.

[0098] Fig. 17C provides body weights of the offspring of dams, normalized to postnatal day 23 [linear mixed effects model (see Methods^.

[0099] Fig. 17D shows the comparison of polysaccharide utilization loci (PULs) highly conserved in the two P. copri MAGs (Bg0018 and Bg0019) identified in the RCT as being significantly positively associated with WLZ and MDCF-2 glycan metabolism, with their representation in the three cultured P. copri strains.

[0100] Fig. 17E provides UHPLC-QqQ-MS analysis of total arabinose and galactose in glycans present in cecal contents collected at euthanasia (P53).

[0101] Fig. 17F provides UHPLC-QqQ-MS of glycosidic linkages containing arabinose in cecal contents. Mean values + SD are shown. P-values were calculated using a Mann-Whitney U test.

[0102] Fig. 17G provides UHRLC-QqQ-MS of glycosidic linkages containing galactose in cecal contents. Mean values ± SD are shown. P-values were calculated using a Mann-Whitney U test.

[0103] Fig. 18A provides comparison of weight-for-length z-score (WLZ) between the MDCF-2 and RUSF groups at different time points up to 2 years after cessation of the 3-month intervention in 12-18 month children with primary MAM.

[0104] Fig. 18B provides comparison of length-for-age z-score (LAZ) between MDCF-2 and RUSF groups at different time points up to 2 years after cessation of the 3-month intervention in 12-18 month children with primary MAM.

[0105] Fig. 18C provides comparison of weight-for-age z-score (WAZ) between MDCF and RUSF group at different time points up to 2 years after cessation of the 3-month intervention in 12-18 month children with primary MAM.

[0106] Fig. 19 shows LC-MS of ileal and colonic acylcarnitines in gnotobiotic mice colonized with P. copri D5.2 and F5.2. Mean values ± SD are shown. P-values were calculated using a Mann-Whitney U test.

[0107] The drawing figures do not limit the present inventive concept to the specific embodiments disclosed and described herein. The drawings are not necessarily to scale, emphasis instead being placed on clearly illustrating principles of certain embodiments of the present inventive concept.

DETAILED DESCRIPTION

[0108] The present disclosure encompasses compositions and methods of treatment for subjects in need thereof, where the methods of treatment comprise administering a disclosed composition. In some embodiments, the methods of treatment address malnutrition, including undernutrition, in part by modifying the gut microbiota of the subject. The global burden of childhood undernutrition is great, causing 3.1 million deaths annually and accounting for 21% of life years lost among children younger than 5 years. More than 18 million children in this age range are affected by severe acute malnutrition (SAM), the most extreme form of undemutrition. SAM is responsible for nearly half of all undernutrition-related mortality. Various aspects of this invention demonstrate that there is a correlation between childhood malnutrition and deficiencies in components of the gut microbiota whose restoration is associated with improved outcomes for acutely malnourished children. In one aspect, the present disclosure is a result of extensive experimental studies that correlate the evolution of the gut microbiome with the various therapeutic and dietary interventions that help improve the health of SAM patients. The presence of one particular bacterial strain Prevotella copri (P. copri) in these synbiotic studies was correlated with much better outcomes for the patients. In another aspect, the present disclosure also stems from extensive screening and in-depth characterization of the gut microbiome for identification of bacterial strains for enhanced survival (fitness) in children who consume diets with limited breastmilk content. While exclusive breastfeeding of infants is recommended by the WHO for the first 6 months, in many low-income settings, gruels, animal milk and complementary foods are often introduced into the diet at an early age for economic and/or cultural reasons. Surprisingly, Prevotella copri obtained from these extensive screening efforts exhibits superior fitness over multiple other strains, in population with complementary plant-based diets. Metagenomic characterization of the strains helped define DNA sequences involved in the uptake, or utilization or both of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof by the isolated strain compared to comparable strains without these DNA sequences.

[0109] The current disclosure describes isolated and engineered strains of Prevotella copri comprising one or more of these DNA sequences, and therapeutic or synbiotic formulations comprising these strains, that when administered into a subject in need thereof, enhance the capacity for uptake or utilization of certain plant-based polysaccharides. Such treatments improve outcomes for malnourished children. In some aspects, the disclosed strain compositions can be administered alone. In some aspects, the disclosed strain compositions can be administered in combination with food formulations. In some aspects, the disclosed strain compositions can be administered with additional probiotic compositions. In some aspects, the strain compositions can be administered with additional food and probiotic formulations. Some aspects of this invention further provide methods for modifying gut microbiota, thus providing advantageous outcomes including but not limited to reducing symptoms of, or treating, acute malnutrition, enteric inflammation, necrotizing enterocolitis, and allergies, promoting recolonization of the gut after diarrhea or antibiotic consumption, and improving vaccine performance by administering therapeutically effective quantities of these formulations.

[0110] DEFINITIONS

[0111] Unless defined otherwise, all technical and scientific terms used herein have the meaning commonly understood by a person skilled in the art to which this invention belongs. The following references provide one of skill with a general definition of many of the terms used in this invention: Singleton et al., Dictionary of Microbiology and Molecular Biology (2nd ed. 1994); The Cambridge Dictionary of Science and Technology (Walker ed., 1988); The Glossary of Genetics, 5th Ed., R. Rieger et al. (eds.), Springer Verlag (1991); and Hale & Marham, The Harper Collins Dictionary of Biology (1991). As used herein, the following terms have the meanings ascribed to them unless specified otherwise.

[0112] The phraseology and terminology employed herein are for the purpose of description and should not be regarded as limiting. For example, the use of a singular term, such as, “a” is not intended as limiting of the number of items. Also, the use of relational terms such as, but not limited to, “top,” “bottom,” “left,” “right,” “upper,” “lower,” “down,” “up,” and “side,” are used in the description for clarity in specific reference to the figures and are not intended to limit the scope of the present inventive concept or the appended claims.

[0113] Further, as the present inventive concept is susceptible to embodiments of many different forms, it is intended that the present disclosure be considered as an example of the principles of the present inventive concept and not intended to limit the present inventive concept to the specific embodiments shown and described. Any one of the features of the present inventive concept may be used separately or in combination with any other feature. References to the terms “embodiment,” “embodiments,” and/or the like in the description mean that the feature and/or features being referred to are included in, at least, one aspect of the description. Separate references to the terms “embodiment,” “embodiments,” and/or the like in the description do not necessarily refer to the same embodiment and are also not mutually exclusive unless so stated and/or except as will be readily apparent to those skilled in the art from the description. For example, a feature, structure, process, step, action, or the like described in one embodiment may also be included in other embodiments but is not necessarily included. Thus, the present inventive concept may include a variety of combinations and/or integrations of the embodiments described herein. Additionally, all aspects of the present disclosure, as described herein, are not essential for its practice. Likewise, other systems, methods, features, and advantages of the present inventive concept will be, or become, apparent to one with skill in the art upon examination of the figures and the description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present inventive concept, and be encompassed by the claims.

[0114] Any term of degree such as, but not limited to, “substantially” as used in the description and the appended claims, should be understood to include an exact, or a similar, but not exact configuration. For example, “a substantially planar surface” means having an exact planar surface or a similar, but not exact planar surface. Similarly, the terms “about” or “approximately,” as used in the description and the appended claims, should be understood to include the recited values or a value that is three times greater or one third of the recited values. For example, about 3 mm includes all values from 1 mm to 9 mm, and approximately 50 degrees includes all values from 16.6 degrees to 150 degrees. For example, they can refer to less than or equal to + 5%, such as less than or equal to ± 2%, such as less than or equal to ± 1 %, such as less than or equal to ± 0.5%, such as less than or equal to + 0.2%, such as less than or equal to ± 0.1 %, such as less than or equal to + 0.05%. As used herein, “about” refers to numeric values, including whole numbers, fractions, percentages, etc., whether or not explicitly indicated. The term “about” generally refers to a range of numerical values, for instance, + 0.5-1%, ± 1-5% or ± 5-10% of the recited value, that one would consider equivalent to the recited value, for example, having the same function or result.

[0115] Lastly, the terms “or” and “and/or,” as used herein, are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean any of the following: “A,” “B" or “C”; “A and B”; “A and C”; “B and C”; “A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.

[0116] When introducing elements of the present disclosure or the preferred aspects(s) thereof, the articles "a", "an", "the" and "said" are intended to mean that there are one or more of the elements. The terms "comprising", "including" and "having" are intended to be inclusive and mean that there may be additional elements other than the listed elements.

[0117] The term “comprising” means “including, but not necessarily limited to”; it specifically indicates open-ended inclusion or membership in a so-described combination, group, series and the like. The terms “comprising” and “including” as used herein are do not exclude additional, unrecited elements or method processes. The term “consisting essentially of’ is more limiting than “comprising” but not as restrictive as “consisting of.” Specifically, the term “consisting essentially of’ limits membership to the specified materials or steps and those that do not materially affect the essential characteristics of the claimed invention.

[0118] The terms "nucleic acid”, "nucleic acid molecule”, and "polynucleotide” are used interchangeably herein. The terms “nucleic acid encoding . . or “nucleic acid molecule encoding . . . ” should be understood as referring to the sequence of nucleotides which encodes a polypeptide.

[0119] As used herein, the term “polynucleotide”, which may be used interchangeably with the term “nucleic acid” generally refers to a biomolecule that comprises two or more nucleotides. In some aspects, a polynucleotide comprises at least two, at least five at least ten, at least twenty, at least 30, at least 40, at least 50, at least 100, at least 200, at least 250, at least 500, or any number of nucleotides. For example, the polynucleotides may include at least 500 nucleotides, at least about 600 nucleotides, at least about 700 nucleotides, at least about 800 nucleotides, at least about 900 nucleotides, at least about 1000 nucleotides, at least about 2000 nucleotides, at least about 3000 nucleotides, at least about 4000 nucleotides, at least about 4500 nucleotides, or at least about 5000 nucleotides. A polynucleotide may be single-stranded or double-stranded. In some aspects, a polynucleotide is a site or region of genomic DNA. In some aspects, a polynucleotide is an endogenous gene that is comprised within the genome of an unmodified cell or universal donor cell. In some aspects, a polynucleotide is an exogenous polynucleotide that is not integrated into genomic DNA. In some aspects, a polynucleotide is an exogenous polynucleotide that is integrated into genomic DNA. In some aspects, a polynucleotide is a plasmid. In some aspects, a polynucleotide is a circular or linear molecule.

[0120] The term “DNA sequence” refers to a heritable sequence of DNA, i.e., a genomic sequence, with functional significance. The term “gene” can be used to refer to, e.g., a cDNA and/or an mRNA encoded by a genomic sequence, as well as to that genomic sequence.

[0121] Nucleic acid is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Generally, “operably linked” means that the DNA sequences being linked are contiguous, and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers do not have to be contiguous.

[0122] The isolated strains of “Prevotella copri’ for use in compositions as disclosed herein refers to P. copri strains available at Professor Jeffery I. Gordon’s laboratory at Washington University, School of Medicine at St. Louis and corresponds to NRRL deposit nos. xxxx, yyyy or zzzz at the ARS Culture Collection (NRRL). A genome sequence of the three strains has also been deposited at the European Nucleotide Archive under project number PRJEB45356 and correspond to accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2 respectively.

[0123] The term “carbohydrate”, as used herein, refers to an organic compound with the formula Cm(H2O)n, where m and n may be the same or different number, provided the number is greater than 3.

[0124] The term “glycan” refers to a linear or branched homo- or heteropolymer of two or more monosaccharides linked glycosidically. As such, the term “glycan” includes disaccharides, oligosaccharides and polysaccharides. The term also encompasses a polymer that has been modified, whether naturally or otherwise; non-limiting examples of such modifications include acetylation, alkylation, esterification, etherification, oxidation, phosphorylation, selenization, sulfonation, or any other manipulation.

[0125] The term “N-glycan,” as used herein, refers to a polymer of sugars that has been released from a glycoconjugate but was formerly linked to the glycoconjugate via a nitrogen linkage (see definition of N-linked glycan below). “N-linked glycans” are glycans that are linked to a glycoconjugate via a nitrogen linkage. A diverse assortment of N-linked glycans exist.

[0126] As used herein “Polysaccharide Utilization Loci” or “PUL” is used interchangeably and corresponds to PUL predictions as provided in the PUL database (Terrapon et al. 2018). The “fiber degrading capacity” of a subject’s gut microbiota may be defined by its compositional state and/or its functional state. For instance, the compositional stage of a subject’s gut microbiota may be defined by the absence, presence and abundance of primary and secondary consumers of dietary fiber, while the functional state may be defined by the representation of relevant genomic loci (polysaccharide utilization loci (PULs), carbohydrate-active enzymes (CAZymes), etc.), expression from these loci, and/or activity of proteins encoded by these loci. An increase in the fiber degrading capacity of a subject may be effected by increasing the abundance of microorgansims with genomic loci for import and metabolism of glycans, as exemplified by PULs and/or loci encoding CAZymes; and/or increasing the abundance or expression of one or more proteins encoded by a PUL and/or one or more CAZyme (with or without concomitant changes in microorganism abundance). Thus, for example PUL17 on the genome of P. copri refers to the genome loci encoding pectin degrading enzymes. [0127] As used herein, the term “malnutrition” refers to one or more forms of undernutrition - for example, wasting (low weight-for-length), stunting (low length-for-age), underweight (low weight-for age), deficiencies in vitamins and minerals, etc. A subject in need of treatment for malnutrition may also be referred to herein as a malnourished subject.

[0128] A length-for-age Z Score (LAZ) refers to the number of standard deviations of the actual length of a child from the median length of the children of his/her age as determined from the standard sample. This is prefixed by a positive sign (+) or a negative sign (-) depending on whether the child's actual length is more than the median length or less than the median length. The terms length and height are used interchangeably herein. Therefore, length-for-age Z Score (LAZ) and height-for-age Z Score (HAZ) refer to the same measurement.

[0129] A weight-for-age Z score (WAZ) refers to the number of standard deviations of the actual weight of a child from the median weight of the children of his/her age as determined from the standard sample. This is prefixed by a positive sign (+) or a negative sign (-) depending on whether the child's actual weight is more than the median weight or less than the median weight.

[0130] A weight-for-length Z score (WLZ) refers to the number of standard deviations of the actual weight of a child from the median weight of the children of his/her length as determined form the standard sample. This is prefixed by a positive sign (+) or a negative sign (-) depending on whether the child's actual weight is more than the median weight or less than the median weight for the same length. The terms length and height are used interchangeably herein. Therefore, weight-for-height Z score (WHZ) and weight-for-length Z score (WLZ) refer to the same measurement.

[0131] A mid-upper-arm-circumference score (MUAC) is an independent anthropometric measurement used to identify malnutrition.

[0132] Moderate acute malnutrition (MAM) is defined by a WHZ less than or equal to -2 and greater than or equal to -3.

[0133] Severe acute malnutrition (SAM) is defined by a WHZ less than -3 and/or bipedal edema, and/or a mid-upper arm circumference (MUAC) less than 11.5 cm.

[0134] As used herein, a “healthy child” has a LAZ and WLZ consistently no more than 1.5 standard deviations below the median calculated from a World Health Organization (WHO) reference healthy growth cohort as described in WHO Multicentre Reference Study (MGRS), 2006 (www.who.int/childgrowth/mgrs/en). [0135] As used herein, “stunting” or linear growth faltering is defined by a LAZ of less than or equal to -2. In some aspects, shunting can occur in the absence of wasting (MAM, SAM), but is often a co-morbidity in children with MAM or SAM.

[0136] As used herein, “statistically significant” is a p-value <0.05, <0.01, <0.001 , <0.0001, or <0.00001.

[0137] The terms “treat,” "treating," or "treatment" as used herein, refer to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) an undesired physiological change or disease/disorder. Beneficial or desired clinical results include, but are not limited to, alleviation of symptoms, diminishment of extent of disease, stabilization (i.e., not worsening) of disease, a delay or slowing of disease progression, amelioration or palliation of the disease state, and remission (whether partial or total), whether detectable or undetectable. “Treatment” can also mean prolonging survival as compared to expected survival if not receiving treatment. Those in need of treatment include those already with the disease, condition, or disorder as well as those prone to have the disease, condition or disorder or those in which the disease, condition or disorder is to be prevented.

[0138] As used herein, the term "effective amount" means an amount of a substance (e.g. a composition including formulations and combinations of the present disclosure) that leads to measurable and beneficial effects for the subject administered the substance, i.e., significant efficacy. As used herein the term “therapeutically effective amount” refers to an amount of the formulation or therapeutic combination that alleviates, in whole or in part, symptoms associated with the disorder or condition, or halts or slows further progression or worsening of those symptoms or prevents or provides prophylaxis for the disorder or condition. A therapeutically effective amount is also one in which any toxic or detrimental effects of compositions of the invention are outweighed by the therapeutically beneficial effects.

[0139] As used herein, the term “raw banana” refers to an unripe, green banana in the genus Musa. “Raw bananas” are also referred to as “green bananas” in the art, and the terms are used interchangeably herein. As is understood in the art, raw bananas are processed (e.g., baked, boiled, steamed, etc.) after which the pulp may or may not be dried prior to use.

[0140] The term “modifying” as used in the phrase “modifying the gut microbiota” is to be construed in its broadest interpretation to mean a change in the representation of microbes in the gastrointestinal tract of a subject. The change may be a decrease or an increase in the presence of a particular microbial strain, species, genus, family, order, or class. In some aspects, “modifying the gut microbiota” can “repair the gut microbiota” or “improve gut microbiota health”. To “repair the gut microbiota of a subject,” which is synonymous with “improve gut microbiota health,” means to change the microbiota of a subject, in particular the relative abundances of age- and health- discriminatory taxa, in a statistically significant manner towards chronologically-age matched reference healthy subjects. The term encompasses complete repair and levels of repair that are less than complete. The term also encompasses preventing or lessening a change in the relative abundances of age-and health-discriminatory taxa, wherein the change would have been significantly greater absent intervention.

[0141] As used herein the term “enhanced uptake” is intended to mean that the presence of the DNA sequence enhances the active transport of glycans, polysaccharides, or both into the bacterial cell compared to the same cell, or a cell of a similar background without the DNA sequence. In some aspects, the DNA sequence is known (based on assays known to a person of ordinary skill in the art including but not limited to binding assays, assays using glycan- recognizing probes comprising one or more of antibodies, lectins, carbohydrate molecules coupled with enzyme assays, immunohistochemistry, confocal microscopy, electron microscopy and flow cytometry) or predicted (based on sequence homology studies or curation using mcSEED analysis) to increase binding and intracellular transport of glycans, or plant derived oligosaccharides, or both by the microbe.

[0142] As used herein the term “enhanced utilization” is intended to mean that the presence of the DNA sequence enhances one or more of transport of glycans, transport of plant-derived polysaccharides, or both into the bacterial cell, and their subsequent metabolic processing [or metabolism]. In some aspects the DNA sequence is known (based on assays known to a person of ordinary skill in the art including but not limited to carbohydrate fermentation assays or glycan- recognizing probes comprising one or more of antibodies, lectins, carbohydrate molecules or enzyme assays) or predicted to (based on sequences homology studies or curation using mcSEED analysis) to increase microbial breakdown of N-glycans or plant derived oligosaccharides, or both.

[0143] As used herein, the term “subject” refers to a mammal. In some aspects, a subject is non-human primate or rodent. In some aspects, a subject is a human. In some aspects, a subject has, is suspected of having, or is at risk for, a disease or disorder. In some aspects, a subject has one or more symptoms of a disease or disorder. In particular aspects, a subject is malnourished. In some aspects, the subject is a child of 0-5 years of age. In some aspects, the subject is a child of 0-5 years of age, suspected of developing or having symptoms of malnutrition. I. Compositions

[0144] In one aspect, the present disclosure encompasses a composition comprising a probiotic strain and at least a carrier, wherein the probiotic bacterial strain is operable to enhance utilization of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and arabinooligosaccharides, or combinations thereof, when administered to a subject in need thereof compared to a subject lacking the probiotic strain. In some aspects, the probiotic strain is an isolated strain of Prevotella copri isolated from the gut of Bangladeshi children, which were found to have enhanced capability to absorb and utilize various food substrates including arabinoxylan, pectin, b-mannan, b-glucan, xylan, arabinoxylan, glucomannan, xyloglucan, b-1,3-glucan, pectin galactan, starch or arabinogalactan. The genome of the strains of Prevotella copri of NRRL deposit no. xxxxx or yyyy or zzzz have been deposited in the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2 respectively. These strains were found to be highly beneficial to the children in protecting against undernutrition SAM, MAM, or stunting, either alone or in conjunction with other food supplements and probiotics. As such, in some aspects, the current disclosure encompasses a composition comprising a carrier and an isolated bacterial strain comprising a genome sequence at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% or 100% identical to the genome sequence as deposited in the the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2 respectively. These isolated strains correspond to the P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. Thus, in some aspects, the current disclosure also encompasses a composition comprising a carrier and an isolated bacterial strain comprising a genome sequence 100% identical to the genome sequence of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects, the current disclosure also encompasses an isolated bacterial strain comprising a genome sequence at least 95%, or at least 96%, or at least 97%, or at least 98%, or at least 99% identical to the genome sequence of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. Further characterization of these strain was conducted, and specific genetic loci could be identified that imparted on the strains disclosed the beneficial properties of glycan utilization as provided herein. Table A-D provides the corresponding location of the Polysaccharide Utilization Loci (PUL) in the genome as identified by their short locus tags, that enhance utilization of the one or more of arabinoxylan, pectin, b-mannan, b-glucan, xylan, arabinoxylan, glucomannan, xyloglucan, b-1 ,3-glucan, pectin galactan, starch or arabinogalactan, in each of the 3 strains

[0145] TABLE A

TABLE B

TABLE C

TABLE D

[0146] In some aspects, the isolated strain of P. copri as disclosed herein comprises at least one polynucleotide sequence from P. copri of NRRL deposit no. xxxxx or yyyy or zzzz that enhances utilization of arabinoxylan, pectin, b-mannan, b-glucan, xylan, arabinoxylan, glucomannan, xyloglucan, b-1 ,3-glucan, pectin galactan, starch or arabinogalactan as provided in Table A. In some aspects, the current disclosure encompasses a composition comprising a carrier and an isolated strain of P. copri comprising at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the isolated strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz.

[0147] In some aspects, the current disclosure also encompasses composition comprising a carrier and a probiotic strain comprising at least one polynucleotide sequence from P. copri of NRRL deposit no. xxxxx or yyyy or zzzz that enhances utilization of arabinoxylan, pectin, b- mannan, b-glucan, xylan, arabinoxylan, glucomannan, xyloglucan, b-1,3-glucan, pectin galactan, starch or arabinogalactan as provided in Table A. In some aspects, the current disclosure encompasses a composition comprising a carrier and a probiotic strain comprising at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the probiotic strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects the probiotic bacterial strain comprises a genome sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the probiotic bacterial strain has a genome at least about 90%, or at least about 91%, or at least about 92%, or at least about 93%, or at least about 94%, or at least about 95%, or at least about 96%, or at least about 97%, or at least about 98%, or at least about 99% or more to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz or the genome as deposition at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

[0148] In some aspects, the current disclosure also encompasses composition comprising a carrier and an engineered probiotic strain comprising at least one polynucleotide sequence from P. copri of NRRL deposit no. xxxxx or yyyy or zzzz that enhances utilization of arabinoxylan, pectin, b-mannan, b-glucan, xylan, arabinoxylan, glucomannan, xyloglucan, b-1 ,3-glucan, pectin galactan, starch or arabinogalactan as provided in Table A. In some aspects, the current disclosure encompasses a composition comprising a carrier and an engineered probiotic strain comprising at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of a polynucleotide sequence encoding a protein from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of a genome sequence deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the engineered probiotic strain comprises at least two, at least three, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10 or at least 20, at least 30 or more of polynucleotide sequences from one or more of the polysaccharide utilization loci PUL3a, PUL3b, PUL9, PUL10, PUL15, PUL16, PUL17, PUL18, PUL 19, PUL20, PUL22, or PUL30 or any combination thereof, of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz. In some aspects the engineered probiotic strain comprises a genome sequence at least about 90% identical to any one of the sequences deposited at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2. In some aspects, the engineered probiotic bacterial strain has a genome at least about 90%, or at least about 91%, or at least about 92%, or at least about 93%, or at least about 94%, or at least about 95%, or at least about 96%, or at least about 97%, or at least about 98%, or at least about 99% or more to the genome of any one of P. copri strain NRRL deposit no. xxxxx or yyyyy or zzzzz or the genome as deposition at the European Nucleotide Archive with accession numbers ERZ17359655a corresponding to Prevotella copri Bg131 , ERZ17359674 corresponding to Prevotella copri BgF5_2 and ERZ17359677 corresponding to Prevotella copri BgD5_2.

[0149] In some aspects, the current disclosure encompasses compositions comprising more than about 10 ², or more than about 10 ³, or more than about 10 ⁵, or more than about 10 ⁷, or more than about 10 ⁹, or more than about 10 ¹¹, or more than about 10 ¹³ cfu per gram of P. copri (NRRL deposit #XXXX or YYYY or both). In some aspects, the composition may comprise more than about 10 ², or more than about 10 ³, or more than about 10 ⁵, or more than about 10 ⁷, or more than about 10 ⁹, or more than about 10 ¹¹, or more than about 10 ¹³ cfu of per gram of one or more isolated P. copri strains as disclosed herein. In some aspects, the composition may comprise more than about 10 ², or more than about 10 ³, or more than about 10 ⁵, or more than about 10 ⁷, or more than about 10 ⁹, or more than about 10 ¹¹, or more than about 10 ¹³ cfu of per gram of an engineered probiotic strain as disclosed herein. In some aspects, the composition may comprise more than about 10 ², or more than about 10 ³, or more than about 10 ⁵, or more than about 10 ⁷, or more than about 10 ⁹, or more than about 10 ¹¹, or more than about 10 ¹³ cfu per gram of a combination of strains comprising at least one of the DNA sequences as disclosed herein. In some aspects, the compositions disclosed herein comprise at least one suitable carrier.

[0150] In some aspects, the composition may comprise viable P. copri , engineered probiotic cells, or combination thereof. In some aspects, the composition may comprise a mixture of viable and non-viable cells. In some aspects, the compositions disclosed herein comprise at least one suitable carrier.

[0151] In some aspects the composition may further comprise additional bacterial strains thus forming a mixture of probiotic strains. As used herein, the term “probiotic” refers to any live microorganism which when administered to a subject in adequate amounts confers a health benefit. In some aspect, the compositions of the current disclosure may comprise an isolated P. copri or engineered probiotic strain as disclosed herein and an additional probiotic strain. In some aspects the additional probiotic strains may include one of more of naturally occurring or engineered strains, particular but non-limiting examples of which include Arthrobacter agilis, Arthrobacter citreus, Arthrobacter globiformis, Arthrobacter leuteus. Arthrobacter simplex, Azotobacter chroococcum, Azotobacter paspali, Azospirillum brasiliencise, Azospriliium lipoferum, Bacillus brevis, Bacillus macerans, Bacillus pumilus, Bacillus polymyxa, Bacillus subtilis, Bacteroides lipolyticum, Bacteroides succinogenes, Brevibacterium lipolyticum, Brevibacterium stationis, Bacillus laterosporus, Bacillus bifidum, Bacillus laterosporus, Bifidophilus infantis, Streptococcus thermophilous, Bifidophilus longum, Bifidobacterium infantis, Bifidobacteria animalis, Bifidobacteria bifidus, Bifidobacteria breve, Bifidobacteria longum, Kurtha zopfil, Lactobacillus paracasein, Lactobacillus acidophilus, Lactobacillus planetarium, Lactobacillus salivarius, Lactobacillus rueteri, Lactobacillus bulgaricus, Lactobacillus helveticus, Lactobacillus easel, Lactobacillus rhamnosus. Lactobacillus sporogenes, Lactococcus lactis, Myrothecium verrucaris, Prevotella spp., Pseudomonas calcis, Pseudomonas dentrificans, Pseudomonas flourescens, Pseudomonas glathei, Phanerochaete chrysosporium, Saccharomyces boulardii, Streptmyces fradiae, Streptomyces cellulosae, Stretpomyces griseofiavus and combinations thereof.

[0152] In some aspects the formulation may comprise a viable mixture of probiotic cells. In some aspects the formulation may comprise non-viable mixture of probiotic cells. In some aspects the formulation may comprise a mixture of viable and non-viable mixture of pro-biotic cells.

[0153] In some aspects, the compositions as disclosed herein further comprise a suitable carrier. “Carrier” is understood as any substance that facilitates the growth, transportation and/or administration of the strains of the present invention. Depending on the purpose and/or use to which said strains are intended for, the “carriers” could be of different nature. The present invention relates to pharmaceutically acceptable “carriers” such as those commonly associated to capsules, tablets or powder, as well as a “carriers” formed by ingredients or food products. In some aspects, the carrier is an ingestible carrier. Non-limiting examples of ingestible carriers include milk components, baby formula, baby food including but not limited to F-75 or F-100 formulas used for the management of malnutrition, human milk oligosaccharides, breast milk, sugar, flavor enhancers.

[0154] In some aspects the formulation may further comprise a prebiotic material, an excipient, an adjuvant, stabilizers, a biological compound, dietary supplements, proteins, a vitamin, a drug, a vaccine or a combination thereof. "Prebiotic" means one or more non-digestible food substance that promotes the growth of health beneficial micro-organisms, or probiotics in the intestines. They are not broken down in the stomach, or upper intestine or absorbed in the Gl tract of the person ingesting them, but they are fermented by the gastrointestinal microbiota or by probiotics. In some aspects, the current disclosure also encompasses synbiotic formulations comprising the at least a probiotic strain as disclosed herein. Synbiotics refer to nutritional supplements combining probiotics and prebiotics in a form of synergism. Non-limiting examples of prebiotics include acacia gum, alpha glucan, arabinogalactans, beta glucan, dextrans, fructooligosaccharides, fucosyllactose, galactooligosaccharides, galactomannans, gentiooligosaccharides, glucooligosaccharides, guar gum, inulin, isomaltooligosaccharides, lactoneotetraose, lactosucrose, lactulose, levan, maltodextrins, milk oligosaccharides, partially hydrolyzed guar gum, pecticoligosaccharides, resistant starches, retrograded starch, sialooligosaccharides, sialyllactose, soyoligosaccharides, sugar alcohols, xylooligosaccharides, or their hydrolysates, or combinations thereof. Non-limiting examples of proteins include dairy based proteins, plant-based proteins, animal-based proteins and artificial proteins. Dairy based proteins include, for example, casein, caseinates (e.g., all forms including sodium, calcium, potassium caseinates), casein hydrolysates, whey (e.g., all forms including concentrate, isolate, demineralized), whey hydrolysates, milk protein concentrate, and milk protein isolate. Plant based proteins include, for example, soy protein (e.g., all forms including concentrate and isolate), pea protein (e.g., all forms including concentrate and isolate), canola protein (e.g., all forms including concentrate and isolate), other plant proteins that commercially are wheat and fractionated wheat proteins, corn and it fractions including zein, rice, oat, potato, peanut, green pea powder, green bean powder, and any proteins derived from beans, lentils, and pulses. As used herein the term “vitamin” is understood to include any of various fat-soluble or water-soluble organic substances (non-limiting examples include vitamin A, Vitamin B1 (thiamine), Vitamin B2 (riboflavin), Vitamin B3 (niacin or niacinamide), Vitamin B5 (pantothenic acid), Vitamin B6 (pyridoxine, pyridoxal, or pyridoxamine, or pyridoxine hydrochloride), Vitamin B7 (biotin), Vitamin B9 (folic acid), and Vitamin B12 (various cobalamins; commonly cyanocobalamin in vitamin supplements), vitamin C, vitamin D, vitamin E, vitamin K, folic acid and biotin) essential in minute amounts for normal growth and activity of the body and obtained naturally from plant and animal foods or synthetically made, pro-vitamins, derivatives, analogs. Non-limiting examples of excipients include binders, emulsifiers, diluents, fillers, disintegrants, effervescent disintegration agents, preservatives, antioxidants, flavormodifying agents, lubricants and glidants, dispersants, coloring agents, pH modifiers, chelating agents, and release-controlling polymers. Non-limiting list of adjuvants include potassium alum, aluminum hydroxide, aluminum phosphate, calcium phosphate hydroxide, paraffin oil, adjuvant 65, killed bacteria of the species Bordetella pertussis, Mycobacterium bovis, toxoids, plant saponins from quillaja and soybean, cytokines: IL-1 , IL-2, IL-1 , Freund's complete adjuvant, Freund's incomplete adjuvant and squalene.

[0155] In some aspects, the current disclosure also encompasses synbiotic formulations comprising the compositions as disclosed herein and further comprising a food formulation. In some aspects, any suitable food formulation can be combined with the disclosed compositions.

[0156] In some aspects, the food formulation as disclosed herein is an edible composition that impacts the subject’s gut microbiota in a manner to modulate expression of nucleic acids encoding proteins in particular enzyme families, such that physiological parameters of the subject are improved, e.g., ponderal growth or rate of ponderal growth. Components of the food formulation and some exemplary formulations are provided below in sections a-f. In some aspects, the food formulations as disclosed herein can be used with the probiotic compositions disclosed herein. However, the current disclosure also encompasses the use of these food formulation without the use of additional compositions comprising a probiotic bacterial strain, but to promote the beneficial functions of the target P. copri strains already present in a subject’s microbiota.

(a) food formulation comprising chickpea flour, peanut flour, soy flour, raw banana

[0157] In one aspect, a food formulation of the present disclosure comprises chickpea flour, peanut flour, soy flour, and raw banana, wherein the chickpea flour, the peanut flour, the soy flour, and the raw banana provide at least 8.5 g of protein per 100 g of the food formulation. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind, or no milk, powdered milk, or milk product of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (a) no seed, nuts, and nut butter, and/or (b) no cocoa nibs, cocoa powder or chocolate, and/or (c) no rice flour and lentil flour, and/or (d) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (a) no seed, nuts, and nut butter, and/or (b) no cocoa nibs, cocoa powder or chocolate, and/or (c) no rice flour and lentil flour, and/or (d) no dried fruit.

[0158] In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide 8.5 g to about 40 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 9 g to about 40 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 10 g to about 40 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 11 g to about 40 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 9 g to about 30 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 10 g to about 28 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 11 g to about 26 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 12 g to about 24 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 12 g to about 14 g of protein per 100 g of the food formulation. In some aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide about 13 g to about 15 g of protein per 100 g of the food formulation. In other aspects, the chickpea flour, the peanut flour, the soy flour, and the raw banana, in total, provide 8.5 g, about 9 g, about 9.5 g, about 10 g, about 10.5 g, about 11 g, about 11.5 g, about 12 g, about 12.5 g, about 13 g, about 13.5 g, about 14 g, about 14.5 g, or about 15 g, about 15.5 g, about 16 g, about 16.5 g, about 17 g, about 17.5 g, about 18 g, about

18.5 g, about 19 g, about 19.5, about 20 g, about 20.5, about 21 g about 21 .5, about 22 g about

22.5, about 23 g, about 23.5, about 24 g, about 24.5, about 25 g, about 25.5, about 26 g, about

26.5, about 27 g, about 27.5, about 28 g, about 28.5, about 29 g, about 29.5, about 30 g, about

30.5, about 31 g, about 31 .5, about 32 g, about 32.5 g, about 33, about 33.5 g, about 34 g, about

34.5 g, about 35 g, about 35.5 g, about 36, about 36.5 g, about 37 g, about 37.5 g, about 38 g, about 38.5 g, about 39 g, about 39.5 g, about 40 g of protein per 100 g of the food formulation.

[0159] In each of the above aspects, the weight ratio of the chickpea flour to the peanut flour to the soy flour to the raw banana may vary. Typically, chickpea flour has about 20%-40% protein by weight, peanut flour has about 20%-50% protein by weight, soy flour has about 20%-50% protein by weight, and raw banana has about 1-30% protein by weight. The weight percentages of protein in each ingredient may vary however, depending upon the varietal of plant and, in the case of the flours, the method used to manufacture the flour. In some aspects, the weight ratio is about 1 : about 1 : about 0.8: about 1 .9, respectively (chickpea flour: peanut flour: soy flour: raw banana), or a weight ratio adjusted as needed to reflect differences in the ingredients.

[0160] In an exemplary aspect, a food formulation of the present disclosure comprises about 9- 11 g of chickpea flour, about 9-11 g of peanut flour, about 7-9 g of soy flour, and about 17-21 g of raw banana. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit.

[0161] In another exemplary aspect, a food formulation of the present disclosure comprises about 10 g of chickpea flour, about 10 g of peanut flour, about 8 g of soy flour, and about 19 g of raw banana. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit.

[0162] In another exemplary aspect, a food formulation of the present disclosure comprises about 11.9 g of chickpea flour, about 10 g of peanut flour, about 13 g of soy flour, and about 15 g of raw banana. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit.

[0163] In another exemplary aspect, a food formulation of the present disclosure comprises about 13 g of chickpea flour, about 13 g of peanut flour, about 11 g of soy flour, and about 14.90 g of raw banana. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit.

[0164] In another exemplary aspect, a food formulation of the present disclosure comprises about 8.68 g of chickpea flour, about 13.87 g of peanut flour, about 16.30 g of soy flour, and about 8.75 g of raw banana. In preferred aspects, the food formulation contains no cow’s milk or powdered cow’s milk, or no milk or powdered milk of any kind. In still further aspects, the food formulation also contains no seeds, nuts, nut butters, dried fruit, cocoa nibs, cocoa powder, chocolate, rice flour, lentil flour, or any combination thereof. For example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no cow’s milk or powdered cow’s milk and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. In another example, food formulations of the present disclosure comprising chickpea flour, peanut flour, soy flour, and raw banana may contain no milk or powdered milk of any kind and (i) no seed, nuts, and nut butter, and/or (ii) no cocoa nibs, cocoa powder or chocolate, and/or (iii) no rice flour and lentil flour, and/or (iv) no dried fruit. (b) food formulation comprising glycan equivalents of chickpea flour, peanut flour, soy flour, raw banana

[0165] In another aspect, a food formulation of the present disclosure is a food formulation of (a), wherein some or all the chickpea flour, the peanut flour, the soy flour, and/or the raw banana is replaced with a glycan equivalent thereof. As used herein, a “glycan equivalent” refers to a food formulation with a similar glycan content. The term “similar” generally refers to a range of numerical values, for instance, ± 0.5-1%, ± 1-5% or ± 5-10% of the recited value, that one would consider equivalent to the recited value, for example, having the same function or result. Because a glycan equivalent has a similar glycan content to the ingredient it is replacing, it may be substituted about 1 :1. For instance, if 3 g of chickpea flour is to be replaced with a glycan equivalent thereof, one of skill in the art would use about 3 g of the chickpea glycan equivalent. A glycan equivalent may be defined in terms of its monosaccharide content and optionally by an analysis of the glycosidic linkages. Methods for measuring monosaccharide content and analyzing glycosidic linkages are known in the art.

[0166] In some aspects, some or all the chickpea flour is replaced with a glycan equivalent of chickpea flour. For instance, a food formulation of (a) may comprise a glycan equivalent of about 0.5 g or more of chickpea flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g, about 2 g, about 3 g, about 4 g, about 5 g, about 6 g, about 7 g, about 8 g, about 9 g, or about 10 g, or about 11 g, or about 12 g, or about 13 g, or about 14 g, or about 15 g of chickpea flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 0.1 g to about 15 g of chickpea flour, or about 0.5 to about 5 g of chickpea flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g to about 15 g of chickpea flour, or about 1 g to about 5 g of chickpea flour, or about 2.5 g to about 7.5 g of chickpea flour, to about 5 g to about 15 g of chickpea flour. In further aspects, some or all the peanut flour is also replaced with a glycan equivalent of peanut flour, some or all the soy flour is also replaced with a glycan equivalent of soy flour, and/or some or all the raw banana is also replaced with a glycan equivalent of raw banana.

[0167] In some aspects, some or all the peanut flour is replaced with a glycan equivalent of peanut flour. For instance, a food formulation of (a) may comprise a glycan equivalent of about 0.5 g or more of peanut flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g, about 2 g, about 3 g, about 4 g, about 5 g, about 6 g, about 7 g, about 8 g, about 9 g, or about 10 g, or about 11 g, or about 12 g, or about 13 g, or about 14 g, or about 15 g of peanut flour. In another example, a food formulation of Section l(a) may comprise a glycan 15 g of peanut flour. In another example, a food formulation of Section l(a) may comprise a glycan equivalent of about 0.1 g to about 15 g of peanut flour, or about 0.5 to about 5 g of peanut flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g to about 10 g of peanut flour, or about 1 g to about 15 g of peanut flour, or about 2.5 g to about 12.5 g of peanut flour, to about 5 g to about 10 g of peanut flour. In further aspects, some or all the chickpea flour is also replaced with a glycan equivalent of chickpea flour, some or all the soy flour is also replaced with a glycan equivalent of soy flour, and/or some or all the raw banana is also replaced with a glycan equivalent of raw banana.

[0169] In some aspects, some or all the soy flour is replaced with a glycan equivalent of soy flour. For instance, a food formulation of (a) may comprise a glycan equivalent of about 0.5 g or more of soy flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g, about 2 g, about 3 g, about 4 g, about 5 g, about 6 g, about 7 g, or about 8 g, or about 9 g, or about 10 g, or about 11 g, or about 12 g, or about 13 g, or about 14 g, or about 15 g of soy flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 0.1 g to about 15 g of soy flour, or about 0.5 to about 10 g of soy flour. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g to about 15 g of soy flour, or about 1 g to about 5 g of soy flour, or about 2 g to about 7.5 g of soy flour, to about 10 g to about 15 g of soy flour. In further aspects, some or all the chickpea flour is also replaced with a glycan equivalent of chickpea flour, some or all the peanut flour is also replaced with a glycan equivalent of peanut flour, and/or some or all the raw banana is also replaced with a glycan equivalent of raw banana.

[0170] In some aspects, some or all the raw banana is replaced with a glycan equivalent of raw banana. For instance, a food formulation of (a) may comprise a glycan equivalent of about 0.5 g or more of raw banana. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g, about 2 g, about 3 g, about 4 g, about 5 g, about 6 g, about 7 g, about 8 g of raw banana, about 9 g of raw banana, about 10 g of raw banana, about 11 g of raw banana, about 12 g of raw banana, about 13 g of raw banana, about 14 g of raw banana, about 15 g of raw banana, about 16 g of raw banana, about 17 g of raw banana, about 18 g of raw banana, or about 19 g of raw banana. In another example, a food formulation of (a) may comprise a glycan equivalent of about 0.1 g to about 8 g of raw banana, or about 0.5 to about 5 g of raw banana. In another example, a food formulation of (a) may comprise a glycan equivalent of about 1 g to about 8 g of raw banana, or about 1 g to about 4 g of raw banana, or about 2 g to about 6 g of raw banana, to about 4 g to about 8 g of raw banana. In further aspects, some or all the chickpea flour is also replaced with a glycan equivalent of chickpea flour, some or all the peanut flour is also replaced with a glycan equivalent of peanut flour, and/or some or all the soy flour is also replaced with a glycan equivalent of soy flour.

[0171] A micronutrient premix in a food formulation of the present disclosure is present in an amount that provides at least 60% of the recommended daily allowance (RDA), for a given age group, of minimally vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc. The RDA of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc, for various age groups, is known in the art. Given that different age groups may have different RDA’s, it will be appreciated by a person of skill in the art that certain food formulations may not be suitable for subjects of all ages. For example, a food formulation with 60% of the Vitamin C RDA for a subject 7-12 months in age (e.g., 40 mg) will not contain at least 60% of the Vitamin C RDA for a subject 21 years of age (e.g., 75-90 mg). The term “vitamin “B,” as used herein, is inclusive of all B vitamins, unless otherwise specified. Although food formulations of the present disclosure are described as comprising a micronutrient premix, the addition of each vitamin and mineral separately, or the use of multiple premixes, is also contemplated and encompassed by the aspects described herein. Similarly, in alternative aspects, the micronutrient premix can be formulated separately and administered as a distinct food formulation in conjunction with a food formulation comprising chickpea flour or a glycan equivalent thereof, peanut flour or a glycan equivalent thereof, soy flour or a glycan equivalent thereof, raw banana or a glycan equivalent thereof.

[0172] In various aspects, a micronutrient premix provides at least 60%, at least 61 %, at least 62%, at least 63%, at least 64%, at least 65%, at least 66%, at least 67%, at least 68%, at least

69%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least

76%, at least 77%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81%, at least

82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least

89%, at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least

96%, at least 97%, at least 98%, at least 99%, or at least 100% of the recommended daily allowance (RDA), for a given age group, of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. In certain aspects, a micronutrient premix provides more than 100% of the RDA, for a given age group, of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. In a specific aspect, the micronutrient premix provides at least 75% of the recommended daily allowance (RDA), for a given age group, of minimally vitamins A, C, D and E, all B vitamins, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. The RDA of vitamins and minerals for different age groups is well known in the art.

[0173] In a specific aspect, a micronutrient premix provides at least 60%, at least 61%, at least 62%, at least 63%, at least 64%, at least 65%, at least 66%, at least 67%, at least 68%, at least 69%, at least 70%, at least 71%, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 77%, at least 78%, at least 79%, or at least 80% of the recommended daily allowance (RDA) for children aged 12-24 months of vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc.

[0174] In another specific aspect, the micronutrient premix provides at least 70% of the recommended daily allowance (RDA) for children aged 12-24 months of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc.

[0175] In another specific aspect, the micronutrient premix provides at least 75% of the recommended daily allowance (RDA) for children aged 12-24 months of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc.

[0176] A micronutrient premix may further comprise vitamins and minerals in addition to the vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc .

[0177] In an exemplary aspect, a food formulation of the present disclosure contains vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, phosphorus, potassium, and zinc in the amounts listed in Table E and Table F. In a preferred aspect, a food formulation of the present disclosure contains the nutrients of Table E in the amounts listed in Table E. In another preferred aspect, a food formulation of the present disclosure contains the nutrients of Table F in the amounts listed in Table F. In yet another preferred aspect, a food formulation of the present disclosure contains the nutrients of both Table A and Table B, in the amounts listed in Table E and Table F respectively.

Table E. Vitamin Premix

[0178] In an exemplary aspect, a food formulation of the present disclosure contains the micronutrients in Table F, in the amounts in Table F.

Table F: Mineral Premix

[0179] For a 100 g food formulation, 2.982 g of the Mineral Premix is used. Accordingly, to calculate the amount of a given mineral in a 100 g food formulation, the amounts listed above are multiplied by 2.982.

(d) macronutrient content

[0180] A micronutrient premix in a composition of the present disclosure is present in an amount that provides at least 60% of the recommended daily allowance (RDA), for a given age group, of minimally vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc. The RDA of vitamin A, vitamin C, vitamin D, vitamin E, vitamin B, calcium, copper, iron, magnesium, manganese, phosphorus, potassium, and zinc, for various age groups, is known in the art. Given that different age groups may have different RDA’s, it will be appreciated by a person of skill in the art that certain compositions may not be suitable for subjects of all ages. For example, a composition with 60% of the Vitamin C RDA for a subject 7-12 months in age (e.g., 40 mg) will not contain at least 60% of the Vitamin C RDA for a subject 21 years of age (e.g., 75-90 mg). The term “vitamin “B,” as used herein, is inclusive of all B vitamins, unless otherwise specified. Although compositions of the present disclosure are described as comprising a micronutrient premix, the addition of each vitamin and mineral separately, or the use of multiple premixes, is also contemplated and encompassed by the embodiments described herein. Similarly, in alternative embodiments, the micronutrient premix can be formulated separately and administered as a distinct composition in conjunction with a composition comprising chickpea flour or a glycan equivalent thereof, peanut flour or a glycan equivalent thereof, soy flour or a glycan equivalent thereof, raw banana or a glycan equivalent thereof. [0083] In various embodiments, a micronutrient premix provides at least 60%, at least 61 %, at least 62%, at least 63%, at least 64%, at least 65%, at least 66%, at least 67%, at least 68%, at least 69%, at least 70%, at least 71 %, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 77%, at least 78%, at least 79%, at least 80%, at least 81 %, at least 82%, at least 83%, at least 84%, at least 85%, at least 86%, at least 87%, at least 88%, at least 89%, at least 90%, at least 91 %, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or at least 100% of the recommended daily allowance (RDA), for a given age group, of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. In certain embodiments, a micronutrient premix provides more than 100% of the RDA, for a given age group, of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. In a specific embodiment, the micronutrient premix provides at least 75% of the recommended daily allowance (RDA), for a given age group, of minimally vitamins A, C, D and E, all B vitamins, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. The RDA of vitamins and minerals for different age groups is well known in the art.

[0181] In a specific embodiment, a micronutrient premix provides at least 60%, at least 61 %, at least 62%, at least 63%, at least 64%, at least 65%, at least 66%, at least 67%, at least 68%, at least 69%, at least 70%, at least 71 %, at least 72%, at least 73%, at least 74%, at least 75%, at least 76%, at least 77%, at least 77%, at least 78%, at least 79%, or at least 80% of the recommended daily allowance (RDA) for children aged 12- 18 months of vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc.

[0182] In another specific embodiment, the micronutrient premix provides at least 70% of the recommended daily allowance (RDA) for children aged 12-18 months of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc. [0086] In another specific embodiment, the micronutrient premix provides at least 75% of the recommended daily allowance (RDA) for children aged 12- 18 months of minimally vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc.

[0087] A micronutrient premix may further comprise vitamins and minerals in addition to the vitamin A, vitamin B, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, manganese, phosphorous, potassium and zinc .

(e) additional ingredients

[0183] Food formulations of the present disclosure may further comprise one or more additional ingredient listed in Table G.

Table G

[0184] In some aspects, a food formulation further comprises at least one sweetener. In one aspect, a food formulation further comprises sugar (i.e. sucrose), and optionally one or more additional sweetener. The amount of sugar may vary. In one example, a food formulation comprises up to about 30 g of sugar per 100 g of the food formulation. In another example, a food formulation comprises about 0.1 g to about 30 g of sugar, or about 1 g to about 30 g of sugar, per 100 g of the food formulation. In another example, a food formulation comprises about 10 g to about 30 g of sugar per 100 g of the food formulation. In another example, a food formulation comprises about 20 g to about 30 g of sugar per 100 g of the food formulation. In another example, a food formulation comprises about 25 g to about 30 g of sugar per 100 g of the food formulation. In another example, a food formulation comprises about 27 g to about 30 g of sugar, or about 28 g to about 30 g of sugar, per 100 g of the food formulation. In another example, a food formulation comprises about 27 g, 27.1 g, 27.2 g, 27.3 g, 27.4 g, 27.5 g, 27.6 g, 27.7 g, 27.8 g, 27.9 g or 28 g of sugar per 100 g of the food formulation. In another example, a food formulation of the disclosure comprises about 28 g, 28.1 g, 28.2 g, 28.3 g, 28.4 g, 28.5 g, 28.6 g, 28.7 g, 28.8 g, 28.9 g or 29 g of sugar per 100 g of the food formulation. In another example, a food formulation of the disclosure comprises about 29 g, 29.1 g, 29.2 g, 29.3 g, 29.4 g, 29.5 g, 29.6 g, 29.7 g, 29.8 g, 29.9 g or 30 g of sugar per 100 g of the food formulation.

[0185] In some aspects, a food formulation further comprises at least one fat. A fat may be an animal fat, or more preferably a vegetable oil. In some aspects, a fat is chosen from avocado oil, canola oil, coconut oil, com oil, cottonseed oil, flaxseed oil, grape seed oil, hemp seed oil, olive oil, palm oil, peanut oil, rice bran oil, safflower oil, soybean oil, or sunflower oil. In further aspects, one fat provides at least 50% by weight (wt%) of the total fat in the food formulation. For instance, one fat may provide about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95% by weight of the total fat in the food formulation. In one example the fat is soybean oil. In one example the fat is canola oil. In still further aspects, two or more fats provide at least 50% by weight of the fat in the food formulation. For instance, two or more fats may provide about 50%, about 55%, about 60%, about 65%, about 70%, about 75%, about 80%, about 85%, about 90%, or about 95% by weight of the total fat in the food formulation. In one example, at least one fat is soybean oil or canola oil. In one example, the fat is soybean oil and canola oil.

[0186] In other aspects, a food formulation further comprises soybean oil, and the soybean oil provides at least 50% by weight of the total fat in the food formulation. In further aspects, the soybean oil provides at least 75% by weight of the total fat in the food formulation. In still further aspects, the soybean oil provides at least 90% by weight of the total weight of fat in the food formulation. In still further aspects, the soybean oil provides at least 95% by weight of the total fat in the food formulation. In each of the above aspects, the food formulation may further comprise a fat chosen from animal fat or vegetable oil.

[0187] In still other aspects, a food formulation further comprises about 20 g of soybean oil. In one aspect, a food formulation comprises about 15 g, about 16 g, about 17 g, about 18 g, about 19 g, about 20 g, or about 21 g of soybean oil per 100 g of the food formulation. In another aspect, a food formulation further comprises about 15 g to about 21 g, about 16 g to about 21 g, about 17 g to about 21 g, about 18 g to about 21 g, about 19 g to about 21 g, about 20 g to about 21 g, about 15 g to about 20 g, about 16 g to about 20 g, about 17 g to about 20 g, about 18 g to about 20 g, or about 19 g to about 20 g of soybean oil per 100 g of the food formulation. In still another aspect, a food formulation of the disclosure comprises about 17 g, 17.1 g, 17.2 g, 17.3 g, 17.4 g, 17.5 g, 17.6 g, 17.7 g, 17.8 g, 17.9 g or 18 g of soybean oil per 100 g of the food formulation. In still yet another aspect, a food formulation of the disclosure comprises about 18 g, 18.1 g, 18.2 g, 18.3 g, 18.4 g, 18.5 g, 18.6 g, 18.7 g, 18.8 g, 18.9 g or 19 g of soybean oil per 100 g of the food formulation. In still yet another different aspect, a food formulation further comprises about 19 g, 19.1 g, 19.2 g, 19.3 g, 19.4 g, 19.5 g, 19.6 g, 19.7 g, 19.8 g, 19.9 g or 20 g of soybean oil. In a different aspect, a food formulation of the disclosure comprises about 20 g, 20.1 g, 20.2 g, 20.3 g, 20.4 g, 20.5 g, 20.6, 20.7 g, 20.8 g, 20.9 g or 21 g of soybean oil per 100 g of the food formulation.

(f) exemplary food formulations

[0188] In one aspect, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour or a glycan equivalent thereof, about 10g peanut flour or a glycan equivalent thereof, about 8 g soy flour or a glycan equivalent thereof, about 19 g raw banana or a glycan equivalent thereof, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix. In another aspect, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour, about 10g peanut flour, about 8 g soy flour, about 19 g raw banana, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix. In preferred aspects, the micronutrient premix referenced in this paragraph contains the nutrients listed in Table A and Table B in the amount specified in Table E and Table F, respectively.

[0189] In some aspects, a food formulation of the present disclosure as described in this section (f), has total protein of about 11.6 g, total fat of about 20.8 g, total carbohydrate of about 46.2 g, and total fiber of about 4.5 g. For example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour or a glycan equivalent thereof, about 10g peanut flour or a glycan equivalent thereof, about 8 g soy flour or a glycan equivalent thereof, about 19 g raw banana or a glycan equivalent thereof, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, and have total protein of about 11 .6 g, total fat of about 20.8 g, total carbohydrate of about 46.2 g, and total fiber of about 4.5 g. In another example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour, about 10g peanut flour, about 8 g soy flour, about 19 g raw banana, about 29.9 g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, and have total protein of about 11.6 g, total fat of about 20.8 g, total carbohydrate of about 46.2 g, and total fiber of about 4.5 g. In preferred aspects, the micronutrient premix referenced in this paragraph contains the nutrients listed in Table E and Table F in the amount specified in Table E and Table F, respectively.

[0190] In exemplary aspects, a food formulation of the present disclosure as described in this section (f), has a protein energy ratio (PER) of about 11.4, a fat energy ratio (FER) of about 46.0, and total calories of about 400 to about 560 kcal per 100 g of the food formulation. For example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour or a glycan equivalent thereof, about 10g peanut flour or a glycan equivalent thereof, about 8 g soy flour or a glycan equivalent thereof, about 19 g raw banana or a glycan equivalent thereof, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, wherein the food formulation has a protein energy ratio (PER) of about 11.4, a fat energy ratio (FER) of about 46.0, and total calories of about 400 to about 560 kcal per 100 g of the food formulation. In another example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour, about 10g peanut flour, about 8 g soy flour, about 19 g raw banana, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, wherein the food formulation has a protein energy ratio (PER) of about 11 .4, a fat energy ratio (FER) of about 46.0, and total calories of about 400 to about 560 kcal per 100 g of the food formulation. In yet another example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour or a glycan equivalent thereof, about 10g peanut flour or a glycan equivalent thereof, about 8 g soy flour or a glycan equivalent thereof, about 19 g raw banana or a glycan equivalent thereof, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, and have total protein of about 11.6 g, total fat of about 20.8 g, total carbohydrate of about 46.2 g, and total fiber of about 4.5 g, wherein the food formulation has a protein energy ratio (PER) of about 11.4, a fat energy ratio (FER) of about 46.0, and total calories of about 400 to about 560 kcal per 100 g of the food formulation. In still another example, a food formulation of the present disclosure may contain (per 100g) about 10 g chickpea flour, about 10g peanut flour, about 8 g soy flour, about 19 g raw banana, about 29.9g sugar, about 20 g soybean oil, and about 3.1 g micronutrient premix, and have total protein of about 11.6 g, total fat of about 20.8 g, total carbohydrate of about 46.2 g, and total fiber of about 4.5 g, wherein the food formulation has a protein energy ratio (PER) of about 11.4, a fat energy ratio (FER) of about 46.0, and total calories of about 400 to about 560 kcal per 100 g of the food formulation. In preferred aspects, the micronutrient premix referenced in this paragraph contains the nutrients listed in Table A and Table B in the amount specified in Table A and Table B, respectively. [0191] Food formulations of the present disclosure may be formulated into a beverage, a food or a supplement. Non-limiting examples include a bar, a paste, a gel, a cookie, a cracker, a powder, a pellet, a powdered drink to be reconstituted, a blended beverage, a carbonated beverage, and the like. When food formulations of the present disclosure are intended to be administered and consumed by humans, the ingredients in the food formulations are typically Food Chemicals Codex (FCC) purity or U.S. Pharmacopeia (USP) - National Formulary quality, as appropriate, and free from foreign materials. In some aspects, a food formulation may be a therapeutic food. In some aspects, a food formulation may be a ready-to-use food. The term “ready-to-use food” refers to a food that comes ready to use as provided. Specifically, a ready-to- use food doesn’t require reconstitution or refrigeration, and stays fresh for at least 6 months, preferably one year, or more preferably two years. In some aspects, a food formulation may be a ready-to-use therapeutic food, as defined in U.S. Department of Agriculture, “Commercial Item Description: Ready-to-Use Therapeutic Food (RUTF)” A-A-20363B (2012), which is designed to meet the guidelines established at the FAO-WHO 45 ^th session of the Codex Alimentarius Commission (November 21 , 2022).

[0192] Table H provides a list of exemplary food formulations that may be used with the compositions disclosed herein.

Table H:

[0193] Tables l(a), J(a), K(a) and L(a) further provides food formulations modified from the formulations listed in Table H. The corresponding metrics for the formulation including PER, FER and SERs are provided in Tables l(b), J(b), K(b) and L(b). The 4 exemplary formulations include MDCF-2, MDCF-2SS, MDSF, and MD_RUTF. The formulations provided here are exemplary only, and ingredients can be changed based on factors like availability, target age, function, regulatory requirements etc.

Table l(a): Formulation for MDCF-2 Table l(b): Metrics for MDCF-2

Table J(a): Formulation for MDCF-2SS

Table J(b): Metrics for MDCF-2SS

Table K(a): Formulation for MDSF

Table K(b): Metrics for MDSF

Table L(a): Formulation for MD-RUTF

Table L(b): Metrics for MD-RUTF [0194] In some aspects, the current disclosure also encompasses a food formulation as disclosed herein, for example MDCF-1 , MDCF-2, MDCF-3, MDCF-2SS, MDSF, or MD-RUTF or variants thereof, for treatment of MAM, SAM or stunting. In some aspects, the food formulation may be administered to augment the benefits of P. copri in the gut microbiome. In some aspects, the P. copri is administered as a composition as disclosed herein. In some aspects, the P. copri is not externally administered but exists in the subject’s gut microbiome.

[0195] In some aspects, the compositions of the current disclosure may be formulated for any route of administration, for example oral, gastric, orogastric, nasogastric, implanted, buccal, and rectal.

[0196] In some aspects, the compositions of the current disclosure may be formulated in unit dosage form as a solid, semi-solid, liquid, capsule, powder, emulsions, suspensions, tablets and suitably packaged. In some aspects, the strains of the disclosure, or combination of strains and food formulations disclosed herein may be encapsulated. These formulations are a further aspect of the invention. In some aspect the formulations may be mixed with liquids for suitable for orogastric or nasogastric delivery. Usually, the amount of a strain of the invention, or a combination of strains of the invention, is between 0.1-95% by weight of the formulation, or between 0.1-1% or 1 %-10% or 10%-20%, or20%-30%, or 30%-40%, or40%- 50%, or 50%-60%, or 60%-70%, or 70%-80% or 80%-90% or 90%-99% by weight of the formulation. Methods of formulating compositions are discussed in, for example, Hoover, John E., Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa. (1975), and Liberman, H. A. and Lachman, L., Eds., Pharmaceutical Dosage Forms, Marcel Decker, New York, N.Y. (1980).

[0197] In some aspects, administration of the compositions comprising at least one probiotic strain as disclosed herein, can be combined with simultaneous, or staggered administration of other probiotic strains, for example Bifidobacterium longum subspecies infantis (B. infantis) ID number Bg40721_2D9_SN_2018, food formulations, for example MDF (revisions 1 and 2), or both. Dosage and forms of such formulations can be empirically determined by a person of skill in the art.

II. Methods

[0198] In some aspects, the current disclosure encompasses a method of treatment, the method comprising administering to a subject in need thereof, a therapeutically effective quantity of a composition as disclosed in Section I. In some aspects, the methods disclosed herein may be used in the prevention or treatment of malnutrition, Moderate Acute Malnutrition (MAM), Severe Acute Malnutrition (SAM), stunting, necrotizing enterocolitis, nosocomial infections, enteric inflammation, inflammatory disorders, immunodeficiency, inflammatory bowel disease, irritable bowel syndrome, cancer (particularly of the gastrointestinal and immune systems), diarrheal disease, antibiotic associated diarrhea, pediatric diarrhea, appendicitis, allergies, autoimmune disorders, multiple sclerosis, Alzheimer's disease, rheumatoid arthritis, coeliac disease, diabetes mellitus, organ transplantation, bacterial infections, viral infections, fungal infections, periodontal disease, urogenital disease, sexually transmitted disease, HIV infection, HIV replication, HIV associated diarrhea, surgical associated trauma, surgical-induced metastatic disease, sepsis, weight loss, anorexia, fever control, cachexia, wound healing, ulcers, gut barrier function, allergy, asthma, respiratory disorders, circulatory disorders, coronary heart disease, anemia, disorders of the blood coagulation system, renal disease, disorders of the central nervous system, hepatic disease, ischemia, nutritional disorders, osteoporosis, endocrine disorders, epidermal disorders, psoriasis, acne vulgaris, panic disorder, behavioral disorder and/or post- traumatic stress disorders. In some aspects, the current disclosure also encompasses a method for modifying, repairing, or improving the gut microbiota of a subject in need thereof by administration of a therapeutically effective quantity of a composition as provided in Section I, to a subject in need thereof. In some aspects, the current disclosure also encompasses administration of a therapeutically effective quantity of the disclosed compositions to a subject in need thereof, to enhance the uptake, or utilization, or both of milk N-glycans, or plant-derived polysaccharides, or both.

[0199] As used herein the term “therapeutically effective quantity” refers to an amount of the formulation that alleviates, in whole or in part, symptoms associated with the disorder or condition, or halts or slows further progression or worsening of those symptoms or prevents or provides prophylaxis for the disorder or condition. An “effective amount’ refers to an amount effective, at dosages and for periods of time necessary, to achieve the desired therapeutic result. A therapeutically effective amount is also one in which any toxic or detrimental effects of compounds of the invention are outweighed by the therapeutically beneficial effects. In some aspects the therapeutically effective quantity may be a quantity that results in reduction in biomarkers of enteric inflammation in the subject. In some aspects the therapeutically effective quantity may be an amount that results in increases in the levels of beneficial plasma protein biomarkers. In some aspects the therapeutically effective quantity may be a quantity that results in significant improvement in ponderal growth as evidenced from weight-for-age z score (WAZ) or mid-upper arm circumference (MUAC) or any other objective measure known in the art. In some aspects the therapeutically effective quantity may be an amount that is sufficient to bring about improvement in musculoskeletal and brain development as demonstrated by objective measures known in the art. In some aspects the therapeutically effective quantity may be amounts that result in enhanced colonization of the beneficial probiotic populations in the gut as demonstrated by various objective means used in the art including but not limited to fecal cultures, genomic analysis of fecal or intestinal swabs. In some aspects, the therapeutically effective quantity may be an amount of the formulation that when administered in conjunction with a vaccine, improves the immunogenicity and efficacy of the vaccine for the subject. In some aspects, the therapeutically effective quantity may be an amount of the formulation that improves the overall health of the subject, as measured by objective measures known in the art.

[0200] In some aspects, the amount of a composition administered to a subject and the frequency of administration may vary depending upon the subject or host treated and the particular mode of administration. It will be appreciated by those skilled in the art that the unit content of agent contained in an individual dose of each dosage form need not in itself constitute a therapeutically effective amount, as the necessary therapeutically effective amount could be reached by administration of a number of individual doses.

[0201] Additionally, compositions as disclosed herein may be combined with food formulations as described herein or additional probiotic strains or both. The formulations may be administered together, or the administration may be staggered. Amounts of food formulations or probiotic formulations or both can vary and may be determined by a person of skill in the art. A detailed description of suitable amounts of food formulation for administration is provided in US 2022/0312817, the entire contents of which are hereby incorporated by reference.

[0202] As discussed above, administration can be oral, gastric, orogastric, nasogastric, implanted, buccal, and rectal. In some aspects the compositions in section I may be administered orally as any one of but not limited to a solid, semi-solid, liquid, capsule, powder, emulsions, suspensions and tablet or combinations thereof. In some aspects the compositions in section I may be administered, mixed with any one of but not limited to water, juice, gruel, milk, breast milk, baby food, baby formula including F-75 and F-100 or any other commercially available formula, beverage, food products, fruits and vegetables, raw foods and cooked foods. In some aspects the compositions may be administered once daily. In some aspects the compositions may be administered more than once daily. In some aspects the compositions in section I may be administered orogastrically. In some aspect the compositions may be administered nasogastrically.

[0203] Compositions described herein can be administered in a variety of methods well known in the arts. Administration can include, for example, methods involving oral ingestion, direct injection, drug-releasing biomaterials, polymer matrices, gels, permeable membranes, osmotic systems, multilayer coatings, microparticles, implantable matrix devices, mini-osmotic pumps, implantable pumps, injectable gels and hydrogels, liposomes, micelles (e.g., up to 30 μm), nanospheres (e.g., less than 1 μm), microspheres (e.g., 1-100 μm), reservoir devices, a combination of any of the above, or other suitable delivery vehicles to provide the desired release profile in varying proportions. Other methods of controlled-release delivery of agents or compositions will be known to the skilled artisan and are within the scope of the present disclosure.

[0204] In some aspects, the methods disclosed herein comprise administration of therapeutically effective quantities of the compositions in a subject exhibiting symptoms of or diagnosed with malnutrition. A subject in need of treatment for malnutrition may have a LAZ ≤1 , a MUAC ≤1, a WAZ ≤1 , a WLZ ≤1 , deficiencies in vitamins and minerals, or any combination thereof. In some embodiments, a subject in need of treatment for malnutrition has a LAZ ≤1 , ≤2, or ≤3. In some embodiments, a subject in need of treatment for malnutrition has a MUAC ≤1 , ≤2, or ≤3. In some embodiments, a subject in need of treatment for malnutrition has a WAZ ≤1 , ≤2, or ≤3. In some embodiments, a subject in need of treatment for malnutrition has a WLZ ≤1 , ≤2, or ≤3. In some embodiments, a subject in need of treatment for malnutrition has a LAZ ≤2, a MUAC ≤2, a WAZ ≤2, a WLZ ≤2, or any combination thereof. In some embodiments, a subject in need of treatment for malnutrition has a WAZ ≤1.5 and a WLZ ≤1 .5. In some embodiments, a subject in need of treatment for malnutrition has a WAZ ≤2 and a WLZ ≤2. In some embodiments, the subject has moderate acute malnutrition. In some embodiments, the subject has severe acute malnutrition (SAM). In some aspects the subject is a child or an infant who consume diets with limited breastmilk content. As used herein the term “limited breastmilk diet” is where breastmilk comprises less than 50% of an infant’s total caloric intake. In some aspects breastmilk may comprise 0% of the infant’s total caloric intake. In some aspects breastmilk may comprise less than 10% of the infant’s total caloric intake. In some aspects breastmilk may comprise less than 20% of the total caloric intake. In some aspects breastmilk may comprise less than 30% of the total caloric intake. In some aspects breastmilk may comprise less than 40% of the total caloric intake. In some aspects breastmilk may comprise less than 50% of the total caloric intake. In some aspects the child is exhibiting one or more of the symptoms including but not limited to a very lowweight-for-height (WHZ, less than 3 z-scores below the median WHO growth standards) or a mid-upper arm circumference (MUAC) of less than 11.5cm, visible severe wasting, or nutritional oedema. In some aspects the child is an infant of age 0-24 months. In some aspect the child is of 0-5 years of age. In some aspects the child is from a underdeveloped or developing country. In some aspects the child is from a developed country. In some aspects the child is from an household below the poverty line for a particular country or earning an income below the objective measure of poverty defined for the country of residence. In some aspect the child is exhibiting symptoms of or has been clinically diagnosed with malnutrition.

[0205] In some aspects the present disclosure also encompasses methods for modifying, repairing or improving the health of the gut microbiota of a subject in need thereof. As used herein the term “modifying the gut microbiota” means any intervention that results in change in the gut microbiome as measured by one of many methods available in the art. The change may be a decrease or an increase in the presence of a particular microbial strain, species, genus, family, order, or class. These methods to monitor gut microbiota are well known in the art and may include but are not restricted to fecal cultures, genomic analysis of the feces, or analysis of fecal or intestinal swabs. In some aspects, the present disclosure encompasses methods for repairing or improving the health of the gut microbiota of a subject in need thereof. The “health” of a subject's gut microbiota may be defined by relative abundances of microbial community members, expression of microbial genes, biomarkers, mediators of gut barrier function. To “repair the gut microbiota of a subject,” which is synonymous with “improve gut microbiota health,” means to change the microbiota of a subject, in particular the relative abundances of age- and health- discriminatory taxa, in a statistically significant manner towards chronologically-age matched reference healthy subjects. The term encompasses complete repair (i.e., the measure of gut microbiota health does not deviate by 1 .5 standard deviation or more) and levels of repair that are less than complete. The term also encompasses preventing or lessening a change in the relative abundances of age-and health-discriminatory taxa, wherein the change would have been significantly greater absent intervention. A subject with a gut microbiota in need of repair (e.g., a microbiota in “disrepair”, an “immature” gut microbiota, etc.) has a measure of gut microbiota health that deviates by 1.5 standard deviation or more (e.g., 2 std. deviation, 2.5 std. deviation, 3 std. deviation, etc.) from that of chronologically-age matched subjects, wherein the term “chronological age” means the amount of time a subject has lived from birth. Subjects five years or younger are grouped (or binned) by month. Subjects older than 5 years may be grouped by longer intervals of time (e.g., months or years). In some embodiments, a subject with a gut microbiota in need of repair is a subject with malnutrition, SAM, a subject at risk of malnutrition, a subject with a diarrheal disease, a subject recently treated for diarrheal disease (e.g., within 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 7 weeks, or 8 weeks), a subject recently treated with antibiotics (e.g., within 1 week, 2 weeks, 3 weeks, 4 weeks, 5 weeks, 6 weeks, 7 weeks, or 8 weeks), a subject undergoing treatment with an antibiotic, a subject who will be undergoing treatment with an antibiotic with about 1-4 weeks or about 1-2 weeks.

[0206] In some aspects the subject may be an individual clinically diagnosed with a disease or disorder or syndrome or exhibiting symptoms of disease or disorder or syndrome. In some aspects the subject may be a healthy individual.

[0207] The aforementioned methods are not limited to subjects of a particular age. In one aspect, a subject may be less than six months of age. In one aspect, a subject may be at least six months of age. In one example, a subject may be at least six months of age. In another example, a subject may be eighteen years or younger. In still other examples, a subject may be ≤ 15 years, ≤ 14 years, ≤ 13 years, ≤ 12 years, ≤ 11 years, ≤ 10 years, ≤ 9 years, ≤ 8 years, ≤ 7 years, ≤ 6 years, ≤ 5 years, ≤ 4 years, ≤ 3 years, < 2 years. In still other examples, a subject may be a newborn to six months of age, six months to five years of age, six months to 2 years of age, or six months to 18 months of age. In some aspects the subject is a pre-term baby. In some aspects the subject may be an animal. In some aspect the animal may be a mouse model.

[0208] An additional aspect of this invention is a method of improving immunogenicity and efficacy of a vaccine in children who consume diets with limited breast milk, the method comprising administration of effective amounts of the compositions detailed in section I of detailed description.

[0209] Microbiome can transfer from mother to infant. In some aspects of the invention, the compositions detailed in section I, may be administered to women during pregnancy to facilitate colonization of the probiotic in the infant gut.

[0210] In some aspects, effective amounts of the compositions detailed in section I may be administered prophylactically to reduce the occurrence of malnutrition in children growing up in an household below the poverty line of a particular country or earning an income below the objective measure of poverty defined for the country of residence. In some aspects, the compositions disclosed herein may be administered to “improve a subject’s health". To “improve a subject’s health” means to change one or more aspects of a subject’s health in a statistically significant manner towards chronologically-age matched reference healthy subjects, as well as to prevent or lessen a change in one or more aspects of the subject’s health wherein the change would have been significantly greater absent intervention. The improved aspect of the subject’s health may be growth or rate of growth, for example as measured by a score on an anthropometric index; signs or symptoms of disease; relative abundances of health discriminatory plasma proteins, including but not limited to biomarkers, mediators of gut barrier function, bone growth, neurodevelopment, acute and inflammation, and the like. Those in need of treatment to improve their health include those already with a disease, condition, or disorder as well as those prone to have the disease, condition or disorder or those in which the disease, condition or disorder is to be prevented.

EXAMPLES

[0211] The following examples are included to demonstrate preferred embodiments of the disclosure. It should be appreciated by those of skill in the art that the techniques disclosed in the examples that follow represent techniques discovered by the inventor to function well in the practice of the present disclosure, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the present disclosure.

Example 1 : Methods for Examples 2-6

[0212] The following examples 2-6 describes characterization of the bacterial targets and structure-function relationships of a microbiome-directed complementary food prototype, MDCF- 2. Evidence is accumulating that perturbed postnatal development of the gut microbiome contributes to childhood malnutrition. Designing effective microbiome-directed therapeutic foods to repair these perturbations requires knowledge about how food components interact with the microbiome to alter its expressed functions. Herein is described the use of biospecimens from a randomized, controlled study of a microbiome-directed complementary food prototype (MDCF-2) that produced superior rates of weight gain compared to a commonly used ready-to-use supplementary food (RUSF) in 12-18-month-old Bangladeshi children with moderate acute malnutrition (MAM).

[0213] Collection and handling of biospecimens obtained from participants in the randomized controlled clinical study of the efficacy of MDCF-2

[0214] The human study entitled ‘Community-based Clinical Trial With Microbiota-Directed Complementary Foods (MDCFs) Made of Locally Available Food Ingredients for the Management of Children With Primary Moderate Acute Malnutrition (MAM)’, was approved by the Ethical Review Committee at the icddr.b (Protocol PR-18073; ClinicalTrials.gov identifier: NCT04015999). Informed consent was obtained for all participants. The objective of the study was to determine whether twice daily, controlled administration of a locally produced, microbiota- directed complementary food (MDCF-2) for 3 months to children with MAM provided superior improvements in weight gain, microbiota repair, and improvements in the levels of key plasma biomarkers/mediators of healthy growth, compared to a commonly used rice- and lentil-based ready-to-use supplementary food (RUSF) composition. A total of 124 male and female children with MAM (WLZ -2 to -3) between 12- and 18-months-old who satisfied the inclusion criteria were enrolled, with 62 children randomly assigned to each treatment group using the permuted block randomization method. Children in each treatment group were fed the corresponding dietary intervention (MDCF-2 or RUSF) twice daily at a study center for the first month, once daily at a study center and once daily at home for the second month, and twice daily at home for the third month, after which children returned to their normal feeding routine with continued intensive monitoring for one additional month. Fifty-nine participants in each treatment group completed the 3-month intervention and 1 -month post-treatment follow-up. To ensure sample integrity for DNA and RNA analyses, fecal biospecimens were collected within 20 minutes of their production and immediately transferred to liquid nitrogen-charged vapor shippers for transport to a -80 °C freezer at the study center. Coded biospecimens were shipped to Washington University on dry ice where they were stored at -80 °C, along with associated metadata, in a dedicated repository with approval from the Washington University Human Research Protection Office.

Defining the relationship between MAG abundances and WLZ

[0215] Linear mixed-effects models were used to relate the abundances of MAGs identified in each trial participant to WLZ using the formula:

WLZ~ β ₁(MAG) + β ₂(study week) + (1 |P/D)

[0216] The data normalization strategies prior to linear modeling did not include a consideration of MAG assembly length. Therefore, the TPM was analyzed (reads per kilobase per million) output of Kallisto (v0.43.0) by applying a filter requiring each MAG’s abundance >5 TPM in >40% of the 707 fecal samples collected at time points where anthropometry was also measured. This filtering approach yielded 837 MAGs. The unfiltered count output from Kallisto was then used to perform a variance stabilizing transformation [DESeq2] to control for heteroskedasticity, and the dataset was filtered to the same 837 MAGs. Subsequently the linear mixed effects models were fitted to the transformed abundances of each MAG across all 707 fecal samples (lme430, v1 .1-27.1 ; lmerTest31 , v3.1-3). ANOVA was used to determine the statistical significance of the fixed effects in the model - specifically, the relationship between MAG abundance and WLZ. ‘WLZ-associated MAGs’ were defined as those having P-values adjusted for false discovery rate (q-values) <0.05.

[0217] Determining the effects of MDCF-2 supplementation on the abundances of WLZ- associated MAGs and MAGs belonging to a given species [0218] Dream32 (variancePartition R package, v1.24.0) an empirical Bayesian linear mixed- effects modeling framework, was employed to model MAG abundance as a function of treatment group, study week, and their interaction, controlling for the repeated measurements taken from each study participant with random effect term for participant. The equation used to quantify the effects of treatment on MAG abundance took the form:

MAG _i ~ (treatment group) + β ₂(study week) + β ₃(treatment group x study week) + ( 1IPID)

[0219] The ‘treatment group’ coefficient β1 indicates whether MDCF-2 produced changes in the mean abundance of a given MAG relative to RUSF over the 3-month intervention, while the ‘treatment group x study week - interaction’ coefficient β3 indicates whether MDCF-2 affected the rate of change of a given MAG more so than RUSF (i.e., was a MAG increasing or decreasing more rapidly in the microbiomes of participants in the MDCF-2 versus the RUSF treatment group). Each coefficient for each MAG abundance analysis is described by an associated t-statistic - a standardized measure, based on standard error, of a given coefficient’s deviation from 0 which can be used to calculate a P-value and infer the significance of the effect of a given coefficient on the dependent variable. The t-statistics produced by this method can also be used as a ranking factor for input to GSEA. For this analysis, gene sets were defined as groups of MAGs that were either significantly positively (n=75) or significantly negatively (n=147) associated with WLZ. This analysis was conducted for both the ‘treatment group; β1’ coefficient and the ‘interaction; β3’ coefficient. Statistical significance is reported as q-values after adjustment for false-discovery rate (Benjamini-Hochberg method).

Microbial RNA-Seq analysis of MAG gene expression

[0220] For RNA extraction, approximately 50 mg of a fecal sample, collected from each participant at the baseline, 1-month, or 3-month timepoints, was pulverized under liquid nitrogen with a mortar and pestle and transferred to 2 mL cryotubes. A 3.97 mm steel ball and the equivalent of 250 μL of 0.1 mm zirconia/silica beads were subsequently added to each sample tube, together with 500 μL of a mixture of phenol:chloroform:isoamyl alcohol (25:24:1 , pH 7.8- 8.2), 210 μL of 20% SDS, and 500 μL of 2X Qiagen buffer A (200 mM NaCI, 200 mM Trizma base, 20 mM EDTA). After a 1 -minute treatment in a bead beater (Biospec Minibeadbeater-96), samples were centrifuged at 3,220 × g for 4 minutes at 4 °C. One hundred microliters of the resulting aqueous phase was transferred by a liquid handling robot (Tecan) to a deep 96-well plate along with 70 μL of isopropanol and 10 μL of 3M NaOAc, pH 5.5. The solution was mixed by pipetting 10 times. The crude DNA/RNA mixture was incubated at -20 °C for 1 hour and then centrifugated at 3,220 x g at 4°C for 15 minutes before removing 210 μL of the supernatant to yield nucleic acid-rich pellets. A Biomek FX robot was used to add 300 μL Qiagen Buffer RLT to the pellets and to resuspend the RNA/DNA by pipetting up and down 50 times. A 400 μL aliquot was transferred from each well to an Qiagen AHPrep 96 DNA plate, which was centrifuged at 3,220 x g for 1 minute at room temperature. The RNA flow-through was purified as described in the AHPrep 96 protocol. cDNA libraries were prepared from extracted RNA using an Illumina Total RNA Prep with Ribo-Zero Plus and dual unique indexes. Libraries were balanced, pooled, and sequenced in two runs of an Illumina NovaSeq using S4 flow cells.

[0221] As an initial pre-processing step, raw reads were aggregated across the two NovaSeq runs, resulting in a total of 5.0x10 ⁷±4.7x10 ⁶ paired-end 150 nt reads per sample (mean±SD). Adapter sequences and low-quality bases were removed from raw reads (Trim Galore33, vO.6.4), and pairs of trimmed reads were filtered out if either one of the paired reads was less than 100 nt long. Pre- and post-trimmed sequence quality and adapter contamination were assessed using FastQC34 (vO.11.7). Filtered reads were pseudoaligned to the set of 1 ,000 annotated, dereplicated high quality MAGs to quantify transcripts with Kallisto35. Reads that pseudoaligned to rRNA genes were excluded, leaving an average of 7.1x10 ⁶±3.9x10 ⁶ bacterial mRNA reads (mean±SD) per sample. Counts tables were further filtered to retain only transcripts that pseudoaligned to the 837 MAGs that passed the abundance and prevalence thresholds described above. To minimize inconsistently quantified counts related to low-abundance MAGs, a transcript count of zero was assigned, on a per-sample basis, to any MAG with a DNA abundance <0.5 TPM in that sample.

[0222] Differential expression analysis (edgeR36, v3.32.1) was conducted using the following steps: (i) transcript filtering for presence/absence and prevalence; (ii) library size normalization using TMM (trimmed mean of M-values); (iii) estimating per-gene count dispersions; and (iv) testing for differentially expressed genes. Transcripts were first filtered using edgeR default parameters, followed by a parameter sweep of transcript abundance and prevalence threshold combinations. Based on this analysis, transcripts with > 5 counts per million mapped reads (CPM) in > 35% of samples were retained for differential expression analysis. The transcripts that passed this filtering were normalized using a TMM-based scaling factor. Negative binomial dispersions was estimated and fit trended per-gene dispersions (using the power method) to negative binomial generalized linear models, which were used to characterize (i) the effect of treatment group and study week among all participants and (ii) the effect of WLZ-quartile and study week among MDCF-2 participants in the upper and lower quartile of WLZ-response using the following model formulae:

Transcripts ~ β ₁(treatment group) + β ₂(study week) + β ₃(treatment group x study week)

TranscriptSi ~ β ₁(WLZ response quartile) + β ₂(study week) + β ₃(WLZ response quartile x study week)

[0223] From these models, genes that exhibited significant differential expression were identified using the quasi-likelihood F-test (edgeR, function glmQLFTest) which accounts for the uncertainty in estimating the dispersion for each gene.

[0224] For subsequent functional metabolic pathway enrichment analyses, the following was undertaken (i) ordered transcripts assigned to WLZ-associated MAGs based on a ranking metric calculated as the direction of the fold change × -log ₁₀(P-value) for a given differential expression analysis, (ii) defined gene sets as groups of these transcripts assigned to the same metabolic pathway, and (iii) performed GSEA (fgsea37, v3.14). This set of analyses allowed the identification of differentially expressed metabolic pathways comprised of >10 genes overtime (i) between treatment groups, (ii) between WLZ response quartiles or (iii) as a function of interacting terms in the linear mixed effect models (treatment group x study week; WLZ-response quartile x study week). Enrichment results were considered statistically significant if they exhibited q-values <0.1 after controlling for false-discovery rate (Benjamini-Hochberg method).

[0225] For targeted transcriptional analyses of the CAZymes encoded by P. copri MAGs Bg0018 and Bg0019, Dream32 was employed in R with no additional filtering, and the formula above relating transcripts to WLZ response quartile, study week, and the interaction of both terms, with the addition of a random effect for participant.

Principal Components Analysis

[0226] Principal Components Analysis (PCA) was performed on VST-transformed DNA or transcript counts for the 837 MAGs passing the filter described in the section entitled ‘Defining the relationship between MAG abundances and WLZ’ above. The PCA related to transcripts encompassed 27,518 genes expressed by these MAGs at thresholds for levels and prevalence that are described in the section entitled ‘Microbial RNA-Seq analysis of MAG gene expression’ above. PCA was performed in R using the ‘prcomp’ function, with each data type centered but not scaled, since the dataset was already VST-normalized. The functions ‘get_eigenvalues,’ ‘get_pca_ind,’ and ‘get_pca_var’ from the factoextra (v1.0.7) package were utilized to extract (i) the variance explained by each principal component, (ii) the coordinates for each sample along principal components, and (ill) the contributions of each variable to principal components 1-3. ‘Adonis2’ function within the vegan library (v2.5-7) was used to test for the statistical significance of baseline differences in the microbiome (MAGs) or meta-transcriptome between the two treatment groups.

LC-MS analyses of carbohydrates present in MDCF-2, RUSF, their component food ingredients, and fecal biospecimens

[0227] Sample preparation for glycan structure analysis - Samples of MDCF-2, RUSF, their respective ingredients, and fecal biospecimens were ground with a mortar and pestle while submerged in liquid nitrogen. A 50 mg aliquot of each homogenized sample was lyophilized to dryness. Lyophilized samples were shipped to the Department of Chemistry at the University of California, Davis. On receipt, samples were pulverized to a fine powder using 2 mm stainless steel beads (for foods) or2 mm glass beads (for feces). A 10 mg/mL stock solution of each sample was prepared in Nanopure water. All stock solutions were again bead homogenized, incubated at 100 °C for 1 h, bead homogenized again, and stored at -20 °C until further analysis.

[0228] Monosaccharide composition analysis - Briefly, 10 μL aliquots were withdrawn from homogenized stock solutions and transferred to a 96-well plate containing 2 mL wells. Each sample aliquot was acid hydrolyzed (4 M trifluoroacetic acid for 1 h at 121 °C) and quenched by adding 855 μL of ice-cold Nanopure water. Hydrolyzed samples, plus an external calibration standard comprised of 14 monosaccharides with known concentrations (0.001-100 μg/mL each) were derivatized with 0.2 M 1-phenyl-3-methyl-5-pyrazolone (PMP) in methanol plus 28% NH4OH for 30 min at 70 °C. The derivatized glycosides were fully dried by vacuum centrifugation, reconstituted in Nanopure water (Thermo Fischer Scientific), and excess PMP was extracted with chloroform. A 1 μL aliquot of the aqueous layer was injected into an Agilent 1290 Infinity II ultrahigh-performance liquid chromatography (UHPLC) system, separated using a 2-minute isocratic elution on a C18 column (Poroshell HPH, 2.1 x 50 mm, 1.9 μm particle size, Agilent Technologies), and analyzed using an Agilent 6495A triple quadrupole mass spectrometer (QqQ- MS) operated in dynamic multiple reaction monitoring (dMRM) mode. Monosaccharides in the food and fecal samples were identified and quantified by comparison to the external calibration curve.

[0229] Glycosidic linkage analysis - Under an argon atmosphere, a 5 μL aliquot from each homogenized stock solution was permethylated in a 200 μL reaction that contained 5 μL saturated NaOH and 40 μL iodomethane in 150 μL of DMSO. Permethylated glycosides were extracted with dichloromethane, and the extract was dried by vacuum centrifugation. The extracted glycosides were subjected to acid hydrolysis (4 M trifluoroacetic acid for 2 h at 100 °C) followed by vacuum centrifugation to dryness. Samples were then derivatized with PMP as described above for monosaccharide analysis, followed by another vacuum centrifugation to complete dryness. Methylated monosaccharides were then reconstituted with 100 μL of 70% methanol in water. A 1 μL aliquot of the aqueous layer was injected into an Agilent 1290 Infinity II UHPLC system, separated using a 16-minute gradient elution on a C18 column (ZORBAX RRHD Eclipse Plus, 2.1 x 150 mm, 1.8 μm particle size, Agilent Technologies), and analyzed using an Agilent 6495A QqQ-MS operated in multiple reaction monitoring (MRM) mode. A standard pool of oligosaccharides and reference MRM library were used to identify and quantify glycosidic linkages in all samples.

[0230] Fenton’s iinniittiiaattiioonn ttoowwaarrdd defined oligosaccharide groups (FITDOG) polysaccharide analysis - To separate endogenous oligosaccharides from the background food matrix, polysaccharides were precipitated with 80% aqueous ethanol. Dried precipitates were reconstituted, homogenized, and 10 mg/mL stock solutions were prepared. The FITDOG reaction was carried out using a 100 μL aliquot of the 10 mg/mL resuspended food pellet and 900 μL of reaction buffer (44 mM sodium acetate, 1.5% H ₂O ₂, 73 μM Fe ₂(SO ₄) ₃(H ₂O) ₅). The reaction mixture was incubated at 100 °C for 45 minutes, quenched with 500 μL 2 M NaOH, and then neutralized with 61 μL of glacial acetic acid. The resulting oligosaccharides were then reduced to their corresponding alditols with sodium borohydride (NaBH ₄) to prevent anomerization during chromatographic separation. For the reduction of oligosaccharides, a 400 μL aliquot of the reaction mixture was incubated with 400 μL 1 M NaBH ₄ at 65 °C for 60 minutes. Oligosaccharide products were then enriched using C18 and porous graphitized carbon (PGC) 96-well solid-phase extraction plates. For the C18 enrichment, cartridges were primed with 2 x 250 μL acetonitrile (ACN) and then 5 x 250 μL water washes prior to loading the reduced sample. Cartridge effluent was collected and subjected to subsequent PGC clean-up. PGC cartridges were primed with 400 μL water, 400 μL 80% ACN/0.1% (v/v) trifluoroacetic acid (TFA), and then 400 μL water prior to loading the C18 effluent. Washing was performed with 8 x 400 μL water, and the oligosaccharides were eluted with 40% ACN/0.05% (v/v) TFA and then dried using a vacuum centrifugal dryer. Oligosaccharides were reconstituted with 100 μL Nanopure water and a 10 μL aliquot was injected into the HPLC-Q-TOF instrument. Separation was carried out using an Agilent 1260 Infinity II HPLC with a PGC column (Hypercarb, 1 x 150 mm, 5 μm particle size, Thermo Scientific) coupled to an Agilent 6530 Accurate-Mass Q-TOF mass spectrometer. Oligosaccharide identification was based on MS/MS fragmentation and retention time (RT) compared to reacted polysaccharide standards (amylose, cellulose, mannan, galactan, linear arabinan, and xylan). Food polysaccharides were quantified using an external calibration curve that included the three most abundant oligosaccharides from each parent polysaccharide as the quantifier species.

[0231] Statistical analysis of carbohydrate composition - The abundance trends of glycosidic linkages over time and between WLZ-response quartiles using linear mixed-effects models (Ime4, ImerTest packages in R) of the following form:

Linkage _i ~ β ₁(WLZ response quartile) + β ₂(study week) + β ₃(WLZ response quartile x study week) + (1 |P/D)

[0232] Linkages displaying a significant interaction (q <0.05) between WLZ response quartile and study week (β3 coefficient) were identified.

Metagenome assembled genomes (MAGs)

[0233] Short-read shotgun sequencing - DNA was isolated from 942 fecal samples and shotgun sequencing libraries were prepared using a reduced-volume Nextera XT (Illumina) protocol. Libraries were quantified, balanced, pooled, and sequenced [Illumina NovaSeq 6000, S4 flow cell; 2.3±1.4x10 ⁷150 nt paired-end reads/sample (mean±SD)]. Reads were demultiplexed (bcl2fastq, Illumina), trimmed to remove low quality bases, and processed to remove read-through adapter sequences (Trim Galore33, v0.6.4). Read pairs where the length of either read was <50 nt after quality and adapter trimming were discarded. The remaining reads were mapped to the human genome (UCSC hg19) using bowtie2 (v2.3.4.1) and were filtered to remove H. sapiens sequences.

[0234] Preprocessed, short-read shotgun data were aggregated from each participant’s fecal sample set (n=7-8 samples/participant; 118 participants) prior to MAG assembly. This strategy was adopted to enable the contig abundance calculations required by the MAG assembly algorithms employed below, while at the same time mitigating the risk of chimeric assemblies inherent to co-assembly across individuals. Assemblies were generated for all 118 datasets using MegaHit (v1.1.4), and the resulting contigs were quantified in each assembly by mapping preprocessed reads to the assembled contigs with Kallisto. Contigs were assembled into MAGs using MaxBin2 (v2.2.5) and MetaBAT2 (v2.12.1). The parallel results of both binning strategies were merged and dereplicated using DAS Tool (v1 .1 .2) on a per-participant basis.

[0235] Long-read shotgun sequencing- Long-read sequencing was applied to fecal samples obtained at the 0- and 3- month time points from each of the 15 upper quartile WLZ responders in the MDCF-2 treatment group. Aliquots containing 400-1000 ng of DNA from each biospecimen were transferred to a 96-well, 0.8 mL, deep-well plate (Nunc, ThermoScientific) and prepared for long-read sequencing using the SMRTbell Express Template Prep Kit 2.0 (PacBio). All subsequent DNA handling and transfer steps were performed with wide-bore, genomic DNA pipette tips (ART, ThermoScientific). Barcoded adapters were ligated to A-tailed DNA fragments by overnight incubation at 20 °C. Adapter-ligated fragments were then treated with the SMRTbell Enzyme Cleanup Kit to remove damaged or partial SMRTbell templates. A high molecular weight DNA fraction was purified using AMPure beads (ratio of 0.45x well-mixed AMPure bead volume to sample) and eluted in 12 mL of PacBio elution buffer. DNA libraries were sequenced on a Sequel System (Pacific Biosciences) using a Sequel Binding Kit 3.0 and Sequencing Primer v4 with 24 hours of data collection. A total of 3.0x10 ⁹±9.8x10 ⁸ bp/sample were collected, with an average subread length of 5,654±871 bp (meaniSD).

[0236] Hybrid assembly of short- and long-read data was performed using OPERA-MS (v0.9.0). OPERA-MS uses assembly graph and coverage-based methods to cluster contigs into MAGs based on optimizing per-cluster Bayesian information criterion (BIC). Prior to hybrid assembly, continuous long reads (CLR) were combined across the two available timepoints for each participant and reads that mapped to the human genome were removed. Illumina short reads and PacBio long reads (CLR) were provided to OPERA-MS and assembled using the built-in OPERA- MS genome database and default settings (the latter includes polishing of output MAGs with Pilon).

[0237] MAG dereplication, curation and abundance calculations - After assembling MAGs by both short-read only and short- plus long-read strategies, all MAGs from all assembly strategies were assessed for completeness and contamination (‘lineage_wf’ command in CheckM, v1.1.3) and refined (‘tetra’, ‘outliers’, and ‘modify’ commands) to remove contaminating contigs. Additional refinement based on the distribution of phylogenetic markers present in each MAG was performed [‘phylo-markers’, ‘clade-markers’, and ‘clean-bin’ commands in MAGpurify (v2.1.2)]. A final MAG quality assessment was performed using CheckM, followed by a stringent (> 90% complete, ≤ 5% contaminated, ANI > 99%) bulk dereplication (options ‘-I 50000’, -- completeness 90’, ‘--contamination 5’, ‘-pa 0.9’, ‘-sa 0.99’ in dRep (v2.6.2). The final dataset contained 681 ±99.4 (mean+SD) MAGs/participant. All MAGs satisfied the threshold criteria of having an abundance >5 TPM when present at any time point in an individual. MAG assembly summary statistics were collected from CheckM and quast analyses (v4.5) and aggregated. Initial MAG annotations were performed using prokka (v1.14.6). To quantify the abundance of each MAG in each sample MAGs were processed to create a single Kallisto quantification index. Reads from each fecal DNA sample were then mapped to this index.

[0238] MAG taxonomy - Taxonomic assignments were initially made by employing the Genome Taxonomy Database Toolkit (GTDB-Tk) and corresponding database (release 95). MAG assignments were complimented by using Kraken2 (v2.0.8) and Bracken (v2.5) and a Kraken2- compatible version of the GTDB reference.

[0239] P. copri has been partitioned into four distinct clades (‘A-D’) based on marker gene phylogeny. To classify Prevotella MAGs in this study, an unrooted, marker gene-based phylogeny was constructed using Phylophlan (v3.0.60). This tree encompassed 17 reference isolate genomes and 1006 MAGs from a previous study plus any MAGs from the set classified by GTDB- Tk as belonging to the genera Prevotella (n=51 ) or Prevotella massilia (n=13). Putative Prevotella MAGs from the present study that clustered within the four previously identified P. copri clades were assigned to the corresponding clade based on visualization with Graphlan (v1 .1.4).

[0240] Certain Bifidobacterium species consist of multiple closely related subspecies (e.g., B. longum). Therefore, a pan-genome for 34 Bifidobacterium MAGs was calculated in the dataset, plus 14 reference isolate genomes (FIG. 6), using Roary (v3.12.0) and a 60% minimum sequence identity threshold for blastp. The reference isolate genomes included 10 Bifidobacterium species and three subspecies of Bifidobacterium longum (subsp. longum, infantis, and suis). Concatenated nucleotide sequences of 142 identified core genes were aligned using MAFFT (v7.313). The resulting alignment was trimmed [microseq R package (v2.1.4)] and was then used to construct a maximum likelihood phylogenetic tree [IQ-TREE (v1.6.12)]. The Bifidobacterium gallicum DSM 20093 genome was selected as an outgroup. Putative Bifidobacterium MAGs from this study that clustered together with reference genome clades were assigned to the corresponding clade. Using this method, the initial GTDB-Tk-based classifications of all Bifidobacterium MAGs were confirmed or updated or resolved nearly all closely related subspecies (FIG. 6).

Subsystems-based annotation and prediction of functional capabilities (‘inferred metabolic phenotypes’) of MAGs

[0241] MAG genes were assigned functions, and metabolic pathways were reconstructed using a combination of (i) public domain tools for sequence alignment and clustering, (ii) custom scripts to process the results of sequence alignments (e.g., for domain annotation in multifunctional proteins), and (iii) a reference collection of 2,856 human gut bacterial genomes for which reconstructed and manually-curated metabolic pathways were available related to 98 distinct metabolites and 106 metabolic phenotypes. These annotations are captured in the mcSEED database, a microbial community-centered adaptation of the SEED genomic platform, featuring subsystems-based annotation and pathway reconstruction applied to representative human gut bacterial genomes that were initially automatically annotated by RAST or downloaded from the PATRIC database. Each mcSEED subsystem includes a set of functional roles (e.g., enzymes, transporters, transcriptional regulators) that contribute to the prediction of functional metabolic pathways and pathway variants involved in utilization and catabolism carbohydrates and amino acids, biosynthesis of vitamins/cofactors and amino acids, and generation of fermentation endproducts such as short-chain fatty acids.

[0242] Briefly, a reference database was constructed containing 995,591 functionally annotated proteins comprising the entire set of curated metabolic subsystems from the 2,856 reference genomes, plus an additional 2,988,751 proteins (‘outgroup’ not included in these metabolic subsystems), clustered at 90% amino acid identity (‘cluster’ command, MMSeqs; v1-c7a). The predicted protein sequences were aligned from the set of 1 ,000 high-quality MAGs against this reference protein database (DIAMOND, v2.0.0). To account for any influence of MAG fragmentation on metabolic reconstruction, gene fragments were identified using prodigal (v2.6.3) and were annotated in parallel. The following method were implemented to account for instances of multidomain structure that require multiple annotations. For each MAG query protein, top 50 hits were used based on the bitscore, and the start and end position coordinates of the corresponding alignments were clustered using DBSCAN (Scikit-learn), center of each clustered start and end position was used as potential domain boundary coordinates, and split query proteins into domains with database hits attributed to the corresponding domains. Next, for each domain >35 amino acids gaussian kernel density modeling was used (Kernel Density function, neighbors module, Scikit-learn, vO.22.1) of the sequence identity distribution of each set of hits to that domain. A highest local minimum (argrelextrema function, signal module, Scikit-learn) was employed as a threshold to remove low confidence hits. Finally, functional annotations were applied from the reference database to each query protein or domain by majority rule within each set of high-scoring, domain-specific reference hits. High-identity hits to proteins from the outgroup of the reference database were used as criteria to vote “against” applying annotation to each query. This procedure yielded a set of 199,334 annotated MAG proteins, representing 1 ,308 unique protein products across a set of 80 mcSEED subsystems.

Phenotype prediction strategies - The results of gene-level functional annotation were integrated into in silica predictions of the presence or absence (denoted as binary: “1 ” for presence or “0” for absence) of 106 functional metabolic pathways using a semi-automated process based on a combination of the following three approaches:

[0243] Pathway Rules (PR)-based phenotype predictions - This approach uses explicit logic-based “pathway rules” to assign binary phenotypes. These rules combine (i) expert curators’ knowledge regarding the gene composition of various metabolic pathway variants contained in the mcSEED database with (ii) a decision tree method to identify patterns of gene representation in reference genomes corresponding to an intact functional pathway variant (and a respective binary phenotype value denoted as “1”). A total of 106 functional pathway-specific decision trees were generated (Rpart, v4.1.15), where the presence or absence of a particular phenotype was the response variable, and the presence or absence of functional roles (encoded by genes) in each reference pathway were predictor variables. The resulting pathway rules were formally encoded into a custom R script that allowed us to process MAG gene data and assign values (1 or 0) for each of the 106 functional metabolic pathways.

[0244] Machine Learning (ML)-based phenotype predictions ->30 ML methods (Caret, V6.0.86) were compared, using a leave one out’ cross-validation approach in which a single reference genome was removed from the set of 2,856 reference genomes, trained ML models on the remaining genomes, then applied the models to the “test” genome to predict phenotypes. This procedure was then repeated for each genome and each metabolic phenotype. The results of this analysis identified Random Forest as the best-performing method (i.e., it produced the greatest number of correctly predicted phenotypes in the reference training dataset). Random Forest models were built for each phenotype based on the reference dataset, optimized model parameters using a grid search, and used these models to predict binary (1/0) values for the same set of 106 phenotypes for all MAGs.

[0245] Neighbor Group (NG)-based phenotype predictions - This approach identifies reference bacteria that are closely related to the MAGs in this study and uses these high-quality reference genomes for phenotype predications that are robust to variation in MAG quality. Examination of groups of closely related reference organisms suggested that close phylogenetic neighbor genomes tend to either possess or lack an entire pathway variant, whereas more distant neighbors (e.g., other neighbor groups) often carry more divergent pathway variants that specify the same phenotype. This observation was used to develop heuristics that minimize false negative phenotype assignments emerging from the other two prediction strategies. A set of NGs was compiled that comprised of MAGs and closely related reference genomes (Mash/MinHash distance ≤ 0.1, corresponding to ANI >90%). At this similarity threshold, 640 of the 1 ,000 MAGs from this study were assigned to NGs containing from as few as four to more than 100 members. Within each NG and for each metabolic pathway, a binary phenotype value was tentatively assigned for a given MAG based on the NG genome with the closest matching gene annotation pattern (based on Hamming distance), even if some of the genes were absent in the query MAG. Limited comparisons of genes was required for the function of each respective pathway.

[0246] Consensus phenotype predictions - A procedure was developed to reconcile inconsistent phenotype predictions between the three strategies described above, based on observing discordant gene patterns and/or discordant predicted phenotypes within a given group of neighbor genomes. In the rare case of irreconcilable disagreement between the prediction methods, assignment of a consensus phenotype defaulted to that produced by the ML method. Consensus confidence scores were assigned to each prediction based on the degree of concordance between the three techniques. The complete phenotype prediction process was validated using the 2,856 reference genomes in the mcSEED database, their functionally annotated genes and the accompanying patterns of presence/absence of functional metabolic pathways (curator-inferred binary phenotypes). The consensus phenotype predictions were combined into a binary phenotype matrix (BPM) containing 1,000 MAGs and 106 phenotypes.

[0247] Gene annotation and phenotype prediction for Bifidobacteria-specific carbohydrate utilization pathways - MAG annotation pipeline described above was adapted (also see Fig. 12D, 12E and 12F) to obtain functional annotations of genes comprising 25 additional carbohydrate utilization pathways for a set of 34 Bifidobacterium MAGs followed by inference of respective binary phenotypes. As input data for this set of Bifidobacterium-specific phenotypes, a set of 14 metabolic subsystems were curated in 387 reference human gut-derived Bifidobacterium genomes using the mcSEED framework. The reconstructed metabolic pathways and a corresponding BPM for reference Bifidobacterium genomes were used to predict carbohydrate utilization phenotypes in the 34 Bifidobaterium MAGs. Finally, the automatically generated BPM was further manually curated to account for the variability of certain pathways in this taxonomically restricted set of predictions.

[0248] Applying enrichment analyses to predicted MAG phenotypes - Not all successfully annotated MAG genes were components of an intact functional pathway. To enable inferred phenotype-based analysis, gene annotations were filtered to those that were part of a complete functional pathway (with a respective binary phenotype value denoted as “1”). This filter resulted in a list of 208,246 genes used for microbiome and meta transcriptome phenotype enrichment analyses. [0249] Example 2: Reconstructing bacterial genomes associated with ponderal growth responses

[0250] Children aged 12-18 months, with MAM (WLZ -2 to -3) were fed two 25g servings/d corresponding to 100-125kcal/serving, with fresh daily produced RUSF, MDCF-1 , MDCF-2, MDCF-3 as shown in FIG. 1A and 1B. Levels of >1300 plasma proteins were monitored that are key regulators of many aspects of growth and health. The effects on gut microbiota were also monitored. Further experiments were conducted with MDCF-2.

[0251] FIG. 2A summarizes the design of the completed randomized, controlled feeding study of children with MAM, aged 15.4+2.0 months (mean+SD) at enrollment. These children lived in an impoverished urban area (Mirpur) located in Dhaka, Bangladesh. The 3-month intervention involved twice-daily dietary supplementation with either MDCF-2 or RUSF. A total of 59 children in each treatment group completed the intervention and a 1-month follow-up 4. There were no statistically significant differences in the amount of nutritional supplement consumed between children receiving MDCF-2 versus RUSF, no differences in the proportion of children who satisfied current World Health Organization requirements for minimum meal frequency or minimum acceptable diet, and no differences in the amount of breast milk consumed between the two treatment groups. Fecal samples were collected every 10 days during the first month and every 4 weeks thereafter.

[0252] To reconstruct the genomes of bacterial taxa present in the gut microbiomes of study participants, DNA was isolated from all fecal samples (n=942; 7-8 samples/participant) and performed short-read shotgun sequencing. DNA recovered from fecal biospecimens collected at t=0 and 3 months from the subset of participants comprising the upper-quartile of the ponderal growth response to MDCF-2 (n=15) were subjected to additional long-read sequencing. Pooled shotgun sequencing data was assembled from each participant’s fecal samples (short-read only, or short- plus long-reads when available) and aggregated contigs into metagenome-assembled genomes (MAGs) (FIG. 2B and 2C). The resulting set of 1 ,000 high-quality MAGs (defined as >90% complete and ≤5% contaminated based on marker gene analysis) represented 65.6±8.0% and 66.2±7.9% of all quality controlled, paired-end shotgun reads generated from all 942 fecal DNA samples analyzed in the MDCF-2 and RUSF treatment groups, respectively [2.3±1.4x10 ⁷ 150 nt paired-end reads/sample (mean+SD)]. Taxonomy was assigned to MAGs using a consensus approach that included marker gene and kmer-based classification together with the Genome Taxonomy Database. Abundances were calculated for each MAG in the 707 fecal samples that spanned the beginning of treatment through the 1-month post-intervention timepoint and for which matching anthropometric measurements from children had been collected. A total of 837 MAGs satisfied the abundance and prevalence thresholds. A linear mixed-effects models was used to identify 222 MAGs whose abundances were significantly associated with WLZ [β1 (MAG), q<0.05, FIG. 2D] over the 90-day course of the intervention and 30-day follow-up. MAGs that were significantly positively associated with WLZ were predominantly members of the genera Agathobacter, Blautia, Faecalibacterium and Prevotella while members of Bacteroides, Bifidobacterium and Streptococcus were prevalent among MAGs negatively associated with WLZ (FIG. 2D and 2E).

[0253] Changes in MAG abundances were subsequently modeled as a function of treatment group, study week, and the interaction between treatment group and study week, controlling for repeated measurements taken from the same individual (see equation in FIG. 2F and Methods). The ‘treatment group’ coefficient describes the mean difference in abundance of a given MAG between the MDCF-2 and RUSF groups over the course of the intervention (FIG. 2F), while the interaction coefficient in the equation describes the difference in the rate of change in abundance of a given MAG (FIG. 2G). Restricting this analysis to the time of initiation of treatment did not reveal any statistically significant differences in MAG abundances between the two groups (q>0.05, one linear model per MAG). Expanding the analysis to include all time points disclosed that MAGs whose abundances increased faster in the MDCF-2 group compared to in the RUSF group were significantly enriched for those positively associated with WLZ [q=3.41x10 ^-3, gene set enrichment analysis (GSEA); FIG. 2F]. In contrast, MAGs with a higher mean abundance as well as those that increased more rapidly in RUSF-treated children were significantly enriched for those negatively associated with WLZ (q= 1.57x10 ^-9 and q=3.41x10 ^-3, respectively; GSEA) (FIG. 2E and 2F).

[0254] A ‘subsystems’ approach was adapted from the SEED genome annotation platform to identify genes that comprise metabolic pathways represented in WLZ-associated MAGs. To do so, genes were aligned to a reference collection of 2,856 human gut bacterial genomes that had been subjected to in silico reconstructions of metabolic pathways in mcSEED, a microbial community-centered implementation of SEED. Putative functions were asigned to a subset of 199,334 proteins in all 1 ,000 MAGs; these proteins, which represented 1 ,308 nonredundant functions, formed the basis for predicting which of 106 metabolic pathways, curated across a reference collection of 2,856 representative human gut bacterial genomes and reflecting major nutrient utilization capabilities, were present or absent in each MAG. This effort generated a set of inferred metabolic phenotypes for each MAG. GSEA disclosed multiple metabolic pathways involved in utilization of carbohydrates that were significantly (q<0.05) enriched in WLZ- associated MAGs, and in MAGs ranked by abundance response to MDCF-2 compared to RUSF treatment. While other non-carbohydrate pathways were also identified using this approach (e.g., those involved in aspects of amino acid and bile acid metabolism), pathways involved in carbohydrate utilization predominated (P = 0.006, Fisher’s test; FIG. 2H; Tables 1, 2 and 3).

Table 1 : GSEA for the presence or absence of a functional pathway in MAGs ranked by

WLZ association

Table 2: GSEA of pathway enrichment in MAGs ranked by change in abundance in response to MDCF-2 compared to RUSF treatment ('treatment group' coefficient)

Table 3: GSEA of metabolic pathways in MAGs ranked by change in abundance in response to MDCF-2 compared to RUSF treatment (interaction between 'treatment group' and 'study week' coefficients)

Example 3: Carbohydrate composition of M DCF-2 and RUSF

[0255] Prior to analyzing the transcriptional responses of MAGs to each nutritional intervention, the carbohydrates present in MDCF-2 and RUSF were characterized, as well as their constituent

Bangladeshi-sourced food ingredients [chickpea flour, soybean flour, peanut paste and mashed green banana pulp in the case of MDCF-2; rice, lentil and milk powder in the case of RUSF (Table

4 and Table 30A-C). Table 4: Composition of MDCF-2 and RUSF diets.

[0256] Ultrahigh-performance liquid chromatography-triple quadrupole mass spectrometry (UHPLC-QqQ-MS) was used to quantify 14 monosaccharides and 49 unique glycosidic linkages. Polysaccharide content was defined using a procedure in which polysaccharides were chemically cleaved into oligosaccharides, after which the structures of these liberated oligosaccharides were used to characterize and quantify their ‘parent’ polysaccharide.

[0257] The results revealed that L-arabinose, D-xylose, L-fucose, D-mannose, and D- galacturonic acid (GalA) are significantly more abundant in MDCF-2 (q<0.05; t-test) as are eight linkages, three of which contain these monosaccharides (FIG. 3A and 3B; Table 5 and 6).

Table 5: Difference in monosaccharide composition between MDCF-2 and RUSF (μg of monosaccharide I mg of dried diet

Table 6: Difference in glycosidic linkage content between MDCF-2 and RUSF (peak area, arbitrary units / ng dried diet)

[0258] Integrating the quantitative polysaccharide and glycoside linkage data allowed to conclude that MDCF-2 contains significantly more galactans and mannans than RUSF (q<0.05; t-test), while RUSF contains significantly more starch and cellulose (q<0.05; t-test) (FIG. 3D;

Table 7).

Table 7: Difference in polysaccharide content between MDCF-2 and RUSF (μg polysaccharide I mg of dried diet)

[0259] Galactans are represented in MDCF-2 as unbranched 1-1 ,4-linked galactan as well as arabinogalactan I (FIG. 3E). Mannans are present as unbranched 1-1 ,4-linked mannan (1- mannan), galactomannan and glucomannan (FIG. 3C, and 3F). Arabinan is abundant in both compositions, although the representation of arabinose and glycosidic linkages containing arabinose are significantly greater in MDCF-2 than in RUSF (see FIG. 3A and 3B for results of statistical tests). Arabinan in MDCF-2 is largely derived from its soybean, banana, and chickpea components, while in RUSF, this polysaccharide originates from rice and lentil. Arabinans in both compositions share a predominant 1 ,5-linked-L-arabinofuranose (Araf) backbone. Soybean arabinans are characterized by diverse side chains composed of 1 ,2- and 1 ,3-linked-L-Araf connected by 1,2,3-, 1,2,5-, and 1 ,3, 5-L-Araf branch points, while chickpea, lentil, and banana arabinans primarily contain 1 ,3-linked side chains from 1 ,3,5- L-Araf branch points (FIG. 3C).

Example 4: MDCF-2 effects on WLZ-associated MAG gene expression

[0260] Microbial RNA-Seq was performed using RNA isolated from fecal samples collected from all study participants just prior to initiation of treatment, and at the 1-, and 3-month time points (n=350 samples). Transcripts were then quantified by mapping reads from each sample to all 1 ,000 MAGs. The resulting counts tables were filtered based on the abundance and prevalence of MAGs in the full set of all fecal samples. These filtering steps were designed to exclude MAGs with minimal contributions to the meta-transcriptome from subsequent differential expression analysis (exclusion criteria were benchmarked against a simulated meta-transcriptomic dataset using the approach described in the Methods).

[0261] Principal components analysis (RCA) was used to determine baseline differences in overall microbiome or meta-transcriptome configurations between the treatment groups, and to subsequently identify microbes that were principal drivers in shifts during treatment. FIG. 4A-4D plot (i) the percent variance explained by the top 10 principal components (PCs) in analyses of 837 MAGs in fecal samples collected across all timepoints from all study participants (FIG. 4B- 4D) and (ii) the taxa enriched (q <0.05; GSEA) along the first three principal components of the MAG abundance and meta-transcriptome datasets (FIG. 4A). There were no statistically significant differences in microbiome or meta-transcriptome configuration between groups prior to treatment (P > 0.1 ; PERMANOVA). Analysis of MAG contributions to each PCA analysis highlights the remarkable enrichment of Prevotella spp., and to a lesser extent, Bifidobacterium spp., along the principal axis of variation (PC1 ) of the transcript PCA, and the absence of enrichment of these organisms along PC1 of the DNA-based PCA.

[0262] Next the transcripts expressed by the 222 MAGs whose abundances were significantly associated with WLZ were studied. Transcripts were ranked by their response to MDCF-2 versus RUSF treatment or by their response over time (negative binomial generalized linear model; see equation in FIG. 4E). GSEA was then performed to identify metabolic pathways enriched in these ranked transcripts. The analysis revealed a MDCF-2-associated pattern of gene expression characterized by significant enrichment (q<0.1 ; GSEA) of three metabolic pathways related to carbohydrate utilization [a-arabinooligosaccharide (aAOS), arabinose and fucose; FIG. 4E], three pathways related to de novo amino acid synthesis (arginine, glutamine, and lysine biosynthesis), and one pathway for de novo vitamin synthesis (folate). In contrast, none of the 106 metabolic pathways exhibited statistically significant enrichment in their expression in children who received RUSF.

[0263] MAGs which were responsible for the observed enrichment of expressed pathways were investigated. To do so, leading edge’ transcripts were turned; a term defined by GSEA as those transcripts responsible for enrichment of a given pathway (Methods). Among positively WLZ- associated MAGs, two belonging to P. copri (MAG Bg0018 and MAG Bg0019) were the source of 11 of the 14 leading-edge transcripts related to aAOS utilization- a pathway whose expression was significantly elevated in children treated with MDCF-2 compared to RUSF (FIG. 4E). Of the 11 P. copri MAGs in the dataset, these two were the only MAGs assigned to this species that were significantly positively correlated with WLZ. Both MAGs are members of a P. copri clade (Clade ‘A’) that is broadly distributed geographically (FIG. 5A); furthermore, P. copri exhibits substantial strain-level genomic and functional diversity (FIG. 5B) for the predicted carbohydrate utilization pathways represented in all 51 MAGs assigned to the genus Prevotella that were identified in the 1 ,000 MAG dataset).

[0264] Although P. copri MAGs were the greatest source of leading-edge transcripts related to aAOS utilization, other MAGs in the microbiome display expression responses consistent with their participation in metabolizing MDCF-2 glycans (or their breakdown products); these include MAGs that are negatively correlated with WLZ. For example, MAGs expressing leading-edge transcripts assigned to aAOS, arabinose and fucose utilization arose from Bifidobacterium longum subsp. longum (Bg0006), Bifidobacterium longum subsp. suis (Bg0001 ), Bifidobacterium breve (Bg0010; Bg0014), Bifidobacterium sp. (Bg0070), and Ruminococcus gnavus (Bg0067).

[0265] Features of the metabolism of these glycans in Bifidobacterium and Ruminococcus MAGs are distinct from those expressed by the P. copri MAGs. For example, B. longum subsp. longum MAG Bg0006 encodes extracellular exo-α-1,3-arabinofuranosidases that belong to glycoside hydrolase (GH) family (e.g., BIArafA); these enzymes cleave terminal 1 ,3-linked-L-Araf residues present at the ends of branched arabinans and arabinogalactans, two abundant glycans found in MDCF-2 (FIG. 3C and 3E). In contrast, P. copri possesses an endo-α-1 ,5-L-arabinanase that cleaves interior α-1 ,5-L-Araf linkages, generating aAOS. Integrating these predictions suggests a complex set of interactions between primary arabinan degraders like P. copri and members of B. longum, such as Bg0001 and Bg0006, that are capable of metabolizing products of arabinan degradation (see FIG. 6 for reconstructions of carbohydrate utilization pathways in Bifidobacterium MAGs). It could not be discerned whether the arabinose available to Bifidobacterium is derived from free arabinose or the breakdown products of arabinan polysaccharides. It is important to consider that in these 12- to 18-month-old children with MAM, responses to MDCF-2 are occurring in the context of the underlying co-development of their microbial community and host biology, during the period of transition from exclusive milk feeding to a fully weaned state. A MAG defined as positively associated with WLZ by linear modeling is an organism whose fitness (abundance) increases. The studies in healthy 1- to 24-month-old children living in Mirpur have documented how B. longum and other members of Bifidobacterium decrease in absolute abundance during the period of complementary feeding. For the negatively WLZ-associated Bifidobacterium MAGs described above, the levels of consumption of MDCF-2 metabolic products during the period of complementary feeding, and the nature of the changes in metabolism that occurs in these organisms as a result, may not be sufficient to overcome a more dominant effect exerted on their abundance/fitness and impact on ponderal growth by background diet and/or the state of community-host co-development.

[0266] Based on these observations, further evidence that the two P. copri MAGs are related to the magnitude of ponderal growth responses, and to levels of fecal glycan structures generated from MDCF-2 metabolism, was sought.

Example 5: Carbohydrate utilization pathways and clinical responses

[0267] As noted above, the primary outcome measure of the clinical trial was the rate of change of WLZ over the 3-month intervention. Participants receiving MDCF-2 were stratified into WLZ- response quartiles and analysed on (i) children in the upper- and lower-WLZ-quartiles (n=15/group) and (ii) transcripts expressed by the 222 MAGs whose abundances were significantly associated with WLZ. Enrichment of carbohydrate utilization pathways were tested in transcripts rank-ordered by the strength and direction of their relationship with WLZ-quartile or, in a separate analysis, the interaction between WLZ-quartile and study week; GSEA to identify enriched pathways were performed.

[0268] Eight carbohydrate utilization pathways were significantly enriched in transcripts differentially expressed in upper compared to lower WLZ quartile responders. One of these pathways (fructooligosaccharides utilization), plus three other pathways that are involved in arabinose, b-glucoside, and xylooligosaccharide utilization, were enriched in transcripts with a positive ‘WLZ quartile x study week’ interaction coefficient, suggesting that the extent of the difference in expression of these pathways increases over the course of treatment (Fig. 4E).

[0269] Remarkably, over half of the leading-edge transcripts (67/99; 68%) from the eight, upper WLZ-quartile enriched carbohydrate utilization pathways were expressed by P. copri MAGs Bg0018 and Bg0019. Moreover, these two MAGs contributed no leading-edge transcripts to lower WLZ-response quartile enriched pathways.

[0270] P. copri is a member of the phylum Bacteroidota. Members of this phylum contain syntenic sets of genes known as polysaccharide utilization loci (PULs) that mediate detection, import and metabolism of a specific glycan or set of glycans25. To further define how expressed genomic features distinguish the capacity of Bg0019 and Bg0018 to respond to MDCF-2, PULs were identified and compared to those present in the nine other P. copri MAGs in this study. These two WLZ-associated P. copri MAGs share (i) seven PULs designated as highly conserved (i.e. , a given pair of shared PULs that encode protein products with >90% amino acid identity and have identical genomic organization) plus (ii) three PULs designated as present but ‘structurally distinct’ (i.e., displaying divergence expected to impact function). The representation of these 10 PULs varied among the other nine P. copri MAGs which span three of the four principal clades of this organism (FIG. 7A). Strikingly, the representation of these PULs is significantly associated with the relationship between each of the 11 P. copri MAGs in the 1 ,000 MAG dataset and WLZ across both treatment groups [Pearson r between Euclidean distance from Bg0019 PUL profile and 131 (MAG) = -0.79 (P = 0.0035); FIG. 7B], Five of the seven highly conserved PULs are related to utilization of mannan and galactan - glycans that are significantly more abundant in MDCF-2 than RUSF. Expression of three of these seven PULs, as well as two of the conserved but structurally distinct PULs, are also related to the enrichment of transcripts in carbohydrate utilization pathways that distinguish upper from lower WLZ-quartile responders (‘WLZ-response quartile’ or ‘WLZ quartile x study week’ terms in FIG. 7F). PULs that generate these leading-edge transcripts are predicted to metabolize 13-glucan, glucomannan, 13-mannan, xylan, pectin/pectic galactan and arabinogalactan (see FIG. 7A for which of these 10 PULs contribute differentially- expressed transcripts).

[0271] A comparative analysis of MAGs Bg0018 and Bg0019 and 22 reference P. copri genomes in PULDB26 indicated that one of the highly conserved PULs (PUL7) contains a bimodular GH26|GH5_4 13-glycanase with 52% amino acid sequence identity to an enzyme known to cleave 13-glucan, 13-mannan, xylan, arabinoxylan, glucomannan, and xyloglucan (FIG. 7C and 7D). The gene encoding this multifunctional enzyme did not satisfy the criteria for statistically significant differential expression between MDCF-2 and RUSF treatment, nor between upper versus lower quartile WLZ-responders. However, it was consistently expressed across these conditions/comparisons and its enzymatic product is expected to contribute to the utilization of a broad range of plant glycans, including those represented in MDCF-2. Together, these results highlight both the versatility in carbohydrate metabolic capabilities of these two WLZ-associated P. copri MAGs, as well as the specificity of their treatment-inducible metabolic pathways for carbohydrates prominently represented in MDCF-2.

[0272] To contextualize our observations regarding conserved polysaccharide-degradation features of our P. copri MAGs, we selected a set of six P. copri isolates, obtained from Bangladeshi children who participated in our clinical trials, and representing a diverse PUL conservation repertoire and phylogenetic distance from the WLZ-associated Bg0018 and Bg0019 (FIG. 7A) for further analysis. These isolates include BgD5_2 and BgF5_2, strains which are highly phylogenetically related to Bg0018 and Bg0019 and possess 9/10 conserved PULs when compared to these MAGs (see Tables 28 and 29 for details of functional conservation between the genomes of these and other P. copri strains and MAGs).

[0273] The same fecal samples collected at the 0- and 3-month time points from participants in the upper and lower WLZ quartiles in the MDCF-2 treatment group that had been used for the DNA- and RNA-level analyses were subjected to UHPLC-QqQ-MS-based quantitation of 49 glycosidic linkages. These linkages were measured after their liberation by in vitro hydrolysis of fecal glycans. Linear mixed-effects modeling demonstrated that, with treatment, fecal levels of 14 of these linkages increased significantly more (q<0.05) in participants in the upper compared to the lower WLZ response quartile (FIG. 8A and 8B, Table 8). These 14 differentially abundant glycosidic linkages are all represented in MDCF-2.

Table 8: Changes in fecal glycosidic linkage levels over time in upper- compared to lower- WLZ quartile responders [0274] Differences in levels of these 14 glycosidic linkages can be explained in part by the specificity of the expressed CAZymes encoded by PULs conserved between P. copri MAGs Bg0018 and Bg0019. Among the 14 significantly differentially abundant linkages, t-Araf, 4- Mannose, t-Xylopyranose, 5-Araf and 2-Xylopyranose exhibit the greatest difference in fecal levels between upper and lower quartile responders over time; notably, all are elevated in upper quartile responders. FIG. 8A, 8C and 8D describe their likely polysaccharide sources in MDCF- 2, show the P. copri PULs predicted to generate glycan fragments containing these linkages, and highlight that these fragments are likely resistant to further degradation and thus can accumulate in the feces (FIG. 8A). For example, t-Araf is a component of arabinan, arabinoxylan and arabinogalactan type l/ll in soybean, chickpea, peanut and banana (FIG. 3A and 3B), and would be expected to accumulate in the intestine as CAZymes encoded by P. copri Bg0019 PULs 4, 7, 8, 16 and 17b cleave accessible linkages, exposing additional t-Araf (FIG. 8B-E). Exo- α-1 ,2/1 ,3-L-arabinofuranosidase and endo-α-1,5-L-arabinanase encoded by PUL17b (FIG. 8E-H) are predicted to remove successive residues from the 1 ,2 and 1 ,3-linked-L-Araf chains of branched arabinan and hydrolyze the 1,5-linked-L-Araf backbone from this polysaccharide. In P. copri Bg0019, this activity is complemented by two PUL4-encoded pectate lyases that assist in cleaving branched arabinan sidechains. In another example, CAZyme activities encoded by these two WLZ-associated P. copri MAGs also explain the greater increase in fecal levels of 4,6- mannose over time in upper- compared to lower-WLZ quartile responders (FIG. 8A). This linkage is a characteristic component of soybean galactomannan and is expected to accumulate in the feces upon partial degradation of this glycan by endo-1 ,4-β-mannosidases encoded by PUL7 and PUL8 (FIG. 8F).

[0275] CAZyme transcripts assigned to PULs 4, 7, 8, 16 and 17b were detectable in all but one of the 30 participants assigned to the two WLZ responder quartiles, with levels of expression of the majority of these CAZymes being modestly elevated in upper compared to lower WLZ-quartile responders over the course of treatment [these include the GH51 CAZyme encoded by PUL17b plus the GH26, GH26-GH5_4, GH130 and carbohydrate esterase family 7 (CE7) transcripts from PUL7; see FIG. 8B and 8C]. However, their differential expression did not satisfy the criteria for statistical significance. This latter finding raised the question of what other factors might contribute to the observed differences in fecal linkage content between upper and lower quartile responders. Intake of MDCF-2 was not significantly different between the upper- and lower-WLZ quartile participants [P>0.05; linear mixed-effects model; daily MDCF-2 consumption ~ days-on-treatment + WLZ-response quartile + WLZ-response quartile:days-on-treatment + (1 |PID)]. Data from a food frequency questionnaire (FFQ) administered at each fecal sampling disclosed that the mean correlation between the abundances of the 14 glycosidic linkages elevated in upper WLZ-quartile responders and FFQ queries was strongest for the question related to consumption of legumes and nuts and the levels of t-Araf, 5-Araf, 2,3-Araf, t-GalA, and 2,4,6-Glucose. Consumption of this food group was also the most discriminatory response between upper compared to lower WLZ quartile responders (Table 9).

Table 9: Effect of food groups on upper compared to lower WLZ quartile responders

[0276] Together, these observations suggest that children consuming more of the classes of complementary food ingredients present in MDCF-2 may also exhibit enhanced growth responses; they also provided a rationale for performing a direct test in gnotobiotic mice, described in the accompanying paper, that a P. copri isolate, which shares features of the carbohydrate metabolic apparatus present in Bg0018 and Bg0019, is a key mediator of the degradation of MDCF-2 glycans, promotes ponderal growth, and has marked effects on multiple aspects of metabolism in intestinal epithelial cell lineages.

Example 6: Discussion

[0277] The current study illustrates an approach for characterizing the bacterial targets and structure-function relationships of a microbiome-directed complementary food prototype, MDCF- 2. This MDCF produced significantly greater weight gain during a 3-month-long, randomized controlled study of 12- to 18-month-old Bangladeshi children with moderate acute malnutrition compared to a calorically more dense, commonly employed, ready-to-use supplementary food (RUSF). Metagenome-assembled genomes (MAGs) were studied, specifically (i) treatment- induced changes in expression of carbohydrate metabolic pathways in MAGs whose abundances were significantly associated with WLZ, and (ii) mass spectrometric analysis of the metabolism of glycans present in the two therapeutic food compositions. Quantifying monosaccharides, glycosidic linkages and polysaccharides present in MDCF-2, RUSF and their component foods disclosed that MDCF-2 contains a greater content of galactans and mannans (e.g., galactan, arabinogalactan I, galactomannan, 13-mannan, glucomannan). Two types of comparisons were performed of the transcriptional responses of MAGs that were found to be significantly associated with WLZ: one involved participants who consumed MDCF-2 versus RUSF and the other focused on MDCF-2 treated children in the upper versus in lower quartiles of WLZ responses. The results revealed that two P. copri MAGs, both positively associated with WLZ, were the principal contributors to MDCF-2-induced expression of metabolic pathways involved in the utilization of its component glycans (13-glucan, glucomannan, 13-mannan, xylan, arabinoxylan, pectin/pectic galactan and starch).

[0278] UHPLC-QqQ-MS was able to identify statistically significant changes in glycan composition in a complex matrix like feces in children consuming a therapeutic food, even in the face of varied (non-uniform) background diets. Moreover, the approach of identifying MAGs, characterizing their gene expression as a function of treatment type and host response, and correlating gene expression with fecal glycosidic linkage content revealed just two P. copri strains among 75 WLZ-positively correlated MAGs. The findings that (i) these two MAGs possess PULs that are uniquely conserved compared to other P. copri MAGs in the study population, and (ii) PUL content correlates with WLZ association and levels of a number of glycosidic linkages from therapeutic food ingredients, highlight how this approach can be used to identify the strain-level specificity and genomic features of bacterial targets of MDCF-2, as well as the chemical structures present in the food components of MDCF-2 that these strains utilize.

[0279] Intriguingly, although intake of MDCF-2 did not differ in children in the upper quartile of WLZ improvement, children in the upper quartile trended toward diets containing more legumes and nuts than their lower WLZ quartile counterparts. The “legumes and nuts” food group includes major components of MDCF-2. It is postulated herein that MDCF-2 ‘kick-starts’ a microbiome response that includes changes in the fitness and expressed metabolic functions of key growth- associated bacterial strains, such as P. copri . Background diet can further modify this response, as evidenced by the higher levels of microbial metabolic products of legume/nut-associated glycans in the feces of children with upper quartile WLZ responses. More detailed, quantitative assessments of food consumption during future clinical studies of MDCF-2 could serve to not only facilitate design of improved compositions, but also to inform future recommendations regarding complementary feeding practices - recommendations that recognize the important role of the gut microbiome in the healthy growth of children.

[0280] Linking dietary glycans and microbial metabolism in this fashion provides a starting point for culture-based initiatives designed to retrieve isolates of these ‘effector’ taxa for use as potential probiotic agents, or if combined with key nutrients that they covet, synbiotic compositions for repairing the microbiomes of children who already manifest undernutrition or who are judged to be at risk for growth faltering. This repair could take the form of rebalancing the representation and/or expressed functions of beneficial organisms so that the microbiome assumes an age- appropriate configuration for healthy microbiome-host co-development.

[0281] Much remains unknown about whether or how the direct breakdown products of MDCF- 2 glycan metabolism, or other secondary P. copri metabolites, are related to weight gain. Furthermore, interactions between P. copri , MDCF-2 glycans, and WLZ response does not exclude the contribution of other macro-or micronutrients. Direct tests of the role played by organisms such as P. copri in mediating microbial community and host responses to components of microbiome-targeted therapeutic foods can come from ‘reverse translation’ experiments of the type illustrated in the study that accompanies this report. To study this gnotobiotic mouse model colonized with defined collections of cultured were used, WLZ-associated gut bacterial taxa with or without P. copri , (ii) single nucleus RNA-Seq and microbial RNA- Seq and (iii) UHPLC-QqQ- MS to characterize the contributions of P. copri to regulating gene expression in gut epithelial cell lineages, processing of MDCF-2 glycans, and metabolism in intestinal and extra-intestinal tissues.

[0282] Some additional observations from the current study are provided below.

Short sequencing read only versus hybrid (long and short read) MAG assembly

[0283] The impact of the addition of long read sequencing data on various quality characteristics of MAGs assembled from data collected from the 0- and 3-month time points from all upper WLZ quartile responders (n=15) in the MDCF-2 treatment group was explored. The final set of high-quality, dereplicated MAGs, 918 MAGs represented contigs assembled from short read only data, while 82 were derived from hybrid short and long read assemblies. Although the mean quality characteristics of MAGs from each assembly type did not differ in completeness (determined by marker gene analysis) or total length, MAGs derived from hybrid assemblies displayed a significantly lower rate of contamination, fewer contigs, and greater N.

Comparing MAG assembly accuracy and quantitation using a pseudo-alignment and expectation maximization approach

[0284] MAG assembly algorithms that synthesize both contig sequence characteristics and contig abundance to assemble MAGs (e.g., MaxBin2, MetaBAT2) require accurate contig quantitation. Alignment-free quantitation approaches (e.g., Kallisto) have demonstrated superior speed and accuracy compared to read mapping-based quantitation in the context of metagenomic analyses where read-mapping ambiguity is common.

[0285] The utility of Kallisto-based quantitation was studied for (i) contigs, prior to MAG assembly, and (ii) MAGs themselves after assembly and curation. For this analysis, we employed a ‘mouse gut metagenome toy dataset’ from CAMI II that included 64 ‘mock fecal samples’; these mock samples were produced using sequencing data from 791 publicly available bacterial genomes (representing 549 species) and genomic abundances that mirrored bacterial 16S rRNA gene profiles of 64 actual mouse fecal biospecimens. Three components of this reference dataset were utilized for the analyses: (i) simulated sequencing data (1.67*10 ⁷ Illumina paired-end 150 nt reads) from each of the 64 mock fecal samples, (ii) anonymized reference contigs from the 791 reference genomes, and (iii) reference abundances of contigs/genomes in each fecal sample.

[0286] The effect of Kallisto quantification of contigs on the fidelity of MAG assembly was first investigated. The reference contigs using either Kallisto or bowtie2 and the short-read simulated Illumina data. Next, MAGs were assembled using MaxBin248, MetaBAT249 and CONCOCT82 with data from either Kallisto or bowtie2 contig quantitation as input. The output of each MAG assembly method for each sample was combined using DAS Tool. Finally, each MAG set was compared against 791 intact reference genomes using AMBER83. MAGs generated using Kallisto contig quantification and DAS Tool dereplication were more complete (P=6.4x10-14; Wilcoxon test) and less contaminated (P <2.2x10-16; Wilcoxon test) than those generated using bowtie2. Additionally, a significantly greater number of MAGs (P<0.05; Fisher's exact test) were detected using Kallisto contig quantitation.

[0287] Next, the same simulated dataset was employed to test the accuracy of Kallisto-based MAG quantitation. The short-read data was mapped for each of the 64 fecal samples to the set of 791 reference genomes using Kallisto and bowtie2. The abundance profiles generated by each quantitation method were then correlated to the ‘true’ abundance profile for each sample. The correlations between true genome abundances and Kallisto genome abundances were stronger than those calculated using bowtie (mean Pearson’s r2= 0.99 for Kallisto versus r2= 0.97 for bowtie; P<2.2x10-16, Wilcoxon test comparing each distribution of correlation coefficients).

[0288] The false positive and false negative rate of MAG detection across all samples were determined. Notably, Kallisto quantitation resulted in more false positive abundances across the 64 mock fecal samples [300.2±50.1 versus 69.3±28.4 for bowtie2, respectively (mean±SD); P< 2.2x10-16, Wilcoxon test] while bowtie2 quantitation resulted in more false negative abundances [0.09±0.42 versus 17.2±26.1 (mean±SD), respectively; P<2.2x10-16, Wilcoxon test]. Importantly, analysis of the average values of false positive abundance generated using Kallisto suggested that a low abundance filter would significantly reduce the false positive rate. For example, applying a filter to this dataset that required >5 TPM for a MAG to be designated as ‘detected’ resulted in a false positive rate significantly lower than that of bowtie2 (P=0.02, Wilcoxon test).

[0289] As a greater number of high quality (less contaminated and more complete) MAGs assembled could be assembled using Kallisto quantitation, plus the increased accuracy of MAG quantitation using this method, Kallisto was used for all quantitation tasks in the MAG analysis workflow described in the current study.

Analysis of consistency in MAG functional metabolic pathway annotation

[0290] A global comparison of binary phenotype assignments derived using the Pathway Rules (PR), Machine Learning (ML), and Neighbor Group (NG) approaches described in Methods revealed a remarkably low frequency of inconsistencies: in a subset of 640 MAGs where all three methods could be applied, only 4.5% of NG-based phenotype assignments were inconsistent between one or more other methods. These inconsistencies reflect different biases associated with each approach. The NG-based approach exhibits limited performance for small (<5-member) NGs with underrepresented local diversity of gene patterns. Alternatively, PR/ML-based methods appear to be less robust with respect to genome incompleteness in MAGs, resulting in omission (absence) of genes essential for the function of a pathway and, more generally, for pathways with less than three essential genes. Our consensus approach (Methods) resolved 70% of observed inconsistencies toward PR/ML-based assignments. In the remaining cases, a consensus phenotype was assigned in favor of the NG-based method. The overall level of inconsistencies between PR- and ML-based phenotype assignments (across the entire set of 199,334 assignments in 1 ,000 MAGs) was much lower (<0.7%). A detailed investigation of selected cases showed that, in general, the ML-based method yielded higher accuracy phenotype assignments. Therefore, in rare cases of irreconcilable disagreement between these two methods in the set of 360 MAGs without NGs, the semi-automated assignment of the consensus phenotype was made in favor of the ML-based approach. These assignments were considered low confidence.

Non-carbohydrate related differentially expressed transcripts in upper versus lower WLZ quartile responders

[0291] Transcripts expressed at greater levels in upper WLZ response quartile participants ( β ₁ WLZ quartile term) were also enriched for pathways involved in biosynthesis of vitamin B3 and B9 and the essential amino acids tryptophan, lysine, histidine and leucine. Leading-edge analysis revealed that P. copri Bg0018 and Bg0019 were major contributors to increased expression of transcripts involved in the biosynthesis of vitamins B3 and B9 plus four essential amino acids (tryptophan, histidine, leucine, lysine) among the upper quartile participants but contributed minimally (2 transcripts assigned to the arginine biosynthetic pathway) to enrichment of functional pathways among the lower WLZ quartile responders.

Example 7: Methods for Examples 8-12

Bacterial genome sequencing and annotation

[0292] Monocultures of each isolate were grown overnight at 37°C in Wilkins-Chalgren Anaerobe Broth (Oxoid Ltd.; catalog number: CM0643) in a Coy Chamber under anaerobic conditions (atmosphere; 75% N2, 20% CO2 and 5% H2) without shaking. Cells were recovered by centrifugation (5000 x g for 10 minutes at 4 °C) and high molecular weight genomic DNA was purified (MagAttract® HMW DNA kit, Qiagen) following the manufacturer’s protocol and the amount quantified (Qubit fluorometer). The sample was passed up and down through a 29-gauge needle 6-8 times and the fragment size distribution was determined (~30 kbp; TapeStation, Agilent) (Tables 10, 11 and 12).

Table 10: Bacterial strains used in the defined community gnotobiotic mouse experiments

Table 11 : P. copri PS131.S11 PULs PUL 18 no CAZyme 3,126,507

3,131 ,588

PUL 19 b-1 ,2-glucan 3,143,047

3,153,155

PUL 20 no CAZyme 3,182,564 PUL 12 ¹

3,187,111

PUL 21 A/ and O-glycans 3,188,964

3,206,116

PUL 22 a-glucoside, a-1 ,6- 3,288,752 glucan (dextran)

3,307,423

PUL 23 unknown 16,235 -

40,890

PUL 24 no CAZyme 61,128 - 75,818

PUL 25 type II 126,212 - rhamnogalacturonan 176,300

PUL 26 a-glucan (starch) 186,195 - 197,081

PUL 27a starch 208,477 - PUL 18a ² PUL 17a ¹

227,992

PUL 27b arabinogalactan 228,820 - PUL 18b ² PUL 17b ² 247,973

PUL 28 no CAZyme 252,400 -

260,174

PUL 29 no CAZyme 296,439 - PUL 15 ¹ 303,639

PUL 30 b-1 ,3-glucan 325,753 - PUL 16 ¹ PUL 11 ¹ 337,891

PUL 31 a-glucan (starch) 362,093 - PUL 1 ¹ CAZyme 372,659 cluster ² ¹ Functionally conserved Structurally distinct

Table 12: Bacterial strains used in the P. copri colonization dependency gnotobiotic mouse experiments.

Taxonomic Strain Contig length (bp) Complete Contamin assignment name ness (%; ation (%; CheckM) CheckM)

Prevotella copri G8 3992021*, 138455*, 90197*, 98.65 1.86 26100*

[0293] Fragmented genomic DNA (400-1000 ng) was prepared for long-read sequencing using a SMRTbell Express Template Prep Kit 2.0 (Pacific Biosciences) adapted to a deep 96-well plate (Fisher Scientific) format. All DNA handling and transfer steps were performed with wide-bore, genomic DNA pipette tips (ART). Barcoded adapters were ligated to A-tailed fragments (overnight incubation at 20 °C) and damaged or partial SMRTbell templates were subsequently removed (SMRTbell Enzyme Cleanup Kit). High molecular weight templates were purified (volume of added undiluted AMPure beads = 0.45 times the volume of the DNA solution). Libraries prepared from different strains were pooled (3-6 libraries/pool). A second round of size selection was then performed; AMPure beads were diluted to a final concentration of 40% (v/v) with SMRTbell elution buffer with the resulting mixture added at 2.2 times the volume of the pooled libraries. DNA was eluted from the AMPure beads with 12 μL of SMRTbell elution buffer. Pooled libraries were quantified (Qubit), their size distribution was assessed (TapeStation) and sequenced [Sequel System, Sequel Binding Kit 3.0 and Sequencing Primer v4 (Pacific Biosystems)]. The resulting reads were demultiplexed and Q20 circular consensus sequencing (CCS) reads were generated (Cromwell workflow configured in SMRT Link software). Genomes were assembled using Flye (v2.8.1) with hifi-error set to 0.003, min-overlap set at 2000, and other options set to default. Genome quality was evaluated using CheckM (v1.1.3).

[0294] Prokka (v1.14) was applied to identify potential open reading frames (ORF) in each assembled genome. Additional functional annotation of these ORFs using a ‘subsystems’ approach adapted from the SEED genome annotation platform was performed. Functions were assigned to 9,820 ORFs in 20 isolate genomes using a collection of mcSEED metabolic subsystems that capture the core metabolism of 98 nutrients/metabolites in four major categories (amino acids, vitamins, carbohydrates, and fermentation products) projected over 2,856 annotated human gut bacterial genomes. In silica reconstructions of selected mcSEED metabolic pathways were based on functional gene annotation and prediction using homology-based methods and genome context analysis. Reconstructions were represented as a binary phenotype matrix (BPM) where for amino acids and B vitamins, “1” denotes a predicted prototroph and “0” an auxotroph, for carbohydrates, “1” and “0” refer to a strain’s predicted ability or inability, respectively, to utilize the indicated mono-, di- or oligosaccharide, and for fermentation end products, a “1” and “0” indicate a strain’s predicted ability/inability to produce the indicated compound, respectively.

[0295] To calculate phylogenetic relationships between five P. copri isolates and MAGs

Bg0018 and Bg0019, CheckM (v1.1.3) was first used to extract and align the amino acid sequences of 43 single copy marker genes in each isolate or each of the two MAGs, plus an isolate genome sequence of Bacteroides thetaiotaomicron VPI-5482 (accession number: 226186.12). Concatenated marker gene sequences were analyzed using fasttree (v2.1.10) to construct a phylogenetic tree using the Jones-Taylor-Thornton model and ‘CAT’ evolution rate approximation, followed by tree rescaling using the ‘Gamma20’ optimization. The tree was subsequently processed in R using ‘ape’ (v5.6-2) to root the tree with the B. thetaiotaomicron genome and extract phylogenetic distances between genomes, followed by ‘ggtree’ (v3.2.1 ) for tree plotting.

[0296] The similarity between the genomes of these strains and MAGs was quantified by calculating the ANI score with pyani (ANIm implementation of ANI, v0.2.10). Firstly, ANIm scores were calculated for all possible combinations between MAGs and the genomes of cultured bacterial strains, and subsequently removed any MAG-strain genome combination with <10% alignment coverage. For the remaining MAGs, a “highly similar” genome in the collection of cultured bacterial strains was defined as having > 94% ANIm score. The degree of binary phenotype concordance was then defined between each genome in the collection of cultured bacterial strains and its “highly similar” MAG. A binary phenotype concordance score was calculated by dividing the number of binary phenotypes shared between a cultured strain’s genome and a MAG by the total number of binary phenotypes annotated in the strain and MAG. A ‘Representative MAG’ for each genome was defined as having a binary phenotype concordance score >90%.

[0297] PULs were predicted for P. copri PS131.31 based on methods described in Terrapon et al. (Terrapon et al. Automatic prediction of polysaccharide utilization loci in Bacteroidetes species. Bioinformatics. 2015. 31, 647-655) and displayed with the PULDB interface. PULs were placed into three categories: (i) ‘functionally conserved’ (PULs containing shared ORFs encoding the same CAZymes and SusC/SusD proteins in the same organization in their respective genomes with >90% amino acid identity between proteins); (ii) ‘structurally distinct’ (PULs present in respective genomes but where one or more CAZymes or one or both SusC/SusD proteins are missing or fragmented in a way likely to impact function, or where extra PUL elements are present), and (iii) ‘not conserved’ (PULs present in respective genomes but with mutations likely to completely compromise function, or no PUL identified).

Colonization and husbandry

[0298] Gnotobiotic mouse experiments were performed using protocols approved by the Washington University Animal Studies Committee. Germ-free C57BL/6J mice were maintained in plastic flexible film isolators (Class Biologically Clean Ltd) at 23 °C under a strict 12-hour light cycle (lights on a 0600h). Autoclaved paper ‘shepherd shacks’ were kept in each cage to facilitate natural nesting behaviors and provide environmental enrichment.

[0299] A weaning diet containing MDCF-2 was formulated. Ingredients represented in the different diet modules were combined and the mixture was dried, pelleted, and sterilized by gamma irradiation (30-50 KGy). Sterility was confirmed by culturing the pellets in LYBHI medium and Wilkins-Chalgren Anaerobe Broth under aerobic and anaerobic conditions for 7 days at 37 °C followed by plating on LYBHI- and blood-agar plates. Nutritional analysis of each irradiated diet was performed by Nestle Purina Analytical Laboratories (St. Louis, MO) (Table 13).

Table 13: Nutritional analysis of the diets

[0300] Pregnant C57BI/6J mice originating from trio matings were given ad libitum access to an autoclaved breeder chow (Purina Mills; Lab Diet 5021 ) throughout their pregnancy and to postpartum day 2. Key points about the experimental design doe the gnotobiotic mouse experiments described in FIG. 10B and FIG. 15A are: (i) all bacterial strains were cultured in Wilkins-Chalgren Anaerobe Broth (except for F. prausnitzii which was cultured in LYBHI medium) and were harvested after overnight growth at 37 °C (Table 13); (ii) all gavage mixtures contained equivalent amounts (by OD600) of their constituent bacterial strains except for F. prausnitzii which was concentrated 100-fold before preparing the gavage mixture; (iii) each bacterial consortium was administered to the postpartum dams in a volume of 200 μL using an oral gavage needle (Cadence Science; catalog number: 7901); (iv) the number of dams and pups per treatment group [two dams and 7-8 pups/treatment group (FIG. 10B); four dams and 18-19 pups/treatment group (FIG. 15A)]; (v) half of the bedding was replaced with fresh bedding in each cage each day from postpartum day 1 to 14, after which time bedding was changed every 7 days; (vi) diets were provided to mothers as well as to their weaning and post-weaning pups ad libitum, (vii) fecal samples were collected from mice when they were euthanized (without prior fasting) and snap frozen in liquid nitrogen and stored at -80 °C before use.

[0301] Pups were weighed on P23, P35, and P53, and normalized to the weight on P23. A linear mixed-effects model was used to evaluate the effect of different microbial communities on normalized mouse weight gain: normalized weight-β ₁(Arm) + β ₂ (postnatal day) + (1 I mouse)

(1 )

Defining the absolute abundances of bacterial strains in ileal, cecal and fecal communities

[0302] The absolute abundances of bacterial strains were determined in the fecal microbiota. In brief, 3.3x10 ⁶ cells of Alicyclobacillus acidiphilus DSM 14558 and 1.49x10 ⁷ cells of Agrobacterium radiobacter DSM 30147 (ref. 49) were added to each weighed frozen sample prior to DNA isolation and preparation of barcoded libraries for shotgun sequencing. Sequencing was performed for 136 samples (Illumina NextSeq instrument; unidirectional 75 nt reads) at an average depth of 2.0x10 ⁶ + 4.0x10 ⁵ reads/sample (mean+SD). Bacterial abundances were determined by assigning reads to each bacterial genome, followed by a normalization for genome uniqueness in the context of a given community. The resulting count table was imported into R (v4.0.4). The absolute abundance of a given strain i in sample j in reference to the spike-in A. acidiphilus (Aa) and A. radiobacter (Ar) genomes was calculated using the following equation:

[0303] The statistical significance of observed differences in the abundance of a given strain across different treatment groups and time was tested using a linear mixed effects model within the R packages Ime4 (v1.1-27) and ImerTest (v3.1-3). For the experiment described in FIGs. 9A- C, 37 fecal samples were sequenced [5.8x10 ⁶±1.6x 10 ⁶ unidirectional 75 nt reads/sample (mean ± SD)] (Tables 14 and 15)] while for the experiment described in FIG. 15A, 37 cecal samples were sequenced [1.3x10 ⁶ ± 1.3x10 ⁵ unidirectional 75 nt reads/sample] and absolute abundances were determined as above. The change in P. copri absolute abundance in fecal samples during the course of the experiment was determined by a linear mixed-effects model:

P. copri absolute abundance-’ β ₁(Arm) + β ₂(postnatal day) + (1 I mouse)

(3)

Table 14: Sample metadata

Table 15: Absolute abundance of strains [Iog10(genome equivalents per gram sample)] in cecal contents collected from progeny of dams at sacrifice

Microbial RNA-Seq

[0304] RNA was isolated from cecal contents collected at the end of the experiment. cDNA libraries were generated from isolated RNA samples using the ‘Total RNA Prep with Ribo-Zero Plus’ kit (Illumina). Barcoded libraries were sequenced [Illumina NovaSeq instrument; bidirectional 150 nt reads; 8.0x10 ⁷ ± 1.6x10 ⁷ reads/sample (mean±SD); n=40 samples]. Raw reads were trimmed by using TrimGalore (v0.6.4). Trimmed reads longer than 100 bp were mapped to reference genomes with Kallisto (v0.43.0). Mapping of reads was skipped for strains that were not gavaged in all arms (P. copri , P. stercorea, B. longum subsp. infantis strains Bg2D9 and Bg463) in order to compare transcriptional changes induced by the differential presence of these strains. The resulting Kallisto pseudocount dataset, comprised of 48,390 transcripts, was imported into R (v4.0.4). edgeR (v 3.36.0) was used to filter the pseudocount dataset by expression level, resulting in a dataset of 22,387 expressed genes.

[0305] For analysis of metabolic pathway expression, cecal content pseudocounts for each transcript were first normalized by the absolute abundance of the corresponding strain in order to minimize the confounding effects of differences in strain abundance. The rlog transformation in the R DESeq2 package (v1.34.0) was then applied to this dataset for variance stabilization. To obtain relative expression of metabolic pathways, only transcripts corresponding to complete mcSEED metabolic pathways (/.e., pathways with binary phenotype scores of 1 ; see above) were retained and transformed expression values were averaged across genes within a given pathway in each strain. The resulting aggregated pathway expression dataset was then centered prior to singular value decomposition with NumPy (v1.21.5). Principal component analysis of samples was calculated by projecting the centered pathway dataset onto right singular vectors (FIG. 11D). Sample projections along the first two principal components were converted to a Euclidian distance matrix, and PERMANOVA was used to test for the significance of separation of samples by experimental group. Pathway PCA loadings were calculated by projecting the transpose of the centered dataset onto left singular vectors. Metabolic pathways were subsequently ranked by their corresponding loadings, and directionality was resolved such that more negative left singular vector 1 projections corresponded to higher pathway expression in the P. copri colonization context.

[0306] For differential expression analysis of microbial transcripts, the filtered version of Kallisto pseudocounts was imported into edgeR. For each community member, ‘within-taxon sum-scaling’ was applied by calculating the trimmed mean of M-value library size corrections based on the total pool of RNA reads from that member. This organism-scaled transcript set was then used for dispersion estimation and fitting of a generalized linear model (GLM). In addition to ‘experiment arm’, the absolute abundance of the organism was included in each cecal sample as a covariate in the GLM to reduce false discoveries due to differences in the abundances of community members. A likelihood ratio test (edgeR) was then used to detect differential expression between samples obtained from members of the w/ P. copri versus w/o P. copri treatment groups. Transcripts with statistically significant differences in their expression were identified [q-value < 0.05 (adjusted P value < 0.05)] after multiple hypothesis correction was applied to the entire set of transcripts from a given organism via the Benjamini-Hochberg method.

Histomorphometric analysis of villus height and crypt depth

[0307] Jejunal and ileal segments were fixed in formalin, embedded vertically in paraffin; 5 gm- thick sections were prepared and stained with hematoxylin and eosin. Slides were scanned (NanoZoomer instrument, Hamamatsu). For each animal, 10 well-oriented crypt-villus units were selected from each intestinal segment for measurement of villus height and crypt depth using QuPath (v0.3.2). Measurements were performed with the investigator blinded with respect to colonization group. A two-tailed Mann-Whitney U test was applied to the resulting datasets.

Single nucleus (sn) RNA-seq

[0308] Jejunal segments of 1.5 cm in length were collected from mice and snap frozen in liquid nitrogen [n=4 animals/treatment group (2 males and 2 females); 2 treatment groups in total]. The method for extracting nuclei was adapted from a previously described protocol for the pancreas ⁶². Briefly, tissues were thawed and minced in lysis buffer [25mM citric acid, 0.25M sucrose and 0.1% NP-40, and 1 x protease inhibitor (Roche)]. Nuclei were released from cells using a pestle douncer (Wheaton), washed 3 times with buffer [25mM citric acid, 0.25M sucrose, and 1 x protease inhibitor], and filtered successively through 100tm, 70tm, 40tm, 20tm and finally 5tm diameter strainers (pluriSelect) to obtain single nuclei in resuspension buffer [25mM KCI, 3mM MgCI2, 50mM Tris, 1mM DTT, 0.4U/tL RNase inhibitor (Sigma) and 0.4U/tL Superase inhibitor (ThermoFisher)]. Approximately 10,000 nuclei per sample were subjected to gel bead-in-emulsion (GEM) generation, reverse transcription and construction of libraries for sequencing according to the protocol provided in the 3’ gene expression v3.1 kit manual (10X Genomics). Libraries were balanced, pooled and sequenced [Illumina NovaSeq S4; 3.23x10 ⁸ ± 1.39x10 ⁷ paired-end 150 nt reads/nucleus (mean±SD) from jejunal samples, respectively]. Read alignment, feature-barcode matrices and quality controls were processed by using the CellRanger 5.0 pipeline with the flag - -include-introns’ to ensure that reads would be allowed to map to intronic regions of the mouse reference genome (GRCm38/mm10). Nuclei with over 2.5% reads from mitochondria-encoded genes reads or ribosomal protein genes were filtered out.

[0309] Analysis of snRNA-seq datasets - Sample integration, count normalization, cell clustering and marker gene identification was performed using Seurat 4.0. Briefly, filtered featurebarcode matrices outputted from CellRanger were imported as a Seurat object using CreateSeuratObject (min.cells = 5, min.features = 200). Each sample was normalized using SCTransform ^63,64 and integrated using SelectlntegrationFeatures, PrepSCTIntegration, FindlntegrationAnchors, and IntegrateData from the Seurat software package. The integrated dataset, incorporating nuclei from all samples, was subject to unsupervised clustering using FindNeighbors (dimensions = 1 :30) and FindClusters (resolution = 1 ) from the Seurat package, which executes a shared nearest-neighbor graph clustering algorithm to identify putative cell clusters. Cell type assignment was performed manually based on expression of reported markers.

[0310] Cross-condition differential gene expression analysis was performed based on a “pseudobulk” strategy; for each cell cluster, gene counts were aggregated to obtain sample-level counts; each pseudo-bulked sample served as an input for edgeR-based differential gene expression analysis.

[0311] For NicheNet-based analysis (v1.1.0), all clusters in snRNA-seq dataset were used as senders for crypt stem cells, proliferating TA/stem cells, villus base enterocytes, mid-villus enterocytes and villus tip enterocytes, plus goblet cells). The nichenet_seuratobj_aggregate (assay_oi = “RNA”) function was used with its default settings to incorporate differential gene expression information from Seurat into our NicheNet analysis and to select bona fide ligandreceptor interactions.

[0312] Compass-based in silico metabolic flux analysis (v0.9.10.2) was performed using transcripts from each of six epithelial cell clusters (crypt stem cells, proliferating TA cells, villusbase, mid-villus and villus tip enterocytes and goblet cells). The reaction scores calculated by Compass were filtered based on (i) the confidence levels of the Recon2 reactions and (ii) the completeness of information for Recon2 reaction annotations. Only Recon2 reactions that are supported by biochemical evidence (defined by Recon2 as having a confidence level of 4) and that have complete enzymatic information for the reaction were advanced to the follow-on analysis (yield: 2,075 pass filter reactions in 83 Recon2 subsystems).

[0313] A “metabolic flux difference” was calculated to determine whether the presence or absence of P. copri affected Compass-based predictions of metabolic activities at the Recon2 reaction level in the six cell clusters. The “net reaction score” was calculated as follows c= c _f - c _r

(4) were cf denotes the Compass score for a given reaction in the “forward” direction, and, if the biochemical reaction is reversible, c _r denotes the score for the “reverse” reaction.

[0314] A Wilcoxon Rank Sum test was used to test significance of the net reaction score between the two treatment groups. P values from the Wilcoxon Rank Sum tests were adjusted for multiple comparisons with the Benjamini-Hochberg method.

[0315] Cohen’s d can be used to show the effect size of cf or c _r for each reaction between two groups (in mice harboring communities with and without P. copri ). Briefly, Cohen’s d of two groups, j and k, was calculated based on Equations 4 and 5. n, S, and a in Equation 4 represent the number, the variance, and the mean of the observations (in our case, the net reaction scores). Cohen’s d was defined as:

[0316] If both ai and a* are non-negative numbers, a positive Cohen’s d indicates the mean of group j is greater than that of group k whereas a negative Cohen’s d means the mean of group j is smaller in that comparison. The magnitude of Cohen’s d represents the effect size and is correlated with the difference between the means of the two groups. Because the mean of the net subsystem scores as well as the net reaction scores could be negative, the following adjustments were made to Cohen’s d in order to preserve the concordance of sign and the order of group means. The adjusted Cohen’s d represents the metabolic flux difference m, and is defined as:

[0317] scCODA (VO.1.8) is a Bayesian probabilistic model for detecting ‘statistically credible differences’ in the proportional representation of cell clusters, identified from snRNA-seq datasets, between different treatment conditions. This method accounts for two main challenges when analyzing snRNA-seq data: (i) low sample number and (ii) the compositionality of the dataset (an increase in the proportional representation of a specific cell cluster will inevitably lead to decreases in the proportional representation of all other cell clusters. Therefore, applying univariate statistical tests, such as a t-test, without accounting for this inherent negative correlation bias will result in reported false positives). scCODA uses a Bayesian generalized linear multivariate regression model to describe the ‘effect’ of treatment groups on the proportional representation of each cell cluster; Hamiltonian Monte Carlo sampling is employed to calculate the posterior inclusion probability of including the effect of treatment in the model. The type I error (false discovery) is derived from the posterior inclusion probability for each effect. The set of “statistically credible effects” is the largest set of effects that can be chosen without exceeding a user-defined false discovery threshold α (α=0.05 by default). Application of scCODA was done using default parameters, including choice of prior probability in the Bayesian model and the setting for Hamiltonian Monte Carlo sampling. The enteroendocrine cell cluster was used as the reference cluster.

Mass spectrometry

UHPLC-QqQ-MS of cecal glycosidic linkages and GC-MS of short-chain fatty acids

[0318] Ultra-high performance liquid chromatography-triple quadrupole mass spectrometric (UHPLC-QqQ-MS) quantification of glycosidic linkages and monosaccharides present in cecal glycans was performed. Levels of short-chain fatty acid levels in cecal contents were measured by GC-MS.

LC-MS of acylcarnitines, amino acids, and biogenic amines in host tissues

[0319] Acylcarnitines were measured in jejunum, colon, liver, gastrocnemius, quadriceps, and heart muscle, and plasma, while 20 amino acids plus 19 biogenic amines were quantified in jejunum, liver, and muscle. Plasma levels of non-esterified fatty acids were measured using a UniCel DxC600 clinical analyzer (Beckman Coulter).

Targeted mass spectrometry of cecal amino acids and B-vitamins

[0320] Methods for targeted LC-QqQ-MS of amino acids and B vitamins were adapted from a previous established methods and as described herein. Cecal samples were extracted with ice- cold methanol, and a 200μL aliquot was dried (vacuum centrifugation; LabConco CentriVap) and reconstituted with 200μL of a solution containing 80% methanol in water. A 2μL aliquot of extracted metabolites was then injected into an Agilent 1290 Infinity II UHPLC system coupled with an Agilent 6470 QqQ-MS operated in positive ion dynamic multiple reaction monitor mode (dMRM). The native metabolites were separated on HILIC column (ACQUITY BEH Amide, 2.1 x 150 mm, 1.7 μm particle size, Waters) using a 20 minute binary gradient with constant flow rate of 0.4 mL/minute. The mobile phases were composed of 10mM ammonium formate buffer in water with 0.125% formic acid (Phase A) and 10mM ammonium formate in 95% acetonitrile/H ₂O (v/v) with 0.125% formic acid (Phase B). The binary gradient was listed as follows: 0-8 minutes: 91- 90% B; 8-14 minutes: 90-70% B; 15-15.1 minutes: 70-91% B; 15.1-20 minutes: 91% B. A pool of 20 amino acids and 7 B vitamins standards with known concentrations (amino acid pool: 0.1ng/mL-100ug/mL; B vitamin pool: 0.01 ng/mL-10μg/mL) was injected along with the samples as an external calibration curve for absolute quantification.

Example 8: A manipulatable model of maternal-pup transmission of cultured WLZ-associated taxa

[0321] Selection of bacterial strains - To test the role of P. copri in the context of a defined human gut microbial community that captured features of the developing communities of children who had been enrolled in the clinical study of MDCF-2, 20 bacterial isolates were selected, 16 of which were cultured from the fecal microbiota of 6- to 24-month-old Bangladeshi children living in Mirpur (Table 10). They included strains initially identified by the close correspondence of their 16S rRNA gene sequences to (i) a group of taxa that describe a normal program of development of the microbiota in healthy Bangladeshi children and (ii) taxa whose abundances had statistically significant associations (positive or negative) with the rate of weight gain (b-WLZ) in clinical study participants, and statistically significant correlations with plasma levels of WLZ-associated proteins. The relatedness of these strains to the 1 ,000 MAGs assembled from fecal samples obtained from all participants in the clinical study was determined by average nucleotide sequence identity (ANI) scores, alignment coverage parameters and their encoded metabolic pathways. A cultured, bacterial strain was deemed as representing a specific MAG if the whole genome alignment coverage was >10%, ANI was >94%, and the binary phenotype concordance score was >90% (see Methods). Based on these criteria, four of the 20 strains were classified as corresponding to MAGs positively associated with WLZ, including P. copri , and eight strains as corresponding to MAGs negatively associated with WLZ.

[0322] Liquid chromatography-mass spectrometry (LC-MS) analysis of glycosidic linkages and polysaccharides in MDCF-2 and RUSF disclosed that cellulose, galactan, arabinan, xylan, and mannan represent the principal non-starch polysaccharides in MDCF-2. Gene set enrichment analysis (GSEA) of fecal microbial RNA-Seq datasets generated from children in the MDCF-2 and RUSF arms of the clinical trial disclosed that MDCF-2 produced a meta-transcriptome that was enriched for components of metabolic pathways involved in the utilization of arabinose, a- arabinooligosaccharides (aAOS), and fucose. One-third of the leading-edge’ transcripts associated with these pathways (/.e., transcripts most discriminatory for the pathway response) were derived from the two P. copri MAGs whose abundances were positively correlated with WLZ (MAGs Bg0018 and Bg0019); These leading-edge transcripts include 11 of the 14 related to aAOS utilization. Moreover, a comparison of the fecal meta-transcriptomes of children in the MDCF-2 arm of the clinical study who were classified as being in the upper versus lower quartiles of WLZ responses to treatment revealed that those in the upper quartile exhibited significant enrichment in the expression of metabolic pathways for utilization of xylooligosaccharides, fructooligosaccharides, oligogalacturonate, galactooligosaccharides, galactose, glucuronate, galacturonate and a-arabinooligosaccharides. A majority of the leading-edge transcripts in these pathways were also derived from the two P. copri MAGs. Another feature that distinguished these two MAGs from the other nine P. copri MAGs present in the microbiomes of study participants is that they share 10 functionally conserved PULs, including seven that are completely conserved and three that are partially conserved, albeit structurally distinct (see Methods for the criteria used to classify the degree of PUL conservation). These 10 PULs encode a diverse set of glycoside hydrolases (Table 11) including a multifunctional glycoside hydrolase with broad substrate specificity for glycans present in MDCF-2 (range of substrates: β-glucan, β- mannan, xylan, arabinoxylan, glucomannan, and xyloglucan). Notably, the degree of representation of the seven completely conserved PULs among the 11 P. copri MAGs identified in study participants was highly predictive of each MAG’s association with WLZ, suggesting a link between metabolism of carbohydrates by P. copri and growth responses among the malnourished children.

[0323] The Bangladeshi P. copri strain PS131.S11 was the only P. copri strain in the 20- member collection. There were several reasons why PS131.S11 was chosen over four other cultured P. copri strains obtained from Bangladeshi children. First, based on phylogenetic distance, P. copri PS131.S11 was most similar to MAGs Bg0018 and Bg0019 (FIGs. 9A-C). Second, it has an overall binary phenotype concordance score of 97% and 96% when compared to Bg0018 and Bg0019, respectively. Among 55 carbohydrate utilization pathways analyzed, 53 are shared across PS131.S11 , Bg0018 and Bg0019. Importantly, a total of 93% and 95% of the reconstructed carbohydrate utilization pathways induced in Bg0018 and Bg0019 by MDCF-2 are represented in PS131.811. Third, P. copri PS131.S11 contains 32 PULs including six of the 11 highly and partially conserved PULs shared by Bg0018 and Bg0019. These six PULs in P. copri PS131.S11 were predicted to be involved in utilizing arabinoxylan (PUL15), 13-glucan (PUL8 and PUL30), pectin (PUL3), pectic galactan (PUL14), starch (PUL27a), and xylan (PUL8) (Table 11). Although the strict criteria for conservation with the Bg0018/Bg0019 PULs was not met, an additional arabinogalactan-targeted PUL (PUL27b) immediately adjacent to the conserved PUL27a was also identified.

[0324] P. stercorea was the only other Prevotella species present in the 20-member collection. Although none of the WLZ positively (or negatively) associated MAGs identified in the clinical study belonged to P. stercorea, this isolate was included in the collection to assess the specificity of the responses of P. copri to MDCF-2. The P. stercorea isolate did not possess any of the PULs presents in P. copri PS131.S11 or Bg0018/Bg0019, even after relaxing the criteria for sequence conservation to account for the taxonomic divergence between the two species. The cultured P. stercorea strain has 10 PULs, only five of which encode known carbohydrate utilization enzymes. The glycoside hydrolases in these five PULs were predicted to have very different carbohydrate specificities from those found in the P. copri strain and two P. copri MAGs (the P. stercorea PULs mainly target non-plant glycans) (Table 11).

[0325] B. infantis is a prominent early colonizer of the gut. Therefore, it was ensured that it was well represented at the earliest stages of assembly of the defined community so that later colonizers such as P. copri could establish. The collection of cultured isolates also included two strains of Bifidobacterium longum subsp. infantis (B. infantis) recovered from Bangladeshi children - B. infantis Bg463 and B. infantis Bg2D9. The Bg463 strain had been used in earlier preclinical studies that led to development of MDCF-2.

Example 9: Initial colonization and phenotyping

[0326] Design - The 20-strain collection was used to perform a 3-arm, fixed diet study that involved ‘successive’ waves of maternal colonization with four different bacterial consortia (FIGs. 10A-C). The sequence of introduction of taxa into dams was designed to emulate temporal features of the normal postnatal development of the human gut community, e.g., consortia 1 and 2 were comprised of strains that are prominent colonizers of healthy infants/children in the first postnatal year while those in consortium 3 are prominent in the second postnatal year. This dam- to-pup colonization strategy also helped overcome the technical challenge of reliable delivery of bacterial consortia to newborn pups via oral gavage.

[0327] Dually-housed germ-free dams were switched from a standard breeder chow to a ‘weaning-diet’ supplemented with MDCF-2 on postpartum day 2, two days before initiation of the colonization sequence. This diet was formulated to emulate the diets consumed by children in the clinical trial during MDCF-2 treatment (See Methods; FIG. 10A; Tables 13, 16, 17). It contained (i) powdered human infant formula, (ii) complementary foods consumed by 18-month-old children living in Mirpur, Bangladesh where the study took place, and (iii) MDCF-2. The contributions of the milk, complementary food and MDCF-2 ‘modules’ to total caloric content (53%, 17%, and 30%, respectively) were based on published studies of the diets of cohorts of healthy and undernourished 12- to 23-month-old children from several low- and middle-income countries, including Bangladesh, as well as the amount of MDCF-2 given to the 12-18-month-old children with MAM in the clinical study.

Table 16: Ingredients in each diet module

Table 17: Representation of modules in the weaning diet supplemented with MDCF-2

[0328] In Arm 1 , dams received the following series of oral gavages: (i) on postpartum day 4, a consortium of five ‘early’ infant gut community colonizers; (ii) on postpartum day 7, P. copri and P. stercorea; (iii) on postpartum days10 and 12 additional age-discriminatory and WLZ-associated taxa, and (iv) on postpartum day 21 , P. copri , P. stercorea, and Faecalibacterium prausnitzii (FIG. 10C). At this last time point, the three strains were given by oral gavage to both the dams and their offspring to help promote successful colonization. In Arm 2, pups were subjected to the same sequence of microbial exposures and the same diet manipulations as in Arm 1 , except that B. infantis Bg463 rather than B. infantis Bg2D9 was included in the first gavage mixture. Arm 3 was a replicate of Arm 2 but without the Prevotella gavages. Pups in all three arms were subjected to a diet sequence that began with exclusive milk feeding (from the nursing dam) followed by a weaning period where pups had access to the weaning phase diet supplemented with MDCF-2. Pups were weaned at P24, after which time they received MDCF-2 alone ad libitum until P53 when they were euthanized. The rationale for the timing of the first three gavages was based on the diet sequence [gavage 1 of early colonizers at a time (P4) when mice were exclusively consuming the dam’s milk, gavage 2 as the pups were just beginning to consume the human weaning (complementary food) diet, gavage 3 somewhat later during this period of ‘complementary feeding and the fourth gavage to help to ensure a consistent level of P. copri colonization at the end of weaning (and subsequently through the post-weaning period)]. The relative abundances of these strains in fecal samples collected from dams on days postpartum days 21 , 24, and 35, as well as the absolute abundances of these strains in fecal samples collected from their offspring on P21 , P24, P35, and P53 were quantified by shotgun sequencing of community DNA (n=2 dams and 5-8 pups analyzed/arm).

[0329] A relationship between B. infantis and P. copri coionization - B. infantis Bg2D9 successfully colonized pups at P21 in Arm 1 [8.4±0.5 Iog10 (genome equivalents/g feces) (mean±SD); relative abundance, 9.0±3.9% (mean±SD)]. In contrast, the abundance of B. infantis Bg463 was 5-8 orders-of-magnitude lower in Arms 2 and 3 [3.22±1.9 and 0.6±1.5 log 10 (genome equivalents/g feces) (mean±SD), respectively]. These differences were sustained through P53 (FIG. 10D). The results also revealed that exposure to B. infantis Bg2D9 in Arm 1 was associated with an absolute abundance of P. copri in the pre-weaning period (P21) that was 3 orders-of- magnitude greater than in Arm 2 mice exposed to B. infantis Bg463; P<0.005, Mann-Whitney U test] (FIG. 10E). Administering the fourth gavage on P21 elevated the absolute abundance of fecal P. copri in Arm 2 to a level comparable to Arm 1; this level was sustained throughout the post-weaning period (P24 to P53) [FIG. 10E; P>0.05; mixed linear effects model (Methods). This effect of the fourth gavage was also evident in the ileal and cecal microbiota.

[0330] The effects of B. infantis on P. copri did not generalize to P. stercorea. Unlike P. copri , the absolute abundance of P. stercorea in feces sampled on P21 and P24 was not significantly different in mice belonging to Arms 1 and 2 (P>0.05, Mann-Whitney U test). Prior to weaning at P24, the absolute abundance of P. stercorea was 5-orders of magnitude lower than that of P. copri . Throughout the post-weaning period, the absolute abundance of P. stercorea remained similar in members of both treatment arms (P>0.05, Mann-Whitney U test) but 2-orders of magnitude below that of P. copri .

[0331] Based on these results, the colonization dependency of P. copri on B. infantis was directly tested in two independent experiments whose designs are outlined in FIG. 9A. Dually- housed germ-free dams were switched from standard breeder chow to the weaning Bangladeshi diet supplemented with MDCF-2 on postpartum day 2. On postpartum day 4, one group of dams was colonized with B. infantis Bg2D9. On postpartum days 7 and 10, both groups of gnotobiotic mice were gavaged with a consortium containing five P. copri strains. These five P. copri strains (1A8, 2C6, 2D7, G8, and PS131.S11) were all isolated from fecal samples obtained from Bangladeshi children (Table 12). Pups were separated from their dams at the completion of weaning and their diet was transitioned to MDCF-2. The results disclosed that the total absolute abundance of P. copri in feces collected on P42 from mice that had received B. infantis Bg2D9 was three orders of magnitude higher than in animals never exposed to B. infantis (FIG. 9B; Tables 14 and 15) - a finding that confirmed what was observed between Arms 1 and 2 of the initial colonization experiment (see FIG. 10E). There was no statistically significant difference in weight gain from P23 to P42 between the mono- and bi-colonization groups. However, interpretation of this result was confounded by the fact that compared to the bi-colonized animals with significantly higher levels of P. copri , mono-colonized mice with low levels of P. copri had massive, fluid-filled cecums, similar to those commonly seen in germ-free mice. This pronounced cecal enlargement adds substantially to body weight and in a comparison of the two treatment groups obscures the ability to discern whether increased levels of P. copri has ponderal growthpromoting effects.

[0332] Effects on weight gain and metabolism of MDCF-2 glycans - Gnotobiotic mice in Arm 1 exhibited a significantly greater increase in weight gain between P23 (the first time point measured, 2 days after the final gavage) and P53 compared to mice in the two other experimental arms [P < 0.05 compared to Arm 2; P<0.01 compared to Arm 3; linear mixed-effects model (see Methods)] (FIG. 10F). Unlike the mono- and bi-colonization experiments described above, cecal sizes were comparable across the three treatment groups. Based on these results, we advanced samples collected from mice in Arms 1 and 3 for additional analyses of the metabolism of MDCF- 2 glycans.

[0333] Integrating results from mass spectrometric and microbial RNA-Seq data generated from cecal contents harvested from mice at the time of euthanasia (P53) provided several lines of evidence for the important role played by P. copri in metabolizing the principal polysaccharide components of MDCF-2. First, unlike P. stercorea, P. copri PS131.S11 contains and expresses PULs involved in processing MDCF-2 glycans: i.e., PUL27a and PUL27b specify and express CAZymes known or predicted to digest starch and arabinogalactan, while PUL2 possesses and expresses a fucosidase that could target the terminal residues found in arabinogalactan II (Table 11). Second, UHPLC-QqQ-MS-based measurements of 49 glycosidic linkages in cecal contents disclosed that animals in Arm 1 harboring P. copri had (I) significantly lower levels of t-p-Ara, t- f-Ara, 2-f-Ara, 2,3-f-Ara, and 3,4-p-Xyl/3,5-f-Ara (P<0.05; Mann-Whitney U Test; FIG. 11A) and (ii) significantly lower amounts of arabinose in cecal glycans (P<0.05; Mann-Whitney U test; FIG. 11B). Third, GC-MS-based measurements of cecal short-chain fatty acids showed significantly higher levels of acetate, indicating increased fermentation by the P. copri -containing microbial community (P<0.01 ; Mann-Whitney U test) (FIG. 11C; Table 18). Together, these results indicate that mice with the P. copri -containing community exhibit a greater degree of liberation of arabinose from MDCF-2 glycans.

Table 9: Targeted mass spectrometric analysis of short-chain fatty acids in the cecal contents of gnotobiotic mice colonized with defined consortia

[0334] Increased levels of enzyme-resistant arabinose linkages, such as 5-f-Ara, 2-f-Ara, and 2,3-f-Ara, has been previously reported in the feces of MDCF-2 treated children in the upper- compared to lower quartile of WLZ response. The lower levels of these resistant arabinose- containing linkages documented in gnotobiotic mice harboring P. copri versus those lacking the organism indicate more complete degradation of branched arabinans in their cecums - a portion of the gastrointestinal tract that is specialized for microbial fermentation. Because (i) P. copri was the only Prevotella sp. in the defined community that encodes and expresses CAZymes capable of degrading linkages in MDCF-2 glycans (Table 11), (ii) P. copri has higher absolute abundance than P. stercorea, and (iii) previous analyses linked the abundance of P. copri but not P. stercorea MAGs to host growth, Arm 1 of this experiment is referred as ‘w/ P. copri ’ and Arm 3 as ‘w/o P. copri ’, as described herein.

[0335] Effects on expressed metabolic functions in other community members - To investigate the transcripts driving the observed differences in microbial glycan processing in the ‘w/ P. copri ’ versus ‘w/o P. copri ’ arms, we performed microbial RNA-Seq on cecal contents collected at the time of euthanasia (Tables 19 and 20). Transcript abundance tables were filtered and counts were aggregated based on mcSEED reconstructions of metabolic pathways to give an average expression value across the genes in a given metabolic pathway in a given organism (see Methods'). Principal component analysis (PCA) was performed on these aggregated tables and compared the contribution of each expressed metabolic pathway to each principal component and the clustering of samples from each experimental Arm in a space determined by PC1 and PC2 (FIG. 11 D). The results revealed significant separation of meta-transcriptomes aggregated by metabolic pathway between cecal samples from the with and without P. copri Arms (P<0.001 ;

PERMANOVA) (FIG. 11 E).

Table 19: Sample metadata

Table 20: Level of gene expression in P. copri PS131.S11 PULs (TPM normalized)

[0336] In order to identify pathways that drive separation of samples along PC1 , the contribution of each pathway was used in each community member to each singular vector to rank the pathways. Notably, arabinose utilization was consistently among the most upregulated pathways with P. copri colonization (FIG. 11F). Moreover, three of the four bacteria capable of arabinose utilization (Blautia obeum, Bifidobacterium catenulatum, and Mitsuokella multacida) were significantly more abundant in the cecums of mice colonized with P. copri .

[0337] Subsequently, differential expression analysis was performed (using edgeR; see Methods) to further assess the effects of P. copri -colonization on the transcriptomic profiles of community members at gene-level resolution. Differentially expressed transcripts associated with complete metabolic pathways are summarized in FIG. 11G. Among the four arabinose-utilizing strains, B. obeum and M. multacida demonstrated significantly higher expression of all of their genes involved in arabinose utilization with P. copri colonization. Both B. obeum and M. multacida also demonstrated statistically significant upregulation of all or most of their genes involved in biosynthesis of glutamine and glutamate, as well as branched-chain amino acids (isoleucine, leucine, and valine). In addition, B. obeum displayed elevated transcription of genes involved in acetate production in the P. copri -colonized mice. Integrating the mass spectrometric and microbial RNA-seq results generated from this defined consortium indicates that P. copri colonization leads to liberation of arabinose from MDCF-2 glycans, which in turn becomes bioavailable to other community members, including positively WLZ-associated members such as B. obeum, resulting in their increased fitness and altered expressed metabolic functions.

Example 10: snRNA-Seq of intestinal gene expression

[0338] Histomorphometric analysis of villus height and crypt depth in jejunums harvested from mice harboring communities with and without P. copri (n=8 and 7, respectively) disclosed no statistically significant architectural differences between the two treatment groups (P>0.05; Mann- Whitney U test; Table 21). snRNA-Seq was used subsequently, to investigate whether these two colonization states produced differences in expressed functions along the crypt-villus axis in jejunal tissue collected from P53 animals (n=4/treatment arm; FIG. 12A-F; FIG. 13A-C; Tables 22 and 23). A total of 30,717 nuclei passed our quality metrics (see Methods). Marker gene-based annotation disclosed cell clusters that were assigned to the four principal intestinal epithelial cell lineages (enterocytic, goblet, enteroendocrine, and Paneth cell) as well as to vascular endothelial cells, lymphatic endothelial cells, smooth muscle cells and enteric neurons (FIG. 13A-C). Marker gene analysis allowed us to further subdivide the enterocytic lineage into three clusters: ‘villusbase’, ‘mid-villus’ and ‘villus-tip’. Pseudobulk snRNA-seq analysis, which aggregates transcripts for each cell cluster and then uses edgeR to identify differentially expressed genes in each cluster ^18,19, disclosed that a majority of all statistically significant differentially expressed genes (3,651 of 5,765; 63.3%) were assigned to the three enterocyte clusters (FIG. 13C).

Table 21: Quantification of jejunal villus height and crypt depth from gnotobiotic mice harboring defined bacterial consortia

Table 22: snRNA-Seq dataset generated from jejunums of gnotobiotic mice colonized with defined consortia of cultured bacterial strains, sample metadata

Table 23: Proportional representation of cell clusters identified from snRNA-seq dataset

[0339] NicheNet was used initially, to evaluate the effects of the P. copri community on intercellular communications. NicheNet integrates information on signaling and gene regulation from publicly available databases to build a “prior model of ligand-target regulatory potential” and then predicts potential communications between user-defined “sender” and “receiver” cell clusters. After incorporating snRNA-Seq-based expression data from both sender and receiver cells, NicheNet computes a list of potential ligand-receptor interactions between senders and receivers. The ligand-receptor interactions in the resulting list are then ranked based on the effect of the ligand-receptor interactions on downstream genes in their signaling pathway (/.e., more downstream genes are expressed in a ‘high-ranking’ interaction). After this ranking step, an additional filter is applied, with ligand-receptor interactions having firm experimental validation in the literature designated as “bona fide” interactions. Finally, NicheNet uses information generated by Seurat from a snRNA-Seq dataset to identify altered “bona fide” ligand-receptor interactions.

[0340] The six epithelial cell clusters (crypt stem cells, proliferating TA/stem cells, villus base, mid-villus, and villus tip enterocytes and goblet cells) were designated as “receiver cells” while all clusters (both epithelial and mesenchymal) were designated “sender cells”. NicheNet analysis was then conducted for each sender-receiver pair. FIG. 14 shows bona fide ligand-receptor interactions that are altered between the two colonization conditions for each receiver cell cluster.

Ligands identified include those known to affect cell proliferation (igf-1), cell adhesion (cadm1, cadm3, cdh3, Iama2, npnt), zonation of epithelial cell function/differentiation along the length of the villus (bmp4, bmp5), as well as immune responses (cadml, il15, tgfbl, tnc) (FIG. 14). Among all receiver cell clusters, crypt stem cells exhibited the highest number of altered bona fide ligandreceptor interactions. For example, lgf-1 signaling is known to enhance intestinal epithelial regeneration. The colonization with the P. copri -containing consortium was found associated with markedly elevated expression of igf-1 in goblet cells and lymphatic endothelial cells - an interaction that propagates downstream to activate Igf-1 signal transduction in crypt stem cells.

[0341] The Compass algorithm was subsequently applied to our snRNA-Seq datasets to generate in silico predictions of the effects of the consortia containing and lacking P. copri on the metabolic states of (i) stem cell and proliferating TA cell clusters positioned in crypts of Lieberkuhn, (ii) the three villus-associated enterocyte clusters, and (iii) the goblet cell cluster. Compass combines snRNA-seq data with the Recon2 database. This database describes 7,440 metabolic reactions grouped into 99 Recon2 subsystems, plus information about reaction stoichiometry, reaction reversibility, and associated enzyme(s). Using snRNA-seq data, Compass computes a score for each metabolic reaction. If the metabolic reaction was reversible, then one score eas calculated for the “forward” reaction and another score was calculated for the “reverse” reaction. A ‘metabolic flux difference’ was calculated (see Methods) to quantify the difference in net flux for a given reaction (/.e., the forward and reverse activities) between the two treatment groups.

[0342] FIG. 15A-F shows the predicted metabolic flux differences for Recon2 reactions in enterocytes distributed along the length of the villus and in goblet cells. In clusters belonging to the enterocyte lineage, the number of statistically significant differences was greatest in villus base enterocytes and decreases towards the villus tip (FIG. 15A). Mice in the w/ P. copri treatment group had the greatest predicted increases (relative to their w/o P. copri counterparts) in activities of subsystems related to energy metabolism, the metabolism of carbohydrates, amino acids and fatty acids, as well as various transporters, in their villus base and mid-villus enterocytes (FIG. 15B, FIG. 16).

[0343] While enterocytes prioritize glutamine as their primary energy source, they were also able to utilize fatty acids and glucose. The Compass-defined increase in reactions related to fatty acid oxidation that occur in the villus enterocytes of mice in the w/ P. copri group extended to their crypts of Lieberkuhn (FIG. 17B). Fatty acid oxidation has been linked to intestinal stem cell maintenance and regeneration. Mice colonized with P. copri exhibited ‘statistically credible increases’ in the proportional representation of crypt stem cells and proliferating TA/stem cells but not in their villus-associated enterocytic clusters (FIG. 17C; see Table 23 for results regarding all identified epithelial and mesenchymal cell clusters). [The term ‘statistically credible difference’ was defined by scCODA (see Methods)]. Compared to mice lacking P. copri , those colonized with this organism also had predicted increases in energy metabolism in their goblet cells, as judged by the activities of subsystems involved in glutamate (Glu) metabolism, the urea cycle, fatty acid oxidation and glycolysis (FIG. 17B).

[0344] Citrulline is generally poorly represented in human diets; as it is predominantly synthesized via the metabolism of glutamine in small intestinal enterocytes and transported into the circulation. Studies of various enteropathies and short bowel syndrome have demonstrated that citrulline is a quantitative biomarker of metabolically active enterocyte mass and its levels in plasma were indicative of the absorptive capacity of the small intestine. Citrulline was markedly lower in blood from children with severe acute malnutrition compared to levels found in healthy controls from the same community. Low plasma citrulline levels have also been reported in cohorts of children with environmental enteric dysfunction, with higher levels predictive of future weight gain.

[0345] Both glutamate and arginine were found important for citrulline production in enterocytes. Glutaminase (GIs) and glutamate dehydrogenase (GluD) in the glutamine pathway provided ammonia for generating carbamoyl phosphate (FIG. 17D). Arginine is a primary precursor for ornithine synthesis: ornithine transcarbamylase (Ots) produces citrulline from carbamoyl phosphate and ornithine. Compass predicted that mice harboring P. copri exhibited statistically significant increases in these reactions in their villus base and mid-villus enterocyte clusters [q<0.05 (adjusted P-value); Wilcoxon Ranked Sum test; FIG. 17D]. Targeted mass spectrometric analysis confirmed that citrulline was significantly increased in jejunal, ileal and colonic tissue segments, as well as in the plasma of mice harboring P. copri (P<0.05; Mann-

Whitney U test; Fig. 17E).

[0346] The presence of P. copri was also associated with significantly greater predicted activities in Recon2 subsystems involved in transport of nine amino acids (including the essential amino acids leucine, isoleucine, valine, and phenylalanine), dipeptides and monosaccharides (glucose and galactose) in villus base and mid-villus enterocytes (FIG. 17F). This prediction suggested greater absorptive capacity for these important growth-promoting nutrients, which are known to be transported within the jejunum at the base and middle regions of villi.

Example 11: Additional assessment of host metabolic effects produced by P. copri

[0347] To validate some of these Compass-based predictions, the experiment described above was repeated but with just two of its arms (“w/ P. copri ” and “w/o P. copri ”) and with a larger number of animals (4 dually housed germ-free dams yielding 18-19 viable pups per arm). The same cultured strains, the same sequence of their introduction and the same sequence of diet switches were applied (FIG. 17A). B. infantis strain Bg2D9 was utilized in both arms. Reproducible colonization of consortium members within each arm was confirmed by quantifying their absolute abundances in cecal samples collected at the time of euthanasia (P53; see Table 24). Consistent with the previous experiment, animals in the w/ P. copri arm exhibited significantly greater weight gain between P23 and P53 [P<0.05; linear mixed-effects model (see Methods)] (FIG. 17B).

Table 24: Absolute abundances of bacterial strains in dam-pup dyads colonized with cultured bacterial consortia in the validation experiment, sample metadata

[0348] Mass spectrometric analysis of host metabolism - Targeted mass spectrometry was used to quantify levels of 20 amino acids, 19 biogenic amines, and 66 acylcarnitines in the jejunum, colon, gastrocnemius, quadriceps, heart muscle, and liver of the two groups of mice. Additionally, the 66 acylcarnitines were quantified in their plasma (FIG. 15C-E). Consistent with the previous experiment, citrulline, the biomarker for metabolically active enterocyte biomass, was significantly elevated in the jejunums of mice belonging to the w/ P. copri group (P<0.05; Mann- Whitney U test) (FIG. 15C).

[0349] Significant elevations of acylcarnitines derived from palmitic acid (C16:0), stearic acid (C18:0), oleic acid (C18:1 ), linoleic acid (C18:2), and linolenic acid (C18:3) were observed in the jejunums of P. copri -colonized animals (P<0.01 ; Mann-Whitney U test) (FIG. 15D); and which are the major fatty acids found in soybean oil, a principal source of lipids in MDCF-2. These acylcarnitine chain lengths, were found at higher abundance than all other medium or long-chain acylcarnitine species in the samples, indicating their role as primary dietary lipid energy sources . Elevation of these species suggested an increased transport and β-oxidation of long-chain dietary lipids in the jejunum.

[0350] Analysis of colonic tissue showed significant elevation of C16:0, C18:1 , and C18:2 acylcarnitines in P. copri -colonized animals, suggesting that β-oxidation was also elevated in tissue compartments not directly involved in lipid absorption (P<0.01 ; Mann-Whitney U test) (FIG. 15E). This finding was matched by a significant elevation in plasma levels of non-esterified fatty acids in w/ P. copri animals, suggesting higher circulation of dietary lipids which would support fatty acid p-oxidation in peripheral tissues (P<0.05; Mann-Whitney U test) (FIG. 15F; Table 25). Targeted LC-MS was further conducted in liver, gastrocnemius muscle, quadriceps and heart. The statistically significant difference in levels of acylcarnitines whose chain length corresponded to components of soybean oil was an increase in 18:2 and 18:3 species in the myocardium of w/ P. copri compared to w/o P. copri animals. Additionally, jejunal levels of C3 and C4 acylcarnitines as well as colonic levels of C4 and C5 acylcarnitines known to be derived from branched-chain amino acid catabolism, were significantly elevated in the P. copri -colonized animals (P<0.05; Mann-Whitney U test; FIG. 15D, 15E). Together, these results suggested that the presence of P. copri induced differential fuel utilization via fatty acid b-oxidation at sites involved in dietary nutrient absorption. Table 25: Non-esterfied fatty acids in plasma (NEFA; mmol/L)

Example 11: Evaluation of the effects of preweaning P. copri colonization

[0351] To directly determine whether pre-weaning colonization with P. copri strains resembling MAGs Bg0018 and Bg0019 is sufficient to promote growth and produce the metabolic effects described above, an additional experiment was performed. The design was similar to that described above but there were several important modifications. First, P. stercorea was not included in the second gavage mixture; it only contained P. copri . Second, two strains of P. copri (D5.2 and F5.2) were used, cultured from fecal samples from Bangladeshi children, that displayed greater similarity to Bg0018 and Bg0019 than the PS131 strains quantified by ANI and their content of PULs and mcSEED metabolic pathways. Both P. copri D5.2 and F5.2 shared 102/106 (96%) metabolic pathway completeness annotations with MAG Bg0018 and 101/106 (95%) annotations with MAG Bg0019. Similarly, 9 of the 10 functionally conserved PULs shared by MAGs Bg0018 and Bg0019 were conserved in P. copri D5.2 and F5.2. Third, the fourth gavage was omitted, previously administered at the end of the weaning period, that had included P. copri and P. stercorea. The control group of animals did not receive P. copri (n=2 dams and 13 pups/treatment group).

[0352] Shotgun sequencing of DNA isolated from cecal contents collected at the time of euthanasia (P53) confirmed that animals in the experimental group had been colonized with both P. copri isolates as well as all other members of the defined consortia. In animals colonized with both isolates, P. copri D5.2 was present at significantly higher absolute abundance than the F5.2 strain (FIG. 17B); their relative abundances were 37.8±4.4% and 15.5+1.0%, respectively, compared with 31 ±6.6% and 24+8.0% for P. copri PS131 in the first and second experiments. Colonization of all administered strains was confirmed in the control group. Comparing the experimental and control groups disclosed that carriage of these two isolates was associated with a significantly greater total bacterial load, indicating that their colonization augmented community biomass without displacing other bacteria (FIG. 17B).

[0353] A significantly greater increase in body weight was observed between P23 and P53 in mice colonized with P. copri D5.2 and F5.2 compared to those without P. copri [P<0.0001 ; linear mixed-effects model] (FIG. 17C). The difference in the mean percent increase in postweaning weight between the experimental and control groups in this experiment (24%) was comparable to that document in the two previous experiments (25% in the first and 13% in the second); as in these previous experiments, the weight difference was not attributable to differences in cecal size.

[0354] Mass spectrometry confirmed that preweaning colonization with P copri affected intestinal lipid metabolism and was a major determinant of MDCF-2 glycan degradation. Targeted LC-MS of ileal and colonic tissue revealed significant elevation of long-chain acylcarnitines corresponding to soybean oil lipids, consistent with changes observed in the prior experiment (Fig 15e).

[0355] A comparison of the two isolates used in this experiment against our previously used isolate, P. copri PS131 , and the 10 functionally conserved PULs of MAGs Bg0018 and Bg0019 disclosed that these isolates contain PULs conserved between Bg0018 and Bg0019 involved in the degradation of substrates including galactose and mannose containing glycans (Fig 17D). UHPLC-QqQ-MS-based measurement of total monosaccharides in cecal contents indicated that the presence of these two more MAG Bg0018- and Bg0019-like strains resulted in significantly lower levels of arabinose, consistent with previous observations using P. copri PS131 , as well as galactose, a finding that was specific to this experiment (Fig. 17E). We simultaneously observed that P. copri D5.2 and F5.2 colonization significantly lower levels of all arabinose-containing linkages measured, as well as three galactose-containing linkages (Fig. 17F and G). Together, these data suggest that the different PUL content of these new isolates leads to enhanced degradation of dietary glycans by the microbial community.

[0356] Targeted UHPLC-QqQ-MS-based measurement of glycosidic linkages in cecal contents indicated that the presence of these two more MAG Bg0018- and Bg0019-like strains resulted in X effects (FIG. 17D). Targeted UPHLC-QqQ-MS measurements of all 20 amino acids and seven B-vitamins revealed that P. copri colonization was associated with significantly higher cecal levels of two essential amino acids (tryptophan, lysine), and seven non-essential amino acids (glutamate, glutamine, aspartate, asparagine, arginine, proline, glycine) and higher levels of pantothenic acid (B5).

[0357] It was concluded that pre-weaning colonization with P. copri augments weight gain in the context of the MDCF-2 diet, that the organism is a major determinant/effector of MDCF-2 glycan degradation, and that its presence in the community produces substantial changes in intestinal tissue fatty acid metabolism. Example 12: Summary of results from Examples 7-11

[0358] In the disclosed examples, a ‘reverse translation’ strategy was illustrated that can be used to address the mechanisms by which microbiome-targeted nutritional interventions impact the operations of microbial community members and how these changes can alter human physiology at a molecular, cellular and systems level. Gnotobiotic mice were colonized with defined consortia of age- and WLZ-associated bacterial strains cultured from the study population. Dam-to-pup transmission of these communities occurred in the context of a sequence of diets that re-enacted those consumed by children in the clinical study. Microbial RNA-Seq and targeted mass spectrometry of glycosidic linkages present in intestinal contents provided evidence that Prevotella copri, represented by an isolate similar to MAGs identified as WLZ-associated in the clinical trial, was crucial to the metabolism of polysaccharides contained in MDCF-2. snRNA-Seq and targeted mass spectrometry indicated that P. copri increased the uptake and metabolism of lipids, including those fatty acids that are most prominently represented in the soybean oil that comprises the principal lipid component of MDCF-2. Additional effects on uptake and metabolism of amino acids (including essential amino acids) and monosaccharides were predicted. The effects on nutrient processing and energy metabolism involved proliferating epithelial progenitors in the crypts as well as their descendant lineages distributed along the villus. snRNA-Seq revealed discrete spatial features of these effects, with populations of enterocytes positioned at the base-, mid- and tip regions of villi manifesting distinct patterns of differential expression of a number of metabolic functions.

[0359] In summary, the above-described examples illustrated an approach for identifying members of a gut microbial community that function as principal metabolizers of MDCF components as well as key effectors of host biological responses. Characterizing their genomic features and expression, can be used for developing microbiome-based diagnostics for stratification of populations of undernourished children who are candidates for treatment with a given MDCF, and for monitoring their treatment responses, including in adaptive clinical trial designs. Further, a knowledge base needed is provided for (i) creation of ‘next generation’ MDCFs composed of (already) identified bioactive components, but from alternative food staples which are more readily available, affordable and culturally acceptable for populations living in different geographic locales; (ii) more informed decisions about the dose of an MDCF for undernourished children as a function of their stage of development and disease severity, and (iii) evolving policies about complementary feeding practices that build upon traditional macro- and micro-nutrient- centric considerations, but now add insights about how food components impact the fitness and expressed beneficial functions of growth-promoting elements of a child’s microbiome. Finally, the recovered growth-promoting strains can be used as next-generation probiotics, and/or as components of synbiotics for repairing gut microbial communities that cannot be resuscitated with food-based interventions alone.

Example 13: Effects of MDCF-2 as provided in Examples 1-6 persist beyond cessation of the 3-month intervention

[0360] In order to study if the effect of intervention with administration of MDCF-2 as provided in Examples 1-6, last beyond the 3-month period of intervention, weight-for-length z-score (WLZ), length-for-age z-score (LAZ), weight-for-age z-score (WAZ) between the MDCF-2 and RUSF groups at different time points up to 2 years after cessation of the 3-month intervention in 12-18 month children with primary MAM was calculated. Table 26 provides the baseline characteristics of the children in the primary MAM study.

Table 26: Baseline characteristics of children in the primary MAM study

[0361] Figures 18A-C provide data for the WLZ, LAZ and WAZ comparison corresponding to: 9=One-month follow-up after cessation of intervention; 10=Six-month follow-up after cessation of intervention; 11 =12-month follow-up after cessation of intervention; 12=18-month follow-up after cessation of intervention; and 13=24-month follow-up after cessation of intervention.

[0362] Table 27 provides a compilation of the results. Mixed effect multiple linear model adjusted for baseline anthropometry (WLZ, LAZ, WAZ score), interventions (RUSF or MDCF-2) child age (in days, continuous), child gender (male and female) and child past seven days morbidity status (yes, no) were developed.

Table 27: Results

Results show that the effect of the intervention persist beyond cessation of the 3-month intervention and lead to a delayed but lasting improvement in stunting (up to 2 years after cessation of treatment - Figs 18A-C and Table 27). This is a significant finding as there are few if any reported treatments that affect linear growth (LAZ) of stunted children in this way and thus expands the benefits of the MDF/P. copri combination beyond solely ponderal growth (weight gain).

TABLE 28

Table 30B(i) : Glycosidic linkage composition (peak area, arbitrary units / ng dried diet or ingredient) - MDCF-2

Table 30B(ii) : Glycosidic linkage composition (peak area, arbitrary units / ng dried diet or ingredient) - RUSF

Table 30C : Polysaccharide composition (FITDOG, μg polysaccharide I mg of dried diet or ingredient)

Previous Patent: METHOD OF PRODUCING SYNGAS FROM BIOMASS UTILIZING TAIL GAS FOR TAR REMOVAL

Next Patent: HYDRODYNAMIC AND GRAVITY METHOD OF FORMING AND SHAPING TAPERED MICROFLUIDIC DEVICES