DIAGNOSING INFLAMMATORY BOWEL DISEASES

Title:

DIAGNOSING INFLAMMATORY BOWEL DISEASES

Document Type and Number:

WIPO Patent Application WO/2023/002491

Kind Code:

Abstract:

A method of diagnosing an inflammatory bowel disease (IBD) of a subject is disclosed. The method comprises analyzing the RNA expression level of particular human gene in a fecal RNA sample of the subject, wherein when the expression level is above a predetermined amount it is indicative of the inflammatory bowel disease.

Inventors:

ITZKOVITZ SHALEV SHAUL (IL)
BAHAR HALPERN KEREN (IL)
EGOZI ADI (IL)
BEN-HORIN SHOMRON SILAN (IL)
UNGAR BELLA (IL)
BEN-MOSHE SHANI (IL)

Application Number:

PCT/IL2022/050793

Publication Date:

January 26, 2023

Filing Date:

July 21, 2022

Export Citation:

Click for automatic bibliography generation Help

Assignee:

YEDA RES & DEV (IL)
TEL HASHOMER MEDICAL RES INFRASTRUCTURE & SERVICES LTD (IL)

International Classes:

C12Q1/6883

Domestic Patent References:

WO2016120625A1	2016-08-04
WO2009120877A2	2009-10-01

Foreign References:

US20090258848A1	2009-10-15
US20200308644A1	2020-10-01
US7056716B2	2006-06-06
US4458066A	1984-07-03

Other References:

CUI ET AL., DIGESTIVE DISEASES AND SCIENCES, vol. 66, 2021, pages 1488 - 1498
J. SAMBROOK ET AL.: "Molecular Cloning: A Laboratory Manual", 1989, COLD SPRING HARBOUR LABORATORY PRESS
"PCR Protocols: A Guide to Methods and Applications", 1990, ACADEMIC PRESS
P. TIJSSEN: "Hybridization with Nucleic Acid Probes--Laboratory Techniques in Biochemistry and Molecular Biology (Parts I and Π", 1993, ELSEVIER SCIENCE
K. FRENKEL ET AL., FREE RADIC. BIOL. MED., vol. 19, 1995, pages 373 - 380
"Short Protocols in Molecular Biology", 2002, JOHN WILEY & SONS
S. A. NARANG ET AL., METH. ENZYMOL., vol. 68, 1979, pages 109 - 151
E. S. BELOUSOV ET AL., NUCLEIC ACIDS RES., vol. 25, 1997, pages 3440 - 3444
D. GUSCHIN ET AL., ANAL. BIOCHEM, vol. 250, 1997, pages 203 - 211
M. J. BLOMMERS ET AL., BIOCHEMISTRY, vol. 33, 1994, pages 7886 - 7896
E. S. MANSFIELD ET AL., MOL. CELL. PROBES, vol. 9, 1995, pages 145 - 156

Attorney, Agent or Firm:

EHRLICH, Gal et al. (IL)

Download PDF:

View/Download PDF PDF Help

Claims:

WHAT IS CLAIMED IS:

1. A method of diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein the gene is selected from the group consisting of CSF3R, CASP4, NFKB1A, CFFAR, FAM49B, RNF145, FOSF2, PEF1, PTPRE, GK, MX2, NAGK, MCTP2, SFC03A1, STAT1, RASSF3, MARCKS, SAT1, VPS37B, RNF149, HLA-E, PLAUR, MSN, HIFIA, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM wherein when the expression level is above a predetermined amount it is indicative of the inflammatory bowel disease.

2. A method of diagnosing a disease of the gastrointestinal tract of a subject comprising analyzing the expression level of at least one gene in a fecal wash of the subject, wherein the expression level is indicative of the disease of the gastrointestinal tract.

3. A method of diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein when the expression level of a human gene set forth in Table 1, 5 or 6 is statistically significantly altered over the level of said gene in a fecal RNA sample of a control subject, it is indicative of the inflammatory bowel disease.

4. The method of claims 1 or 3, further comprising depleting said fecal RNA sample of microbial RNA prior to the analyzing.

5. The method of claim 2, wherein said fecal wash is of the sigmoid colon of the subject.

6. The method of claim 2, wherein said fecal wash is of the rectum of the subject.

7. The method of any one of claims 1-2, wherein said analyzing the expression level comprises performing whole cell transcriptome analysis.

8. The method of any one of claims 1-2, wherein said analyzing the expression level comprises performing RT-PCR.

9. The method of claim 2, wherein said analyzing is effected at the RNA level.

10. The method of claim 2, wherein said analyzing is effected at the protein level.

11. The method of claim 1, wherein said fecal sample comprises a fecal wash, the at least one gene is selected from the group consisting of CSF3R, CFLAR, FAM49B, MX2, STAT1,

CASP4, NFKB1A, RNF145, FOSL2, PEL1, PTPRE and GK.

12. The method of claims 2 or 9, wherein said at least one gene is selected from the group consisting of CSF3R, CASP4, NFKB1A, RNF145, FOSL2, PEL1, PTPRE, MX2, NAGK, MCTP2, SLC03A1, STAT1, RASSF3 and GK.

13. The method of claims 2 or 9 wherein said at least one gene is selected from the group consisting of MX2, CSF3R, NAGK, MCTP2, SLC03A1, CASP4, NFKBIA, STAT1, RNF145 and RASSF3.

14. The method of claim 1, wherein said fecal sample comprises a solid fecal sample, the at least one gene is selected from the group consisting of MARCKS, SAT1, NFKBIA, VPS37B, RNF149, HLA-E, PLAUR, MSN, HIF1A, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM.

15. The method of claim 2, wherein the disease is an inflammatory bowel disease

(IBD).

16. The method of any one of claims 1, 3 or 15, wherein said IBD comprises ulcerative colitis or Crohn’s colitis.

17. The method of claim 2, wherein the disease is a colon cancer.

18. The method of claim 2, wherein the disease is irritable bowel syndrome.

19. The method of any one of claims 1-17, wherein said diagnosing the IBD comprises determining the severity of the IBD.

20. The method of any one of claims 1-16, wherein the expression level of said at least one gene correlates with the degree of histological inflammation.

21. A method of treating an inflammatory bowel disease of a subject in need thereof comprising:

(a) confirming that the subject has the inflammatory bowel disease according to the method of any one of claims 1 or 3; and

(b) administering to the subject a therapeutically effective amount of an agent useful for treating the disease.

22. A method of treating a disease of the gastrointestinal tract of a subject in need thereof comprising:

(a) confirming that the subject has the inflammatory bowel disease according to the method of claim 2; and

(b) administering to the subject a therapeutically effective amount of an agent useful for treating the disease.

23. A method of selecting an agent for the treatment of an inflammatory bowel disease (IBD) comprising:

(a) contacting the agent with an RNA sample derived from feces of a subject having the IBD; and

(b) analyzing the amount of at least one RNA set forth in Table 1, wherein a decrease in the amount of said at least one RNA in the presence of the agent as compared to the amount of said at least one RNA in the absence of the agent is indicative of an agent which is suitable for the treatment of the inflammatory bowel disease.

24. The method of claim 23, wherein said RNA sample is a solid fecal sample, the at least one gene is selected from the group consisting of MARCKS, SAT1, NFKBIA, VPS37B, RNF149, HLA-E, PLAUR, MSN, FHF1A, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM.

25. The method of claim 23, wherein said RNA sample is a fecal wash of the subject, the at least one gene is selected from the group consisting of CSF3R, CFLAR, FAM49B, MX2, STAT1, CASP4, NFKB1A, RNF145, FOSL2, PEL1, PTPRE and GK.

Description:

DIAGNOSING INFLAMMATORY BOWEL DISEASES

RELATED APPLICATION

This application claims the benefit of priority of Israel Patent Application No. 285031 filed 21 July, 2021, the contents of which are incorporated herein by reference in their entirety.

SEQUENCE LISTING STATEMENT

The file entitled 92757. xml, created on 21 July 2022, comprising 53,248 bytes, submitted concurrently with the filing of this application is incorporated herein by reference.

FIELD AND BACKGROUND OF THE INVENTION

The present invention, in some embodiments thereof, relates to methods of diagnosing gastric diseases and more particularly inflammatory bowel diseases.

Biologic therapies have revolutionized therapy for moderate to severe IBD. While 50-60% of patients significantly improve with biologies and experience less hospitalizations and surgeries, many patients are either primary non-responders or experience loss of response over time. Non- invasive markers that may provide information on histological inflammation, and therefore predict patient prognosis or response to therapies, are critically needed.

Several studies performed RNA sequencing of colonic biopsies obtained during lower endoscopies, with the aim of staging the disease and predicting therapeutic outcomes. Furthermore, certain mucosal micro-RNA and long noncoding RNA have been associated with IBD natural history. Recent studies used single cell RNA sequencing (scRNAseq) and single cell mass -cytometry of IBD biopsy samples to reveal distinct populations and genes that are altered in specific disease states. In addition to transcriptomics, unique DNA methylation patterns have been identified in biopsies of IBD patients compared to controls. Data from RNA bulk sequencing of intestinal biopsies has also been integrated with genome-wide-associations to identify genes most associated with regulatory pathways in IBD. Nevertheless, an outstanding challenge of the analysis of biopsies is that they provide localized information and may miss out on inflammatory processes, especially in cases where endoscopic inflammation is not apparent.

A complementary method to assess intestinal inflammation is the use of fecal samples. A recent study demonstrated that patients with active Crohn's disease had a distinct microRNA profile measured in their stool. Fecal proteomics can also inform on intestinal inflammation status. Indeed, calprotectin, a leukocyte protein, is a widely applied biomarker of intestinal inflammation. Nevertheless, the calprotectin assay is limited in sensitivity and specificity and only few additional proteins have been shown to be both resistant to proteolysis and associated with inflammation. An advantage of fecal samples is that they may provide broad sampling of processes that occur throughout the gastrointestinal tract. Recent works demonstrated that fecal host transcriptomes may carry prognostic information related to colorectal cancer, however the utility of this approach to the staging and prognosis of IBDs has not been explored.

Background art includes Cui et ah, Digestive Diseases and Sciences (2021) 66:1488-1498; and US Patent Application No. 20200308644.

SUMMARY OF THE INVENTION

According to an aspect of the present invention there is provided a method of diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein the gene is selected from the group consisting of CSF3R, CASP4, NFKB1A, CFLAR, FAM49B, RNF145, FOSL2, PEL1, PTPRE, GK, MX2, NAGK, MCTP2, SLC03A1, STAT1, RASSF3, MARCKS, SAT1, VPS37B, RNF149, HLA-E, PLAUR, MSN, FHF1A, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM wherein when the expression level is above a predetermined amount it is indicative of the inflammatory bowel disease.

According to an aspect of the present invention there is provided a method of diagnosing a disease of the gastrointestinal tract of a subject comprising analyzing the expression level of at least one gene in a fecal wash of the subject, wherein the expression level is indicative of the disease of the gastrointestinal tract.

According to an aspect of the present invention there is provided a method of diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein when the expression level of a human gene set forth in Table 1, 5 or 6 is statistically significantly altered over the level of the gene in a fecal RNA sample of a control subject, it is indicative of the inflammatory bowel disease.

According to embodiments of the invention, the expression level of the at least one gene correlates with the degree of histological inflammation.

According to an aspect of the present invention there is provided a method of treating an inflammatory bowel disease of a subject in need thereof comprising: (a) confirming that the subject has the inflammatory bowel disease according to the method described herein; and

(b) administering to the subject a therapeutically effective amount of an agent useful for treating the disease.

According to an aspect of the present invention there is provided a method of treating a disease of the gastrointestinal tract of a subject in need thereof comprising:

(a) confirming that the subject has the inflammatory bowel disease according to the method described herein; and

(b) administering to the subject a therapeutically effective amount of an agent useful for treating the disease.

According to an aspect of the present invention there is provided a method of selecting an agent for the treatment of an inflammatory bowel disease (IBD) comprising:

(a) contacting the agent with an RNA sample derived from feces of a subject having the IBD; and

(b) analyzing the amount of at least one RNA set forth in Table 1, wherein a decrease in the amount of the at least one RNA in the presence of the agent as compared to the amount of the at least one RNA in the absence of the agent is indicative of an agent which is suitable for the treatment of the inflammatory bowel disease.

According to embodiments of the invention, the method further comprises depleting the fecal RNA sample of microbial RNA prior to the analyzing.

According to embodiments of the invention, the fecal wash is of the sigmoid colon of the subject.

According to embodiments of the invention, the fecal wash is of the rectum of the subject.

According to embodiments of the invention, the analyzing the expression level comprises performing whole cell transcriptome analysis.

According to embodiments of the invention, the analyzing the expression level comprises performing RT-PCR.

According to embodiments of the invention, the analyzing is effected at the RNA level.

According to embodiments of the invention, the analyzing is effected at the protein level.

According to embodiments of the invention, the fecal sample comprises a fecal wash, the at least one gene is selected from the group consisting of CSF3R, CFLAR, FAM49B, MX2, STAT1, CASP4, NFKBIA, RNF145, FOSL2, PEL1, PTPRE and GK. According to embodiments of the invention, the at least one gene is selected from the group consisting of CSF3R, CASP4, NFKB1A, RNF145, FOSL2, PEL1, PTPRE, MX2, NAGK, MCTP2, SLC03A1, STAT1, RASSF3 and GK.

According to embodiments of the invention, the at least one gene is selected from the group consisting of MX2, CSF3R, NAGK, MCTP2, SLC03A1, CASP4, NFKBIA, STAT1, RNF145 and RASSF3.

According to embodiments of the invention, the fecal sample comprises a solid fecal sample, the at least one gene is selected from the group consisting ofMARCKS, SAT1, NFKBIA, VPS37B, RNF149, HLA-E, PLAUR, MSN, HIF1A, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM.

According to embodiments of the invention, the disease is an inflammatory bowel disease

(IBD).

According to embodiments of the invention, the IBD comprises ulcerative colitis or Crohn’s colitis.

According to embodiments of the invention, the disease is a colon cancer.

According to embodiments of the invention, the disease is irritable bowel syndrome.

According to embodiments of the invention, the diagnosing the IBD comprises determining the severity of the IBD.

According to embodiments of the invention, the RNA sample is a solid fecal sample, the at least one gene is selected from the group consisting of MARCKS, SAT1, NFKBIA, VPS37B, RNF149, HLA-E, PLAUR, MSN, HIF1A, NBPF14, CXCR1, CSF2RA, CLEC2B, GBP5, IL1B, FZD3, MMP25 and OSM.

According to embodiments of the invention, the RNA sample is a fecal wash of the subject, the at least one gene is selected from the group consisting of CSF3R, CFLAR, FAM49B, MX2, STAT1, CASP4, NFKBIA, RNF145, FOSL2, PEL1, PTPRE and GK.

Unless otherwise defined, all technical and/or scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of embodiments of the invention, exemplary methods and/or materials are described below. In case of conflict, the patent specification, including definitions, will control. In addition, the materials, methods, and examples are illustrative only and are not intended to be necessarily limiting. BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWINGS

Some embodiments of the invention are herein described, by way of example only, with reference to the accompanying drawings. With specific reference now to the drawings in detail, it is stressed that the particulars shown are by way of example and for purposes of illustrative discussion of embodiments of the invention. In this regard, the description taken with the drawings makes apparent to those skilled in the art how embodiments of the invention may be practiced.

In the drawings:

FIG. 1 - Illustration of experimental layout.

FIGs. 2A-E - Fecal wash gene expression patterns are more indicative of histological inflammation compared to those of biopsies. A - Principal Component Analysis (PCA) plot showing biopsies (blue circles) and fecal washes (brown circles). Red outer circles denote samples that correspond to patients with histological inflammation determined based on pathology examination of the colonic biopsies. B - Hierarchical clustering of fecal wash samples (brown branches) and colonic biopsies (blue branches). Samples corresponding to patients with active histological inflammation are marked in red. Naming nomenclature: sample name-condition- endoscopic inflammation (0/1) - histological inflammation (0/1). C, D - PCA plots of biopsies (C), and fecal wash samples (D). Red outer circles denote IBD patients with corresponding histological inflammation. E - Transcriptomic signatures of fecal washes for patients with histological inflammation are more correlated among themselves than those of biopsy transcriptomic signatures. Violin plots demonstrating that the correlation distances between pairs of samples that both have histological inflammation (red dots) are significantly smaller than the distances between mixed samples with and without histological inflammation when examining fecal washes (brown dots, bottom) but not when examining biopsies (blue dots, top). White circles are medians, blackboxes denote the 25-75 percentiles.

FIGs. 3A-C - Differentially expressed genes between inflamed and non-inflamed fecal washes. A - Volcano plot, each dot is a gene, x-axis is the log2-ratio of expression between samples with and without histological inflammation, y axis is -loglO (p value), where p value is computed using Wilcoxon rank sum tests. Genes with corresponding q-values below 0.1 are marked in red (q- values computed using Benjamini-Hochberg FDR correction). Names of representative up- regulated genes are shown. B - Hierarchical clustering of fecal wash samples over 100 genes consisting of 50 genes with the maximal ratio of expression levels and 50 with the lowest ratio between histologically inflamed and non-inflamed washes. Samples corresponding to patients with active histological inflammation are marked with red branches. C - Gene Set Enrichment Analysis (GSEA) over the Hallmark and Kegg sets. Shown are all gene sets with q-value<0.3. Inflamed washes (red circles) were associated withimmune cell pathways, while non-inflamed washes (blue circles) expressed more epithelial cell related pathways. Naming nomenclature: sample name- condition-endoscopic inflammation (0/1) - histological inflammation (0/1).

FIGs. 4A-B - Cell compositions of inflamed versus non inflamed fecal washes and biopsies, inferred by computational deconvolution. A - Hierarchical clustering of cell type representation in fecal wash samples and colonic biopsies. Fecal washes from patients with histological inflammation are marked in red. B - Inferred relative representation of genes associated with different cell types in histologically inflamed and non-inflamed colonic biopsies and fecal washes. Immune-related cell types, more abundant in the fecal washes of patients with histological inflammation, are marked with a red box. Naming nomenclature: sample name- condition-endoscopic inflammation (0/1) - histological inflammation (0/1). White circles are medians, gray boxes denote the 25-75 percentiles.

FIGs. 5A-E - Expression of individual genes in fecal washes has a higher statistical power in classifying histological inflammation compared to biopsy gene expression. A - ROC curve example for the gene NFKBIA using fecal washes (blue, AUC=0.97) and biopsies (red, AUC=0.67). B - AUC of 5% genes with the highest AUC for biopsies and washes. The AUC of the top classifier genes is significantly higher for fecal washes compared to biopsies (p=l.85*10 ⁷²). C - Comparison of AUC for individual genes based on biopsies (X axis) and fecal washes (Y axis). NFKBIA (black dot) is shown as an example. Gray boxes mark the top AUC (>0.9) for both groups. Fecal washes contain 150 genes with AUC>0.9 whereas biopsies contain only 10 such genes. D, E - Expression levels for the eight genes with the highest AUC levels for washes (D) and biopsies (E). White circles are medians, gray boxes mark the 25-75 percentiles.

FIGs. 6A-B - Protein and mRNA levels in fecal washes are only weakly correlated. A - Fecal calprotectin levels as measured by a commercial EFISA assay are correlated with the Mass- Spectrometry proteomics levels of the same protein, S100A8, S100A9. Each blue dot denotes a fecal sample. B - Correlation between protein levels and fecal wash mRNA levels for the same samples. Each dot is the average expression over the four samples.

FIGs. 7A-F - Analysis of the Spearman correlation distances between pairs of washes (A,C,E) /biopsies (B,D,F) with endoscopic inflammation (A-B) or histological inflammation (C- F). C-D - Analysis stratified over patient ages between 40 and 60 only. E-F - Analysis stratified for patients not receiving biologies. White circles are medians, black boxes denote the 25-75 percentiles. FIG. 8 - Analysis of the Spearman correlation distance between each wash and its matching biopsy in comparison to the mean of the distances to other biopsies. In 23 out of 31 samples the distance to other biopsies was higher (only samples with more than 10000 Unique Molecular Identifiers (UMIs) in both washes and biopsies were included, Figures 7A-F).

FIG. 9 - Clusters of human colonic cell types based on a recent single cell RNAseq atlas. The average expression of each cluster was used as input signature for CIBERSORTx computational deconvolution.

FIGs. 10A-E illustrate that bacterial rRNA depleted stool transcriptomics are informative in assessing intestinal inflammation. A. Box plots of fractions of reads mapped to human exonic regions from seven samples, before depleting bacterial rRNA (left) and after depleting bacterial rRNA (right). Red lines denote group medians, blue boxes mark IQR. Gray horizontal lines mark the change within each sample. Ranksum paired test p-value = 0.0156. B. Principal component analysis (PCA) on transcriptomes of 106 wash samples (above 10000 UMIs) and 7 stool samples (above 7500 UMIs). Included in the analysis are genes with maximal expression level of at least 0.005 across all samples. C. Clustergram of 106 wash samples and 7 stool samples. Red branches mark a cluster enriched with inflamed samples. Inflamed washes are colored pink, non-inflamed washes are colored blue, inflamed stools are colored brown and non-inflamed stools are colored turquoise. D. differential gene expression between stool samples of inflamed IBD patients (n=3 samples) and of non-inflamed individual (n=7 samples). All samples included in the analysis had more than 5000 UMIs. Grey dots demarcate all genes used for the analysis. Dots encircled in red and blue are genes upregulated and downregulated in IBD inflamed fecal washes, respectively (|fold change| > 1.5, FDR < 0.01). Venn diagrams at the top of the plot demonstrate the overlap between genes downregulated (top left) or upregulated (top right) in both washes and stool samples of inflamed IBD patients, together with the number of non-overlapping genes in each sampling method. P-values were calculated using hypergeometric test. E. Violin plots of selected genes with different expression in inflamed (n=3) and non-inflamed (n=7) stool samples. White dots mark the group median, purple lines are plotted between the means of the groups.

DESCRIPTION OF SPECIFIC EMBODIMENTS OF THE INVENTION

The present invention, in some embodiments thereof, relates to methods of diagnosing gastric diseases and more particularly inflammatory bowel diseases.

Before explaining at least one embodiment of the invention in detail, it is to be understood that the invention is not necessarily limited in its application to the details set forth in the following description or exemplified by the Examples. The invention is capable of other embodiments or of being practiced or carried out in various ways.

Colonoscopy is the gold standard for evaluation of inflammation in inflammatory bowel disease (IBD), yet entails cumbersome preparations and risks of injury. Existing non-invasive prognostic tools are limited in their diagnostic power. Moreover, transcriptomics of colonic biopsies have been inconclusive in their association with clinical features.

The present inventors have now examined whether host transcriptomics of fecal samples could serve as a diagnostic tool for IBD patients. Specifically, the present inventors sequenced the RNA of biopsies and fecal-wash samples from IBD patients and controls undergoing lower endoscopy. The present inventors showed that the host fecal-transcriptome carried information that was distinct from biopsy RNAseq and fecal proteomics. Transcriptomics of fecal washes, yet not of biopsies, from patients with histological inflammation were significantly correlated to one another (p=5.3*10 ¹²), as illustrated in Figures 2A-E. Fecal-transcriptome was significantly more powerful in identifying histological inflammation compared to intestinal biopsies (150 genes with area-under-the-curve >0.9 in fecal samples versus 10 genes in biopsy RNAseq), as illustrated in Figures 5A-E.

The present inventors thus deduce that fecal wash host transcriptome is a powerful non- invasive biomarker reflecting histological inflammation, opening the way to the identification of important correlates and therapeutic targets that may be obscured using biopsy transcriptomics. Since the fecal wash host transcriptome was shown to be informative on the state of histological inflammation in the gastrointestinal tract, the present inventors propose that RNA transcriptome analysis of fecal samples themselves can also serve as a diagnostic tool for IBD.

According to one aspect of the present invention a method is provided for diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein when the expression level of a human gene set forth in Table 1 is statistically significantly altered (e.g. increased) over the level of the gene in a fecal RNA sample of a control subject, it is indicative of the inflammatory bowel disease.

According to another aspect of the present invention there is provided a method of diagnosing an inflammatory bowel disease (IBD) of a subject comprising analyzing the RNA expression level of at least one human gene in a fecal RNA sample of the subject, wherein the gene is selected from the group consisting ofCSF3R, CASP4, NFKBIA, RNF145, FOSF2, PEF1, RTPRE, GK, MX2, NAGK, MCTP2, SFC03A1, STAT1, RASSF3, MARCKS, SAT1, NFKBIA, VPS37B, RNF149, HLA-E, PLAUR, MSN, HIF1A and NBPF14, wherein when the expression level is above a predetermined amount it is indicative of the inflammatory bowel disease, thereby diagnosing the inflammatory bowel disease.

Inflammatory bowel diseases (IBD) are severe gastrointestinal disorders characterized by intestinal inflammation and tissue remodeling, that increase in frequency and may prove disabling for patients. The major forms of IBD, ulcerative colitis (UC) and Crohn's disease are chronic, relapsing conditions that are clinically characterized by abdominal pain, diarrhea, rectal bleeding, and fever.

As used herein, the term “diagnosing” refers to determining presence or absence of the disease, classifying the disease (e.g. classifying the disease according to the histological inflammation status), determining a severity of the disease, monitoring disease progression, forecasting an outcome of a pathology and/or prospects of recovery and/or screening of a subject for the inflammatory bowel disease.

According to a specific embodiment, the diagnosing refers to determining if the subject is in remission from the disease.

In another embodiment, the diagnosing comprises determining if the subject is suitable for a particular treatment. Thus, for example if the RNA determinants indicate an increase in inflammation, an anti-inflammatory drug (e.g. anti-TNF-a therapy).

The RNA sample may be derived from solid feces (i.e. stool sample) or a fecal wash (as described herein below). The RNA may comprise total RNA, mRNA, mitochondrial RNA, chloroplast RNA, DNA-RNA hybrids, viral RNA, cell free RNA, and mixtures thereof. In one embodiment, the RNA sample is substantially devoid of DNA. In another embodiment, the RNA sample is substantially devoid of protein.

The sample may be fresh or frozen.

Isolation, extraction or derivation of RNA may be carried out by any suitable method. Isolating RNA from a biological sample generally includes treating a biological sample in such a manner that the RNA present in the sample is extracted and made available for analysis. Any isolation method that results in extracted RNA may be used in the practice of the present invention. It will be understood that the particular method used to extract RNA will depend on the nature of the source.

Methods of RNA extraction are well-known in the art and further described herein under.

Phenol based extraction methods: These single-step RNA isolation methods based on Guanidine isothiocyanate (GITC)/phenol/chloroform extraction require much less time than traditional methods (e.g. CsCE ultracentrifugation). Many commercial reagents (e.g. Trizol, RNAzol, RNAWLZ) are based on this principle. The entire procedure can be completed within an hour to produce high yields of total RNA.

Silica gel - based purification methods: RNeasy is a purification kit marketed by Qiagen. It uses a silica gel-based membrane in a spin-column to selectively bind RNA larger than 200 bases. The method is quick and does not involve the use of phenol.

Oligo-dT based affinity purification of mRNA: Due to the low abundance of mRNA in the total pool of cellular RNA, reducing the amount of rRNA and tRNA in a total RNA preparation greatly increases the relative amount of mRNA. The use of oligo-dT affinity chromatography to selectively enrich poly (A)+ RNA has been practiced for over 20 years. The result of the preparation is an enriched mRNA population that has minimal rRNA or other small RNA contamination. mRNA enrichment is essential for construction of cDNA libraries and other applications where intact mRNA is highly desirable. The original method utilized oligo-dT conjugated resin column chromatography and can be time consuming. Recently more convenient formats such as spin-column and magnetic bead based reagent kits have become available.

The sample may also be processed prior to carrying out the diagnostic methods of the present invention. Processing of the sample may involve one or more of: filtration, distillation, centrifugation, extraction, concentration, dilution, purification, inactivation of interfering components, addition of reagents, and the like.

The present inventors contemplate negative genomic selection of abundant microbial transcripts such as bacterial (SEQ ID NOs: 1-20) and/or fungal rRNA (SEQ ID NOs: 21-24) prior to the analysis. This increases the fraction of human exonic reads in the sequenced samples. This may be effected on the solid fecal samples or on fecal wash samples.

Examples of additional RNA transcripts that may be depleted include, but are not limited to Eubacterium rectale, Faecalibacterium prausnitzii, Bifidobacterium adolescentis, Ruminococcus sp 5 1 39BFAA, Bifidobacterium longum, Subdoligranulum, Ruminococcus gnavus, Escherichia coli, Ruminococcus torques, Akkermansia muciniphila, Ruminococcus bromii, Dialister invisus, Collinsella aerofaciens, Bacteroides uniformis, Bacteroides vulgatus, Eubacterium hallii, Dorea longicatena, Prevotella copri, Alistipes putredinis and Bifidobacterium bifidum

The present inventors contemplate depletion of at least one, at least two, at least three, at least four or at least 5 of the above identified bacteria. Methods of depleting particular RNAs are known in the art. For example DNA probes may be synthesized to be reverse-complement to the bacterial or fungal transcripts. Next, RNase H enzyme may be used which digests RNA-DNA specific hybrids. This leads to the selective digestion of only RNA molecules targeted by the DNA probes. Lastly, endocucleases such as DNase I enzyme may be used to remove the left over DNA probes and other DNA residues left in the sample after RNA extraction. Another method for depleting particular RNAs is by using nucleic acid probes (which are attached to an affinity tag) that specifically hybridize to the RNAs. Exemplary affinity tags include, but are not limited to hemagglutinin (HA), AviTag™, V5, Myc, T7, FLAG, HSV, VSV-G, His, biotin, or streptavidin

After obtaining the RNA sample, cDNA may be generated therefrom. For synthesis of cDNA, template mRNA may be obtained directly from lysed cells or may be purified from a total RNA or mRNA sample. The total RNA sample may be subjected to a force to encourage shearing of the RNA molecules such that the average size of each of the RNA molecules is between 100- 300 nucleotides, e.g. about 200 nucleotides. To separate the heterogeneous population of mRNA from the majority of the RNA found in the cell, various technologies may be used which are based on the use of oligo(dT) oligonucleotides attached to a solid support. Examples of such oligo(dT) oligonucleotides include: oligo(dT) cellulose/spin columns, oligo(dT)/magnetic beads, and oligo(dT) oligonucleotide coated plates.

According to another embodiment, long-read transcriptome sequencing is carried out, wherein the full length RNA molecule is sequenced (i.e. from the 3’polyA tail to the 5’ cap).

Generation of single stranded DNA from RNA requires synthesis of an intermediate RNA- DNA hybrid. For this, a primer is required that hybridizes to the 3’ end of the RNA. Annealing temperature and timing are determined both by the efficiency with which the primer is expected to anneal to a template and the degree of mismatch that is to be tolerated.

The annealing temperature is usually chosen to provide optimal efficiency and specificity, and generally ranges from about 50 °C to about 80°C, usually from about 55 °C to about 70 °C, and more usually from about 60 °C to about 68 °C. Annealing conditions are generally maintained for a period of time ranging from about 15 seconds to about 30 minutes, usually from about 30 seconds to about 5 minutes.

According to a specific embodiment, the primer comprises a polydT oligonucleotide sequence. Preferably the polydT sequence comprises at least 5 nucleotides. According to another is between about 5 to 50 nucleotides, more preferably between about 5-25 nucleotides, and even more preferably between about 12 to 14 nucleotides.

Following annealing of the primer (e.g. polydT primer) to the RNA sample, an RNA-DNA hybrid is synthesized by reverse transcription using an RNA-dependent DNA polymerase. Suitable RNA-dependent DNA polymerases for use in the methods and compositions of the invention include reverse transcriptases (RTs). Examples of RTs include, but are not limited to, Moloney murine leukemia virus (M-MLV) reverse transcriptase, human immunodeficiency virus (HIV) reverse transcriptase, rous sarcoma virus (RSV) reverse transcriptase, avian myeloblastosis virus (AMV) reverse transcriptase, rous associated virus (RAV) reverse transcriptase, and myeloblastosis associated virus (MAV) reverse transcriptase or other avian sarcoma-leukosis virus (ASLV) reverse transcriptases, and modified RTs derived therefrom. See e.g. U.S. Patent No. 7,056,716. Many reverse transcriptases, such as those from avian myeloblastosis virus (AMV- RT), and Moloney murine leukemia virus (MMLV-RT) comprise more than one activity (for example, polymerase activity and ribonuclease activity) and can function in the formation of the double stranded cDNA molecules.

Additional components required in a reverse transcription reaction include dNTPS (dATP, dCTP, dGTP and dTTP) and optionally a reducing agent such as Dithiothreitol (DTT) and MnCh _.

Following cDNA synthesis, the present inventors contemplate amplifying the cDNA (e.g using a polymerase chain reaction - PCR, details of which are known in the art).

As mentioned, in order to diagnose IBD, the quantity of at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten human RNA determinant of Table 1, 5 or 6 is analyzed. According to another embodiment, no more than 20 RNA, 30, 40 or 50 RNA determinants set forth in Table 1 are analyzed in fecal washes or solid feces of a subject.

In another embodiment, in order to diagnose IBD, the quantity of at least one, at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten human RNA determinant of Table 5 or 6 is analyzed. Particuarly relevant RNAs for diagnosing IBD from a solid fecal sample include CXCR1, CSF2RA, CLEC2B, GBP5, ILIB, FZD3, MMP25 and OSM. According to another embodiment, no more than 20 RNA, 30, 40 or 50 RNA determinants set forth in Tables 5 or 6 are analyzed in feces of a subject. Table 1

Particular combinations of RNAs contemplated by the present invention which may be analyzed are set forth below:

NFKBIA + CASP4; NFKBIA + CFLAR, NFKBIA +MX2, NFKBIA +STAT1, NFKBIA + GK, PELI1 + CASP4, PELI1 + CFLAR, PELI1 + MX2, PELI1 + STAT1, PELI1 + GK, FAM49B + CASP4, FAM49B + CFLAR, FAM49B + MX2, FAM49B + STAT1, FAM49B + GK, CSF3R + CASP4, CSF3R + CFLAR, CSF3R + MX2, CSF3R + STAT1, CSF3R + CSF3R + GK, PTPRE + CASP4, PTPRE + CFLAR, PTPRE +MX2, PTPRE + STAT1 and PTPRE + GK.

More specifically, in order to diagnose IBD, the quantity of at least one human RNA determinant of Table 1, Table 5 or Table 6 is measured in RNA isolated from feces or a fecal wash of the subject. In another embodiment, at least two human RNA determinants of Table 1 are measured in RNA isolated from feces or a fecal wash of the subject. In another embodiment, at least three human RNA determinants of Table 1 are measured in RNA isolated from a solid fecal sample or a fecal wash of the subject.

In another embodiment, at least four human RNA determinants of Table 1 are measured in RNA isolated from feces or a fecal wash of the subject. In another embodiment, at least five human RNA determinants of Table 1, Table 5 or Table 6 are measured in RNA isolated from a solid fecal sample or a fecal wash of the subject.

According to particular embodiments, when the level of the RNA determinant in Table 1, Table 5 or Table 6 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative that the subject has an inflammatory bowel disease (i.e. an inflammatory bowel disease may be ruled in). In another embodiment, when the level of the RNA determinant in Table 1 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative of increased inflammation (e.g. corresponding to histological inflammation).

According to particular embodiments, when the level of the RNA determinant in Table 1, Table 5 or Table 6 is above a predetermined level (e.g. above the level that is present in a control sample derived from a previous sample of the subject), it is indicative that the inflammatory bowel disease has become more severe. In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 10 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 20 % higher than the amount in the control sample, an IBD is ruled in. In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 30 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 40 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 50 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 1, Table 5 or Table 6 is at least 100 % higher than the amount in the control sample, an IBD is ruled in.

Alternatively or additionally, when the level of determinant in Table 2 is below a predetermined level, it is indicative that the subject does not have an inflammatory bowel disease. According to a particular embodiment, the RNA determinant is set forth in Tables 2 or 3.

Table 2

More specifically, in order to diagnose IBD, the quantity of at least one human RNA determinant of Table 2 is measured in RNA isolated from a solid fecal sample or a fecal wash of the subject. In another embodiment, at least two human RNA determinants of Table 2 are measured in RNA isolated from a solid fecal sample or a fecal wash of the subject. In another embodiment, at least three human RNA determinants of Table 2 are measured in RNA isolated from feces or a fecal wash of the subject.

In another embodiment, at least four human RNA determinants of Table 2 are measured in RNA isolated from a solid fecal sample or a fecal of the subject. In another embodiment, at least five human RNA determinants of Table 2 are measured in RNA isolated from feces or a fecal wash of the subject.

According to particular embodiments, when the level of the RNA determinant in Table 2 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative that the subject has an inflammatory bowel disease (i.e. an inflammatory bowel disease may be ruled in). In another embodiment, when the level of the RNA determinant in Table 2 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative of increased inflammation (e.g. corresponding to histological inflammation).

In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least 10 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least 20 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least 30 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least

40 % higher than the amount in the control sample, an IBD is ruled in. In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least 50 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 2 is at least 100 % higher than the amount in the control sample, an IBD is ruled in.

Alternatively or additionally, when the level of determinant in Table 2 is below a predetermined level, it is indicative that the subject does not have an inflammatory bowel disease.

In order to diagnose IBD, the quantity of at least one human RNA determinant of Table 3 is measured in RNA isolated from a fecal wash of the subject. In another embodiment, in order to diagnose IBD, the quantity of at least two human RNA determinants of Table 3 are measured in RNA isolated from a fecal wash of the subject. In another embodiment, in order to diagnose IBD, the quantity of at least three human RNA determinants of Table 3 are measured in RNA isolated from a fecal wash of the subject. In another embodiment, in order to diagnose IBD, the quantity of at least four human RNA determinants of Table 3 are measured in RNA isolated from a fecal wash of the subject. In another embodiment, in order to diagnose IBD, the quantity of at least five human RNA determinants of Table 3 are measured in RNA isolated from a fecal wash of the subject.

According to particular embodiments, when the level of the RNA determinant in Table 3 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative that the subject has an inflammatory bowel disease (i.e. an inflammatory bowel disease may be ruled in).

In another embodiment, when the level of the RNA determinant in Table 3 is above a predetermined level (e.g. above the level that is present in a control sample derived from a subject that does not have an inflammatory disease of the gut (e.g. a healthy subject); it is indicative of increased inflammation (e.g. corresponding to histological inflammation).

According to particular embodiments, when the level of the RNA determinant in Table 3 is above a predetermined level (e.g. above the level that is present in a control sample derived from a previous sample of the subject), it is indicative that the inflammatory bowel disease has become more severe.

In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 10 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 20 % higher than the amount in the control sample, an IBD is ruled in. In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 30 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 40 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 50 % higher than the amount in the control sample, an IBD is ruled in.

In one embodiment, when the level of RNA of one of the determinants in Table 3 is at least 100 % higher than the amount in the control sample, an IBD is ruled in.

The term “fecal wash” refers to fecal material which is removed from the body in a liquid state. In one embodiment, the fecal fluid is suctioned from the subject during colonoscopy or sigmoidoscopy or gastroscopy, including fluid suctioned from the small or large intestine. Fecal wash can also be obtained via rectal tube suctioning, with or without rectal irrigation. In another embodiment, the fecal wash refers to a liquid stool sample collected by the patient after consumption of a laxative.

Alternatively or additionally, when the level of determinant in Table 3 is below a predetermined level, it is indicative that the subject does not have an inflammatory bowel disease.

The predetermined level of any of the aspects of the present invention may be a reference value derived from population studies, including without limitation, such subjects having a known inflammatory bowel disease, subject having the same or similar age range, subjects in the same or similar ethnic group, or relative to the starting sample of a subject undergoing treatment for a disease. Such reference values can be derived from statistical analyses and/or risk prediction data of populations obtained from mathematical algorithms and computed indices of infection. Reference determinant indices can also be constructed and used using algorithms and other methods of statistical and structural classification.

It will be appreciated that the control sample is the same sample type as the sample being analyzed.

According to this aspect of the present invention, no more than 30 RNA determinants are used in order to diagnose the IBD, no more than 25 RNA determinants are used in order to diagnose the IBD, no more than 20 RNA determinants are used in order to diagnose the IBD, no more than 15 RNA determinants are used in order to diagnose the IBD, no more than 10 RNA determinants are used in order to diagnose the IBD, no more than 5 RNA determinants are used in order to diagnose the IBD, no more than 4 RNA determinants are used in order to diagnose the IBD, no more than 3 RNA determinants are used in order to diagnose the IBD, no more than 2 RNA determinants are used in order to diagnose the IBD.

Methods of analyzing the amount of RNA are known in the art and include Northern Blot analysis, RT-PCR analysis, RNA in situ hybridization stain, DNA microarray, DNA chips, oligonucleotide microarray, RNA sequencing and deep sequencing.

According to one embodiment, the sequencing method comprises deep sequencing.

As used herein, the term “deep sequencing” refers to a sequencing method wherein the target sequence is read multiple times in the single test. A single deep sequencing run is composed of a multitude of sequencing reactions run on the same target sequence and each, generating independent sequence readout.

In a particular embodiment, the RNA sequencing is effected at the single cell level.

It will be appreciated that in order to analyze the amount of an RNA, oligonucleotides may be used that are capable of hybridizing thereto or to cDNA generated therefrom According to one embodiment a single oligonucleotide is used to determine the presence of a particular determinant, at least two oligonucleotides are used to determine the presence of a particular determinant, at least five oligonucleotides are used to determine the presence of a particular determinant, at least four oligonucleotides are used to determine the presence of a particular determinant, at least five or more oligonucleotides are used to determine the presence of a particular determinant.

In one embodiment, the method of this aspect of the present invention is carried out using an isolated oligonucleotide which hybridizes to the RNA or cDNA of any of the determinants listed in Tables 1-2 by complementary base-pairing in a sequence specific manner, and discriminates the determinant sequence from other nucleic acid sequence in the sample. Oligonucleotides (e.g. DNA or RNA oligonucleotides) typically comprises a region of complementary nucleotide sequence that hybridizes under stringent conditions to at least about 8, 10, 13, 16, 18, 20, 22, 25, 30, 40, 50, 55, 60, 65, 70, 80, 90, 100, 120 (or any other number in- between) or more consecutive nucleotides in a target nucleic acid molecule. Depending on the particular assay, the consecutive nucleotides include the determinant nucleic acid sequence.

The term "isolated", as used herein in reference to an oligonucleotide, means an oligonucleotide, which by virtue of its origin or manipulation, is separated from at least some of the components with which it is naturally associated or with which it is associated when initially obtained. By "isolated", it is alternatively or additionally meant that the oligonucleotide of interest is produced or synthesized by the hand of man. In order to identify an oligonucleotide specific for any of the determinant sequences, the gene/transcript of interest is typically examined using a computer algorithm which starts at the 5' or at the 3' end of the nucleotide sequence. Typical algorithms will then identify oligonucleotides of defined length that are unique to the gene, have a GC content within a range suitable for hybridization, lack predicted secondary structure that may interfere with hybridization, and/or possess other desired characteristics or that lack other undesired characteristics.

Following identification of the oligonucleotide it may be tested for specificity towards the determinant under wet or dry conditions. Thus, for example, in the case where the oligonucleotide is a primer, the primer may be tested for its ability to amplify a sequence of the determinant using PCR to generate a detectable product and for its non ability to amplify other determinants in the sample. The products of the PCR reaction may be analyzed on a gel and verified according to presence and/or size.

Additionally, or alternatively, the sequence of the oligonucleotide may be analyzed by computer analysis to see if it is homologous (or is capable of hybridizing to) other known sequences. A BLAST 2.2.10 (Basic Local Alignment Search Tool) analysis may be performed on the chosen oligonucleotide (worldwidewebdotncbidotnlmdotnihdotgov/blast/). The BLAST program finds regions of local similarity between sequences. It compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches thereby providing valuable information about the possible identity and integrity of the ‘query’ sequences.

According to one embodiment, the oligonucleotide is a probe. As used herein, the term "probe" refers to an oligonucleotide which hybridizes to the determinant specific nucleic acid sequence to provide a detectable signal under experimental conditions and which does not hybridize to additional determinant sequences to provide a detectable signal under identical experimental conditions.

The probes of this embodiment of this aspect of the present invention may be, for example, affixed to a solid support (e.g., arrays or beads).

According to particular embodiments, the array does not comprise nucleic acids that specifically bind to more than 50 determinants, more than 40 determinants, 30 determinants, 20 determinants, 15 determinants, 10 determinants, 5 determinants or even 3 determinants.

Methods for immobilization of oligonucleotides to solid-state substrates are well established. Oligonucleotides, including address probes and detection probes, can be coupled to substrates using established coupling methods. According to another embodiment, the oligonucleotide is a primer of a primer pair. As used herein, the term "primer" refers to an oligonucleotide which acts as a point of initiation of a template-directed synthesis using methods such as PCR (polymerase chain reaction) or LCR (ligase chain reaction) under appropriate conditions (e.g., in the presence of four different nucleotide triphosphates and a polymerization agent, such as DNA polymerase, RNA polymerase or reverse-transcriptase, DNA ligase, etc, in an appropriate buffer solution containing any necessary co-factors and at suitable temperature(s)). Such a template directed synthesis is also called "primer extension". For example, a primer pair may be designed to amplify a region of DNA using PCR. Such a pair will include a "forward primer" and a "reverse primer" that hybridize to complementary strands of a DNA molecule and that delimit a region to be synthesized/amplified. A primer of this aspect of the present invention is capable of amplifying, together with its pair (e.g. by PCR) a determinant specific nucleic acid sequence to provide a detectable signal under experimental conditions and which does not amplify other determinant nucleic acid sequence to provide a detectable signal under identical experimental conditions.

According to additional embodiments, the oligonucleotide is about 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 nucleotides in length. While the maximal length of a probe can be as long as the target sequence to be detected, depending on the type of assay in which it is employed, it is typically less than about 50, 60, 65, or 70 nucleotides in length. In the case of a primer, it is typically less than about 30 nucleotides in length. In a specific preferred embodiment of the invention, a primer or a probe is within the length of about 18 and about 28 nucleotides. It will be appreciated that when attached to a solid support, the probe may be of about 30-70, 75, 80, 90, 100, or more nucleotides in length.

The oligonucleotide of this aspect of the present invention need not reflect the exact sequence of the determinant nucleic acid sequence (i.e. need not be fully complementary), but must be sufficiently complementary to hybridize with the determinant nucleic acid sequence under the particular experimental conditions. Accordingly, the sequence of the oligonucleotide typically has atleast70 % homology, preferably at least80 %, 90 %, 95 %, 97 %, 99 % or 100 % homology, for example over a region of at least 13 or more contiguous nucleotides with the target determinant nucleic acid sequence. The conditions are selected such that hybridization of the oligonucleotide to the determinant nucleic acid sequence is favored and hybridization to other determinant nucleic acid sequences is minimized.

By way of example, hybridization of short nucleic acids (below 200 bp in length, e.g. 13- 50 bp in length) can be effected by the following hybridization protocols depending on the desired stringency; (i) hybridization solution of 6 x SSC and 1 % SDS or 3 M TMAC1, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 mg/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization temperature of 1 - 1.5 °C below the Tm, final wash solution of 3 M TMAC1, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 1.5 °C below the Tm (stringent hybridization conditions) (ii) hybridization solution of 6 x SSC and 0.1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 mg/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization temperature of 2 - 2.5 °C below the Tm, final wash solution of 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS at 1 - 1.5 °C below the Tm, final wash solution of 6 x SSC, and final wash at 22 °C (stringent to moderate hybridization conditions); and (iii) hybridization solution of 6 x SSC and 1 % SDS or 3 M TMACI, 0.01 M sodium phosphate (pH 6.8), 1 mM EDTA (pH 7.6), 0.5 % SDS, 100 mg/ml denatured salmon sperm DNA and 0.1 % nonfat dried milk, hybridization temperature at 2.5-3 °C below the Tm and final wash solution of 6 x SSC at 22 °C (moderate hybridization solution).

Oligonucleotides of the invention may be prepared by any of a variety of methods (see, for example, J. Sambrook et ah, "Molecular Cloning: A Laboratory Manual", 1989, 2.sup.nd Ed., Cold Spring Harbour Laboratory Press: New York, N.Y.; "PCR Protocols: A Guide to Methods and Applications", 1990, M. A. Innis (Ed.), Academic Press: New York, N.Y.; P. Tijssen "Hybridization with Nucleic Acid Probes--Laboratory Techniques in Biochemistry and Molecular Biology (Parts I and P)", 1993, Elsevier Science; "PCR Strategies", 1995, M. A. Innis (Ed.), Academic Press: New York, N.Y.; and "Short Protocols in Molecular Biology", 2002, F. M. Ausubel (Ed.), 5.sup.th Ed., John Wiley & Sons: Secaucus, N.J.). For example, oligonucleotides may be prepared using any of a variety of chemical techniques well-known in the art, including, for example, chemical synthesis and polymerization based on a template as described, for example, in S. A. Narang et ah, Meth. Enzymol. 1979, 68: 90-98; E. L. Brown et ah, Meth. Enzymol. 1979, 68: 109-151; E. S. Belousov et ah, Nucleic Acids Res. 1997, 25: 3440-3444; D. Guschin et ah, Anal. Biochem 1997, 250: 203-211; M. J. Blommers et ah, Biochemistry, 1994, 33: 7886-7896; and K. Frenkel et ah, Free Radic. Biol. Med. 1995, 19: 373-380; and U.S. Pat. No. 4,458,066.

In certain embodiments, the detection probes or amplification primers or both probes and primers are labeled with a detectable agent or moiety before being used in amplification/detection assays. In certain embodiments, the detection probes are labeled with a detectable agent. Preferably, a detectable agent is selected such that it generates a signal which can be measured and whose intensity is related (e.g., proportional) to the amount of amplification products in the sample being analyzed.

The association between the oligonucleotide and detectable agent can be covalent or non- covalent. Labeled detection probes can be prepared by incorporation of or conjugation to a detectable moiety. Labels can be attached directly to the nucleic acid sequence or indirectly (e.g., through a linker). Linkers or spacer arms of various lengths are known in the art and are commercially available, and can be selected to reduce steric hindrance, or to confer other useful or desired properties to the resulting labeled molecules (see, for example, E. S. Mansfield et ah, Mol. Cell. Probes, 1995, 9: 145-156).

As shown in the Examples section herein below, the fecal wash of the sigmoid colon and the rectum was found to comprise exfoliated inflammatory cells of the gut, which is highly indicative of disease state (and more specifically diseases associated with inflammation).

Thus, according to another aspect of the present invention there is provided a method of diagnosing a disease of the gastrointestinal tract of a subject comprising analyzing the expression level of at least one gene in a fecal wash of the sigmoid colon or rectum of the subject, wherein the expression level is indicative of the disease of the gastrointestinal tract.

Diseases of the gastrointestinal tract include but are not limited to Irritable bowel syndrome (IBS), colon cancer, celiac disease and inflammatory bowel disease, including ulcerative colitis, Crohn’s disease, microscopic colitis, Bechet’s disease, immune-therapy-induced colitis or ileitis, eosinophilic gastritis/ileitis/colitis and collagenous gastritis / ileitis.

Exemplary genes which are informative on IBD which can be analyzed in fecal washes are provided in Table 2, herein above.

According to this aspect of the present invention, the analysis on the fecal wash samples may be carried out on the RNA level (as described herein above) or on the protein level (as described herein below).

Methods of measuring the levels of proteins are well known in the art and include, e.g., immunoassays based on antibodies to proteins, aptamers or molecular imprints.

The protein determinants can be detected in any suitable manner, but are typically detected by contacting a sample from the subject with an antibody, which binds the determinant and then detecting the presence or absence of a reaction product. The antibody may be monoclonal, polyclonal, chimeric, or a fragment of the foregoing, as discussed in detail above, and the step of detecting the reaction product may be carried out with any suitable immunoassay. In one embodiment, the antibody which specifically binds the determinant is attached (either directly or indirectly) to a signal producing label, including but not limited to a radioactive label, an enzymatic label, a hapten, a reporter dye or a fluorescent label.

According to some embodiments of the invention, diagnosing of the subject for IBD is followed by substantiation of the screen results using gold standard methods.

In some embodiments, once a diagnosis has been obtained, screening for additional diseases may be recommended. For example routine colonoscopy may be recommended to monitor for colorectal cancer, since those with IBD are at a higher risk for developing it.

According to some embodiments of the invention, the method further comprises informing the subject of the diagnosis.

As used herein the phrase “informing the subject” refers to advising the subject that based on the diagnosis the subject should seek a suitable treatment regimen.

Once the diagnosis is determined, the results can be recorded in the subject’s medical file, which may assist in selecting a treatment regimen and/or determining prognosis of the subject.

Optionally, once the diagnosis is confirmed using the methods described herein, the subject can be treated accordingly. IBD may be treated using anti-inflammatory drugs including, but not limited to corticosteroids (e.g. glucocorticoids such as budesonide (Uceris), prednisone (Prednisone Intensol, Rayos), prednisolone (Millipred, Prelone) and methylprednisolone (Medrol, Depo-Medrol)); 5-ASA drugs (aminosalicylates) including but not limited to balsalazide (Colazal), mesalamine (Apriso, Asacol HD, Canasa, Pentasa), olsalazine (Dipentum) and sulfasalazine (Azulfidine), immuno modulators including but not limited to methotrexate (Otrexup, Trexall, Rasuvo), azathioprine (Azasan, Imuran) and mercaptopurine (Purixan).

Other agents suitable for treating IBD include inhibitors of TNF- alpha (including but not limited to adalimumab (Humira), golimumab (Simponi) and infliximab (Remicade). Other biologies for treating IBD include certolizumab (Cimzia); natalizumab (Tysabri); ustekinumab (Stelara) and vedolizumab (Entyvio).

Surgical interventions that can be recommended for treating IBD include strictureplasty to widen a narrowed bowel, closure or removal of fistulas, removal of affected portions of the intestines, for people with Crohn’s disease and removal of the entire colon and rectum, for severe cases of UC.

The present inventors further conceive that the RNAs shown to be associated with the inflammatory status of the disease may be useful for selecting an agent for the treatment of an inflammatory bowel disease. This may be carried out as a method of selecting a known agent for a particular subject (i.e. personalized therapy) or as a more general method for uncovering novel drugs for the treatment of IBD.

Thus, according to another aspect of the present invention, a method of selecting an agent for the treatment of an inflammatory bowel disease (IBD) is provided. The method comprises;

(a) contacting the agent with an RNA sample derived from feces of a subject having the IBD; and

The RNA sample may be derived from solid feces of the subject or a liquid sample, as described herein above.

The contacting is typically carried out ex vivo.

Analysis of RNA is described herein above.

As used herein the term “about” refers to ± 10 %

The terms "comprises", "comprising", "includes", "including", “having” and their conjugates mean "including but not limited to".

The term “consisting of’ means “including and limited to”.

The term "consisting essentially of' means that the composition, method or structure may include additional ingredients, steps and/or parts, but only if the additional ingredients, steps and/or parts do not materially alter the basic and novel characteristics of the claimed composition, method or structure.

As used herein, the singular form "a", "an" and "the" include plural references unless the context clearly dictates otherwise. For example, the term "a compound" or "at least one compound" may include a plurality of compounds, including mixtures thereof.

Throughout this application, various embodiments of this invention may be presented in a range format. It should be understood that the description in range format is merely for convenience and brevity and should not be construed as an inflexible limitation on the scope of the invention. Accordingly, the description of a range should be considered to have specifically disclosed all the possible subranges as well as individual numerical values within that range. For example, description of a range such as from 1 to 6 should be considered to have specifically disclosed subranges such as from 1 to 3, from 1 to 4, from 1 to 5, from 2 to 4, from 2 to 6, from 3 to 6 etc., as well as individual numbers within that range, for example, 1, 2, 3, 4, 5, and 6. This applies regardless of the breadth of the range.

Whenever a numerical range is indicated herein, it is meant to include any cited numeral (fractional or integral) within the indicated range. The phrases “ranging/ranges between” a first indicate number and a second indicate number and “ranging/ranges from” a first indicate number “to” a second indicate number are used herein interchangeably and are meant to include the first and second indicated numbers and all the fractional and integral numerals therebetween.

As used herein the term "method" refers to manners, means, techniques and procedures for accomplishing a given task including, but not limited to, those manners, means, techniques and procedures either known to, or readily developed from known manners, means, techniques and procedures by practitioners of the chemical, pharmacological, biological, biochemical and medical arts.

As used herein, the term “treating” includes abrogating, substantially inhibiting, slowing or reversing the progression of a condition, substantially ameliorating clinical or aesthetical symptoms of a condition or substantially preventing the appearance of clinical or aesthetical symptoms of a condition.

When reference is made to particular sequence listings, such reference is to be understood to also encompass sequences that substantially correspond to its complementary sequence as including minor sequence variations, resulting from, e.g., sequencing errors, cloning errors, or other alterations resulting in base substitution, base deletion or base addition, provided that the frequency of such variations is less than 1 in 50 nucleotides, alternatively, less than 1 in 100 nucleotides, alternatively, less than 1 in 200 nucleotides, alternatively, less than 1 in 500 nucleotides, alternatively, less than 1 in 1000 nucleotides, alternatively, less than 1 in 5,000 nucleotides, alternatively, less than 1 in 10,000 nucleotides.

It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable subcombination or as suitable in any other described embodiment of the invention. Certain features described in the context of various embodiments are not to be considered essential features of those embodiments, unless the embodiment is inoperative without those elements.

Various embodiments and aspects of the present invention as delineated hereinabove and as claimed in the claims section below find experimental support in the following examples. EXAMPLE 1

MATERIALS AND METHODS

Patient population: The study groups included patients with ulcerative colitis or Crohn’ s colitis, or healthy controls. All control patients performed lower endoscopy for screening purposes and IBD patients underwent the procedure due to clinical indications (screening for dysplasia / assessment of disease status). Clinical and demographic parameters were obtained from patients’ computerized files.

Sample collection : Upon endoscopy, biopsies (2 consecutive biopsies per patient - "double bite") from the sigmoid colon were obtained and fecal fluid was suctioned from the sigmoid colon, at the beginning of the procedure before any through- the- scope washing was applied. Samples were snap-frozen in liquid nitrogen and stored at -80 °C until further analysis. In addition, stool samples were obtained from 4 patients (2 IBD patients and 2 controls) for proteomics analysis and stool calprotectin measurements.

Study Outcomes: The primary outcome was to map the transcriptomic profile of fecal washes in different patient groups (control, IBD with or without endoscopic and histological inflammation) and to identify biomarkers for classifying these groups. Secondary outcomes included a comparison of fecal washes to colonic biopsies and inference of the cellular composition of the fecal washes using computational deconvolution based on scRNAseq data.

Exclusion criteria:

• Patients younger than 18.

• Undetermined diagnosis of UC or CD (IBD-unclassified).

• Missing clinical / demographic data.

• Patients with active endoscopic inflammation in the right colon only.

Biomarker measurements: Stool calprotectin was measured using a commercially available Calprosmart home-test ¹⁹.

Definition of clinical remission: Clinical status was determined by HBI (Harvey-Bradshaw index) for Crohn's disease (CD) and by SCCAI (Simple Clinical Colitis Activity Index) for ulcerative colitis (UC) patients. Clinical remission was defined as HBI <5 for CD patients and SCCAK 3 for UC patients ²⁰- ²¹.

Definition of mucosal healing and histological healing: Endoscopic and histological inflammation were graded according to standardized indices and by blinded gastroenterologists and pathologists, Endoscopic scores were determined prospectively during lower endoscopy. Mucosal healing was defined as absence of ulcers or lack of inflammation on endoscopic examination, for CD and UC respectively ²². Histological inflammation was determined by a certified pathologist based on biopsies from the same sigmoid colon region used for the biopsy transcriptomics. Histological healing was retrospectively defined as grade 0 on the Nancy histological index.

RNA extraction: For colonic biopsies - snap frozen tissues (2mm*2mm) were thawed in 300 pi Tri-reagent and mechanically homogenized with bead beating, followed by a short centrifugation step to pull down beads and any tissue left-overs. For colonic washes - Tri-reagent was added at a ratio of 3: 1, samples were allowed to thaw on ice followed by thorough mixing. A first centrifugation step was used (1 minute, 18,000 rpm) to eliminate fecal solids. Following this, ethanol was added in a ratio of 1:1 to the supernatant from the previous step and continued according to the manufacturer instructions of Direct- zol mini and micro prep kit (ZYMO research, R2052) ²³.

RNA sequencing of samples: RNA was processed by the mcSCRBseq protocol ²⁴ with minor modifications. RT reaction was applied on 10 ng of total RNA with a final volume of 10 pi (lx Maxima H Buffer, 1 mM dNTPs, 2 mM TSO* E5V6NEXT, 7.5% PEG8000, 20U Maxima H enzyme, 1 mΐ barcoded RT primer). Subsequent steps were applied as mentioned in the protocol. Library preparation was done using Nextera XT kit (Illumina) on 0.6 ng amplified cDNA. Library final concentration of 2nM was loaded on NextSeq 500/550 (Illumina) sequencing machine aiming at 20 M reads per sample ²³ with the following setting: Readl - 16bp, Indexl - 8bp, Read2 - 66bp.

Proteomic analysis: Fecal samples were lysed in lysis buffer containing 5% SDS, proteins were extracted, digested with trypsin, and tryptic peptides were subjected to LC-MS/MS analysis ²⁵. Acquired raw data was analyzed using the MaxQuant software while searching against the human protein database, and downstream quantitative comparisons were calculated using the Perseus software ²⁶.

Bioinformatics and computational analysis: Illumina output sequencing raw files were converted to FASTQ files using bcl2fastq package. To obtain the UMI counts, FASTQ files were aligned to the human reference genome (GRCh38.91) using zUMI package ²⁷. Statistical analyses were performed with MATLAB R2018b. Mitochondrial genes and non-protein coding genes were removed from the analysis. Protein coding genes were extracted using the annotation in the Ensembl database (BioMart) for reference genome GRch38 version 91, using the R package "biomaRt" (version 2.44.4). Gene expressionfor each sample was consequently normalized by the sum of the UMIs of the remaining genes. Samples with less than 10,000 UMIs over the remaining genes were removed from the analyses. Clustering was performed with the MATLAB function clustergram over the Zscore-transformed expression matrix, using Spearman distances. Differential gene expression was performed using Wilcoxon ranksum tests and Benjamini- Hochberg FDR corrections. Computational deconvolution was performed using CIBERSORTx ²⁸ using signature tables obtained from a single cell atlas of control and UC patients ²⁹. Original cell type annotations were used, but subsequently coarse-grained into small number of cell types. M cells were removed from the analysis due to their low abundance. Receiver Operating Curve analyses were performed using the MATLAB function perfcurve. Gene Set Enrichment Analysis (GSEA) ³⁰ was performed over the Hallmark and Kegg gene sets. Pathway analysis for the top classifying fecal wash genes was performed using EnrichR ³¹. RESULTS

Cohort characteristics: In total, 39 biopsies and 39 matching fecal wash samples were obtained from 16 patients with ulcerative colitis, 3 patients with Crohn’s colitis and 20 control subjects undergoing colonoscopy. Pairs of biopsies and matching washes were obtained concomitantly (Figure 1, Table 4). Table 4

IBD -Inflammatory bowel disease, CD - Crohn’s disease, UC - ulcerative colitis, IQR - interquartile range.

* Out of total CD patients (n-3).

** Clinical remission was defined using the HBI and SCCAI scores for CD and UC respectively Control patients were those undergoing lower endoscopy for screening purposes, recommended over the age of 50, and therefore they were significantly older than the IBD group (p<0.0008), with more comorbidities, other than IBD (p=0.0015, Table 1). Eleven (58%) of all IBD patients were treated with immuno modulator / biological therapy and five (26%) were on concomitant steroids at time of enrollment. Nine (47%) of the patients were in clinical remission, twelve (63%) were in endoscopic remission and seven (37%) achieved histologic remission as determined on the day of the lower endoscopy. Five fecal wash samples were excluded from the analysis due to technical dropouts.

Fecal wash host transcriptome is more informative than biopsy transcriptome in classifying patient disease status: Bulk RNA sequencing of all samples was performed using the UMI-based mcSCRBseq (see Methods) and the reads were mapped to the human genome. Gene expression signatures of colonic biopsies were found to be different from those of colonic washes (Figure 2A, B). Biopsy samples with histological inflammation were not distinct from biopsy samples of patients without histological inflammation in the PCA or clustering analysis (Figure 2C). In contrast, colonic fecal wash samples showed a clear separation between samples with and without histological inflammation (Figure 2D).

The present inventors next sought to quantify the comparative ability of biopsy and fecal wash transcriptomics to inform on histological inflammation. To this end, they examined correlations between gene expression profiles of pairs of samples that both have histological inflammation compared to mixed pairs (one with and one without histological inflammation). There was no significant difference between transcriptomic profiles obtained from biopsies with histological inflammation compared to correlations between mixed biopsies (with or without histological inflammation) (p=0.98). However, fecal washes with histological inflammation were significantly more correlated to each other than mixed washes (Figure IE, p=5.3*10 ¹²). This analysis therefore demonstrates that fecal wash transcriptomics may provide signatures for classifying patients with or without histological inflammation.

When assessing concordance of fecal washes and biopsies with endoscopic, rather than histological inflammation, similarly, fecal washes, rather than biopsies, were associated with endoscopic remission (p=0.004 versus p=0.6 respectively, Figure 7A). Furthermore, statistically higher concordance of fecal wash transcriptomics with histological inflammation status was observed, compared to biopsy transcriptomics when stratifying according to patients’ age or biological therapy (Figures 7B-C). The expression signatures of fecal washes were generally more similar to their matching biopsies than to other biopsies (Figure 8)

Gene expression patterns are significantly different between fecal washes of patients with and without histological inflammation: 1168 genes out of 3999 highly expressed genes were differentially expressed in fecal washes from patients with and without histological inflammation (Figures 3A, B normalized expression above 5*10 ⁵ q-value<0.1, Wilcoxon rank sum tests with Benjamini-Hochberg false discovery rate correction). Genes that were upregulated in inflamed sample washes included S100A8 and S100A9, encoding the subunits of the calprotectin protein, as well as other immune-related genes such as NFKBIA, TNF, TNFRSF1B, CCR1, STAT1 and IFIT3. Using Gene Set Enrichment Analysis (GSEA) ³² it was found that washes from inflamed patients were enriched in genes associated with TNFA signaling, IL6 signaling, chemokine signaling pathway and the JAK STAT pathway, and depleted in epithelial pathways such as glycolysis and glutathione metabolism (Figure 3C).

Inflamed fecal washes exhibit distinct cellular composition: cell compositions among inflamed versus non inflamed fecal washes and biopsies were inferred using CIBERSORTx ²⁸ (Methods / Bioinformatic and computational analysis), using gene expression signatures of human colonic cell types that were parsed based on a recent single cell RNAseq study ²⁹ (Methods, Figure 9). An elevated representation of distinct immune cell subtypes were found in the washes of patients with histological inflammation (Figures 4A-B). Cell types that were elevated in fecal washes from patients with histological inflammation included regulatory T cells (p=2.1*10 ⁴), natural killer (NK) cells (p=5.5*10 ³), inflammatory monocytes (7.5*10 ⁹) and innate lymphoid cells (ILCs, p=1.4*10 ⁶). The increased differential representation of these immune subsets was higher in fecal washes when compared to biopsies (Figure 4B). Non-inflamed washes had a significantly higher representation of enterocytes (p=2.7*10 ³), myofibroblasts (2.1*10 ⁹) and goblet cells (2.8*10 ⁸) compared to inflamed washes.

More genes have expression levels that are highly predictive of histological inflammation in fecal washes compared to biopsies: The present inventors next sought to assess whether expression levels of individual genes can classify samples as belonging to patients with or without histological inflammation. The present inventors performed Receiver Operating Characteristic (ROC) curve analysis for all genes in the biopsies and fecal washes and examined the area under the curve (AUC). NFKBIA is demonstrated as an example (Figure 5A). It was found that in the washes, the expression levels of multiple individual genes were significantly more predictive of histological inflammation compared to the biopsies. This was evident by the significantly higher AUC of the 5% genes with highest AUC levels in both groups (p=1.85*10 ⁷², Figure 5B). Fecal washes included 150 genes with AUC>0.9, whereas biopsies had only 10 such genes (Figure 5C- E). Pathway analysis demonstrated that the 5% genes with the highest AUC in fecal washes were enriched for TNFa signaling via NF-kB, and inflammatory response, interferon a and g signaling pathways, and IL-6/JAK/STAT signaling.

Fecal wash transcriptomics carries distinct information from fecal proteomics: To assess the information contained by the fecal wash transcriptomics measurements in relation to fecal proteomics, Mass Spectrometry Proteomics of 10 fecal samples (6 fecal washes and 4 stool samples) was performed. The six fecal washes had matching fecal wash transcriptomics analyses. Fecal calprotectin levels were measured in the 4 stool samples. Protein expression of S100A8 and S100A9 were correlated with stool calprotectin levels (Figure 6A). Notably, protein and mRNA levels were only weakly correlated (R=0.16, p=1.2*10 ⁴). Genes with discordant mRNA and protein levels included pancreatic proteins, such as the amylase protein AMY2A, and the elastase proteins CELA2A, CELA3A and CELA3B (Figure 6B). These proteins are produced by pancreatic acinar cells and settle on the luminal side of the intestinal epithelium, explaining the lack of mRNAs. Other discordances may represent differential stability of distinct proteins and mRNA species. The fecal host transcriptomics therefore provides information that is distinct from fecal proteomics.

Fecal wash transcriptomics carries information regarding histological inflammation in the ileum.

In 10/11 patients with Crohn’ s disease in the ileum / right (proximal) colon left sided colonic washes demonstrated an inflammatory signature, similar to patients with left sided inflammation. EXAMPLE 2

MATERIALS AND METHODS

Sample collection and storage: Participants were given either a 15ml tube with 5-10 ml of RNAlater, or a shaking 15 ml tube with 2 ml RLT (cell lysis buffer supplied in Qiagen RNeasy kits based on guanidinium thiocyonate) supplemented with DTT in a final concentration of 0.04 M. Collection tubes without the sample were kept at RT. Participants transferred 2 spoonfuls from a fresh stool sample, which has not passed into the toilet, into the collection tube. Sample size was 0.5mm ³ * 2 samples. The tube was shaken manually up and down for 60 seconds and were stored in a vertical position for 24-48 hours until delivery.

Collection tubes containing the samples were kept at 4 °C for at least 24 hours and not more than 48 hours, and were then frozen at -80 °C for at least 2 days. Content from shaking tube was transferred into two 2ml Eppendorf tube prior to freezing (~lml per tube).

RNA extraction: Prior to RNA extraction, samples were thawed on ice. Samples stored in RNAlater were transferred into a 2ml Eppendorf, with as little residues of RNAlater as possible. A volume of 1ml RLT + 0.04M DTT was added to the sample. Samples frozen in RLT were thawed and additional RLT-DTT was added according to consistency.

All 2ml Eppendorf tube containing sample and RLT+DTT were vigorously vortexed. 0.55 mm diameter RNase free zirconium- oxide homogenization (Next Advance) were added in a mass comparable to that of the fecal sample, and samples were homogenized in a Bullet Blender (Next Advance) using speed 8 setting for 3 mins. After the homogenization step, samples were centrifuged (200-500 ref, 1-10 min) and 700 pi from the supernatant were transferred into a new Eppendorf tube. An equal volume of 100% EtOH is added to the sample and tube was vortexed. The content was then loaded on an RNeasy spin column placed in a 2 ml collection tube. RNA extraction steps were according to Qiagen’ s protocol for RNeasy micro kit with DNase I digestion step. Samples were eluted in 30 mΐ nuclease free water.

Samples stored in RNAlater were also transferred in the same manner into cold 700ul TRI- reagent for RNA isolation (Sigma), and following the Bullet Blender homogenization steps described above, supernatant was transferred into a new tube. 100% EtOH was added an equal volume, and after vortexing, samples were loaded into Direct- zol RNA microprep column (Zymo research). RNA extraction was performed as detailed in kit protocol, with a DNase I incubation step. Samples were eluted in 30 mΐ nuclease free water. Negative selection by depletion on non-host RNA+DNA

RNA extracted from stool samples is comprised of transcripts of multiple species, including host (human) and commensal microorganisms such as microbiome, mycobiome and other parasite populations. As these populations outnumber the cells shed from the intestinal tract, only a minority of RNA content is originated in human. In order to enrich the readout from human RNA in an unbiased fashion, RNA molecules highly expressed by bacteria were depleted.

The depletion was carried out in three steps: first, DNA oligos designed and synthesized to be reverse-complement to bacterial transcripts with high expression level (such as bacterial rRNA - 5S, 16S, 23S) were hybridized to the RNA. Next, RNase H enzyme digests and RNA-DNA specific hybrids, which leads to the selective digestion of only RNA molecules targeted by the DNA probes. Lastly, DNase I enzyme endonucleases the left over DNA probes and other DNA residues left in the sample after RNA extraction followed by RNA cleanup.

To deplete bacterial rRNA from our samples, the NEBNext rRNA Depletion Kit (Bacteria) kit and protocol was followed. After RNA purification step, bacterial rRNA depleted RNA is eluted in 8.4 pi nuclease free water and immediately proceeded to mcSCRBseq protocol for library preparation.

RESULTS

Stool samples represent particularly challenging starting biological material, due to their texture, potential long residence time of shed intestinal cells and elevated microbial content. Here, the present inventors have optimized protocol for the enrichment and successful sequencing of host mRNA. The key steps involved are sample acquirement and negative genomic selection of abundant microbial transcripts. This increases the fraction of human exonic reads in the sequenced samples (Figure 11A). When applying this method on stool from both control donors and IBD patients with active inflammation, stool samples are clustered similarly to fecal wash samples, based on their presence of inflammation (Figure 1B-C). Profound enrichment of the inflamed host transcriptomic signature observed in fecal washes (Figure 11D-E) is also observed in stool samples following bacterial rRNA depletion demonstrating the ability of stool host transcriptomics to provide valuable information on IBD disease state.

Figures 10A-E illustrate that bacterial rRNA depleted stool transcriptomics are informative is assessing intestinal inflammation.

Tables 5 and 6 provides a list of genes that were upregulated in the stool sample of subjects with Crohn’s disease as compared to control (healthy subjects). Table 5 lists genes that were upregulated in stool samples and not in fecal washes in subjects with Crohn’ s disease as compared to control (healthy subjects) and Table 6 lists genes that were upregulated in both stool samples and in fecal washes in subjects with Crohn’s disease as compared to control (healthy subjects). log2(fold change) and Kruskal-Wallis p-values are given for stool samples (Table 5) and AUC score, together with log2(fold change) and FDR are given for fecal wash analysis (Table 6).

Table 5

Table 6

Although the invention has been described in conjunction with specific embodiments thereof, it is evident that many alternatives, modifications and variations will be apparent to those skilled in the art. Accordingly, it is intended to embrace all such alternatives, modifications and variations that fall within the spirit and broad scope of the appended claims.

All publications, patents and patent applications mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present invention. To the extent that section headings are used, they should not be construed as necessarily limiting.

In addition, any priority documents) of this application is/are hereby incorporated herein by reference in its/their entirety.

Previous Patent: QUORUM QUENCHING ENZYME, AND METHODS OF USING SAME

Next Patent: CODON OPTIMIZATION OF NUCLEIC ACIDS