Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MEANS AND METHODS FOR DETECTING BACTERIA IN AN AEROSOL SAMPLE
Document Type and Number:
WIPO Patent Application WO/2010/032245
Kind Code:
A2
Abstract:
This disclosure provides a method for detecting and/or identifying uncultured bacteria. The sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid. The method comprises obtaining absorption spectra (AS) of the sample, extracting and processing the acquired data, thereby detecting and/or identifying the bacteria.

Inventors:
BEN-DAVID MOSHE (IL)
GANNOT GALLYA (IL)
ERUV TOMER (IL)
MARKOWITZ ZVI (IL)
Application Number:
PCT/IL2009/000908
Publication Date:
March 25, 2010
Filing Date:
September 16, 2009
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
OPTICUL DIAGNOSTICS LTD (IL)
BEN-DAVID MOSHE (IL)
GANNOT GALLYA (IL)
ERUV TOMER (IL)
MARKOWITZ ZVI (IL)
International Classes:
G06K9/46
Domestic Patent References:
WO2005060380A22005-07-07
Foreign References:
US20070086003A12007-04-19
Other References:
"Table of contents" In: Richard O. Duda, Peter E. Hart and David G. Stork: "Pattern classification" 31 December 2001 (2001-12-31), Wiley , XP002570941 ISBN: 0471056693pages vii-xvi, the whole document
"Table of contents" In: RICHARD G BRERETON: "Multivariate pattern recognition in chemometrics, illustrated by case studies" 31 December 1992 (1992-12-31), Elsevier , XP002570942 ISBN: 0444897844pages v-viii, the whole document
ROGGO ET AL: "A review of near infrared spectroscopy and chemometrics in pharmaceutical technologies" JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, NEW YORK, NY, US, vol. 44, no. 3, 10 July 2007 (2007-07-10), pages 683-700, XP022145323 ISSN: 0731-7085
POWELL J R ET AL: "An Algorithm for the Reproductible Spectral Subtraction of Water from the FT-IR Spectra of Proteins in Dilute Solutions and Adsorbed Monolayers" APPLIED SPECTROSCOPY,, vol. 40, no. 3, 1 March 1986 (1986-03-01), pages 339-344, XP002570943
Attorney, Agent or Firm:
DR. EYAL BRESSLER LTD. (Lazrom House, Ramat Gan, IL)
Download PDF:
Claims:
CLAIMS

1. A method for detecting and/or identifying specific bacteria within an uncultured sample; said method comprising: a. obtaining an absorption spectrum (AS) of said uncultured sample; b. acquiring the n dimensional volume boundaries for said specific bacteria by i. obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. dividing said AS2 into several segments according to said x features; iv. calculating y features of each of said segment of said AS2; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(Sw); Sw /(Sb+Sw); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. defining n dimensional space; n equals the sum of said JC and said >> features; vii. defining the n dimensional volume in said n dimensional space; viii. determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; ta processing said AS; i. noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. dividing said AS into several segments according to said m features; iv. calculating mi features of each of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the -signal, ..Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, d. detecting and/or identifying said specific bacteria if said mi featurs and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

2. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 1, additionally comprising step of selecting said x feature and/or said y features via algorithms selected form Chi- Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

3. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 1, wherein said step of acquiring the n dimensional volume boundaries for the specific bacteria, additionally comprising step of calculating the Gaussian distribution and/or Multivariate Gaussian distribution, and/or Rayleigh distribution, and/or Maxwell distribution, and/or Estimate the distribution by the Parzen method, or mixed model, the Gaussian Mixed Model (GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

4. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 1 , wherein said step (c) of data processing said AS additionally comprising steps of: i. calculating at least one of the o' derivative of said AS; said o is an integer greater than or equals 1 ; ii. extracting ni2 features from said entire o' derivative spectrum; said rri2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi2 is an integer greater than or equal to one; iii. dividing said oth derivative into several segments according to said ni2 features; iv. calculating the wj features in at least one of said segments; said rri3 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; nt2 is an integer greater than or equal to one; and, v. detecting and/or identifying said specific bacteria if said mi and/or m^, features and/or said m and/or said /w? features are within said n dimensional volume.

5. The method for detecting and/or identifying specific bacteria within an uncultured sample according to either one of claims 1-4, additionally comprising the step of selecting said specific bacteria selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemofyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof.

6. The method for detecting and/or identifying specific bacteria within an uncultured sample according to either one claims 1-5, wherein said step of obtaining the AS additionally comprising steps of: a. providing at least one optical cell accommodates said uncultured sample; b. providing p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light to said optical cell; c. providing detecting means for receiving the spectroscopic data of said sample; d. emitting light from said light source at different wavelength to said optical cell; and, e. collecting said light exiting from said optical cell by said detecting means; thereby obtaining said AS.

7. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 6, wherein said step of emitting light is performed at the wavelength range of UV, visible, IR, mid-IR, far-IR and terahertz.

8. The method for detecting and/or identifying specific bacteria within an uncultured sample according to either one of claims 1 -7, additionally comprising the step of detecting said bacteria by analyzing said AS in the region of about 3000-3300 cm"1 and/or about 850-1000 cm"1 and/or about 1300-1350 cm"1, and/or about 2836-2995 cm"1, and/or about 1720-1780 cm"1, and/or about 1550- 1650 cm"1, and/or about 1235-1363 cm"1, and/or about 990-1190 cm"1 and/or about 1500-1800 cm"1 and/or about 2800-3050 cm"1 and/or about 1180-1290 cm"1.

9. A method for detecting and/or identifying specific bacteria within an uncultured sample; said method comprising: a. obtaining an absorption spectrum (AS) of said uncultured sample; said AS containing water influence; b. acquiring the n dimensional volume boundaries for said specific bacteria by: i. obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. extracting x features from said AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. calculating at least one derivative of said AS2; iv. dividing said AS2 into several segments according to said x features; v. calculating the y features of each of said segment; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value,. Kurtosis value.. Gaussians' set of ...parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; vi. assigning at least one of said x features and/ or at least one of said y- features to said specific bacteria -by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(Sw); Sw /(Sb+Sw); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vii. defining n dimensional space; n equals the sum of said x features and said y features; viii. defining the n dimensional volume in said n dimensional space; ix. determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), - Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; c. eliminating said water influence from said AS by at least one of the following methods: Low pass filter, High pass filter and Water absorption division; d. data processing said AS without said water influence by i. noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater or equal to one; iii. dividing said AS into several segments according to said m features; iv. calculating the mj features of at least one of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, e. detecting and/or identifying said specific bacteria if said mi features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

10. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 9, additionally comprising step of selecting said x feature and/or said y features via algorithms selected form Chi- Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

11. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 9, wherein said step of acquiring the n dimensional volume boundaries for the specific bacteria, additionally comprising step of calculating the Gaussian distribution and/or Multivariate Gaussian distribution, and/or Rayleigh distribution, and/or Maxwell distribution, and/or Estimate the distribution by the Parzen method or by or mixed model, Gaussian Mixed Model (GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

12. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 9, wherein said step (c) of data processing said AS without said water influence, additionally comprising steps of i. calculating at least one of the o' derivative of said AS; said o is an integer greater than or equals 1 ; ii. extracting m.2 features from said entire o' derivative spectrum; said ni2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi2 is an integer greater than or equal to one; iii. dividing said o'h derivative into several segments according to said rri2 features; iv. calculating the ms features in at least one of said segments; said πi3 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi2 is an integer greater than or equal to one; and,

. . v. detecting and/or identifying said specific .bacteria if said mi and/or ms features and/or said m and/or said ni2 features are within said n dimensional volume.

13. The method for detecting and/or identifying specific bacteria within an -uncultured sample according to either one claims 9-12, additionally comprising the step of selecting said specific bacteria selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enter ocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans „ streptococcus Pneumonia, Staph epidermidis, Corynebacterium .Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, ■Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof.

14. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim either one claims 9-13, wherein said step of obtaining the AS additionally comprising steps of: a. providing at least one optical cell accommodating said uncultured sample; b. providing p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1; said p light source are adapted to emit light to said optical cell; c. providing detecting means for receiving the spectroscopic data of said sample; d. emitting light from said light source at different wavelength to said optical cell; e. collecting said light exiting from said optical cell by said detecting means; thereby obtaining said AS.

15. The method for detecting and/or identifying specific bacteria within an uncultured sample according to claim 14, wherein said step of emitting light is performed at the wavelength range of UV, visible, IR, mid-IR, far IR and terahertz.

16. The method for detecting . and/or identifying specific .bacteria within an uncultured sample according to either one of claims 9-15, wherein the absorption spectra is obtained using an instrument selected from the group consisting of a spectrometer, Fourier transform infrared spectrometer, a fluorometer and a Raman spectrometer.

17. The method for detecting and/or identifying specific bacteria within an uncultured sample according to either one of claims 9-15, wherein said aerosol sample is taken from the human body.

18. A system 1000 adapted to detect and/or identify specific bacteria within an uncultured sample; said system comprising: a. means 100 for obtaining an absorption spectrum (AS) of said uncultured sample; b. statistical processing means 200 for acquiring the n dimensional volume boundaries for said specific bacteria; said means 200 are characterized by: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. means 202 for extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians1 set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; iii. means 203 for dividing said AS2 into several segments according to said x features; iv. means 204 for calculating y features from at least one of each of said segment; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total .sum of . areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. means 205 assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(Sw); Sw /(Sb+Sw); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. means 206 for defining n dimensional space; n equals the sum of said x features and said y features; i. means 207 for defining the n dimensional volume in the n dimensional space; vii. means 208 for determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; viii. means 209 for assigning the n dimensional volume to said specific bacteria; means 300 for data, processing said AS; said means 300 are characterized by i. means 301 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass .filter or any combination .thereof; ii. means 302 for extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. means 303 for dividing said AS into several segments according to said m features; iv. means 304 for calculating the mi features of at least one of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rm is an integer greater than or equal to one; and, d. means 400 for detecting and/or identifying said specific bacteria if said mi features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

19. The system 1000 according to claim 18, additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi- Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

20. The system 1000 according to claim 18, wherein said statistical processing means 200 additionally comprising means 210 for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, or Estimate the distribution by the Parzen method, or mixed model, the Gaussian Mixed Model (GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

21. The system 1000 according to claim 18, wherein said means 300 for data processing said AS additionally characterized by: i. means 305 for calculating at least one of the oth derivative of said AS; said o is an integer greater than or equals 1; ii. means 306 for extracting m.2 features from said entire o'h derivative spectrum; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri2 is an integer greater than or equal to one; iii. means 307 for dividing said oth derivative into several segments according to said πi2 features; iv. means 308 for calculating the m$ features in at least one of said segments; said m^ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LRC), mean value of the signal, ..Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi2 is an integer greater than or equal to one; and, v. means 309 for detecting and/or identifying said specific bacteria if said mi and/or ms features and/or said m and/or said rri2 features are within said n dimensional volume.

22. The system 1000 according to either one of claims 18-21, wherein said specific bacteria is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic , Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi , staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof.

23. The system 1000 according to either one of claims 18-21, wherein said means 100 for obtaining an absorption spectrum (AS) of said sample additionally comprising: a. at least one optical cell for accommodating said uncultured sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light at different wavelength to said optical cell; and, c. detecting means for receiving the spectroscopic data of said sample exiting from said optical cell.

24. The system 1000 according to claim 23, wherein said/? light source are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far-IR and terahertz.

25. A system 2000 adapted to detect and/or identify specific bacteria within an uncultured sample; said system 2000 comprising: a. means 100 for obtaining an absorption spectrum (AS) of said uncultured sample; said AS containing water influence; b. statistical processing means 200 for acquiring the n dimensional volume boundaries for said specific bacteria; said means 200 are characterized by: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. means 202 for extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. means 203 for dividing said AS2 into several segments according to said x features; iv. means 204 for calculating the y features of at least one of said segments; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v.- means 205 for assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(Sw); Sw /(Sb+Sw); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. means 206 for defining n dimensional space; n equals the sum of said x features and said y features; vii. means 207 for defining the n dimensional volume in said n dimensional space; viii. means 208 for determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K- nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; ix. means 209 for assigning said n dimensional volume to said specific bacteria; means 300 for eliminating said water influence from said AS selected from a group consisting of; Low pass filter, High pass filter and Water absorption division d. means 400 for data processing said AS without said water influence; said means 400 are characterized by: i." means 401 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass filter or any combination thereof; ii. means 402 for extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater than or equal to one; iii. means 403 for dividing said AS into several segments according to said m features; iv. means 404 for calculating mi features at least one of said segments; said mj features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, e. means 500 for detecting and/or identifying said specific bacteria if said /w/ features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

26. The system 2000 according to claim 25, additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi- Squared, χ2. test, Wilcoxon test, and t-test or any combination thereof.

27. The system 2000 according to claim 25, wherein said statistical processing means 200 additionally comprising means 210 for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, or Estimate the distribution by the Parzen method, or mixed model, the Gaussian Mixed Model (GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

28. The system 2000 according to claim 25, wherein said means 400 for data processing said AS without said water influence additionally comprising: i. means 405 for calculating at least one of the o'h derivative of said AS; said o is an integer greater than or equals 1; ii. means 406 for extracting rri2 features from said entire oth derivative spectrum; said πi2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri2 is an integer greater than or equal to one; iii. means 407 for dividing said o' derivative into several segments according to said rri2 features; iv. means 408 for calculating the ms features from at least one of said segments; said wj features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of . , parameters ,(μ,σ,Ai). different peaks' intensity ratios, wavelet coefficients or any combination thereof; ni2 is an integer greater than or equal to one; and, v. means 409 for detecting and/or identifying said specific bacteria if said mi and/or m.3 features and/or said m and/or said ni2 features are within said n dimensional volume.

29. The system 2000 according to either one of claims 25-28, wherein said specific bacteria is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus -Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli, or any combination theerof.

30. The system 2000 according to either one of claims 25-29, wherein said means 100 for obtaining an absorption spectrum (AS) of said sample additionally comprising: a. at least one optical cell for accommodating said uncultured sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light at different wavelength to said optical cell; and, c. detecting means for receiving the spectroscopic data of said sample exiting from said optical cell.

31. The system 2000 according to claim 30, wherein said/? light source are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far-IR and terahertz.

32. The system according to either one of claims 18-31, wherein the absorption spectra is obtained using an instrument selected from the group consisting of a spectrometer, Fourier transform infrared spectrometer, a fluorometer and a Raman spectrometer.

33. The system according to either one of claims 18-31, wherein said aerosol sample is taken from the human body.

34. The system as any of claims 18-31, additionally comprising means adapted to recommend, after the specific bacteria has been identified, what kind of antibiotics and medicine to take.

35. The methods as any of claims 1-17, additionally comprising step of recommending, after the specific bacteria has been identified, what kind of antibiotics and medicine to take.

36. The system as any of claims 18-31, wherein said sample is an aerosol sample obtained from air moisture and/or contaminations in air condition systems.

37. The methods as any of claims 1-17, wherein said sample is an aerosol sample obtained from air moisture and/or contaminations in air condition systems.

38. The system as any of claims 18-31, wherein the sensitivity of said system is less than 6x106 bacteria/ μL.

Description:
MEANS AND METHODS FOR DETECTING BACTERIA IN AN AEROSOL SAMPLE

FIELD OF THE INVENTION

The present invention relates to the field of spectroscopic medical diagnostics of specific bacteria within a sample. More particularly, the present invention provides means and methods for detecting different kinds of bacteria in an aerosol sample by using spectroscopic measurements. The detection can be used for both medical and non-medical applications, such as detecting bacteria in water, beverages, food production lines, sensing for hazardous materials in crowded places, bio-defense etc.

BACKGROUND OF THE INVENTION

The identification of microorganisms is clearly of great importance in the medical fields. Furthermore, in recent years the need for efficient and relatively rapid identification techniques has become even more pressing owing to the remarkable expansion of environmental and industrial microbiology. One field in which it there is an urgent need for a rapid and accurate identification of bacteria in an aerosol environment.

Respiratory disease is an umbrella term for diseases of the lung, bronchial tubes, trachea and throat. These diseases range from mild and self-limited (coryza -or common cold) to being life-threatening, (bacterial pneumonia, or pulmonary embolism for example).

Respiratory diseases can be classified as either obstructive or restrictive. Obstructive is a condition which impede the rate of flow into and out of the lungs (e.g, asthma); and restrictive is a condition which cause a reduction in the functional volume of the lungs (e.g., pulmonary fibrosis).

Respiratory disease can be further subdivided as either upper or lower respiratory tract (most commonly used in the context of infectious respiratory disease), parenchymal and vascular lung diseases. Infectious Respiratory Diseases are, as the name suggests, typically caused by one of many infectious agents able to infect the mammalian respiratory system, the etiology can be viral or bacterial (for example the bacterium Streptococcus pneumoniae).

A patient who suffers from infectious respiratory diseases will usually endure sore

throat and have trouble swallowing. However, these sympthoms might indicate also a flu.

Usually a throat culture is taken from the patient, that is suspected to have strep throat, in order to correctly diagnose the infection and to give the proper treatment.

The throat culture and bacterial analysis will usually take about three days. Moreover, the test causes some inconvience to the patient.

The bacterial analysis will determine what is the desired and correct treatment and medication.

Another kind of tests are the "rapid" Strep. A tests. In these tests, a throat swab is inserted into a reagent and the presence of the bacteria is determined according to the chemical reaction between the bacteria and the reagent. Although these test give fast results (10 to 30 minutes) their sensitivity is very poor and they are not user friendly.

Therfore they are not commonly used by the medical stuff.

Usually the physician desires to know if the bacteria is present and then perscribe antibiotics. Therefore, it will be beneficial for the doctor and the patient alike to get an immidiate response for the throat sample.

An immindiate response might be obtained by sampling the exhaled debrit (exhaled gases and micro fluids) of coughing or other human fluids (saliva, mucos etc.) and optically characterizing their content. Optically characterizing the sample will likely be more convinient for the patient than the usual throat culturing.

Some spectroscopic techniques already known in the art. For example, PCT No. WO 98/41842 to NELSON, Wilfred discloses a system for the detection of bacteria antibody complexes. The sample to be tested for the presence of bacteria is placed in a medium which contains antibodies attached to a surface for, binding to specific bacteria to form an antigen - antibody complex. The medium is contacted with an incident beam of light energy. Some of the energy is emitted from the medium as a lower resonance enhanced Raman backscattered energy. The detection of the presence or absence of the microorganism is based on the characteristic spectral peak of said microorganism. In other words PCT No. WO 98/41842 uses UV resonance Raman spectroscopy.

US patent No. 6,599,715 to Laura A. Vanderberg relates to a process for detecting the presence of viable bacterial spores in a sample and to a spore detection system. The process includes placing a sample in a germination medium for a period of time sufficient for commitment of any present viable bacterial spores to occur. Then the sample is mixed with a solution of a lanthanide capable of forming a fluorescent complex with dipicolinic acid. Lastly, the sample is measured for the presence of dipicolinic acid.

US patent No. 4,847,198 to Wilfred H. Nelson; discloses a method for the identification of a bacterium. Firstly, taxonomic markers are excited in a bacterium with a beam of ultra violet energy. Then, the resonance enhance Raman back scattered energy is collected substantially in the absence of fluorescence. Next, the resonance enhanced Raman back scattered energy is converted into spectra which corresponds to the taxonomic markers in said bacterium. Finally, the spectra are displayed and thus the bacterium may be identified.

US patent No. 6,379,920 to Mostafa A. El-Sayed discloses a method to analyze and diagnose specific bacteria in a biological sample by using spectroscopic means. The method includes obtaining the spectra of a biologic sample of a non-infected patient for use as a reference, subtracting the reference from the spectra of an infected sample, and comparing the fingerprint regions of the resulting differential spectrum with reference spectra of bacteria. Using this diagnostic technique, patent 6,379,920 claims to identify specific bacteria without culturing.

Naumann et al had demonstrated bacteria detection and classification in dried samples using FTIR spectroscopy [Naumann D. et al., "Infrared spectroscopy in microbiology", Encyclopedia of Analytical Chemistry, R. A. Meyers (Ed.) pp. 102— 131, John Wiley & Sons Ltd, Chichester, 2000.]. Marshall et al had identifies live microbes using FTIR Raman spectroscopy [Marshall et al " Vibrational spectroscopy of extant and fossil microbes: Relevance for the astrobiological exploration of Mars", Vibrational Spectroscopy 41 (2006) 182-189]. Others methods involve fluorescence spectroscopy of a combination of the above.

None of the prior art literature discloses means and method that can quickly (without culturing) and accurately detect bacteria from a sample, and none demonstrates identification within a wet sample. Furthermore, none of the prior art literature discloses means and method that can eliminate the water influence from the sample so as to better detect the bacteria. Moreover all of the above require a skilled operator and/or the use of reagents or a complicated sample preparation for the detection of bacteria.

Furthermore, none of the above distinguishes among different bacteria in a mixture or within a sample.

Thus, there is a long felt need for means and method for accurate bacteria identification from an uncultured sample and more specifically an aerosol sample without the use of reagents and/or complicated sample preparation.

SUMMARY OF THE INVENTION

It is one object of the present invention to provide a method for detecting and/or identifying specific bacteria within an uncultured sample; said method comprising: a. obtaining an absorption spectrum (AS) of said uncultured sample; b. acquiring the n dimensional volume boundaries for said specific bacteria by i. obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. dividing said AS2 into several segments according to said x features; iv. calculating y features of each of said segment of said AS2; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. defining n dimensional space; n equals the sum of said x and said y features; vii. defining the n dimensional volume in said n dimensional space;determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; ta processing said AS; i. noise reducing by using different smoothing techniques selected from a group consisting of running average, savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. dividing said AS into several segments according to said m features; iv. calculating mi features of each of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, d. detecting and/or identifying said specific bacteria if said mi features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising step of selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof. It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of acquiring the n dimensional volume boundaries for the specific bacteria, additionally comprising step of calculating the Gaussian distribution and/or Multivariate Gaussian distribution, and/or Rayleigh distribution, and/or Maxwell distribution, and/or Estimate the distribution by the Parzen method or mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step (c) of data processing said AS additionally comprising steps of: i. calculating at least one of the o' h derivative of said AS; said o is an integer greater than or equals 1 ; ii. extracting ni 2 features from said entire o' derivative spectrum; said τri 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri 2 is an integer greater than or equal to one; iii. dividing said o' h derivative into several segments according to said m 2 features; iv. calculating the ms features in at least one of said segments; said ms features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m. 2 is an integer greater than or equal to .one; and, v. detecting and/or identifying said specific bacteria if said mi and/or m^ features and/or said m and/or said ni 2 features are within said n dimensional volume.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising the step of selecting said specific bacteria selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemofytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enter ocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof. It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of obtaining the AS additionally comprising steps of: a. providing at least one optical cell accommodates said uncultured sample; b. providing p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light to said optical cell; c. providing detecting means for receiving the spectroscopic data of said sample; d. emitting light from said light source at different wavelengths to said optical cell; and, e. collecting said light exiting from said optical cell by said detecting means; thereby obtaining said AS.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of emitting light is performed at the wavelength range of UV, visible, IR, mid-IR, far-IR and terahertz.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising the step of detecting said bacteria by analyzing said AS in the region of about 3000-3300 cm "1 and/or about 850-1000 cm "1 and/or about 1300-1350 cm "1 , and/or about 2836-2995 cm "1 , and/or about 1720-1780 cm "1 , and/or about 1550- 1650 cm "1 , and/or about 1235-1363 cm "1 , and/or about 990-1190 cm "1 and/or about 1500-1800 cm "1 and/or about 2800-3050 cm "1 and/or about 1180-1290 cm "1 . It is another object of the present invention to provide a method for detecting and/or identifying specific bacteria within an uncultured sample; said method comprising: a. obtaining an absorption spectrum (AS) of said uncultured sample; said AS containing water influence; b. acquiring the n dimensional volume boundaries for said specific bacteria by: i. obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. extracting x features from said AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. calculating at least one derivative of said AS2; iv. dividing said AS2 into several segments according to said x features; v. calculating the y features of each of said segment; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; vi. assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vii. defining n dimensional space; n equals the sum of said x features and said y features; viii. defining the n dimensional volume in said n dimensional space; ix. determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; c. eliminating said water influence from said AS by at least one of the following methods: Low pass filter, High pass filter and Water absorption division; d. data processing said AS without said water influence by i. noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater or equal to one; iii. dividing said AS into several segments according to said m features; iv. calculating the mi features of at least one of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, e. detecting and/or identifying said specific bacteria if said m / features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising step of selecting said x feature and/or said y features via algorithms selected form Chi- Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of acquiring the n dimensional volume boundaries for the specific bacteria, additionally comprising step of calculating the Gaussian distribution and/or Multivariate Gaussian distribution, and/or Rayleigh distribution, and/or Maxwell distribution, and/or Estimate the distribution by the Parzen method or mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step (c) of data processing said AS without said water influence, additionally comprising steps of i. calculating at least one of the o th derivative of said AS; said o is an integer greater than or equals 1 ; ii. extracting w^ features from said entire o' derivative spectrum; said m. 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; ni2 is an integer greater than or equal to one; iii. . dividing said o' derivative into several segments according to said m.2 features; iv. calculating the ins features in at least one of said segments; said m. 3 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi 2 is an integer greater than or equal to one; and, v. detecting and/or identifying said specific bacteria if said mi and/or nis features and/or said m and/or said ni 2 features are within said n dimensional volume.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising the step of selecting said specific bacteria selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans „ streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof. It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of obtaining the AS additionally comprising steps of: a. providing at least one optical cell accommodating said uncultured sample; b. providing p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1; said/? light source are adapted to emit light to said optical cell; c. providing detecting means for receiving the spectroscopic data of said sample; d. emitting light from said light source at different wavelength to said optical cell; e. collecting said light exiting from said optical cell by said detecting means; thereby obtaining said AS.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said step of emitting light is performed at the wavelength range of UV, visible, IR, mid-IR, far IR and terahertz.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein the absorption spectra is obtained using an instrument selected from the group consisting of a spectrometer, Fourier transform infrared spectrometer, a fluorometer and a Raman spectrometer.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, wherein said aerosol sample is taken from the human body.

It is another object of the present invention to provide a system 1000 adapted to detect and/or identify specific bacteria within an uncultured sample; said system comprising: a. means 100 for obtaining an absorption spectrum (AS) of said uncultured sample; b. statistical processing means 200 for acquiring the n dimensional volume boundaries for said specific bacteria; said means 200 are characterized by: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. means 202 for extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; iii. means 203 for dividing said AS2 into several segments according to said x features; iv. means 204 for calculating y features from at least one of each of said segment; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. means 205 assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. means 206 for defining n dimensional space; n equals the sum of said x features and said y features; i. means 207 for defining the n dimensional volume in the n dimensional space; vii. means 208 for determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; viii. means 209 for assigning the n dimensional volume to said specific bacteria; means 300 for data processing said AS; said means 300 are characterized by i. means 301 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass filter or any combination thereof; ii. means 302 for extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. means 303 for dividing said AS into several segments according to said m features; iv. means 304 for calculating the mi features of at least one of said segment; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, d. means 400 for detecting and/or identifying said specific bacteria if said mi features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

It is another object of the present invention to provide the system as defined above, additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

It is another object of the present invention to provide the system 1000 as defined above, wherein said statistical processing means 200 additionally comprising means 210 for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, or Estimate the distribution by the Parzen method or by mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

It is another object of the present invention to provide the system 1000 as defined above, wherein said means 300 for data processing said AS additionally characterized by: i. means 305 for calculating at least one of the o th derivative of said AS; said o is an integer greater than or equals 1 ; ii. means 306 for extracting πi 2 features from said entire o th derivative spectrum; said ni 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi 2 is an integer greater than or equal to one; iii. means 307 for dividing said o' h derivative into several segments according to said πi2 features; iv. means 308 for calculating the m. 3 features in at least one of said segments; said m^ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri 2 is an integer greater than or equal to one; and, v. means 309 for detecting and/or identifying said specific bacteria if said mi and/or m^ features and/or said m and/or said ni 2 features are within said n dimensional volume.

It is another object of the present invention to provide the system 1000 as defined above, wherein said specific bacteria is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enter ocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic , Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi , staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof. It is another object of the present invention to provide the system 1000 as defined above, wherein said means 100 for obtaining an absorption spectrum (AS) of said sample additionally comprising: a. at least one optical cell for accommodating said uncultured sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochromator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light at different wavelength to said optical cell; and, c. detecting means for receiving the spectroscopic data of said sample exiting from said optical cell.

It is another object of the present invention to provide the system 1000 as defined above, wherein said p light source are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far-IR and terahertz. It is another object of the present invention to provide a system 2000 adapted to detect and/or identify specific bacteria within an uncultured sample; said system 2000 comprising: a. means 100 for obtaining an absorption spectrum (AS) of said uncultured sample; said AS containing water influence; b. statistical processing means 200 for acquiring the n dimensional volume boundaries for said specific bacteria; said means 200 are characterized by: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing said specific bacteria; ii. means 202 for extracting x features from said entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, KuiϊQsis. value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. means 203 for dividing said AS2 into several- segments according to said x features; iv. means 204 for calculating the y features of at least one of said segments; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal" to one; v. means 205 for assigning at least one of said x features and/ or at least one of said y features to said specific bacteria algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(S w ); S w /(Sb+S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. means 206 for defining n dimensional space; n equals the sum of said x features and said y features; vii. means 207 for defining the n dimensional volume in said n dimensional space; viii. means 208 for determining said boundaries of said n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; i. means 209 for assigning said n dimensional volume to said specific bacteria; c. means 300 for eliminating said water influence from said AS selected from a group consisting of; Low pass filter, High pass filter and Water absorption division d. means 400 for data processing said AS without said water influence; said means 400 are characterized by: i. means 401 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass filter or any combination thereof; ii. means 402 for extracting m features from said entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater than or equal to one; iii. means 403 for dividing said AS into several segments according to said m features; iv. means 404 for calculating mj features at least one of said segments; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mj is an integer greater than or equal to one; and, e. means 500 for detecting and/or identifying said specific bacteria if said mi features and/or said m features are within said n dimensional volume; wherein said sample is an aerosol sample selected from a group consisting of cough, sneeze, saliva, mucus, bile, urine, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum, blood and spinal fluid.

It is another object of the present invention to provide the system as defined above, additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, - and t-test or any combination thereof.

It is another object of the present invention to provide the system 2000 as defined above, wherein said statistical processing means 200 additionally comprising means 210 for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, or Estimate the distribution by the Parzen method or by mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

It is another object of the present invention to provide the system 2000 as defined above, wherein said means 400 for data processing said AS without said water influence additionally comprising: i. means 405 for calculating at least one of the o' h derivative of said AS; said o is an integer greater than or equals 1; ii. means 406 for extracting πi 2 features from said entire o' h derivative spectrum; said m. 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m^ is an integer greater than or equal to one; iii. means 407 for dividing said o' h derivative into several segments according to said rri 2 features; iv. means 408 for calculating the m^ features from at least one of said segments; said m^ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi 2 is an integer greater than or equal to one; and, v. means 409 for detecting and/or identifying said specific bacteria if said mi and/or ms features and/or said m and/or said πi 2 features are within said n dimensional volume.

It is another object of the present invention to provide the system 2000 as defined above, wherein said specific bacteria is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli, or any combination theer of. It is another object of the present invention to provide the system 2000 as defined above, wherein said means 100 for obtaining an absorption spectrum (AS) of said sample additionally comprising: a. at least one optical cell for accommodating said uncultured sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; said p light source are adapted to emit light at different wavelength to said optical cell; and, c. detecting means for receiving the spectroscopic data of said sample exiting from said optical cell.

It is another object of the present invention to provide the system 2000 as defined above, wherein said p light source are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far-IR and terahertz.

It is another object of the present invention to provide the system 2000 as defined above, wherein the absorption spectra is obtained using an instrument selected from the group consisting of a spectrometer, Fourier transform infrared spectrometer, a fluorometer and a Raman spectrometer.

It is another object of the present invention to provide the system 2000 as defined above, wherein said aerosol sample is taken from the human body.

It is another object of the present invention to provide the system 2000 as defined above, additionally comprising means adapted to recommend, after the specific bacteria has been identified, what kind of antibiotics and medicine to take.

It is another object of the present invention to provide the methods as defined above, additionally comprising step of recommending, after the specific bacteria has been identified, what kind of antibiotics and medicine to take.

It is another object of the present invention to provide the system 2000 as defined above, wherein said sample is an aerosol sample obtained from air moisture and/or contaminations in air condition systems. It is another object of the present invention to provide the methods as defined above, wherein said sample is an aerosol sample obtained from air moisture and/or contaminations in air condition systems.

It is another object of the present invention to provide the system 2000 as defined above, wherein the. sensitivity, of said system is less than 6x10 6 bacteria/ μL.

BRIEF DESCRIPTION OF THE FIGURES

For better understanding the invention and to see how it may be implemented in practice, a plurality of embodiments will now be described, by way of non-limiting example only, with reference to the accompanying drawings, in which

Figs. 1-2 illustrate a system 1000 and 2000 respectfully for detecting and/or identify bacteria within an aerosol sample according to preferred embodiments of the present invention.

Figs. 3-4 illustrate an absorption spectrum prior to the water influence elimination (figure 3) and after the water influence elimination (figure 4) whilst using the first method.

Figs. 5-7 illustrate the second method for eliminating the water influence. Figs. 8-9 illustrate the third method for eliminating the water influence.

Figs. 10-11 illustrate Streptococcus Type A (Streptococcus Pyogenes) aerosol spectrum and Streptococcus Bovis_aerosol spectrum respectfully.

Figure 12 illustrates the absorption signal of a sample containing 25% streptococcus pyogenes and 75% streptococcus Bovis prior to and after the noise was reduced (recorded signal vs. smoothed signal).

Figure 13 illustrating the signal's first derivative of a sample containing 25% streptococcus pyogenes and 75% streptococcus Bovis prior to and after the noise was reduced (recorded signal vs. smoothed signal).

Figure 14 illustrates the boundaries of a two dimensions area which enable the identification of bacteria. Figures 15a and 15b illustrate bacterial spectral signal at 1237 cm "1 region for different bacteria concentrations (figure 15 a) and the absorbance as a function of the bacteria concentration (figure 15b).

Figures 16a and 16b illustrates the bacteria spectral signal at 1084 cm "1 region for different bacteriaxoncentrations (figure 16a)- and the absorbance as a function of the bacteria concentration (figure 16b).

Figure 17 illustrates the spectrum of the coughed aerosols taken from a patient suspected to have Strep A.

Figure 18 illustrates the classification results and separation between patients that were Strep. A. positive and those who were Strep. A. negative.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

The following description is provided, alongside all chapters of the present invention, so as to enable any person skilled in the art to make use of said invention and sets forth the best modes contemplated by the inventor of carrying out this invention. Various modifications, however, will remain apparent to those skilled in the art, since the generic principles of the present invention have been defined specifically to provide means and methods for detecting bacteria within a sample by using Spectroscopic measurements.

Spectroscopic measurements, whether absorption fluorescence Raman, and scattering are the bases for all optical sensing devices. In order to identify a hazardous material (for example a bacteria) in an aerosol sample that might contain the material is placed inside a spectrometer and the absorption spectrum of the sample is then analyzed to verify whether the spectral signature of the hazardous material is recognized.

The present invention provides means and methods for detection or identification of bacteria by analyzing the absorption spectra of a sample which might contain bacteria.

The term "sample" refers herein to an aerosol sample. The present invention provides accurate detection means that enable the detection of bacteria in aerosol samples. The detection means can be used for medical or non-medical applications. Furthermore, the detection means can be used, for example, in detecting bacteria in water, beverages, food production, sensing for hazardous materials in crowded places etc. The aerosol sample will be obtained from coughing, sneezing, saliva, bile, mucus, urine (the aerosols will be done using a spray after sample collection), blood (the aerosols will be done using a spray after sample collection), blood Serum (the aerosols will be done using a spray after sample collection) or spinal fluid (the aerosols will be done using a spray after sample collection), vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, serum.

Furthermore, the aerosol samples will be obtained from air moisture (hazardous materials such as soot, metals) and contaminations in air condition and ventilations systems.

The present invention will provides means and method for detecting hazardous materials such as anthrax, chemical agents such as VX, sarin et cetera by sampling the air in suspected places.

The term "High-pass filter (HPF)" refers hereinafter to a filter that passes high frequencies well, but attenuates (reduces the amplitude of) frequencies lower than a cutoff frequency.

The term "Low-pass filter (LPF)" refers hereinafter to a filter that passes low- frequency signals but attenuates (reduces the amplitude of) signals with frequencies higher than a cutoff frequency.

The term "Chi-Squared ,χ2, test" refers hereinafter to any statistical hypothesis test in which the sampling distribution of the test statistic is a chi-square distribution when the null hypothesis is true, or any in which this is asymptotically true, meaning that the sampling distribution (if the null hypothesis is true) can be made to approximate a chi-square distribution as closely as desired by making the sample size large enough. The term "Pearson's correlation coefficient" refers hereinafter to the correlation between two variables that reflects the degree to which the variables are related. Pearson's correlation reflects the degree of linear relationship between two variables. It ranges from +1 to -1. A correlation of -1 means that there is a perfect negative linear relationship between variables. A correlation of 0 means there is no linear relationship between the two variables. A correlation of 1 means there is a complete linear relationship between the two variables.

A commonly used formula for computing Pearson's correlation coefficient r is the following one:

The term "about" refers hereinafter to a range of 25% below or above the referred value.

The term ^segments" refers hereinafter to wavelength ranges within the absorption spectrum.

The term "« dimensional volume" refers hereinafter to a volume in an n dimensional space that is especially adapted to identify the bacteria under consideration. The n dimensional volume is constructed by extracting features and correlations from the absorption spectrum or its derivatives.

The term "n dimensional space" refers hereinafter to a space where each coordinate is a feature extracted from the bacteria spectral signature or calculated out of the spectrum and its derivatives or from a segment of the spectrum and/or its derivatives. The term "n dimensional volume boundaries" refers hereinafter to a range that includes about 95% of the bacteria under consideration possible features and correlation values.

The term "trace(S b )/trace(S w )" refers hereinafter to the ratio between interclass and intraclass covariance matrix. It refers to a method used to measure the separability of two classes. It relates to the ability to achieve high correct classification in a designed classifier. In the following disclosure S b is the covariance matrix reflecting the distance between two classes, and S vv is covariance matrix reflecting the distance within class.

The term "Correlation" refers herein after to correlation between the aerosol bacteria spectrum and a reference bacteria spectrum which is already known, correlation between bacteria spectrum without the water influence and a reference bacteria spectrum which is already known, correlation between ø th derivative of the aerosol bacteria spectrum and a reference bacteria spectrum which is already known, correlation between o th derivative of the bacteria spectrum without the water influence and a reference bacteria spectrum which is already known, o is an integer greater than or equals to 1. The above correlations are calculated on the whole spectrum and/or segments of the spectrum and/ or their derivatives.

Methods and means for bacteria detection adapted to utilize the unique spectroscopic signature of microbes/bacteria/hazardous materials and thus enables the detection of the microbes/bacteria/hazardous materials within a sample are provided by the present invention.

Reference is now made to figure 1, illustrating a system 1000 adapted to detect and/or identify specific bacteria within a sample according to one preferred embodiment of the present invention. System 1000 comprises: a. means 100 for obtaining an absorption spectrum (AS) of the sample; b. statistical processing means 200 for acquiring the n dimensional volume boundaries of at least specific bacteria, having: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing the specific bacteria; ii. means 202 for extracting x features from the entire AS2; said x features are selected from a group consisting of Correlation peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; iii. means 203 for dividing the AS2 into several segments according to at least one of the x features; iv. means 204 for extracting y features from at least one of said segments; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. .. means 205 for assigning at. least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. means 206 for defining n dimensional space; n equals the sum of the x and y; vii. means 207 for defining the n dimensional volume in the n dimensional space; viii. means 208 for determining the boundaries of the n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; ix. means 209 for assigning the n dimensional volume to the specific bacteria; eans 300 for data processing the AS, having: i. means 301 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass filter or any combination thereof; ii. means 302 for extracting m features from the entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. means 303 for dividing the AS into several segments according to the m features; iv. means 304 for extracting πii features from at least one of said segments; said m \ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, d. means 400 for detecting and/or identifying the specific bacteria if the mi and/or m features are within the n dimensional volume.

According to another embodiment of the present invention, the system as defined above additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

According to another embodiment of the present invention, the statistical processing means 200 additionally comprising means 210 (not illustrated in the figures) for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, Estimate the distribution by the Parzen method or mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space. According to another embodiment of the present invention, means 300 (in system

1000) for data processing the AS additionally characterized by: i. means 305 (not illustrated in the figures) for calculating at least one of the o th derivative of the AS; o is an integer greater than or equals 1 ; ii. means 306 (not illustrated in the figures) for extracting πi 2 features from the entire o th derivative spectrum; said m. 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri 2 is an integer greater than or equal to one; iii. means 307 (not illustrated in the figures) for dividing the 0 th derivative into several segments according to the m^ features; iv. mean 308 (not illustrated in the figures) for extracting m 3 features from at least one of said segments; said ms features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; ni 2 is an integer greater than or equal to one; and, v. means 309 (not illustrated in the figures) for detecting and/or identifying the specific bacteria if the mj and/or ms and/or the m and/or the πi2 features are within the n dimensional volume.

According to yet another embodiment of the present invention, the specific bacteria to be identified by system 1000 is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic , Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi , staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof. According to another embodiment of the present invention, the means 100 for obtaining an absorption spectrum (AS) of the sample (in system 1000), additionally comprising: a. at least one optical cell for accommodating the sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; the p light source are adapted to emit light at different wavelength to the optical cell; and, c. detecting means for receiving the spectroscopic data of the sample exiting from the optical cell.

According to yet another embodiment of the present invention, the p light source (in system 1000) are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far- IR and terahertz.

Reference is now made to figure 2, illustrating a system 2000 adapted to detect and/or identify specific bacteria within a sample, according to another preferred embodiment of the present invention. System 2000 comprises: a. means 100 for obtaining an absorption spectrum (AS) of the sample; the AS containing water influence; statistical processing means 200 for acquiring the n dimensional volume boundaries for at least one specific bacteria, having: i. means 201 for obtaining at least one absorption spectrum (AS2) of known samples containing the specific bacteria; i. means 202 for extracting x features from the entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; ii. means 203 for dividing the AS2 into several segments according to at least one of the x features; iii. means 204 for extracting y features from at least one of said segments; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; iv. means 205 for assigning at least one of said x features and/or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(S w ); S vv /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; v. means 206 for defining n dimensional space; n equals the sum ofthe x and y; vi. means 207 for defining the n dimensional volume in said n dimensional space; vii. means 208 for determining the boundaries of the n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; viii. means 209 for assigning the n dimensional volume to the specific bacteria; c. means 300 for eliminating the water influence from the AS, selected from a group consisting of; Low pass filter, High pass filter and Water absorption division; d. means 400 for data processing the AS without the water influence, characterized by: i. means 401 for noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky-golay, low pass filter or any combination thereof; ii. means 402 for extracting m features from the entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater or equal to one; iii. means 403 for dividing the AS into several segments according to the m features; iv. means 404 for extracting mi features from at least one of said segments; said mj features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mj is an integer greater than or equal to one; and, e. means 500 for detecting and/or identifying the specific bacteria if the mi and/or m features are within the n dimensional volume.

According to another embodiment of the present invention, the system as defined above additionally comprising means for selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

According to another embodiment of the present invention, the statistical processing means 200 in system 2000) additionally comprising means 210 (not illustrated in the figures) for calculating the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, Estimate the distribution by the Parzen method or mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

According to another embodiment of the present invention, means 400 (in system 2000) for data processing the AS without the water influence additionally comprising: ii. means 405 (not illustrated in the figures) for calculating at least one of the o th derivative of the AS; o is an integer greater than or equals 1 ; iii. means 406 (not illustrated in the figures) for extracting rri2 features from the entire o l derivative spectrum; said m. 2 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients . of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; ni 2 is an integer greater than or equal to one; iv. means 407 (not illustrated in the figures) for dividing the 0 th derivative into several segments according to the ni 2 features; v. mean 408 (not illustrated in the figures) for extracting m 3 features from at least one of said segments; said m^ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; rri 2 is an integer greater than or equal to one; and, vi. means 409 (not illustrated in the figures) for detecting and/or identifying the specific bacteria if the mi and/or πis and/or the m and/or the ni 2 features are within the n dimensional volume.

According to another embodiment of the present invention, the specific bacteria (in system 2000) is selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheήticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enterocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination theerof. According to another embodiment of the present invention, means 100 for obtaining an absorption spectrum (AS) of the sample additionally comprising: a. at least one optical cell for accommodating the sample; b. p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; p light source are adapted to emit light at different wavelength to the optical cell; and, c. detecting means for receiving the spectroscopic data of the sample exiting from the optical cell.

According to yet another embodiment of the present invention, the p light source are adapted to emit light at wavelength range selected from a group consisting of UV, visible, IR, mid-IR, far-IR and terahertz.

Yet another object of the present invention is to provide a method for detecting and/or identifying specific bacteria within a sample. The method comprises step selected inter alia from: a. obtaining an absorption spectrum (AS) of the sample; b. acquiring the n dimensional volume boundaries for the specific bacteria by: i. obtaining at least one absorption spectrum (AS2) of samples containing the specific bacteria; ii. extracting x features from the entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; JC is an integer greater than or equal to one; iii. dividing the AS2 into several segments according to the x features; iv. extracting, y features from of each of the segment of AS2; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. assigning at least one of said x features and/ or at least one of said y features to said specific bacteria by algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +Sw); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. defining n dimensional space; n equals the sum of the x features and/ or the y features; vii. defining the n dimensional volume in said n dimensional space; viii. determining the boundaries of the n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof. c. data processing the AS; i. noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from the entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas .under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer higher or equal to one; iii. dividing the AS into several segments according to the m features; iv. calculating the mi features of at least one of the segments; said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, d. detecting and/or identifying the specific bacteria if the mi and/or the m features are within the n dimensional volume.

It is another object of the present invention to provide the method for detecting and/or identifying specific bacteria within an uncultured sample as defined above, additionally comprising step of selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

It should be pointed out that in each of the systems or methods as described above (either 1000 or 2000), the statistical processing means 200 is used only once for each specific bacteria.. Once. the boundaries were provided by the statistical processing means 200 the determination whether the specific bacteria is present in a sample is performed by verifying whether the m and/or πi 2 features are within the boundaries. Furthermore, once the boundaries were provided, there exists no need for the statistical processing of the same specific bacteria again.

It should be further pointed out that according to one embodiment of the present invention, either one of the systems (1000 and/or 2000) as defined above can additionally comprise means adapted to recommend any physician, after the specific bacteria has been identified, what kind of antibiotics and medicine to take. Yet another object of the present invention is to provide a method for detecting and/or identifying specific bacteria within a sample. The method comprises steps selected inter alia from: a. obtaining an absorption spectrum (AS) of the sample; the AS containing water influence; b. acquiring the n dimensional volume boundaries for the specific bacteria by: i. obtaining at least one absorption spectrum (AS2) of known samples containing the specific bacteria; ii. extracting x features from the entire AS2; said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one; x is an integer greater than or equal to one; iii. dividing the AS2 into several segments according to the x features; iv. Extracting y features from of each of the segment of AS2; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians 1 set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one; v. assigning at least one of said x features and/ or at least one of said y features to said specific bacteria algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(Sb)/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof; vi. defining n dimensional space; n equals the sum of the x features and/or the y; vii. determining the boundaries of the n dimensional volume by using technique selected from a group consisting of Bayes classifier, Support Vector Machine (SVM), Linear discriminant, functions and Fisher's linear discriminant, Gaussian Mixed Model (GMM), C4.5 algorithm tree, K-nearest neighbor, Weighted K-nearest neighbor, Hierarchical clustering algorithm, K-mean clustering algorithm, Ward's clustering algorithm, Minimum least square, Neural-Network or any combination thereof; eliminating the water influence from the AS by at least one of the following methods: Low pass filter, High pass filter and Water absorption division; d. data processing the AS without the water influence by: i. noise reducing by using different smoothing techniques selected from a group consisting of running average savitzky- golay, low pass filter or any combination thereof; ii. extracting m features from the entire AS; said m features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted . polynomial curve, the total sum of areas under at least two , peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; m is an integer greater or equal to one; iii. dividing the AS into several segments according to the m features; iv. calculating the mi features of each of the segment; said mj features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one; and, e. detecting and/or identifying the specific bacteria if the mi and/or the m features are within the n dimensional volume.

In each of the methods as described above, the statistical processing is used only once for each specific bacteria. Once the boundaries were provided by the statistical processing the determination whether the specific bacteria is present in a sample is performed by verifying whether the mi and/or said m features are within the IL2009/000908

44 boundaries. Furthermore, once the boundaries were provided, there exists no need for the statistical processing of the same specific bacteria again.

Furthermore, an additional step of selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination .thereof.

According to another embodiment of the present invention, the step of acquiring the n dimensional volume boundaries for the specific bacteria in each of the methods as defined above, additionally comprising step of calculating the Gaussian distribution and/or Multivariate Gaussian distribution, and/or Rayleigh distribution, and/or

Maxwell distribution, and/or Estimate the distribution by the Parzen method or by mixed model (like the Gaussian Mixed Model known as GMM). for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space.

According to another embodiment of the present invention step (c) of data processing the AS, in the methods as described above, additionally comprising steps of: i. calculating at least one of the o' h derivative of the AS; o is an integer greater than or equals 1 ; ii. extracting m. 2 features from the entire o' derivative spectrum; said m^ features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi 2 is an integer greater than or equal to one; iii. dividing the o' h derivative into several segments according to the ni 2 features; iv. calculating the mi features in at least one of the segments; said W 3 features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; πi 2 is an integer greater than or equal to one; and, v. detecting and/or identifying the specific bacteria if the mi and/or ms features and/or the m and/or the rri 2 features are within the n dimensional volume.

According to another embodiment of the present invention, the methods as described above, additionally comprising the step of selecting the specific bacteria selected from a group consisting of Streptococcus Pyogenes, Group B, C and G beta-hemolytic streptococci, Corynebacterium haemolyticum pseudodiphtheriticum, Diphtheria and Ulcerans, Neisseria Gonorrhoeae, Mycoplasma Pneumoniae, Yersinia Enter ocolitica, Mycobacterium tuberculosis, Chlamydia Trachomatiss and Pneumoniae, Bordetella Pertussis, Legionella spp, Pneumocystis Carinii, Nocardia, Histoplasma Capsulatum, Coccidioides Immitis, Haemophilus influenza group A beta hemolytic, Streptococcus Viridans ,, streptococcus Pneumonia, Staph epidermidis, Corynebacterium ,Moraxella catarrhalis, Klebsiella, Escherichia CoIi, staphylococcus Aureus, Streptococcus Bovis, Streptococcus Agalactiae, Streptococcus pneumonia, Staphylococcus epidermidis, Klebsiella pneumonia, e. coli or any combination thereof. According to another embodiment of the present invention, the step of obtaining the AS, in the methods as described above, additionally comprising the following steps: a. providing at least one optical cell accommodates the sample; b. providing p light source selected from a group consisting of laser, lamp, LEDs tunable lasers, monochrimator, p is an integer equal or greater than 1 ; p light source are adapted to emit light to the optical cell; c. providing detecting means for receiving the spectroscopic data of the sample; d. emitting light from the light source at different wavelength to the optical cell; and, e. collecting the light exiting from the optical cell by the detecting means; thereby obtaining the AS.

According to another embodiment of the present invention, the step of emitting light is performed at the wavelength range of UV, visible, IR, mid-IR, far-IR and terahertz. According to another embodiment of the present invention, the methods as defined above, additionally comprising the step of detecting the bacteria by analyzing the AS in the region of about 3000-3300 cm "1 and/or about 850-1000 cm "1 and/or about 1300- 1350 cm "1 , and/or about 2836-2995 cm "1 , and/or about 1720-1780 cm "1 , and/or about 1550-1650 cm "1 , and/or about 1235-1363 cm "1 , and/or about 990-1190 cm "1 and/or about 1500-1800 cm "1 and/or about 2800-3050 cm "1 and/or about 1180-1290 cm "1 . According to yet another embodiment of the present invention, the absorption spectra, in any of the systems (1000 or 2000) or for any of the methods as described above, is obtained using an instrument selected from the group consisting of a spectrometer, Fourier transform infrared spectrometer, a fluorometer and a Raman spectrometer.

According to yet another embodiment of the present invention, the uncultured sample, in any of the systems (1000 or 2000) or for any of the methods as described above, is selected from fluid originated from the human body such as blood, saliva, urine, bile, vaginal secretions, middle ear aspirate, pus, pleural effusions, synovial fluid, abscesses, cavity swabs, mucous, and serum.

It should be further pointed out that according to one embodiment of the present invention, either one of the methods as described above can additionally comprise step of recommending, after the specific bacteria has been identified, what kind of antibiotics and medicine to take.

In the foregoing description, embodiments of the invention, including preferred embodiments, have been presented for the purpose of illustration and description. They are not intended to be exhaustive or to limit the invention to the precise form disclosed. Obvious modifications or variations are possible in light of the above teachings. The embodiments were chosen and described to provide the best illustration of the principals of the invention and its practical application, and to enable one of ordinary skill in the art to utilize the invention in various embodiments and with various modifications as are suited to the particular use contemplated. AU such modifications and variations are within the scope of the invention as determined by the appended claims when interpreted in accordance with the breadth they are fairly, legally, and equitably entitled.

EXAMPLES

Examples are given in order to prove the embodiments claimed in the present invention. The examples describe the manner and process of the present invention and set forth the best mode contemplated by the inventors for carrying out the invention, but are not to be construed as limiting the invention.

EXAMPLE 1 - Water influence

One of the major problems in identifying bacteria from a fluid sample's spectrum (and especially an aerosol spectrum) is the water influence (i.e., the water noise which masks the desired spectrum by the water spectrum).

The water molecule mayvibrate' in a number of ways. In the gas state, the vibrations involve combinations of symmetric stretch (vl), asymmetric stretch (v3) and bending

(v2) of the covalent bonds. The water molecule has a very small moment of inertia on rotation which gives rise to rich combined vibrational-rotational spectra in the vapor containing tens of thousands to millions of absorption lines. The water molecule has three vibrational modes x, y and z. The following table (table 1) illustrates the water vibrations, wavelength and the assignment of each vibration:

Table 1 : water vibrations, wavelength and the assignment of each vibration

The present invention provides a method for significantly reducing and even eliminating the water influence within the absorption spectra.

Reference is now made to figures 3 and 4 which illustrate an absorption spectrum of a sample with and without the water influence.

The present invention provides three main methods for eliminating the water influence.

The first method

The first method for eliminating the water influence uses Water absorption division and contains the following steps:

First the absorption spectrum was divided into several segments (i.e., wavelength ranges). The spectrum was divided into the following segments (wavenumber ranges) about 1800 cm "1 to about 2650 cm "1 , about 1400 cm "1 to about 1850 cm "1 , about 1100 cm "1 to about 1450 cm "1 , about 950 cm "1 to about 1100 cm "1 , about 550 cm "1 to about

970 cm "1 .

The segments were determined according to (i) different intensity peaks within the water's absorption spectrum; and, (ii) the signal's trends.

Next, the water influence was eliminated from each segment according to the following protocol:

(a) providing the absorption intensity at each of wavenumber (JC) within the absorption spectrum (refers hereinafter as Sig wιl h W ater(x)),'

(b) calculating the correction factors (CF) at each wavelength (refers hereinafter as x) within each segment (refers hereinafter as CF(x));

(c) acquiring from the absorption spectrum, at least one absorption intensity that is mainly influenced by water (refers hereinafter as Sig wa t er oniy(xl)) at the corresponding wavenumbers (xl);

(d) calculating at least one correction factor of the water (CF watβr on ι y (xl)) at said at least one wavenumber (xl);

(e) dividing at least one Sig water on ι y (xl) by at least one CF waler (i.e., Sig water on ι y (xl) / CF water on ι y (xl)) at said at least one wavenumber (xl);

(f) calculating the average of the results of step (e) (refers hereinafter as A VG[Sig waler oni y (xl) / CF water only (xl)] );

(g) multiplying the AVG[Sig water on ι y (xl) / CF water on ι y ] (xl) by CF(x) for each wavenumber (x); and,

(h) Subtracting each result of step (g) from Sig mth water (x) per each (x). In other words, each absorption intensity within the spectrum is eliminated from the water influence according to the following equation:

Calculating the correction factors

The correction factors ,(CF) depends on the wavelength range, the water absorption peak's shape at each wavelength, peak's width, peak's height, absorption spectrum trends and any combination thereof. The following series were used as a correction factor (x - denote the wavenumber in cm '1 )

1. Wavelength range 1846 cm "1 to 2613 cm "1

Coefficients: all = 137.2; bll = 2170; ell = 224.3; a21 = 19.02; b21 = 2063; c21 = 37.53; a31 = 0.7427; b31 = 2224; c31= 13; a41 = 98.33; b41 = 2124; c41= 109.8; a51 = -4.988; b51= 2192; c51 = 33.87; a61 = 20.19; bβl = 1998; c61 = 40.22; all = 228.3; b71 = 1496; c71 = 1329; a81 = 6.751e+012; b81 = -1226; c81 = 592.1;

2. Wavelength range 1461 cm "1 to 1846 cm "1 al2 = -300.2; bl2 = 1650; cl2 = 13.65; a22 = -51.65; b22 = 1665; c22 = 6.48; a32 = 142.4; b32 = 1623; c32 = 7.584; a42 = 1450; b42 = 1649; c42 = 32.62; a52 = 96.34; b52 = 1617; c52 = 2.387; a62 = 608; b62 = 1470; c62 = 369.3; a72 = 0; b72 = 1873; c72 = 2.625; a82 = 1037; b82 = 1644; c82 = 76.21;

3. Wavelength range 1111 cm "1 to 1461 cm "1 al3 = 1368; bl3 = 2167; cl3 = 767; a23 = 80.67; b23 = 1356; c23 = 68.83; a33 = 36.85; b33 = 1307; c33 = 33.79; a43 = 142.5; b43 = 1244; c43 = 67.19; a53 = 260.4; b53 = 1130; c53 = 88.91; a63 = 66.54; b63 = 1093; c63 = 31; a73 = 7.126; b73 = 1345; c73 = 20.9; a83 = 4.897; b83 = 1280; c83 = 11.05;

4. Wavelength range 961 cm "1 to 1111 cm '1 al4 = 692.6; bl4 = 952; cl4 = 31.04; a24 = 48.46; b24 = 983.2; c24 = 15.72; a34 = 287.5; b34 = 994.6; c34 = 27.98; a44 = 434.9; b44 = 1032; c44 = 40.86; a54 = 17.05; b54 = 1052; c54 = 13.55; aβ4 = 48.61; b64 = 1068; c64 = 16.56; a74 = 70.71; b74 = 1086; c74 = 21.23; a84 = 497.3; b84 = 1124; c84 = 64.42;

Wavelength range 570 cm "1 to 961 cm "1 al5 = -2877; bl5 = 36.23; cl5 = 29.09; a25 = 0; b25 = -124.3; c25 = 22.09; a35= -190.7; b35 = 18.97; c35 = 16.45; a45 = 1.589e+004; b45 = -3.427; c45 = 56.25; a55 = -1.352e+004; b55 = -5.861; c55 = 40.75; a65 = 476.7; b65 = 82.38; c65 = 17.29; a75 = 1286; b75 = 62.29; c75 = 180.3; a85 = 802.9; b85 = 102.8; c85 = 18.79;

Absorption intensity mainly influenced by water

Reference is made again to figure 3 which illustrate the absorption spectrum prior to eliminating the water influence.

As can be seen from the figure, the absorption intensity that is mainly influenced by the water is the wavenumber region of 2000 cm ~l and above. The intensity at that region is about 0.2 absorption units. In the present example, xl is 2000 and Sig wa>er

Reference is made again to figure 4, which illustrate the absorption spectrum of a sample after the influence of the water was eliminated.

It should be pointed out that for the purpose of obtaining a better resolution both graphs (3 and 4) are normalized to 2 (i.e., multiplied by 2).

The second method

The second method uses a low pass filter, LPF. The method comprises the following steps:

1. Selecting the entire spectrum or at least one sub-region of the fully-hydrated bacteria spectrum.

2. Computing a water-baseline spectrum estimate by filtering the selected fully- hydrated bacteria spectrum by a Low-Pass-Filter (LPF). 3. Subtracting the water-baseline spectrum estimate from the selected fully-hydrated bacteria spectrum to obtain the non-smoothed sole bacteria spectrum.

4. A smoothed version of the sole bacteria spectrum is obtained by applying any smoothing operator like Savitzky-Golay, but not limited, on the non-smoothed sole bacteria spectrum.

All the steps described above (in the second method) are illustrated in figures 5-7. Figure 5 illustrates steps 1-4. Figure 6 illustrates the subtracted non smoothed signal and the subtracted .smoothed signal. Figure 7 illustrates Einite-Impulse-Response (FIR) used to generate the LPF coefficients.

The third method

The third method uses a high pass filter, HPF. The method comprises the following steps:

1. Selecting the entire spectrum or a sub-region of the fully-hydrated bacteria spectrum.

2. Computing the sole bacteria spectrum by filtering the selected fully-hydrated bacteria spectrum by a High-Pass-Filter (HPF).

3. Subtracting the sole bacteria spectrum from the entire spectrum to obtain the non- smoothed sole bacteria spectrum.

4. A smoothed version of the sole bacteria spectrum is obtained by applying any smoothing operator like Savitzky-Golay, but not limited, on the non-smoothed sole bacteria spectrum.

All the steps described above (in the third method) are illustrated in figures 8-9. Figure 8 illustrates steps 1-4. Figure 9 illustrates Finite-Impulse-Response (FIR) used to generate the HPF coefficients.

EXAMPLE 2 - Bacteria's absorption spectrum

Each type of bacteria has a unique spectral signature. Although many types of bacteria have similar spectral signatures there are still some spectral differences that are due to different proteins on the cell membrane and differences in the DNA/ RNA structure. The following protocol was used:

1. Strep, β hemolytic ( ATCC 19615) were purchased from HY labs. ■ 2. The content of one full plate that was grass seeded with Strep. Pyo by adding 800 μL of ddH2O to the plate and collecting the content into 1 eppendorf tube 500 μl.

3. Centrifuge the tube for 5 minX 14000rpm

4. Discarding, the supernatant

5. Adding 30 μL of ddW solution.

6. Mixing the content;

7. Reference reading of the empty optical cell

8. Putting 500 μL of the tube in a 3mL spray bottle

9. Spraying one practice squeeze into an eppendorf tube and discarding the tube

10. Spraying 2 squeezes: one on one side, and the other in the other side of the optical cell.

11. Placing the optical cell to the optical system and reading the spectral signature in the optical system.

The same protocol was used for the other bacteria as well.

The following figures show the absorption spectrum of bacteria in aerosols.

Reference is now made to figures 10-11 illustrating Streptococcus Type A

(Streptococcus Pyogenes) aerosol spectrum and Streptococcus Bovis aerosol spectrum respectfully.

EXAMPLE 3 - Distinguishing between two bacteria in an aerosol sample

The following examples illustrate in-vitro examples to provide a method to distinguish between two bacteria within an aerosol mixture of - Streptococcus payogenes and Streptococcus Bovis and to identify and/or determine whether

Streptococcus payogenes is present within the aerosol sample.

The following protocol was used:

1. Strep, β hemolytic (ATCC 19615) and Streptococcus bovis (ATCC 9809) were purchased from HY labs.

2. The content of two full plates of Strep pyo. is added with 800 μL of ddH2O to each plate and the content is placed into eppendorf tube. The procedure is repeated twice (collecting the content of 6 full plates to 3 eppendorf tubes).

3. Step 2 is repeated for S.bovis, collecting the content of 8 full plates to 4 eppendorf tubes. 4. Centrifuging the 4 tubes 3 min X 9,000rpm.

5. Discarding the supernatant.

6. Weighting the four eppendorf tubes.

7. Transferring with ImI ddH2O the bacteria pellets to each tube (one for S.pyogenes and one for S.bovis).

8. Centrifuging the tubes 3 min X 9,000rpm.

9. Discarding the supernatant into two eppendorf tubes

10. Weighting the 2 eppendorf tubes with bacteria pellet.

11. Calculating weight of bacteria pellet as can be seen for example in the following table.

Table 2: bacteria pellet's weight.

12. Adding 917μl of ddH2O to S.pyogenes tube and the same amount to S.bovis tube. The S. pyogenes and S.bovis concentration: 1x10 /μl.

12. Mixing the content.

13. Preparing mixtures of S. pyogenes and S. bovis in 3 ml spray bottle according for example to the following table.

Table 3: different mixtures of S. pyogenes and S. bovis.

13. Reference reading of the optical cell.

14. Spraying two practice squeezes into an eppendorf tube and discarding the tube 15. Spraying two squeezes into the optical cell from each side of the cell in a biological hood.

16. placing the optical cell to the optical system.

17. Reading the spectral signature in the optical system.

The identification and/or detection of specific bacteria was as follows:

(a) The water influence was eliminated using methods selected inter alia from, but not limited, low pass filter, high pass filter, and water absorption division to

» receive the dry bacteria spectrum estimate.

(b) the noise in each of the absorption spectra (without the water influence) was reduced by using Savitzky-Golay smoothing;

(c) m features such as, but not limited to, Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ, Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof .were extracted from the spectra. A total of m features were extracted, m is an integer higher or equals 1;

(d) the signal was divided into several regions (segments, i.e., several wavenumber regions) according to said m features;

(e) mi features were extracted from at least one of the spectrum's regions, said mi features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; mi is an integer greater than or equal to one.

(f) the m features and the mi features were examined and checked whether they are within the n dimensional volume boundaries (which acquired by the statistical processing); (g) the identification of the specific bacteria was determined as positive if the m features and/or the w / features are within the n dimensional volume boundaries.

Statistical processing

The statistical processing is especially adapted to provide the n dimensional volume boundaries. For each specific bacterium the statistical processing was performed only once, for obtaining the boundaries. Once the boundaries were provided, the determination whether the specific bacteria is present in a sample was as explained above (i.e., verifying whether the feature vector are within the boundaries).

The statistical processing for each specific bacterium is performed in the following manner:

(a) obtaining several absorption spectrum (AS2) of known samples containing the specific bacteria;

(b) extracting x features from the signal such as, but not limited to, said x features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ, Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; x is an integer higher or equal to one A total of x features, x is an integer higher or equals 1 ;

(c) dividing the signal into several regions (segments) according to said x features;

(d) Calculating y features for at least one of the segments within the absorption spectrum; said y features are selected from a group consisting of Correlation, peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof; y is an integer higher or equal to one;

(e) assigning at least one of said x features and/ or at least one of said y features to said specific bacteria algorithms selected from a group consisting of Sequential Backward Selection, Sequential Forward Selection, Sequential Forward Floating Selection (SFFS), Max-Min algorithm, trace(S b )/trace(S w ); S w /(S b +S w ); Kullback-Lieber divergence; correct classification rate; and any combination thereof;

(f) Defining n dimensional space, n equals the sum of the x features and y features;

(g) Assigning and/or interlinking each one of the x and y features, to the specific bacteria which its identification is required;

(h) Optionally calculating the statistical distribution for each of the x and y features (thus, defining the n dimensional volume), and,

(i) Determining the boundaries of each volume by using a classifier or a combination of classifiers (for example k nearest neighbor, Bayesian classification et cetera).

It should be pointed out that the assignment of at least one of the x features and/ or at least one of the y features to the specific bacteria is performed by method of feature selection and classification.

It should be pointed out that the method can additionally comprise step of selecting said x feature and/or said y features via algorithms selected form Chi-Squared, χ2, test, Wilcoxon test, and t-test or any combination thereof.

It should be further pointed out that the Gaussian distribution or Multivariate Gaussian distribution, or Rayleigh distribution, or Maxwell distribution, or Estimate the distribution by the Parzen method or by or mixed model (like the Gaussian Mixed Model known as GMM) for at least one of the n features such that the distributions defines the n dimensional volume in the n dimensional space..

It should be further emphasized that all the above mentioned steps could be performed on at least one of the o x derivative of the absorption spectrum; o is an integer greater than or equals 1. e.g. the features are extracted from the o th derivative instead of the signal. If the features (extracted from the spectrum and/or its derivatives) are within the n dimensional volume boundaries, the specific bacteria is identified. Otherwise the bacteria are not identified.

Alternatively or additionally, each of the x and/or y features are given a weighting factor. The weighting factor is determined by the examining how each feature improves the bacteria detection prediction (for example by using maximum likelihood or Bayesian estimation). Once the weighting factor is assigned to each one of the JC and y features the boundaries are determined for the features having the, most significant contribution to the bacteria prediction.

Alternatively or additionally, the AS2 and its derivatives is smoothed by reducing the noise. The noise reduction is obtained by different smoothing techniques selected from a group consisting of running average savitzky-golay or any combination thereof.

The following is an illustration of the two dimensional boundary based on the two best features from a segment of the spectrum.

Smoothing of the spectrum

Reference is now made to figure 12 illustrating the absorption signal of a sample containing 25% streptococcus pyogenes and 75% streptococcus Bovis prior to and after the noise was reduced (recorded signal vs. smoothed signal).

Reference is now made to figure 13 illustrating the signal's first derivative of a sample containing 25% streptococcus pyogenes and 75% streptococcus Bovis prior to and after the noise was reduced (recorded signal vs. smoothed signal).

The m features extracted from the spectrum

The following m features were extracted: peak's wavelength, peak's height, peak's width, peak's cross section, peak's area, at least one of the coefficients of a fitted polynomial curve, the total sum of areas under at least two peaks of the signal, linear prediction coefficient (LPC), mean value of the signal, Variance value of the signal, Skewness value, Kurtosis value, Gaussians' set of parameters (μ,σ,Ai), different peaks' intensity ratios, wavelet coefficients or any combination thereof, m is an integer greater or equal to one.

The features were extracted from (i) the dried bacteria spectrum (i.e., after the water influence was .eliminated), ,(ii). First derivative of the wet bacteria spectrum (prior to the water influence elimination), (iii) Second derivative of the wet bacteria spectrum,

(iv) First derivative of the dried bacteria spectrum (i.e., after the water influence was eliminated), (v) Second derivative of the dried bacteria spectrum estimate (i.e., after the water influence was eliminated), (vi) Correlation.

Other features that were extracted were Peak's wave length and height of the wet bacteria spectrum, Peak's wave length and height of the dried bacteria spectrum estimate, Peak Width from a peak's wave length of the wet bacteria spectrum, Peak

Width from a peak's wave length of the dried bacteria spectrum estimate, Peak Width from a specified wavenumber of the wet bacteria spectrum, Peak Width from a specified wavenumber of the dried bacteria spectrum estimate.

The signal and the signal's first derivative were divided to following segments 3000-

3300 cm "1 , about 850-1000 cm "1 about 1300-1350 cm "1 , about 2836-2995 cm "1 , about

1720-1780 cm "1 , about 1550-1650 cm "1 , about 1235-1363 cm "1 , about 990-1190 cm "1 about 1500-1800 cm '1 about 2800-3050 cm "1 about 1180-1290 cm "1 according to said features due to the fact that in these regions there were differences between the specific bacteria to be detected (i.e., streptococcus pyogenes) and other bacteria (e.g., streptococcus bovis).

The mi features were extracted from at least one of the above mentioned spectrum segments.

The two most significant features found to be the wavelet transform coefficients calculated on the wavenumber region [990-1170] (cm "1 ), where the wavelet family was the Daubechies Wavelets (db2).

Feature #1 is coefficient # 7 (denotes as cA3(7)) in the approximation of level # 3 with db2 wavelet transform, where db2 is the Daubechies family wavelet of order 2

(denotes as column X in the following table), and Feature #2 is coefficient # 6

(denotes as cD3(6)) in the detail of level # 3 with db2 wavelet transform, where db2 is the Daubechies family wavelet of order 2 (denotes as column X in the following table). The selection of these features stem from the fact that they yield the best discrimination power in identifying between the fully-hydrated bovis bacteria and the fully-hydrated mixed-strep-with-bovis bacteria.

In the following table different samples containing different amounts of Strep-Payo bacteria and Strep-Bovis. It should be pointed out that the number preceding the bacteria name is the percent of mixed between the strep bacteria and bovis bacteria; for instance 25Payo75Bovis means that the underlying sample contains of 25% Strep- Payo bacteria and 75% of Strep-Bovis bacteria.

Table 4: different samples containing different amounts of Strep-Payo bacteria and Strep-Bovis

Boundaries calculations

As explained above, the boundaries are calculated according to the features which had the most significant contribution for the specific bacteria identification in the sample. Reference is now made to figure 14 which illustrate the boundaries of a two dimensions area which enable the identification of bacteria. As mentioned above, the boundaries were calculated based on the two features having the significant contribution to the bacteria prediction which are coefficient # 7 and coefficient # 6; whilst using 1- Nearest-Neighbor classifier.

As can be seen from the figure 14, when streptococcus is present in the sample, it is possible to optically determine and identify its presence within the sample.

Verification whether the features or correlation are within the boundaries

Once a sample for detection is obtained (for example, a sample containing 50% strep pyo.), the absorption signal is read, the water influence is eliminated and the features are extracted. Then, according to the features one can determine whether strep, pyo. is present in the sample.

As can be seen from the above and table samples that contain streptococcus fall in the region left to the division line.

For example let us look at a sample containing 50% streptococcus pyogenes and 50% streptococcus Bovis. The wavelet coefficients are -0.6264 and -0.5753 for the first and second features respectively. This point falls on the left side of the line (boundary) in the graph. Therefore, strep, pyo. is identified within the sample.

As another example let us look at Ά sample containing 100% streptococcus Bovis (Le. does not contain streptococcus payogenes). The wavelet coefficients are 1.9373 and

0.2952 for the first and second features respectively. This point falls on the right side of the line in the graph. Therefore, strep, pyo. is not present in the sample.

It should be pointed out that the present invention detects bacteria as whole and not just single proteins on the membrane.

EXAMPLE 4 - Sensitivity measurements

Sensitivity at 1237 cm "1

One of the most important characteristics of the system is its sensitivity.

The term "sensitivity" refers hereinafter as the ability to detect diluted amounts of bacteria.

We measured spectral signature of the bacteria at different bacterial solution concentrations and computed the system sensitivity. At each concentration we sprayed into the optical cell about 40μL of bacteria solution in the form of aerosol.

The aerosols occupy 0.03% of the optical cell volume.

Figures 15a and 15b illustrate bacterial spectral signal at 1237 cm "1 region for different bacteria concentrations (figure 15a) and the absorbance as a function of the bacteria concentration (figure 15b).

As can be seen from the figures the absorbance increases with the concentration. This is due to a higher number of bacteria that absorb light.

It is possible to compute the current experimental setup sensitivity to bacteria concentration (with the aid of figure 15a). As described above, the sensitivity is defined as the minimal bacteria concentration that can be detected using the current experimental setup.

Mathematically (first approximation) it is the point where the linear graph intersects the x-axis. Since there is a signal bias the intersection is with 0.0075 absorbance line (about 5% above the noise level). In order to compute the current experimental setup sensitivity the linear fit (least squares) of the graph and the point where it intersects the x-axis were calculated. The measured sensitivity at 1234cm '1 is 4.741μg/μL or 4.8XlO 6 bacteria/ μL.

Sensitivity at 1084 cm "1

The following figures (figures 16a and 16b) illustrates the bacteria spectral signal at

1084 cm "1 region for different bacteria concentrations (figure 16a) and the absorbance as a function of the bacteria concentration (figure 16b).

Again, the absorbance increases with the concentration. The same analysis was applied to this wavelength region. The measured sensitivity at 1084cm '1 is

6.095μg/μL or 6.IxIO 6 bacteria/μL.

EXAMPLE 5 - detection of Strep throat in an aerosol sample

The term "Strep throat" or "streptococcal pharyngitis" or "Streptococcal Sore

Throat" refers hereinafter to group A streptococcal infection that affects the pharynx.

The system and method of the present invention were tested on 13 patients suspected to have Strep, throat.

Figure 17 illustrates the spectrum of the coughed aerosols taken from a patient suspected to have Strep.

After the above described method was implemented a graph demonstrating the boundaries between patients having Strep A and patient not having Strep. A.

Figure 18 illustrates the classification results and separation between patients that were Strep. A. positive and those who were Strep. A. negative.

The features that were selected were:

Feature #1 cDl(17) which is coefficient # 17 in the approximation of level # 1 with db2 wavelet transform, where db2 is the Daubechies family wavelet of order 2.

Feature #2 : First derivative value at 954.0295 cm "1 after water removal.

As can be seen from the figure, patients having Strep. A are identified.

EXAMPLE 6 - Non medical applications

According to another embodiment of the present invention, the method as described above can be used to detect bacteria such as anthrax (AVA and Next Generation), smallpox, ricin, equine encephalitis, Clostridium botulinum (bacteria) , francisella tularemia (bacterial disease) , viral hemorrhagic fevers and yersinia pestis. hazardous material : Mercury, Pharmaceuticals, Radiologicals, Sterilants and disinfectants, Cleaning chemicals, Laboratory chemicals, Pesticides Bioaccumulative Toxics

This can be used in:

(i) environmental monitoring - hazardous material and bacteria located in crowded places such as airports, trains, planes, cruise ships, stadiums etc.

(ii) Ventilation systems - checking ventilation systems for hazardous materials, .

According to a preferred embodiment, the ventilation system can be monitored in hospitals, cruise ships etc.

(iii) Water reservoirs, water systems etc. - Coliform and E coli;

(iv) Food and beverage production lines - Aeromonas cavia, Aeromonas hydrophila

Aeromonas sobria ,B a dUus cereus, Campylobacter jejuni, Citrobacter spp, lostridium botulinum, Clostridium perfringens, Enterobacter spp., Enterococcus spp., Escherichia coli enteroinvasive strains, Escherichia coli enteropathogenic strains, Escherichia coli enterotoxigenic strains, Escherichia coli O157:H7, Klebsiella spp (as illustrated in figure 20)., Listeria monocytogenes, Plesiomonas shigelloides , Salmonella spp,

Shigella spp, Staphylococcus aureus (as illustrated in figure 19), Streptococcus spp ,

Vibrio cholera, Yersinia enterocolitica

Bio defense and terror - detecting airborne bacteria, chemical agents etc.