Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
AN APPARATUS AND METHOD FOR IDENTIFYING PLANT VARIETIES FROM LEAF SAMPLES TAKEN WHILST IN THE FIELD.
Document Type and Number:
WIPO Patent Application WO/2015/035448
Kind Code:
A1
Abstract:
An apparatus for identifying plant varieties from leaf or flower samples taken whilst in the field comprises a scanning device having a backlight so as to enable a detailed image of a sample to be recorded digitally., a computer for uploading the image for analysis, a computer program which allocates user prescribed parameters such as leaf venation, leaf shape, base position and shape and leaf curvature to the image, and utilising the data produced by the computer program and applying an algorithm to it for matching the data against a database of plant varieties to determine the highest match probability.

Inventors:
WARREN JAMES (AU)
Application Number:
PCT/AU2014/000891
Publication Date:
March 19, 2015
Filing Date:
September 11, 2014
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
AVICENNIA INVEST PTY LTD (AU)
International Classes:
G06F19/00; G06T7/40; G06V10/145; G06T7/60; G06V10/46
Foreign References:
US20080059076A12008-03-06
CN102072882A2011-05-25
US20080059076A12008-03-06
Other References:
WU, Q. ET AL.: "Feature Extraction and Automatic Recognition of Plant Leaf Using Artificial Neural Network", ADVANCES IN ARTIFICIAL INTELLINGENCE, 2006, MEXICO, XP055327172
DU, J.-X. ET AL.: "Leaf shape based plant species recognition", APPLIED MATHEMATICS AND COMPUTATION, vol. 185, no. 2, 2007, pages 883 - 893, XP005908428
MOUINE, S. ET AL.: "Advanced shape context for plant species identification using leaf image retrieval", 2ND ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, June 2012 (2012-06-01), HONG KONG, CHINA, XP055327173
See also references of EP 3044711A4
Attorney, Agent or Firm:
SULMAN, Matthew, James (63 Waldheim Street, Annerley QLD 4103, AU)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1. An apparatus for identifying plant varieties from leaf or flower samples taken whilst in the field comprising:

a scanning device having a backlight so as to enable a detailed image of a sample to be recorded digitally; a computer for uploading the image for analysis;

a computer program which allocates user prescribed parameters such as ieaf venation, leaf shape, base position and shape and !eaf curvature to the image;

utilising the data produced by the computer program and applying an algorithm to it for matching the data against a database of plant varieties to determine the highest match probability.

2. The apparatus of claim 1 wherein the apparatus is a hand-he!d computer.

3. The apparatus of claim 1 wherein the backlight on the scanner is an LED light.

4. A method for identifying plant varieties from leaf or flower samples taken whilst in the field including the steps:

(i) acquiring a scanned image of a leaf or flower sample;

(ii) applying an image manipulation algorithm to the scanned image to enhance venation data recorded;

(iii) producing a venation line drawing;

(iv) cross-referencing venation line drawing data with a set of identification data;

(v) comparing venation and identification data with known samples stored in a database;

(vi) choosing and displaying the most probable match for plant variety according to the sample analysis.

5. A method for identifying plant varieties from leaf or flower samples taken whilst in the field including the steps:

(i) harvesting a leaf or flower sample;

(ii) taking a photographic image of the ieaf or flower sample using a back lit scanning device;

(iii) extracting a set of identification parameters from the photographic image including leaf venation, leaf shape, base position and shape and leaf curvature;

(iv) applying an identification algorithm to the extracted identification parameters; presenting the results of the algorithmic analysis on a screen of a computer or hand held device;

comparing the sample with illustrations' of known leaf samples;

choosing and displaying the most probafaie match for plant variety according to the sample analysis.

Description:
Title

An apparatus and method for identifying plant varieties from !eaf samples taken whilst in the field. Background

Botanists typically spend a large amount of time in the field collecting samples for their research, Historically botanists have relied upon textbooks and compendiums of plant samples to correctly identify plant varieties from leaf and flower samples. The identification of plant varieties from: leaf samples is based upon a close examination of !eaf venation patterns, the configuration of leaf shapes and other identifying features and matching observed but unknown features with known specimens, photographs, illustrations or descriptions.

Recent developments in computer and electronic technological fields have meant that access to digitally stored data is more readil available, Further, computer chip processing speeds and memory storage capacities have increased significantly such that it is how possible to store within a relatively small sized device a large volume of digital data or to access remotely stored data using wireless communication means. The integration of such technology with plant identification and classification algorithms is now possible and desirable as a means of facilitating the ease by which botanists might accurately identify plant samples whilst engaged in field work.

It would be advantageous to develop an apparatus and method for correctly identifying plant varieties whilst in the field based upon an examination of leaf samples which employs a suitably designed and configured electronic device. This could greatly improve the productivity of botanists engaged in field research and reduce the overall costs of undertaking such research. Such an apparatus and method may have other uses outside the field of botanical research, for example, n correctly identifying a plant type in the case of a patient experiencing an allergic reaction to a plant which may potentially be life threatening, in such circumstances the apparatus and method for identifying plant varieties may be very valuable in preserving !ife or in accurately diagnosing a particular medical condition and prescribing a suitable prophylaxis or remedy.

There are a number of prior art apparatuses and methods for identifying plants through leaf venation. On the website wwJsafsnap om for example, there is described an application suitable for use on an t-Phone for identifying leaves which involves taking photographs of leaf samples and matching the photograph of the sample to existing photographs stored within a database. The application compares the outline of the sample leaf or flower which is not sufficiently accurate to enable correct identification of plant species to the degree required of a botanist. The application is suitable for hobbyists.

US Patent 20080059076 describes a method for classifying leaves utilizing venation features. The method includes taking a sample venation image using a Curvature Scale Space Corner Detection Algorithm. The image is then treated to thicken the venation and increase the contrast through the retrieval unit. Canny Edge Detection technology is then applied to detect the feature, branching and end points where the calculated curvature angle is a local maximum, The distribution of the feature points of the extracted venation is calculated by applying a Parzen Window non-parametric estimation method. Existing methods of identifying plant varieties from feaf and fiower samples including those referred to above however suffer from a lack of accuracy and ease of use, particularly in field situations. The method described in US 20080059076, for instance, focuses on the process of categorising leaf venation into 4 categories: pinnate, first parallel, second paraiiel and palmate, It assumes that the image has been captured and does not make mention as to how the exact type of leaf is determined once it has been classified into one of the four categories employed. There is a lack of cross- reference to other characteristic features of the samples examined and no overriding means of enhancing the accuracy of data captured.

It would be advantageous to provide an apparatus and method for identifying plant varieties from leaf samples which overcomes at least some of the problems of prior art devices and which provided for greater accuracy in identification of samples.

Accordingly there is provided an apparatus for identifying plant varieties from leaf samples taken whilst in the field comprising:

a scanning device having a backlight so as to enable a detailed image of a sample to be recorded digitally;

a computer for uploading the image for analysis;

a computer program which allocates user prescribed parameters such as leaf venation, leaf shape, base position and shape and leaf curvature to the image;

utilising the data produced by the computer program and applying an algorithm to it for matching the data against a database of plant varieties to determine the highest match probability.

In some preferred embodiments the apparatus is a hand-held computer.

In some preferred embodiments of the invention the apparatus includes a scanner which has a backlight which is an LED light.

There is provided a method for identifying plant varieties from leaf samples taken whilst in the field including the steps:

(t) acquiring a scanned image of a leaf sample;

(ti) applying an image manipulation algorithm to the scanned image to enhance venation data recorded;

(Hi) producing a venation line drawing;

(iv) cross-referencing venation line drawing data with a set of identification data;

(v) comparing venation and identification data with known samples stored in a database;

(vi) choosing and displaying the most probable match for plant Variety according to the sample analysis. There is also provided a method for identifying plant varieties from leaf samples taken whilst in the field including the steps:

(i) harvesting a leaf sample;

(ii) taking a photographic image of the leaf sample using a back fit scanning device;

(iii) extracting a set of identification parameters from the photographic image including leaf venation, leaf shape, base position and shape and leaf curvature;

(iv) applying an identification algorithm to the extracted Identification parameters;

(v) presenting the results of the algorithmic analysis on a screen of a computer or hand held device;

(vi) comparing the sample with illustrations of known leaf samples;

(vii) choosin and displaying the most probable match for plant variety according to the sample analysis.

Drawings

Figure 1 shows a top-level block diagram of a preferred aspect of the pre-defined database.

Figure 2 shows typical leaf sample shapes that may be used within the Binary Shape Matching Tool.

Figure 3 shows the function employed in the Binary Shape Matching Tool software.

Figure 4 shows a block diagram of the leaf margin test which identifies either spikes or lobes in the sample.

Figure 5 shows a block diagram of the base vein testing step. Figure 6 shows the front screen of the software interface. Figure 7 shows the venation of a sample.

Figure 8 shows an enhanced venation image produced using a circular edge scanning detection technique.

Figure 9 shows the user selection window.

Table 1 shows the sample results after analysis using the five successive tests incorporated in the method of the present invention.

Figure 10 shows samples taken with a backiit scanner and the resulting venation line drawings processed.

Figure 11 shows a block diagram of the image processing steps that may be used to generate a venation line drawing.

Description

The present invention is directed to providing an apparatus and method for correctly identifying plant varieties from leaf samples whilst the user is engaged in field research. it has been found by the inventor that when botanists are using a camera, whether in the field or in the office, that they were unable to obtain a consistent environment so as to enable an accurate assessment of the venation of the sample leaves or flowers. The inability to accurately record leaf or flower characteristics, particularly venation, was as a result of limitations of lighting, distance from sample to lens and camera angle.

It was found therefore that research conducted utilising a back!it scanner produced a much more consistent environment for the accurate collection of leaf and flower samples which could then be more effectively utilised to correctly identify and classify the leaf samples with respect to known plant varieties. When combined with an identification algorithm also developed by the inventor, it was found that the collected sample images could be used to identify plant varieties to an accuracy of between 90 to 95%, This represents a significant improvement in accuracy compared to existing methods of identification and classification that use photographic sample analysis.

The algorithm recognises a number of leaf characteristics including leaf shape, margin, venation and records the data as a series of parameters which are then compared to data contained within a database of line drawings of known leaf samples which may be based upon the common general knowledge in the field set out in existing reference books, for example, "Trees & Shrubs in Rainforests of New South Wales and Southern Queensland" by Gwen Harden which is relevant to the identification of plant species within that geographical region. Preferably all known reference data sources would be accessible. A standard document or image scanner may be used to record the leaf or flower sample images for analysis and identification using the "slide" feature which provides the necessary backlight required to record the sample detail sufficientl to enable analysis. It has been found that a standard scanner is suitable for use with smaller leaf and flower samples however in the case of large sample it has been necessary to construct and utilise a scanning device which has been purpose built to obtain sufficiently accurate identification and classification. images recorded by a backlit scanner are analysed using a series of defined parameters which are cross-referenced against known plant identification parameters to identify sample plant varieties. The recorded parameters are utilised to make high-percentage estimations of identification of plant varieties derived from an algorithmic comparative analysis of the recorded data and existing identification data which may be stored remotely on a computer server located remotely from the scanner.

Various parts or section of the tested leaf sample are analysed. Features for comparison ma include, b way of non-limiting example: leaf shape - elliptic, peltrate, etc;

margin - smooth, lobed, etc;

base veins;

base position - mid or edge;

leaf curvature - low, mid, high. Samples may be recorded using any suitable image recording software, for example, National Instruments Image Acquisition Module (IMAG). Recorded parameters of the sample can be compared against a pre-defined database and test result comparisons can be made to approximate the identity of the sample. An example of a pre-defined database suitable for the present purposes is National Instruments LabVIEW which utilises a visual basic programming language, although other database programming software may be suitable for use without departing from the scope of invention. Figure 1 shows a top-level block diagram of a preferred aspect of the pre-defined database.

The database software (for example LabVIEW) includes a function cai!ed the Binary Shape Matching Tooi which enables comparison of recorded shapes with a list of pre-defined binary template shapes. Only those binary shape templates that are relevant to a recorded sample are used. Figure 2 shows typical leaf sample shapes that may be used within the Binary Shape Matching Tool

Figures 2a - 2f indicate the following leaf shapes respectively:

2a acuminate

2 b aristate

2c elliptic

2d obtuse

2e ovate

2f peltrate

Figure 3 shows the function employed in the Binary Shape Matching Tool software. Once the Binary Shape Matching Tool has been used to approximate the shape of a leaf sample, a leaf margin test is applied to the sample. The margin test is manually programmed and utilises a sweeping line edge detector horizontally at first and then vertically to detect leaf edges. The test detects small crevices using a pre-set threshold distance between crevices, thereby identifying spikes or large crevices, thereby identifying lobes, which appear at distances larger than the pre-set threshold distance. For example, in respect of a Glochidion ferdinandi leaf - 6 small crevice particles with an average size of 165 px; 2 large crevice particles with an average size of 121 px. Alternatively, for a leaf sample that is both spiky and lobed - 99 small crevice particles with an average size of 226 px; 56 large crevice particles with an average size of 1440 px. The detected crevice spots can then be adjusted using binary morphology and manipulation according to the user ' s requirements. Figure 4 shows a block diagram of the leaf margin test which identifies either spikes or lobes in the sample.

The sample data is then subjected to further testing to determine the number of veins attached to the base of the leaf sample, the site of the petiole attachment. In order to detect the number of base veins, a detailed and accurate image needs to be extracted from the original sample image. A binary image of the venation is extracted from the sample and then a circular edge detector is used to detect any attached edges or veins from the base. Recorded data is subjected to an averaging algorithm to provide a good estimation of leaf base veins. Figure 5 provides a block diagram of the base vein testing step. Figure 6 shows the front screen of the software interface wherein the Binary Shape Matching Tool indicates the binary matching template path, leaf database path, image file name and the identification name of the leaf along with the binary image of the sample being processed.

Using specific binary and greyscale morphology techniques a figure such as that depicted in Figure 7 is extracted showing venation of the sample. The base of the sample is then located by user selection and translated using edge detection with a circular scanning method. Figure 8 shows a venation image produced using a circular edge scanning detection technique.

The base and tip positions of the sample are detected by asking the user to select the points from the original binary leaf image recorded. The recorded data image is then manipulated, and rotated to align the leaves all the same. Utilising the information gained from the binary leaf image, supplied by the Binary Shape Matching test and the selected base location, the base position can easily be detected. In this first stage, the base is either positioned on the edge or the middie of the leaf sample. The user selection window is represented in Figure 9. The curvature of some leaves is more intense than others and can therefore be readily differentiated. The base and tip positions are found by asking the user to select them from the original leaf image sample and the chosen positions are then directly compared against the pixel location of the 'centre of mass' " (CoM) of the leafs binary image. If the difference is negligible, the curvature is very small. If the difference between the CoM and the base and tip is large, the leafs curvature is estimated to be large also. The use of known algorithms to assist in identifying sample leaf curvature may increase accuracy of identification. Suitable algorithms may involve "Curvature Scale Space" (CSS) which can be used to find corner and interest points, edges, leaf shape, detect margins, etc. although other known algorithms may be suitable for the purpose herein envisaged. The database used in the present invention is a comma separated vales file, however, other file types may be suitable for use without departing from the scope of invention. Table 1 shows the sample results after analysts using the five successive tests incorporated in the method of the present invention. It has been found that the use of a backlit scanner has several advantages over digital still photographs, for example, a scanner provides a controlled environment with controlled white light balance. The back tight may be, for example, an LED torch, however a slide scanner which activates a backlight for illuminating the veins of the sample is preferable. A backlit scanner highlights the sample venation structure which permits greater accuracy in sample identification. A scanner permits controlled measurements, size and number of pixels and there is no warping, angular or barrel distortion that is common with the use of a lens. Digital still photographic images provide neither sufficient venation detail nor constant white balance across multiple images to permit a high degree of accuracy from sample analysis. A BearPaw 2448TA Plus is a suitable backlit scanner however it suffered problems with the ability to accurately scan larger samples. Accordingly, other scanner models may also be suitable or more suitable for adaptation and use without departing from the scope of invention.

Correct identification of venation structure is the most important aspect of the present invention. Additional parameters that may be employed in the method of the invention include testing the sample vein veins, the number of branch veins, amount of cross-venulation, identifying opposed or alternate branching, vein angles and curvature, the propensity of veins to reach the sample leaf edge and/or curve away. Improved scanning capability and image resolution allows more consistent and superior sample identification results as a result of the improved accuracy of venation pattern line drawings. Figure 10 shows samples taken with a backlit scanner and the resulting venation line drawings processed. Figure 11 shows a block diagram of the image processing steps that may be used to generate a venation line drawing.