Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SYSTEMS AND METHODS FOR SOIL MAPPING
Document Type and Number:
WIPO Patent Application WO/2022/204809
Kind Code:
A1
Abstract:
Systems and methods for training a soil mapping model are provided, as well as systems and methods for generating soil maps with such trained soil mapping models. The soil mapping models predict soil characteristics (e.g. carbon content) across an area of interest. The training method can involve refining the parameters of a soil map model by estimating the soil mapping model's uncertainty at various locations across an area of interest, selecting relatively high- uncertainty areas, identifying sampling locations within those areas for further samples to be collected, and further training the parameters of the soil mapping model based on the newly-collected samples.

More Like This:
Inventors:
GOOD TRAVIS GODWIN (CA)
ROTHWELL AUSTIN CAULFIELD (CA)
Application Number:
PCT/CA2022/050475
Publication Date:
October 06, 2022
Filing Date:
March 30, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TERRAMERA INC (CA)
International Classes:
G01V9/00; G06N20/00; G06V20/13
Foreign References:
CA3120370A12020-06-18
US20200163272A12020-05-28
Other References:
TAGHIZADEH-MEHRJARDI RUHOLLAH, EMADI MOSTAFA, CHERATI ALI, HEUNG BRANDON, MOSAVI AMIR, SCHOLTEN THOMAS: "Bio-Inspired Hybridization of Artificial Neural Networks: An Application for Mapping the Spatial Distribution of Soil Texture Fractions", REMOTE SENSING, MOLECULAR DIVERSITY PRESERVATION INTERNATIONAL (MDPI), CH, vol. 13, no. 5, 8 March 2021 (2021-03-08), CH , pages 1025, XP055976038, ISSN: 2072-4292, DOI: 10.3390/rs13051025
Attorney, Agent or Firm:
MARLES, Jennifer A. et al. (CA)
Download PDF:
Claims:
WHAT IS CLAIMED IS:

1. A method for training a soil mapping model, the method performed by a processor and comprising: training a set of initial parameters for a soil mapping model relating one or more soil characteristics to one or more covariates over an area of interest based on an initial training dataset; determining an uncertainty map for the soil mapping model over at least a portion of the area of interest based on the initial parameters, the uncertainty map mapping each of a plurality of locations in the area of interest to a measure of uncertainty in at least one of the one or more soil characteristics; determining one or more sampling locations in the area of interest based on the uncertainty map; receiving one or more sample measurements corresponding to the one or more sampling locations; and refining the set of initial parameters of the soil mapping model based on the one or more sample measurements to generate a first set of refined parameters for the soil mapping model.

2. The method according to claim 1 comprising mapping at least one location in the area of interest to at least one predicted value of at least one of the one or more soil characteristics, the at least one predicted value based on at least one of: the first set of refined parameters and a further set of refined parameters based on the first set of refined parameters for the soil mapping model.

3. The method according to claim 1 comprising iteratively generating one or more further sets of refined parameters, each set of refined parameters generated by: determining a further uncertainty map for the soil mapping model based on at least one of the first set of refined parameters and the one or more further sets of refined parameters; determining one or more further sampling locations in the area of interest based on the further uncertainty map; receiving one or more further sample measurements corresponding to the one or more further sampling locations; and refining at least one of the first set of refined parameters and at least one further sets of refined parameters based on the one or more sample measurements to generate a further set of refined parameters for the soil mapping model.

4. The method according to claim 3 comprising mapping at least one location in the area of interest to at least one predicted value of at least one of the one or more soil characteristics, the at least one predicted value based on at least one of the further sets of refined parameters for the soil mapping model.

5. The method according to claim 3 wherein iteratively generating the one or more further sets of refined parameters comprises determining a measure of model uncertainty based on at least one of the uncertainty maps; and halting generation of further refined parameters based on the measure of model uncertainty.

6. The method according to claim 5 wherein the measure of model uncertainty comprises an average of measures of uncertainty for the plurality of locations in the area of interest.

7. The method according to claim 5 wherein halting generation of further refined parameters based on the measure of model uncertainty comprises determining that the measure of model uncertainty is less than a threshold uncertainty value.

8. The method according to claim 1 wherein: the plurality of locations mapped by the uncertainty map comprises a plurality of regions in the area of interest; and determining the one or more sampling locations comprises at least one of: random sampling, uniform sampling, stratified sampling, and conditioned Latin hypercube sampling, and orthogonal sampling in the at least one region.

9. The method according to claim 8 wherein the method determining the one or more sampling locations comprises determining, for each one of the plurality of regions, a measure of aggregate uncertainty for the region; and selecting the at least one region based on the measures of aggregate uncertainty.

10. The method according to claim 1 wherein determining one or more sampling locations in the area of interest comprises selecting a first one of a plurality of candidate sampling locations, the first candidate sampling location being mapped to a measure of uncertainty by the uncertainty map at least as great as measures of uncertainty mapped to by any other candidate sampling location of the plurality of candidate sampling locations.

11. The method according to claim 1 wherein receiving the one or more sample measurements corresponding to the one or more sampling locations comprises generating a request for the one or more sample measurements of soil at the one or more sampling locations.

12. The method according to claim 1 comprising measuring soil at the one or more sampling locations to generate the one or more sample measurements.

13. The method according to claim 12 wherein measuring soil at the one or more sampling locations comprises generating the one or more sample measurements by proximal sensing.

14. The method according to claim 13 wherein generating the one or more sample measurements by proximal sensing comprises inserting a proximal sensor into soil at the one or more sampling locations.

15. The method according to claim 1 wherein the one or more soil characteristics comprise a measure of soil carbon content.

16. The method according to claim 15 wherein the measure of soil carbon content comprises a measure of soil organic carbon content.

17. The method according to claim 1 wherein the one or more covariates comprise at least one of: soil type, soil elevation, one or more climate measurements, one or more land uses, one or more farming practices, and one or more optical measurements.

18. The method according to claim 17 wherein the one or more optical measurements comprise satellite imagery.

19. The method according to claim 1 wherein: the soil mapping model comprises a machine learning model, the machine learning model configured to receive the one or more covariates as input and to generate predictions for the one or more soil characteristics as output, the predictions associated with a measure of confidence; and determining an uncertainty map for the soil mapping model comprises, for each of the plurality of locations, generating a prediction for the location and determining the measure of uncertainty for the location based on the measure of confidence for the prediction.

20. The method according to claim 19 wherein the machine learning model comprises at least one of: a neural network and a random forest.

21. The method according to claim 1 wherein: the soil mapping model comprises an ensemble of soil sub-models; determining an uncertainty map for the soil mapping model comprises, for each of the plurality of locations, generating a prediction for the one or more soil characteristics at the location by each of the soil sub-models, thereby generating a plurality of predictions for the location, and determining the measure of uncertainty for the location based on the plurality of predictions.

22. The method according to claim 21 wherein a first soil sub-model of the ensemble comprises initial parameters trained over a first training dataset and a second soil sub-model of the ensemble comprises initial parameters trained over a second training dataset, the first and second datasets being disjoint.

23. The method according to claim 22 wherein the first training dataset comprises images of the area of interest captured from an altitude of no more than 100 km and the second training dataset comprises images of the area of interest captured from an altitude of no less than 100 km.

24. The method according to claim 23 wherein the first training dataset comprises images captured by aircraft and the second training dataset comprises images captured by satellite.

25. The method according to claim 22 wherein the first training dataset comprises images having a first spatial density and the second training dataset comprises images having a second spatial density less than the first spatial density, such that an element of the second training dataset corresponds spatially to a plurality of elements of the first training dataset.

26. A method for mapping soil characteristics, the method performed by a processor and comprising: mapping at least one location in an area of interest to at least one predicted value of at least one soil characteristics by a soil mapping model, the at least one predicted value based on a set of refined parameters for the soil mapping model trained according to any one of claims 1 to 25.

27. A computer system comprising: one or more processors; and a memory storing instructions which cause the one or more processors to perform operations comprising the method of any one of claims 1 to 26.

Description:
SYSTEMS AND METHODS FOR SOIL MAPPING

[0001] This application claims priority to, and the benefit of, United States provisional patent application No. 63/169035 filed 31 March 2021, the entirety of which is incorporated by reference herein for all purposes.

Technical Field

[0002] The present disclosure relates generally to systems and methods for soil mapping, and particularly to systems and methods for determining sampling locations for measuring soil characteristics.

Background

[0003] Soil maps provide information about soil properties over an area of interest and are used for purposes such as land evaluation, agricultural planning, environmental protection assessments, etc. Developing a soil map generally involves characterizing soil properties across the area of interest based on a collection of soil measurements (e.g. physical soil samples rigorously characterized via laboratory analysis) and/or environmental data (e.g. elevation, climate, satellite imagery, and so on). The former tend to provide more accurate measures of soil properties but can be challenging to obtain in sufficient quantity and quality across a large area of interest.

[0004] Although soil maps can be derived directly from high-quality, quantitative soil measurements, they are more often created using digital soil mapping techniques in which environmental variables are correlated with a small number of soil measurements. The environmental variables may then be used to predict soil characteristics across the area of interest relatively more expediently and/or at lower cost than using solely high-quality, quantitative soil measurements, although generally at the expense of accuracy of the resulting soil map; for instance, soil maps are often inaccurate in areas dissimilar to those where quantitative soil data was collected. Models for correlating soil characteristics and environmental variables include kriging, geographically weighted regression, random forest, and neural networks.

[0005] Some soil mapping techniques use specific sampling approaches to determine locations from which to collect a small number of soil measurements to form a basis for generating predictions from environmental data. Sampling approaches include random sampling, uniform sampling, stratified sampling, conditioned Latin hypercube sampling, and orthogonal sampling.

A suitable sampling approach can, in appropriate circumstances, reduce the number of high- quality, quantitative soil measurements required to achieve a certain level of performance and/or improve accuracy of the soil map relative to less suitable sampling measurements.

[0006] There is a general desire for systems and methods for determining sampling locations for measuring soil characteristics to generate soil maps efficiently and/or accurately.

|0007] The foregoing examples of the related art and limitations related thereto are intended to be illustrative and not exclusive. Other limitations of the related art will become apparent to those of skill in the art upon a reading of the specification and a study of the drawings.

Summary

[0008] The following embodiments and aspects thereof are described and illustrated in conjunction with systems, tools and methods which are meant to be exemplary and illustrative, not limiting in scope. In various embodiments, one or more of the above-described problems have been reduced or eliminated, while other embodiments are directed to other improvements.

[0009] One aspect of the invention provides systems and methods for training a soil mapping model. The system comprises one or more processors and a memory storing instructions which cause the one or more processors to perform operations comprising the method. The method is performed by a processor and comprises: training a set of initial parameters for a soil mapping model relating one or more soil characteristics to one or more covariates over an area of interest based on an initial training dataset; determining an uncertainty map for the soil mapping model over at least a portion of the area of interest based on the initial parameters, the uncertainty map mapping each of a plurality of locations in the area of interest to a measure of uncertainty in at least one of the one or more soil characteristics; determining one or more sampling locations in the area of interest based on the uncertainty map; receiving one or more sample measurements corresponding to the one or more sampling locations; and refining the set of initial parameters of the soil mapping model based on the one or more sample measurements to generate a first set of refined parameters for the soil mapping model.

[0010] In some embodiments, the method comprises mapping at least one location in the area of interest to at least one predicted value of at least one of the one or more soil characteristics, the at least one predicted value based on at least one of: the first set of refined parameters and a further set of refined parameters based on the first set of refined parameters for the soil mapping model.

[0011] In some embodiments, the method comprises iteratively generating one or more further sets of refined parameters, each set of refined parameters generated by: determining a further uncertainty map for the soil mapping model based on at least one of the first set of refined parameters and the one or more further sets of refined parameters; determining one or more further sampling locations in the area of interest based on the further uncertainty map; receiving one or more further sample measurements corresponding to the one or more further sampling locations; and refining at least one of the first set of refined parameters and at least one further sets of refined parameters based on the one or more sample measurements to generate a further set of refined parameters for the soil mapping model.

[0012] In some embodiments, the method comprises mapping at least one location in the area of interest to at least one predicted value of at least one of the one or more soil characteristics, the at least one predicted value based on at least one of the further sets of refined parameters for the soil mapping model.

[0013] In some embodiments, iteratively generating the one or more further sets of refined parameters comprises determining a measure of model uncertainty based on at least one of the uncertainty maps; and halting generation of further refined parameters based on the measure of model uncertainty. In some embodiments, the measure of model uncertainty comprises an average of measures of uncertainty for the plurality of locations in the area of interest. In some embodiments, halting generation of further refined parameters based on the measure of model uncertainty comprises determining that the measure of model uncertainty is less than a threshold uncertainty value.

[0014] In some embodiments the plurality of locations mapped by the uncertainty map comprises a plurality of regions in the area of interest; and determining the one or more sampling locations comprises at least one of: random sampling, uniform sampling, stratified sampling, and conditioned Latin hypercube sampling, and orthogonal sampling in the at least one region.

[0015] In some embodiments, the method determining the one or more sampling locations comprises determining, for each one of the plurality of regions, a measure of aggregate uncertainty for the region; and selecting the at least one region based on the measures of aggregate uncertainty. |0016] In some embodiments, determining one or more sampling locations in the area of interest comprises selecting a first one of a plurality of candidate sampling locations, the first candidate sampling location being mapped to a measure of uncertainty by the uncertainty map at least as great as measures of uncertainty mapped to by any other candidate sampling location of the plurality of candidate sampling locations.

[0017] In some embodiments, receiving the one or more sample measurements corresponding to the one or more sampling locations comprises generating a request for the one or more sample measurements of soil at the one or more sampling locations. In some embodiments, the one or more soil characteristics comprise a measure of soil carbon content. In some embodiments, the measure of soil carbon content comprises a measure of soil organic carbon content. In some embodiments, the one or more covariates comprise at least one of: soil type, soil elevation, one or more climate measurements, and one or more optical measurements. In some embodiments, the one or more optical measurements comprise satellite imagery.

[0018] In some embodiments the soil mapping model comprises a machine learning model, the machine learning model is configured to receive the one or more covariates as input and to generate predictions for the one or more soil characteristics as output, and the predictions are associated with a measure of confidence. Determining an uncertainty map for the soil mapping model comprises, for each of the plurality of locations, generating a prediction for the location and determining the measure of uncertainty for the location based on the measure of confidence for the prediction.

[0019] In some embodiments, the machine learning model comprises at least one of: a neural network and a random forest.

[0020] In some embodiments: the soil mapping model comprises an ensemble of soil sub models; determining an uncertainty map for the soil mapping model comprises, for each of the plurality of locations, generating a prediction for the one or more soil characteristics at the location by each of the soil sub-models, thereby generating a plurality of predictions for the location, and determining the measure of uncertainty for the location based on the plurality of predictions. In some embodiments, a first soil sub-model of the ensemble comprises initial parameters trained over a first training dataset and a second soil sub-model of the ensemble comprises initial parameters trained over a second training dataset, the first and second datasets being disjoint. |0021 ] In some embodiments, the first training dataset comprises images of the area of interest captured from an altitude of no more than 100 km and the second training dataset comprises images of the area of interest captured from an altitude of no less than 100 km. In some embodiments, the first training dataset comprises images captured by aircraft and the second training dataset comprises images captured by satellite. In some embodiments, the first training dataset comprises images having a first spatial density and the second training dataset comprises images having a second spatial density less than the first spatial density, such that an element of the second training dataset corresponds spatially to a plurality of elements of the first training dataset.

[0022] One aspect of the invention provides systems and methods for mapping soil characteristics. The system comprises one or more processors and a memory storing instructions which cause the one or more processors to perform operations comprising the method. The method is performed by a processor and comprises: mapping at least one location in an area of interest to at least one predicted value of at least one soil characteristics by a soil mapping model, the at least one predicted value based on a set of refined parameters for the soil mapping model trained according to any one of the training methods disclosed herein.

[0023] In addition to the exemplary aspects and embodiments described above, further aspects and embodiments will become apparent by reference to the drawings and by study of the following detailed descriptions.

[0024] Exemplary embodiments are illustrated in referenced figures of the drawings. It is intended that the embodiments and figures disclosed herein are to be considered illustrative rather than restrictive.

[0025] Figure 1 A shows schematically an exemplary system for training a soil mapping model in a first mode of operation for training a set of initial parameters for the soil mapping model.

[0026] Figure IB shows schematically the exemplary system of Figure 1A in a second mode of operation for refining the parameters of the soil mapping model.

[0027] Figure 2 is a flowchart of an exemplary method for training a soil mapping model, such as the soil mapping model trained by the system of Figures 1A and IB. |0028] Figure 3 shows a first exemplary operating environment that includes at least one computing system for performing methods described herein, such as the method of Figure 2.

|0029] Throughout the following description specific details are set forth in order to provide a more thorough understanding to persons skilled in the art. However, well known elements may not have been shown or described in detail to avoid unnecessarily obscuring the disclosure. Accordingly, the description and drawings are to be regarded in an illustrative, rather than a restrictive, sense.

|0030] Aspects of the present disclosure provide systems and methods for training a soil mapping model, as well as systems and methods for generating soil maps with such trained soil map models which predict soil characteristics (e.g. carbon content) across an area of interest. In at least one embodiment, the training method involves refining the parameters of a soil map model by estimating the soil mapping model’s uncertainty at various locations across an area of interest, selecting high-uncertainty areas, identifying sampling locations within those areas for further samples to be collected, and further training the parameters of the soil mapping model based on the newly-collected samples. Several techniques and suitable models for estimating uncertainty are disclosed.

[0031] By incorporating measures of uncertainty into the training process and collecting soil samples responsively to uncertainty, model performance can, in at least some circumstances, be improved with relatively fewer samples than might otherwise be obtained. This can be advantageous in many contexts; one context of particular interest is the collection of physical soil samples and analysis of such samples in a laboratory. Such samples can greatly reduce uncertainty but are often costly to obtain; the present disclosure’s models and training techniques can allow for the targeting of such samples to high-uncertainty areas, with consequent effects on the quantity of training data required and/or on the performance of the soil mapping model.

[0032] The terms “uncertainty”, “measure of uncertainty”, “uncertainty map”, and related terms, as used in this disclosure, include corresponding terms relating to certainty, confidence, or the like. For example, an embodiment may make use of a “measure of certainty” or generate a “confidence map” without departing from the present disclosure. Where uncertainty is compared between two things, such as where something is said to have “high uncertainty” or “greater uncertainty”, this includes the meaning of having “low certainty” or “less certainty”, respectively (and/or “low confidence” or “less confidence” or the like). Similarly, terms such as “low uncertainty” or “less uncertainty” include the meaning of having “high certainty” or “greater certainty”, respectively (and/or “high confidence” or “greater confidence” or the like). To aid readability, and to avoid repetitively reminding the reader that (e.g.) measures of certainty, confidence, or the like may alternatively or additionally be used, the present disclosure and appended claims generally use “uncertainty” and related terms without loss of generality, except where the context requires otherwise.

A System for Training a Soil Mapping Model

[0033] Figure 1A shows schematically an exemplary system 100 for training a soil mapping model 124 in a first mode of operation for training a set of initial trained parameters 122a (sometimes called “initial parameters” herein for convenience). Figure IB shows system 100 in a second mode of operation for refining parameters of a soil mapping model such as soil mapping model 124. Figures 1A and IB are collectively and individually referred to herein as “Figure 1”. System 100 comprises a computing system as described in greater detail elsewhere herein. System 100 may interact with various inputs and outputs (shown in dashed lines), which are not necessarily part of system 100, although in some embodiments some or all of these inputs and outputs are part of system 100 (e.g. in an example embodiment, trained parameters 122a and/or 122b are part of system 100).

[0034] System 100 comprises a sampler 110a and a soil modeler 120, which in the first mode of operation shown in Figure 1A train a set of initial trained parameters 122a for soil mapping model 124. Sampler 110a receives a representation of an area of interest 102 and determines one or more sampling locations from which to obtain sample measurements 114a. Any suitable sampling approach may be used, such as random sampling, uniform sampling, stratified sampling, and/or conditioned Latin hypercube sampling. For example, sampler 110a may receive a map of an area together with ancillary variables describing features of the map (e.g. elevation, slope, etc.), and generate from that data one or more sampling locations based on a conditioned Latin hypercube sampling approach, e.g. as described by Minasny et al, A conditioned Latin hyper cube method for sampling in the presence of ancillary information (2006), Comput.

Geosci. 32, 9 (November, 2006), 1378-1388, doi:10.1016/j.cageo.2005.12.009. Sampler 110a may generate a request for one or more sample measurements 114a at the one or more sampling locations and receive sample measurements 114a from a user, another system, a sensor, etc. Sample measurements 114a may comprise any suitable measurements for use by soil modeler 120, such as spectral samples (e.g. obtained by satellite, aircraft, and/or terrestrial spectral sensors, such as imaging devices), physical measurements of physical soil samples (e.g. which may be chemically or otherwise analyzed to obtain ground-truth values for properties of interest such as carbon content, soil type, moisture, etc.).

[0035] Sample measurements 114a may be obtained by proximal and/or remote sensing. For example, sample measurements 114a may comprise physical measurements of physical soil samples, spectral data obtained by a proximal sensor (e.g. on a hand-held device, mounted on a terrestrial vehicle such as a rover, and/or otherwise proximally obtained), and/or other proximal sensor data of soil. Proximal sensor data may comprise data obtained of surface soil (e.g. by collecting surface soil and/or placing a sensor proximal to surface soil to collect measurements) and/or of sub-surface soil (e.g. by collecting sub-surface soil and/or by inserting a sensor into soil to collect measurements). Proximal and/or remote sensing may comprise, for example, the systems and methods described in greater detail in US provisional patent applications nos. 63/285,942 and 63/291,151, which are incorporated herein by reference for all purposes.

[0036] Soil modeler 120 receives sample measurements 114a and generates a soil mapping model 124 having trained parameters 122a. Soil modeler 120 may generate any suitable type of soil mapping model 124 for relating one or more soil characteristics (e.g. carbon content, soil type, moisture) to one or more covariates over an area of interest. Soil characteristics may include, for example, features such as soil carbon content (e.g. soil organic carbon content), soil type, and/or moisture. Covariates may include, for example, slope, elevation, soil type, climate, image data (e.g. satellite imagery, drone imagery, terrestrial imagery, visual spectrum imagery, and/or infrared imagery), land use (e.g. agricultural, residential, forest, etc.), and/or farming practices (e.g. cover cropping, crop rotations, etc.). For example, soil modeler 120 may generate a linear estimation model which transforms covariate values to predicted values for the one or more soil characteristics based on weights trained by soil modeler 120 via a suitable training technique such as kriging and/or regression (e.g. geographically weighted regression). Alternatively, or in addition, soil modeler 120 may generate a non-linear model such as a neural network and/or a random forest having weights trained by soil modeler 120 via a suitable training technique such as backpropagation and/or induction/pruning. In at least some embodiments trained parameters 122a comprise such model weights. Examples of soil mapping models 124 and techniques for generating such models are provided by Mcbratney et al, On Digital Soil Mapping (2003), Geoderma, 117, 3-52, doi:10.1016/S0016-7061(03)00223-4. |0037] System 100 refines trained parameters 122 (which may comprise initial parameters 122a and/or previously-refined parameters such as refined parameters 122b) of soil mapping model 124 in the second mode of operation, thereby training a set of refined trained parameters 122b. System 100 further comprises an inference engine 130 for generating an uncertainty map 134 based on soil mapping model 124 (e.g. based on the parameters of model 124, e.g. initial parameters 122a). Uncertainty map 134 maps locations in area of interest 102 to a measure of uncertainty.

|0038] Uncertainty may be estimated in any suitable way. In some embodiments, uncertainty is estimated based on a distribution in the output of the model. For example, in an embodiment where soil mapping model generates a predicted value for a soil characteristic at a given location, inference engine 130 may generate an uncertainty map 134 which maps that location to a measure of uncertainty in the predicted value for the soil characteristic. For instance, if soil mapping model 124 predicts a value c for soil carbon content at a location x, and the value c is associated with (e.g. a mean and/or mode of) a distribution of predicted values having variance v, then uncertainty map 134 may map location to a value based on v - e.g. v itself and/or a combination of variances for a plurality of predicted values for soil characteristics (e.g. soil carbon content, moisture, soil type, etc.). As another example, uncertainty may be estimated by varying input parameters and measuring changes in model output (e.g. predictions); uncertainty may be estimated based on such variation, e.g. based on variance, standard deviation, average change, and/or any other suitable estimate. As yet a further example, uncertainty may be estimated via Monte Carlo methods, e.g. in the case where the model comprises a least-squares model.

[0039] As another example, soil mapping model 124 may comprise an ensemble of distinct soil mapping sub-models which produce different predicted values for a given soil characteristic; inference engine 130 may estimate uncertainty at a given location based on a variance (e.g. based on a standard deviation) of such values at the location. In some embodiments, the sub-models have distinct structure; e.g. one may comprise a linear model trained via kriging and another may comprise a neural network; such models may be trained over the same or different training datasets. In some embodiments, the sub-models are trained over different training datasets, such as where one is trained over images captured by a drone (e.g. at an altitude of no more than 100 km) and another is trained over images captured by satellite (e.g. at an altitude of no less than 100 km), and/or where one is trained over a first dataset having a certain spatial density and another is trained over a second dataset having a relatively denser spatial density (e.g. as may be the case when training one sub-model over high-resolution drone imagery with high spatial density, e.g. ~10 cmVpixel, and training another sub-model over satellite imagery with relatively lower spatial density, e.g. 10 m 2 /pixel); such models may comprise the same or different model structures (e.g. all neural networks, all linear estimation models, other models, and/or combinations thereof). In some embodiments, an ensemble soil mapping model 124 comprises a random forest model with decision trees for sub-models; optionally, a sub-model may comprise a random forest.

|0040] Sampler 110b determines one or more sampling locations in the area of interest based on the uncertainty map. In some embodiments sampler 110a and sampler 110b are provided by a single sampler and/or sampler 110b may comprise sampler 110a. Sampler 110a and sampler 110b may be referred to collectively and individually as “sampler 110”.

[00 1 ] In some embodiments, area of interest 102 is divided into regions and sampler 110b selects one or more regions based on the uncertainty map - e.g. by selecting one or more regions having the greatest uncertainty relative to other regions. Area of interest 102 may be divided into regions by sampler 110b, by another element of system 100, and/or via a predetermined division. In some embodiments area of interest 102 is divided into rectilinear regions of predetermined number and/or size. In some embodiments area of interest 102 is divided into regions by clustering locations in area of interest 102 based on predicted soil characteristics, measures of uncertainty, and/or covariate measurements such that each region comprises a cluster. Sampler 110b may determine the uncertainty of a region based on uncertainty of one or more locations within the region; for instance, sampler 110b may determine the uncertainty of the region based on an aggregate measure, such as the mean, median or modal uncertainty of locations within the region. Sampler 110b may determine one or more sampling locations within the selected region(s) based on any suitable method. For example, sampler 110b of Figure IB may optionally comprise sampler 110a of Figure 1A and/or may determine one or more sampling locations within the region substantially as described above with respect to sampler 110a, e.g. via random sampling, uniform sampling, stratified sampling, conditioned Latin hypercube sampling, and/or orthogonal sampling.

[0042] In some embodiments, sampler 110b determines one or more sampling locations by selecting sampling locations with high uncertainty. For example, sampler 110b may select one or more sampling locations from a set of candidate sampling locations, such as the set of all sampling locations in area of interest 102, sampling locations in certain regions (e.g. selected as described above), certain locations in each region (e.g. the location with greatest uncertainty in each region), and/or any other suitable set of candidate sampling locations. Sampler 110b may select from the set of candidate sampling locations one or more locations which are mapped to a measure of uncertainty by uncertainty map 134 which is at least as great as measures of uncertainty mapped to by any other (non-selected) candidate sampling location.

[0043] Sampler 110b may generate a request for one or more sample measurements 114b at the one or more sampling locations determined by sampler 110b. System 100 may receive sample measurements 114a from a user, another system, a sensor, etc. Sample measurements 114b may comprise any suitable measurements for use by soil modeler 120, as described elsewhere herein (e.g. with reference to sample measurements 114a). Sample measurements 114b characterize locations in area of interest 102 of high uncertainty (relative to other locations in area of interest 102) and are used by soil modeler 120 to refine parameters of soil mapping model 124 as described in greater detail below and elsewhere herein. In suitable circumstances, such refinement can efficiently reduce uncertainty in uncertainty map 134 and correspondingly improve accuracy and/or performance of soil mapping model 124.

[0044] Soil modeler 120 receives sample measurements 114b and refines parameters 122 to generate refined parameters 122b for soil mapping model 124. For example, soil modeler 120 may train soil mapping model 124 “from scratch” over a combination of sample measurements 114a and 114b to yield refined parameters 122b, e.g. by recalculating the weights of a linear estimation model over all sample measurements 114a and 114b collected thus far. As another example, soil modeler 120 may modify trained parameters 122 of soil mapping model 124 (which may comprise, e.g., initial parameters 122a and/or refined parameters 122b generated by a previous iteration of refinement) based on sample measurements 114b, e.g. by backpropagating over the weights of a neural network soil mapping model 124 based on sample measurements 114b and/or by otherwise retraining soil mapping model 124. For the purposes of this disclosure and the appended claims, all such generation of refined parameters 122b is considered to be “refining” parameters 122 whether or not parameters 122 are explicitly used by soil modeler 120, since samples 114b are themselves based on parameters 122 (via uncertainty map 134).

[0045] System 100 may iteratively refine parameters of soil model 124, e.g. by further refining refined parameters 122b. For instance, refined parameters 122b may take the position of parameters 122 in Figure IB, such that inference engine 130 determines a further uncertainty map 134 based on refined parameters 122b, sampler 110b determines one or more further sampling locations in area of interest 102 based on the further uncertainty map 134, and soil modeler 120 receives one or more further sample measurements 114b corresponding to the one or more further sampling locations and generates further refined parameters 122b based on further sample measurements 114b. Such further refined parameters 122b may optionally be further refined as described herein.

[0046] In some embodiments, system 100 iteratively refines parameters 122 until a halting criterion is met. The halting criterion may comprise reaching a predetermined number of iterations, reaching a threshold measure of model uncertainty, and/or any other suitable halting criterion. For example, system 100 may determine a measure of model uncertainty based on at least one uncertainty maps 134, e.g. based on an average uncertainty across at least a portion of an uncertainty map. (For instance, measures of uncertainty at all locations in an uncertainty map 134 may be averaged to yield a measure of model uncertainty for soil mapping model 124 under trained parameters 122.) In some embodiments, system 100 halts iteration upon the measure of model uncertainty falling to below a threshold. In some embodiments, system 100 halts iteration upon a determination that the measure of model uncertainty is converging (e.g. remaining substantially unchanged between iterations, which may comprise changing by less than a threshold value, such as 1%, between iterations).

A Method for Training a Soil Mayyins Model

[0047] Figure 2 is a flowchart of an exemplary method 200 for training a soil mapping model, such as soil mapping model 124 trained by system 100 of Figures 1A and IB. Method 200 is performed by a processor, such as a processor of computing system 300, described elsewhere herein.

[0048] Acts 202, 204, 206, and 208 are optional steps which generally relate to training a set of initial parameters for the soil mapping model. In some embodiments, the area of interest, initial sampling locations, initial training dataset, and/or initial parameters are predetermined or otherwise made available to the processor, in which case one or more of acts 202, 204, 206, and 208 may be omitted. In some such embodiments, the method commences at act 212 (and/or act 216, e.g. if the uncertainty map is also provided to the processor). Acts 202, 204, 206, and 208 are described in greater detail below.

[0049] At act 202, the processor defines an area of interest, such as by receiving a representation of the area of interest from a user, retrieving such a representation from a data store and/or from another system, generating the area of interest based on user input, and/or otherwise defining the area of interest. At act 204, the processor determines one or more initial sampling locations in the area of interest, e.g. by determining sampling locations by sampler 110a as described elsewhere herein. For instance, the one or more initial sampling locations may be determined according to conditioned Latin hypercube optimization across the area of interest. At act 206, the processor determines ground-truth soil characteristics at the one or more initial sampling locations, for example by obtaining and analyzing physical soil samples, imagery, and/or other information for the sampling locations, e.g. as described above with reference to sample measurements 114a, thereby providing an initial training dataset. At act 208, the processor trains a set of initial parameters for the soil mapping model based on the initial training dataset, e.g. by training via soil modeler 120 as described above. The soil mapping model is trained to relate one or more soil characteristics in the initial training dataset to one or more covariates over the area of interest.

[0050] At act 210, the processor refines parameters of the soil mapping model based on new samples collected at locations selected based on uncertainty in the soil mapping model so as to reduce such uncertainty. Act 210 may comprise various acts, e.g. as shown in Figure 2. For instance, act 210 may comprise act 212. At act 212, the processor determines an uncertainty map for the soil mapping model over at least a portion of the area of interest. The uncertainty map is determined based on the current parameters of the soil mapping model (e.g. initial parameters determined at act 208 and/or otherwise obtained). The uncertainty map maps locations in the area of interest to a measure of uncertainty in at least one of the one or more soil characteristics (e.g. soil carbon). For instance, the processor may generate the uncertainty map as described above with reference to inference engine 130 and the uncertainty map may map uncertainty as described above with reference to uncertainty map 134.

[0051] For the sake of example, let us consider a non-limiting implementation where the soil mapping model comprises a neural network with a set of trained parameters (e.g weights) which generates, for each location in the area of interest, a prediction for a soil characteristic, namely (in this example) soil carbon content. The prediction comprises a probability distribution over values describing the soil characteristic, e.g. via a softmax and/or other suitable transformation. The processor determines a measure of uncertainty for the location based on the probability distribution, e.g. based on a standard deviation, variance, width of a 95% confidence interval, and/or other suitable measure. The locations are, in this example, stored in memory in such a way as to be associated with their corresponding measures of uncertainty, e.g. as key-value pairs in a dictionary data structure, thereby providing an uncertainty map.

[0052] At act 214, the processor optionally decides whether to halt refinement. The processor may determine a halting criterion, e.g. as described above, and halt or continue accordingly. If the processor halts refinement, the method may proceed to act 230, described below. Otherwise, the method continues act 210, e.g. by proceeding to act 216.

[0053] At act 216, the processor determines one or more new sampling locations in the area of interest based on the uncertainty map. For instance, the processor may determine sampling locations as described above with reference to sampler 110b. At act 218, the processor receives one or more new sample measurements corresponding to the one or more new sampling locations, e.g. as described above with reference to sample measurements 114b.

[0054] At act 220, the processor refines the parameters of the soil mapping model (e.g. initial parameters determined at 208, previously refined via earlier iterations of act 220, and/or otherwise obtained) based on the one or more new sample measurements to generate a set of refined parameters for the soil mapping model. For instance, the processor may refine the parameters of the soil mapping model as described above with reference to soil modeler 120, soil model 124, and refined parameters 122b.

[0055] In some embodiments (including the depicted embodiment of Figure 2), method 200 returns from act 220 to act 212. For instance, method 200 may return from act 220 to act 212, determine a new uncertainty map, and (at act 214) determine whether to halt or continue iterating act 210 based on the new uncertainty map. In some embodiments, method 200 proceeds from act 220 to act 230. For instance, the processor may halt iteration of act 210 after a predetermined number of iterations and may check whether the predetermined number of iterations has elapsed following act 220.

[0056] At act 230, the processor maps at least one location in the area of interest to at least one predicted value of at least one of the one or more soil characteristics based on the refined parameters. Act 230 is optional while training the soil mapping model, and may be performed separately from the training acts of method 200 (e.g. all preceding acts). Aspects of act 230 are described in greater detail below. Generating Soil Mays with Trained Soil Mapping Models

[0057] The soil mapping model may be used to map the area of interest based on the refined parameters. For example, a processor of a computing system may receive one or more covariate values and transform the covariate values with the soil mapping model, based on refined parameters of the soil mapping model (e.g. as generated at act 220), into a predicted soil characteristic value. Covariates may include any suitable covariates such as those described elsewhere herein, e.g. elevation and/or imagery (e.g. infrared reflectance imagery). Soil characteristics may include any suitable characteristics such as those described elsewhere herein, e.g. soil carbon content. The processor may generate predicted values for soil characteristics at one, some, and/or all locations in the area of interest and/or portions thereof in any manner suitable for the soil mapping model. For instance, the predicted values may be generated via inference with a neural network soil mapping model, via linear transformation with a linear estimation model, etc. The processor may, but does not necessarily, generate an uncertainty map as part of this mapping procedure. The processor does not necessarily perform any of the training acts of method 200 (i.e. all acts other than act 230), although these may optionally also be performed.

Example System Implementation

[0058] Figure 3 illustrates a first exemplary operating environment 300 that includes at least one computing system 302 for performing methods described herein. System 302 may be any suitable type of electronic device, such as, without limitation, a mobile device, a personal digital assistant, a mobile computing device, a smart phone, a cellular telephone, a handheld computer, a server, a server array or server farm, a web server, a network server, a blade server, an Internet server, a work station, a mini-computer, a mainframe computer, a supercomputer, a network appliance, a web appliance, a distributed computing system, multiprocessor systems, or combination thereof. System 302 may be configured in a network environment, a distributed environment, a multi-processor environment, and/or a stand-alone computing device having access to remote or local storage devices.

[0059] A computing system 302 may include one or more processors 304, a communication interface 306, one or more storage devices 308, one or more input and output devices 312, and a memory 310. A processor 304 may be any commercially available or customized processor and may include dual microprocessors and multi-processor architectures. The communication interface 306 facilitates wired or wireless communications between the computing system 302 and other devices. A storage device 308 may be a computer-readable medium that does not contain propagating signals, such as modulated data signals transmitted through a carrier wave. Examples of a storage device 308 include without limitation RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital versatile disks (DVD), or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage. In at least some embodiments such embodiments of storage device 308 do not contain propagating signals, such as modulated data signals transmitted through a carrier wave. There may be multiple storage devices 308 in the computing system 302. The input/output devices 312 may include a keyboard, mouse, pen, voice input device, touch input device, display, speakers, printers, etc., and any combination thereof.

[0060] The memory 310 may be any non-transitory computer-readable storage media that may store executable procedures, applications, and data. The computer-readable storage media does not pertain to propagated signals, such as modulated data signals transmitted through a carrier wave. It may be any type of non-transitory memory device (e.g., random access memory, read only memory, etc.), magnetic storage, volatile storage, non-volatile storage, optical storage,

DVD, CD, floppy disk drive, etc. that does not pertain to propagated signals, such as modulated data signals transmitted through a carrier wave. The memory 310 may also include one or more external storage devices or remotely located storage devices that do not pertain to propagated signals, such as modulated data signals transmitted through a carrier wave.

[00 1] The memory 310 may contain instructions, components, and data. A component is a software program that performs a specific function and is otherwise known as a module, program, engine, and/or application. The memory 310 may include an operating system 314, a sampler 316, a covariate modeler 318, an inference engine 319, training data 320 (e.g. one or more plant images and/or ground-truth structural representations), trained parameters 322 (e.g. comprising parameters 122), one or more input images 324 (e.g. plant images), and other applications and data 330. Depending on the embodiment, some such elements may be wholly or partially omitted. For example, an embodiment intended for inference (e.g. via method 200 and/or system 302) and which has trained parameters 322 might omit training data 320, decoder 318, discriminator 319, and/or some or all trained parameters 322 other than those required by sampler 316. As another example, memory 310 may include no images 324 prior to starting inference (e.g. via system 302 and/or method 200) and may receive images (one-by-one and/or in batches) via an input device 312 and/or from a storage device 308. |0062] While a number of exemplary aspects and embodiments have been discussed above, those of skill in the art will recognize certain modifications, permutations, additions and sub combinations thereof. It is therefore intended that the following appended claims and claims hereafter introduced are interpreted to include all such modifications, permutations, additions and sub-combinations as are within their true spirit and scope.