Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND SYSTEM FOR ACCURATE LONG TERM SIMULTANEOUS LOCALIZATION AND MAPPING WITH ABSOLUTE ORIENTATION SENSING
Document Type and Number:
WIPO Patent Application WO/2018/089703
Kind Code:
A1
Abstract:
Described herein are embodiments of a method and system that uses a vertical or upward facing imaging sensor to compute vehicle attitude, orientation, or heading and combines the computed vehicle attitude, orientation, or heading with range bearing measurements from an imaging sensor, LiDAR, sonar, etc., to features in the vicinity of the vehicle to compute accurate position and map estimates.

Inventors:
AGARWAL SAURAV (US)
CHAKRAVORTY SUMAN (US)
Application Number:
PCT/US2017/060954
Publication Date:
May 17, 2018
Filing Date:
November 09, 2017
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TEXAS A & M UNIV SYS (US)
International Classes:
B60R11/04; G01C21/16; G01C21/32; G05D1/00
Domestic Patent References:
WO2016130719A22016-08-18
WO2016077703A12016-05-19
Foreign References:
US20160070265A12016-03-10
US20130048722A12013-02-28
US20160033272A12016-02-04
US20040168148A12004-08-26
Attorney, Agent or Firm:
FALESKI, Thaddeus, J. (US)
Download PDF:
Claims:
CLAIMS

What is claimed is:

1 . A mapping system comprising:

an upward facing sensor;

a range bearing sensor; and

a processor in communication with the upward facing sensor and the range bearing sensor, the processor configured to:

determine an attitude of the mapping system based upon first data received from the upward facing sensor;

determine a location of local landmarks based upon second data received from the range bearing sensor; and determine a location of the mapping system based upon the attitude and the location of the local landmarks.

2. The mapping system of claim 1 , wherein the first data comprises bearing measurements to one or more features.

3. The mapping system of claim 2, wherein the upward facing sensor comprises a camera.

4. The mapping system of claim 1 further comprising an inertial sensor, wherein the processor is further configured to determine a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from the inertial sensor.

5. The mapping system of claim 4, wherein the processor configured to determine the location of the mapping system comprises the processor configured to fuse a result of the scan match with the attitude.

6. The mapping system of claim 5, wherein the processor configured to fuse the result of the scan match with the attitude comprises the processor configured to input the result of the scan match and the attitude to a Kalman filter.

7. The mapping system of claim 1 , wherein the processor is further configured to update a map based upon the location of the mapping system and the location of the local landmarks.

8. An autonomous vehicle comprising:

an upward facing sensor;

a range bearing sensor; and

a mapping system in communication with the upward facing sensor and the range bearing sensor, the mapping system comprising a processor configured to:

determine an attitude of the autonomous vehicle based upon first data received from the upward facing sensor;

determine a location of local landmarks based upon second data received from the range bearing sensor; and determine a location of the autonomous vehicle based upon the attitude and the location of the local landmarks.

9. The autonomous vehicle of claim 8, wherein the first data comprises bearing measurements to one or more features.

10. The autonomous vehicle of claim 9, wherein the upward facing sensor comprises a camera.

1 1 . The autonomous vehicle of claim 8 further comprising an inertial sensor, wherein the processor is further configured to determine a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from the inertial sensor.

12. The autonomous vehicle of claim 1 1 , wherein the processor configured to determine the location of the autonomous vehicle comprises the processor configured to fuse a result of the scan match with the attitude.

13. The autonomous vehicle of claim 12, wherein the processor configured to fuse the result of the scan match with the attitude comprises the processor configured to input the result of the scan match and the attitude to a Kalman filter.

14. The autonomous vehicle of claim 8, wherein the processor is further configured to update a map based upon the location of the autonomous vehicle and the location of the local landmarks.

15. A method for mapping, the method comprising:

determining an attitude of a mapping system based upon first data received from an upward facing sensor;

determining a location of local landmarks based upon second data received from a range bearing sensor; and

determining a location of the mapping system based upon the attitude and the location of the local landmarks.

16. The method of claim 15, wherein the first data comprises bearing measurements to one or more features.

17. The method of claim 15 further comprising determining a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from an inertial sensor.

18. The method of claim 17, wherein determining the location of the mapping system comprises fusing a result of the scan match with the attitude.

19. The method of claim 18, wherein fusing the result of the scan match with the attitude comprises inputting the result of the scan match and the attitude to a Kalman filter.

20. The method of claim 15 further comprising updating a map based upon the location of the mapping system and the location of the local landmarks.

Description:
METHOD AND SYSTEM FOR ACCURATE LONG TERM SIMULTANEOUS LOCALIZATION AND MAPPING WITH ABSOLUTE ORIENTATION SENSING

CROSS-REFERENCE TO RELATED APPLICATIONS

[0001] The present application claims benefit of U.S. Provisional Patent Application No. 62/419,624 filed November 9, 2016 by Saurav Agarwal, et al., and entitled, "Method and System for Accurate Long Term Localization and Navigation Using Onboard Sensors," which is incorporated herein by reference as if reproduced in its entirety.

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

[0002] This disclosure was made with U.S. Government support under HS-1217991 awarded by the National Science Foundation. The government has certain rights in the embodiments of the disclosure.

TECHNICAL FIELD

[0003] The present disclosure relates to robotic mapping. In particular, accurate long term simultaneous localization and mapping with absolute orientation sensing.

BACKGROUND

[0004] When a robot is not given a priori knowledge of its environment, it must use its sensory data and actions to concurrently build a map of its environment and localize itself within its stochastic map, this is referred to as Simultaneous Localization and Mapping (SLAM). In SLAM, estimation errors tend to build up during exploratory motion and are usually negated by revisiting previously seen locations. When a robot returns to a previously visited location, this may be referred to as loop closure. Certain mobile robotics applications, e.g., autonomous cars, precision farming, planetary exploration etc. may require robots to accurately traverse long trajectories without revisiting previous locations. During long-term point-to-point navigation when global localization information (e.g., GPS) is unavailable or degraded, SLAM is often used. Conventional SLAM visual-inertial localization methods exhibit error of ~0.3%-0.5% which may be unsuitable for precision tasks, e.g. for a 25km trajectory, such error results in 75m-125m position error. [0005] Some robots may be operated indoors where GPS is unavailable or degraded. For example, material handling robots that move goods (boxes, pallets etc.) in large warehouses and distribution centers do not have access to GPS satellites. Installing beacons, markers, or guide cables is expensive, and robots are often expected start without prior knowledge of the map of their operating environment. Further, warehouse environments are highly dynamic due to a mix of industrial vehicles (pallet jacks, forklifts etc.) and people moving rapidly across large floor spaces, thus a robot must continuously update its knowledge of the map and react to changes in its vicinity.

[0006] In some approaches, robots may be driven manually to gather and store measurements (e.g., laser scans, visual landmarks etc.). This data is then processed offline to build detailed maps. Computed maps are subsequently used for positioning and navigation by taking measurements to known features in the world.

[0007] Existing SLAM techniques correct estimation drift by relying on loop closure, i.e., revisiting previously seen locations and correctly associating sensor information to data previously stored in the map. There are at least two problems associated with loop closure. Loop closure is sensitive to data association accuracy, wrong data association can lead to catastrophic failure of the SLAM system; and data association reliability is limited by localization accuracy. Thus, localization drift may cause map quality to degrade as the scale of environment increases.

SUMMARY

[0008] Described herein are embodiments of a method and system that uses a vertical (e.g., upward facing) imaging sensor to compute vehicle attitude (e.g., orientation or heading) and combines the computed vehicle attitude with range bearing measurements (from an imaging sensor, LiDAR, sonar, etc.) to features in the vicinity of the vehicle to compute accurate position and map estimates.

[0009] In an embodiment, a mapping system is described. The mapping system may comprise an upward facing sensor; a range bearing sensor; and a processor in communication with the upward facing sensor and the range bearing sensor. The processor may be configured to determine an attitude of the mapping system based upon first data received from the upward facing sensor; determine a location of local landmarks based upon second data received from the range bearing sensor; and determine a location of the mapping system based upon the attitude and the location of the local landmarks. The first data may comprise bearing measurements to one or more features. The upward facing sensor may be a camera. The mapping system may further comprise an inertial sensor, wherein the processor may be further configured to determine a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from the inertial sensor. The processor configured to determine the location of the mapping system may comprise the processor configured to fuse a result of the scan match with the attitude. The processor configured to fuse the result of the scan match with the attitude may comprise the processor configured to input the result of the scan match and the attitude to a Kalman filter. The processor may be further configured to update a map based upon the location of the mapping system and the location of the local landmarks.

[0010] In an embodiment, an autonomous vehicle is described. The autonomous vehicle may comprise an upward facing sensor; a range bearing sensor; and a mapping system in communication with the upward facing sensor and the range bearing sensor. The mapping system may comprise a processor configured to determine an attitude of the autonomous vehicle based upon first data received from the upward facing sensor; determine a location of local landmarks based upon second data received from the range bearing sensor; and determine a location of the autonomous vehicle based upon the attitude and the location of the local landmarks. The first data may comprise bearing measurements to one or more features. The upward facing sensor may be a camera. The autonomous vehicle may further comprise an inertial sensor, wherein the processor may be further configured to determine a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from the inertial sensor. The processor configured to determine the location of the autonomous vehicle may comprise the processor configured to fuse a result of the scan match with the attitude. The processor configured to fuse the result of the scan match with the attitude may comprise the processor configured to input the result of the scan match and the attitude to a Kalman filter. The processor may be further configured to update a map based upon the location of the autonomous vehicle and the location of the local landmarks.

[0011] In an embodiment, a method for mapping is described. The method may comprise determining an attitude of a mapping system based upon first data received from an upward facing sensor; determining a location of local landmarks based upon second data received from a range bearing sensor; and determining a location of the mapping system based upon the attitude and the location of the local landmarks. The first data may comprise bearing measurements to one or more features. The method may further comprise determining a relative pose based upon a scan match, wherein the scan match comprises inputs of the second data and a third data received from an inertial sensor. Determining the location of the mapping system may comprise fusing a result of the scan match with the attitude. Fusing the result of the scan match with the attitude may comprise inputting the result of the scan match and the attitude to a Kalman filter. The method may further comprise updating a map based upon the location of the mapping system and the location of the local landmarks.

BRIEF DESCRIPTION OF THE DRAWINGS

[0012] Further embodiment of the disclosure will be described with respect to the following figures, in which:

[0013] Figure 1 is a diagram of an embodiment of a robot configured for long term SLAM with absolute orientation sensing;

[0014] Figure 2 is a flow diagram of an embodiment of a method for long term SLAM with absolute orientation sensing;

[0015] Figure 3A is a diagram of an embodiment of robot to feature relative measurement by a robot;

[0016] Figure 3B is a diagram of an embodiment of feature to feature relative measurement by a robot;

[0017] Figure 4 is a diagram of an embodiment of a robot viewing a feature from two poses in robot to feature relative measurement;

[0018] Figure 5A is a graph of error growth in estimate of a last pose as a function of how far a robot moves of an embodiment described herein;

[0019] Figure 5B is a graph of reduction in error growth rate as a number of landmarks increases of an embodiment described herein;

[0020] Figure 5C is a graph of localization error for a feature bank most distant from a starting location before and after loop closure of an embodiment described herein;

[0021] Figure 5D is a graph of localization error in a last pose after loop closure as the trajectory length (number of banks mapped) increases of an embodiment described herein; [0022] Figure 6A is a diagram of an embodiment of a robot starting to make observations to a first bank of features;

[0023] Figure 6B is a diagram of an embodiment of a robot starting to making relative observations between a second bank of features and a third bank of features;

[0024] Figure 6C is a diagram of an embodiment of a robot moving towards its start location;

[0025] Figure 6D is a diagram of an embodiment of a robot re-observing the first bank of features;

[0026] Figure 7A is a graph of linear error growth in estimate of a last pose as a function of how far a robot moves away from its start of an embodiment described herein;

[0027] Figure 7B is a graph of reduction in error growth rate as a number of landmarks increases of an embodiment described herein;

[0028] Figure 7C is a graph of localization error for a feature bank most distant from a starting location before and after loop closure of an embodiment described herein;

[0029] Figure 7D is a graph of localization error in a last bank after loop closure as the trajectory length (number of banks mapped) increases of an embodiment described herein;

[0030] Figure 8A is a diagram of an embodiment of robot trajectory over 5 square kilometers (km) with a 25.9 km trajectory;

[0031] Figure 8B is a diagram of an embodiment of robot trajectory over 10 square km with a 107.9 km trajectory;

[0032] Figure 9A is a graph of average terminal pose localization error as the bank size increases with the 25.9 km trajectory of an embodiment described herein;

[0033] Figure 9B is a graph of average terminal pose localization error as the bank size increases with the 107.9 km trajectory of an embodiment described herein;

[0034] Figure 10A is a diagram of an embodiment of a camera view of a ceiling; and

[0035] Figure 10B is a diagram of an embodiment of a thresholded image of the ceiling.

DETAILED DESCRIPTION

[0036] The following discussion is directed to various embodiments of the disclosure. Although one or more of these embodiments may be preferred, the embodiments disclosed should not be interpreted, or otherwise used, as limiting the scope of the disclosure, including the claims. In addition, one skilled in the art will understand that the following description has broad application, and the discussion of any embodiment is meant only to be exemplary of that embodiment, and not intended to intimate that the scope of the disclosure, including the claims, is limited to that embodiment.

[0037] Estimation drift during exploration may be caused by robot heading uncertainty. In some approaches, reliable absolute orientation measurements may not be available in SLAM. These approaches may rely on odometery and relative pose or feature measurements to estimate robot orientation and position. Embodiments of the present disclosure may attain an accuracy (i.e., error in position as percentage of distance travelled) of 0.0016% for a 107.9 km trajectory without loops using absolute orientation sensor technology. An instantaneous location and/or heading of a robot may be referred to herein as a pose.

[0038] A robot using SLAM with a heading sensor in Extended Kalman Filter-based SLAM (EKF-SLAM) may move much further into unknown areas with consistent estimates. Consistency in a filter implies that the estimation uncertainty captures the true error; conversely an inconsistent filter does not capture true error and may give a false sense of confidence in the robot's belief.

[0039] Two methods of SLAM may be used, filtering-based methods and graph- based methods. Filtering-based methods may maintain a recursive estimate using current robot pose and map. Graph-based methods use robot poses as nodes of a graph and constraints as edges. Graph based SLAM may use a two-pronged approach, 1 ) a front-end which maintains an estimate of the robot pose and computes data association between current and past observations, and 2) a back-end which solves the non-linear optimization to compute the history of robot poses.

[0040] Analysis of EKF-SLAM shows that heading, i.e., robot orientation may be unobservable in the EKF-SLAM formulation. Analysis of the consistency of EKF-SLAM further shows that heading estimation errors may be a cause of inconsistency due to erroneous Jacobian computations. In the case of a stationary robot with zero process noise observing one feature, EKF-SLAM heading estimate may drift. The EKF-SLAM filter may be overconfident, i.e., uncertainty estimates may not reflect the true error and heading uncertainty may be the major cause of inconsistency.

[0041] In graph-based SLAM, non-linear optimization techniques may be used to solve for the maximum likelihood estimate. Graph-based SLAM techniques may rely on an initial guess to bootstrap the optimizer. This initial guess may be based on odometery and may be arbitrarily bad leading to local minima. A special property of SLAM is that when robot orientation is known, SLAM may be posed as a linear estimation problem in position. Some approaches have exploited this structural property with the aim of decoupling non-linearities that arise due to orientation.

[0042] Estimating orientation as the first step and using these estimates to initialize pose graph optimization may result in a robust solution. The separation of orientation and position may be extended to feature-based SLAM. Estimating orientation first may avoid catastrophic failure, e.g. local minima.

[0043] The embodiments described herein include autonomous robot localization and mapping that fuses absolute orientation sensing (using cameras that track stable structural features), with local measurements using a LIDAR, camera, or some other imaging or sensing technique. In some embodiments a robot or autonomous vehicle includes a long term SLAM with absolute orientation sensing system. The robot or autonomous vehicle may be configured to travel over land, through air, water, or any other medium of travel. The system may include a sensor array. The sensor array may include one or more orientation sensors, odometery sensors (such as inertial measurement units, wheel encoders, etc.), and at least one exteroceptive sensor. The orientation sensor may be, for example, a star tracker, a sun sensor, a magnetometer, or a gyrocompass, or an upward facing camera. The inertial sensor may include a combination of accelerometer and gyroscope which measure the vehicles acceleration and angular rates. The odometery sensors may include a rotary encoder coupled to a wheel of the vehicle, or other device for determining distance traveled by the vehicle over a time interval. The exteroceptive sensor (i.e.,. a range bearing sensor) may include a camera, a LIDAR system, a RADAR system, a SONAR system, or other system for providing distance and bearing measurements to features in view of the vehicle. For example, features outdoors may include stars when a star tracker is used; and features indoors may include light fixtures or other fixtures on the ceiling. The system includes a processor (e.g., a microprocessor, digital-signal-processor, etc.) coupled to the sensor array. The processor receives measurements from the various sensors of the sensor array. The navigation system also includes memory (e.g., volatile or non-volatile semiconductor memory) coupled to the processor. The processor may store the measurements received from the sensor array in the memory. The memory may also store instructions that can be executed by the processor to process the measurements and provide the long term SLAM with absolute orientation sensing functionality described herein. For example, the processor may process the measurements to determine a location of the vehicle as described herein, to control a motor or other propulsion system of the vehicle, and/or to control a steering system of the vehicle based on the determined location of the vehicle.

[0044] Other sensors such as magnetometers and gyrocompasses which may be used to provide absolute orientation. Magnetometers may function adequately when the Earth's magnetic field is not corrupted by external influences. Gyrocompasses measure the planet's rotation to determine accurate heading with respect to geographic north. Microelectromechanical Systems (MEMS)-based gyrocompasses have been proposed and may be useful for providing absolute orientation. The methods presented in this disclosure may use magnetometer or gyrocompass for orientation sensing alone or in combination with a star tracker or sun sensor or other upward facing sensor.

[0045] Figure 1 is a diagram of an embodiment of a vehicle 100 with a long term SLAM with absolute orientation sensing system. The long term SLAM with absolute orientation sensing system may comprise an upward facing sensor 1 10, a range bearing sensor 120, a camera 130, an inertial measurement unit 140, wheel encoders 150, and a processor 160. The processor 160 may receive inputs from one or more of the upward facing sensor 1 10, the range bearing sensor 120, the camera 130, the inertial measurement unit 140, or the wheel encoders 150. Upward facing sensor 1 10 may be used to determine an orientation of the vehicle 100 based upon landmarks above the vehicle 100. For example, in an indoor situation, the upward facing sensor 1 10 may be a camera that captures images of the ceiling of a building where the vehicle 100 is located. In another example, the upward facing sensor 1 10 may be a star tracker that captures images of stars or other landmarks in the sky. The orientation determined by the upward facing sensor 1 10 may be used with data from one or more of the range bearing sensor 120, the camera 130, the inertial measurement unit 140, or the wheel encoders 150 by the processor 160 to determine a location and orientation of the vehicle 100. This information may be used to create or update a virtual map of the environment where the vehicle is operating. One or more of the range bearing sensor 120, the camera 130, the inertial measurement unit 140, or the wheel encoders 150 may be optional in some embodiments. The range bearing sensor 120 may a LiDAR, sonar, or some other sensor that detects objects in the environment surrounding the vehicle 100. Camera 130 may a stereo camera or some other image capture device that detects objects in the environment surrounding the vehicle 100. The IMU 140 may be configured to measure the movement of the vehicle 100. Likewise, wheel encoders 150 may count revolutions of the wheels and determine a distance traveled by vehicle 100.

[0046] Figure 2 is a flow diagram of an embodiment of a method for long term SLAM with absolute orientation sensing 200. The embodiment may be implemented by a mapping system. At block 210, an orientation sensing camera, e.g., upward facing sensor 1 10, may determine a heading (e.g., attitude or orientation) of a vehicle, e.g., vehicle 1 10. The heading may be provided to a LOGO slam solver 260. The LOGO slam solver 260 may be executed by a processor, e.g., processor 160. The heading may also be provided to a Kalman filter 250. A scan matcher 240 may determine a relative pose based on input from a range finding sensor 220 and/or a movement sensor 230. Range finding sensor 220 may be a LiDAR, sonar, radar or some other sensor configured to detect objects and their range from the vehicle. Movement sensor 230 may an IMU, a wheel encoder, or any other sensor configured to detect movement of the vehicle and estimate a distance traveled based on the movement detected. The relative pose information from scan matcher 240 may be provided to the Kalman filter 250. The Kalman filter 250 may fuse the relative pose information with the heading to determine a location of the vehicle. If the mapping system determines with a predetermined accuracy that the vehicle is revisiting a previously visited location, then the mapping system may assume loop closure has occurred. If loop closure has occurred, the LOGO slam solver 260 may use the output of the scan matcher 240 along with the heading to update the map. If loop closure has not been detected, the mapping system may perform incremental mapping. While the embodiment of Figure 2 is described in terms of a mapping system on a vehicle, the method may be implemented by mapping system on a handheld device or any other device comprising the elements described in the description of Figure 2. Further, the vehicle may be autonomous or manually controlled.

Outdoor Embodiments

[0047] The following embodiments describe systems that operate in an outdoor environment where the upward facing sensor 1 10 detects objects in the sky. Some of the techniques in these embodiments may be used in either the indoor and outdoor embodiments. Star trackers may be automated camera-based devices that compute inertial attitude with high accuracy. Some star trackers may deliver RMS error down to 10 arcseconds or 0.0028° by using measurements to known celestial bodies and comparing them to star charts. Star trackers may rely on measurements to persistent beacons in space whose trajectories across the sky relative to Earth or other planets may be fixed with great precision based on long-term astronomical observations. Star trackers may be used when GPS is unavailable during both day and night operation. The method of the present disclosure may combine any combination of proprioceptive (odometery) sensors (e.g., inertial sensors, wheel odometer etc.) and/or exteroceptive sensors (e.g., camera, LIDARs etc.) with star tracking or sun sensors for accurate global navigation.

[0048] In an embodiment using a star tracker or sun sensor, let x k £ X, ¾ e U, and z k E TL represent the system state, control input, and observation at time step k respectively, where , Έ, Ί. denote the state, control, and observation spaces respectively. A keyframe pose is designated as "x . The state evolution model / is denoted as x k+1 = f(x k , u k ) + w k , where w k ~ N(0, Q fc ) is zero-mean Gaussian process noise. The robot belief is defined as the probability distribution over all possible states of the robot. Let the belief at time t k be b k ε M where M is the belief space. Then b k represents the posterior over the history of the robot's actions and observations, thus: b k = p(x k \x 0 , Z k , U k _ 1 ) (1 ) where:

Z k is the history of observations up to time t k , and

the history of actions up to time t k _ x .

[0049] The measurement model h is denoted as z k = h(x k ) + v k , where v k ~ N(0, R fc ) is zero-mean Gaussian measurement noise. The map (unknown at t 0 ) is a set of landmarks (i.e., features) distributed throughout the environment. The y -th landmark is defined as / . and l j as the estimate of / . . The observation for landmark I j at time t k is denoted by z{ e z k . The inverse measurement model is denoted by g such that for a given measurement z{ and the state x k at which it was made, g computes the landmark location l j = g(x k , z k J ) . 'df is the relative feature measurement, from feature /, to / . in the local frame of the robot at time t k . In the framework of the present disclosure, a relative feature measurement is an estimate of the displacement vector from one feature to another. The local relative measurement is computed as 'df =' \ J k - \' k , where l \ k ' , ! \ J k are relative positions of features /, and I j respectively with respect to the robot in its local frame. Thus it is linear in positions of the two features in the local frame. Let C(Q k ) denote the direction cosine matrix (DCM) of the robot orientation at state x k . C is a function of the robot orientation parameter (e.g., Euler angles, Quaternions etc.). A local measurement in the robot frame can be projected into a fixed world frame (i.e., a global frame, the frame may be an earth centered inertial frame, frame here refers to a cartesian coordinate frame of reference) as: 'Λ =" Λ = Ι, - Ρ „ (2) where 1, and p k are the feature and robot positions in the world frame respectively.

Thus, when robot orientation Θ is known, the position estimation problem may be linear.

[0050] Embodiments disclosed herein may include various approaches for long-term localization. Two such approaches are Robot to Feature Relative Measurement Model (R2F) and Feature to Feature Relative Measurement Model (F2F). R2F may be designed for systems where a robot moves continuously and has access to odometery, orientation sensor and exteroceptive sensing e.g., Lidar, cameras, etc. R2F may convert local relative measurements from the robot to features at each pose to global frame measurements as shown in Figure 3A. These measurement may then be used to solve a linear estimation problem of the robot and feature positions to attain a high-degree of accuracy. F2F may extend the R2F for systems where extremely high-precision is required. When extremely high-precision is required, odometery may not provide reliable information and time budgets may be relatively large. In this case, independent measurements are made for relative displacements between features which are then chained together to estimate the map and robot position as shown in Figure 3B. [0051] In the scenario of Figures 3A and 3B, a robot may observe four banks of features prior to final pose, i.e., last keyframe. There may be two features in each bank. For example, the first bank indicated by the dashed ellipse may include and / 2 . A bank may be described as the set of features observed at a particular pose. Figure 3A shows how R2F makes relative measurements from robot to features. The relative measurements are indicated by the lines with arrowheads. The keyframes are indicated by the black and white triangles and indicate positions where the robot may make relative measurements.

[0052] The R2F approach includes the following steps. 1 ) Range bearing measurements to features are converted into relative displacement vectors from robot to features at each pose as the robot moves as shown in Figure 4. In Figure 4, the robot may detect the landmark /, from two poses x 1 and x 2 . The transformation of local relative measurements to the global frame may be used to solve for robot and feature positions. 2) At keyframe poses, a linear estimation problem may be solved for the robot pose and features using the recorded data. 3) Once the linear estimation problem is solved, correlations between keyframe poses and features observed at keyframe poses may be maintained, intermediate feature and pose estimates may be dropped. 4) An upper threshold may be set on the number of keyframes to keep in the map after which the oldest keyframe is deleted. The first pose may be saved even if it is the oldest keyframe.

[0053] The R2F approach will now be explained in greater detail. Let

L k = { z 2 ,... , z, k " } be the set of range bearing observations to the set of landmarks visible at time t k . Using the inverse measurement model g (described above), the position of landmark /, in robot's local frame is 1 M k - g(z^) . Hence, the vector of local robot to feature relative measurements is:

[0054] Thus, l b k ~ M( l k , ( RA fc = V 'gl^R^V 'g^), where V'g | ¾ is the Jacobian of measurement function in Eq. 3. At t 0 , embodiments have keyframe K x 0 whose prior belief b 0 ~ N(x 0 = [ρΙ > θ ο] τ > ρ ο) is known. The robot starts moving and collecting odometery, orientation, and exteroceptive measurements at each timestep. At some future timestep t k , embodiments have keyframe K x k . Abusing notation slightly, let l A 0 , k ~ N ( l A 0 . k , be the vector of all local relative position measurements (including robot to feature and translational odometery) recorded at poses x 0 . k . At each t k embodiments have a noisy unbiased orientation measurement which gives the vector 0 O:fc ~ Dropping the time subscript for readability, local relative measurements 'Δ may be transformed to the world frame as:

(4) where C = C(9) is the corresponding composition of DCM matrices parametrized by the heading measurements Θ .

[0055] Based on equation (4), transformed global measurements at each pose x k may be correlated to heading measurement . Heading error covariances may be propagated appropriately in the feature and robot position estimation. Embodiments setup a new measurement model β = + ν ρ by stacking the transformed odometery between poses, robot to feature displacement, heading measurements, and the prior estimate of K x 0 (given). This gives the following linear problem:

Po

Po

A' 0 P

P C r 'A + V

0 I β, ' (5)

Θ A Θ where [ρ^,ρ ,1 ,θ ] is the vector of robot poses, landmarks, and heading, A' is a matrix with each row containing elements of the set {-l,o,+l} , ~ N(0, R^) is the measurement noise vector with covariance:

P 0 is the prior error covariance of K x 0 and Vh„ is the Jacobian of measurement function h o given by

1 0 0

0 C r M 'A M (7)

0 0 I

[0056] The solution to the linear problem in equation (5) is given by:

P

1 * (8)

Θ *

[0057] From equation (8), [p *r ,l *r are of interest. At a future keyframe, in Eq. 5 embodiments may replace p 0 with estimates of past keyframes and corresponding landmarks observed at those keyframes.

[0058] The R2F approach may be analyzed for location accuracy and the effect of loop closure as a robot explores an unknown map. For the sake of clarity, it is assumed that the robot makes independent measurements of global frame displacement from robot to feature and error covariance of every global relative measurement is R a . In practice, independence may be achieved by capturing heading observations such that the same heading observation is not used to transform all local measurements to world frame. The first and last pose of the robot are considered as keyframes with first pose known. [0059] In an exploration task, a robot may move into unknown areas and make measurements to a bank of features at each timestep as shown in Figure 3A. Figures 5A-5D show analysis of the pose and map estimation error as the robot moves and the effect of loop closure in the estimates using the R2F approach. Figure 5A shows that the error growth is linear as final pose moves further away from its start. An interesting point arises in Figure 5B, the rate of error growth drops as ∞ where η β is the number of features in each bank. Thus it implies that for long-term autonomy, it is beneficial to make high-quality observations to a number of features greater than one, but hundreds or thousands of features may not always be needed because the additional features above a certain threshold show decreasing improvement in accuracy. Thus, localization error for a given task may be predicted and a determination made of how many features to observe to balance computational complexity and memory resources with accuracy.

[0060] Loop closure may be considered a necessary action in SLAM to limit error growth. However, in the case of long-term point-to-point navigation, loop closure may not be possible. The effect of loop closure on the bank of features farthest from the start location in a loop and on the last pose is analyzed in Figures 5C and 5D. For the error in farthest bank, Figure 5C shows that the effect of loop closure diminishes as the bank size increases as shown by the gap between solid and dashed lines, e.g., after mapping 9 feature banks, with 1 feature in each bank error drops to 60.49% and with 4 features drops to 66.14% after loop closure. Note that the ratio of error growth rate after and before loop closure is 0.5185, i.e., loop closure approximately halves the error growth in the farthest bank. In point-to-point navigation, the same effect may be achieved by taking twice the number of measurements to each feature. Figure 5D shows that error in last pose after loop closure converges to a fixed value as the trajectory length increases. This indicates that estimation error in the last pose is dominated by the relative measurement to the first bank, i.e., as the trajectory length grows, the effect of the longer "pathway" from start may have almost no effect on estimation accuracy.

[0061] In the F2F approach, the mapping algorithm may transform range bearing observations from robot to features into relative position measurements between features by fusing them with heading estimates from a heading sensor. Figure 3B shows how F2F estimates keyframe poses and may neglect odometery between poses. Localization accuracy may be a function of how far the robot has moved and the number of features in each bank. The key steps of F2F include the following. 1 ) Range bearing measurements to features may be converted into relative displacement vectors between the features at each pose as the robot moves. 2) Robot to feature relative position measurements may be acquired at keyframes, then a linear estimation problem may be solved for keyframe poses and map features using the recorded data. 3) Once the linear estimation problem is solved, correlations between keyframe poses and features observed at keyframes may be maintained and intermediate feature estimates may be dropped. 4) An upper threshold may be set on the number of keyframes to keep in the map after which the oldest keyframe is deleted. The first pose may be saved even if it is the oldest keyframe.

[0062] The F2F approach will now be explained in detail. At time t k , the position of features /, and / in the robot's local frame are 1 M k and ! \ J k respectively. Thus, embodiments may compute the displacement vector from /, to / in the robot frame as

'd u 'k j - A j - l z L k λ> - (9)

[0063] Equation (9) shows that 'df is independent of robot position p k and orientation Θ, . Hence, the vector of local relative measurements is as follows:

[0064] Note in Equation (10) that though measurements to each feature are independent, the set of relative feature measurements may be correlated due to the correlations between relative measurements using the same range-bearing measurement. This is where a difference from some approaches arises. There may be a benefit of capturing independent measurements of relative feature displacements. Independence may be achieved by capturing heading and range bearing observations by stopping the robot at certain times (e.g., keyframes). [0065] Let ( d fc ~ N( l d k , l R dk = V ( g d | Zfc R Zfc V l g T d \ Zk ) be the vector of independent relative feature measurements at time t k with error covariance 'R d , where v'g rf | is the Jacobian of the local relative measurement function in Equation (10). At t Q , embodiments have keyframe K x 0 whose prior belief b 0 ~ N(x 0 = [p 0< θο] τ > p o) is known. The robot starts moving and collecting orientation and exteroceptive measurements at each timestep. At some future timestep t k , embodiments have keyframe K x k . Let l d 1 . k _ 1 ~ N ( l d 1 . k _ 1 , l R d k→ be th e vector of local relative feature measurements captured from time t x to t k . At keyframes K x 0 and K x k embodiments have robot to feature relative measurements l A 0 k ~ N( l A 0 k = [ ι Δ τ 0 , l & T k ] T , ( Ao fc = blkdiagQ l R Ao , 1 R A ). At each pose embodiments have a noisy unbiased heading measurement which gives the vector of orientation estimates 0 O:fc ~ N(9 0 , k , Re 0 . k ) .

Dropping the time subscript, the vector of local relative measurements ' d can be transformed to the world frame as: where C = C(9) is the corresponding composition of DCM matrices. Heading error covariances may be propagated appropriately in the feature and robot position estimation. Thus, embodiments set up a new measurement model y = h r (x 0 A d,9) + v y by stacking the transformed relative position and heading measurements. This gives the following linear problem:

(12) where [ρ 0 ,1 Γ Γ Γ ] Γ is the vector of robot poses, landmarks and heading, A' is a matrix with each row containing elements of the set {-l,o,+l} , v y ~ N(0, R r ) is the measurement noise vector where:

[0066] Vh is the Jacobian of measurement function h given by

I 0 0 0

0 0 c r M d

Vh where M (14)

0 c r 0 M 'A 5Θ

0 0 0 I

[0067] The solution to the linear problem in equation (12) is given by:

[0068] In some embodiments, the localization error grows as a robot explores an unknown map. It may be assumed that the robot makes independent relative feature measurements and the error covariance of each global relative measurement is R a .

The first and last pose of the robot are considered as keyframes with the first pose known.

[0069] Referring to the case depicted in Figure 3B, i.e., the robot maps four banks of two features each ( n fl = 2 ) followed by a final keyframe pose and inspection of the estimation error in the final keyframe. Let the vector of relative measurements be γ . Further, odometery may be discarded as it is superfluous to map estimation when global orientation measurements are available and robot observes common features between successive keyframes (recall that relative feature displacements in world frame are independent of robot position). Embodiments can setup a linear estimation problem for landmark and keyframe pose estimation for Figure 3B as:

γ = A (16)

[0070] Solving the linear least squares problem in equation (16), the information matrix Ω then is given by:

Ω = Σ Α Κ Α (17)

[0071] Ω has a symmetric tridiagonal structure that permits an analytical inversion to compute the error covariance matrix. An analytical solution of the error covariance matrix may allow prediction of feature localization uncertainty at the goal given certain environment characteristics, e.g., the number of features in each bank and how many banks the robot may map as it traverses to the goal. The capability to predict future localization uncertainty implies that given a desired goal accuracy, active sensing to control error growth may be applied.

[0072] The linear estimation problem of equation (12) may be analytically solved and error covariance ∑ computed for multiple cases by varying bank size n fi and the number of banks that the robot maps. Figures 7A, 7B, 7C, and 7D show results of analysis of the feature mapping and localization error as the robot moves and the effect of loop closure. Figure 7A shows that localization error grows linearly as the robot moves away from the start location. Further, the error growth rate shown in Figure 7B is inversely proportional to the square of the size of each feature bank, i.e., -7- where η„ is the number features in one bank. Thus error growth may be controlled by the number of features mapped in each bank.

[0073] Figures 6A, 6B, 6C, and 6D show a simple graphical depiction of loop closure, the left half of each figure shows the robot 600 making range bearing measurements and the right half shows the map being built. Fig. 6D concerns the estimation error of the farthest feature bank (encircled by ellipse 610) and the last pose (encircled by ellipse 620). Thus, Figures 6A, 6B, 6C, and 6D depict loop closure when a robot makes relative feature measurements while moving in a circular trajectory. The exercise of solving equation (16) is repeated, albeit with loop closure and the error covariance ∑ is computed. The results of this analysis are plotted in Figures. 7C and 7D. The ratio of error growth rate with loop closure to error growth rate without loop closure is a constant value of 0.5181 for all values of n fl computed as the ratio of slopes of curves plotted in Figure 7C before and after loop closure. Figure 7D shows that error in last pose after loop closure converges to a fixed value as the trajectory length (number of banks) increases.

[0074] Looking at Fig. 6D (right half), there are two "pathways" that link the farthest bank of features to the first bank. Prior to loop closure there is only one path for the relative measurements to constrain feature estimates to the first bank, however, after loop closure there is a second pathway from the opposite direction. An interesting point to be made is that if two observations ( R a is halved) were taken for each relative feature displacement, embodiments would effectively end up with the same estimation error at the farthest bank were the robot not to close the loop. The second observation shows that the estimation error in the last bank of features is dominated by the relative measurement to the first bank, i.e., as the trajectory length grows, the effect of the longer "pathway" from start has almost no effect on estimation accuracy.

[0075] Figures 8A and 8B present simulation scenarios for waypoint following in a 2D environment. These simulations study the case of a long term exploration task where a robot may not visit prior locations. The robot may be equipped with three sensors: a star tracker with noise standard deviation σ θ = 0.005° (18 arcseconds); a range bearing sensor with a 360° field-of-view, range of 20m and noise standard deviation of a r = 0.1 m in range and σ φ = 1.0 ° in bearing; and wheel odometery with noise standard deviation σ δχ δν = 0. \ π in translation and σ δθ = \ 2 ° in rotation. The robot may move at a speed of lO m/s and simulation time step is 0.05 s. Figure 8A shows a scenario with a 2D world (5km x 5km) with a trajectory of length 25.9km. Figure 8A shows a scenario with a 2D world (10km x 10km) with a trajectory of length 107.9km. Both scenarios are obstacle free and landmarks are randomly placed along the trajectory. Both trajectories terminate far from the start location and there are no loop closures.

[0076] Using the R2F approach, for the scenarios of Figures 8A and 8B, multiple versions of the environment were generated varying the number of features visible at each pose.. Using the F2F approach, for the scenarios of Figures 8A and 8B, multiple versions of the environment were generated varying the number of features visible at each pose. Figures 9A and 9B respectively compare the average terminal localization error obtained using the R2F approach in each of the scenarios of Figures 8A and 8B with the average terminal localization error obtained using the F2F approach in each of the scenarios of Figures 8A and 8B.

[0077] Trigonometric functions of robot orientation are the primary source of non- linearity in SLAM which makes predicting long-term error growth difficult. Analysis shows that given unbiased heading measurements, localization error growth is linear as a robot moves away from its start location. The error growth rate may be controlled by the quality and number of measurements. Further, loop closure may be avoided when absolute heading is available as the same effect may be achieved by taking prolonged measurements. Feature estimates may be consistent due to the linear nature of the problem which may lead to a global minimizer. Consistent feature estimates may lead to better localization as the robot has a reliable notion of uncertainty about its estimated mean.

Indoor Embodiments

[0078] The following embodiments describe systems that operate in an indoor environment where the upward facing sensor detects objects on or attached to the ceiling of an indoor environment. Some of the techniques in these embodiments may be used in either the indoor and outdoor embodiments. In indoor settings the system may use stable structural features as structural cues, for example, in a warehouse building, the ceiling corrugation or ceiling lights are usually aligned along one direction. Thus, the orientation of the building may be fixed in the direction of ceiling direction and a vehicle may estimate its orientation with respect to the building by observing the ceiling.

[0079] In an embodiment, let x = (x 0 , ... , x n } be a set of n + 1 poses, describing the robot position and orientation at each time k. In 2D (planar) problems, x fc = [p£# fc ] T e SE(2), where p fc ε M? is the position and 9 k ε SO(2) is the heading. Let be a measurement pose of w.r.t to pose i then, ξ ί ' ~ 0J - 01 J where R j is the rotation matrix composed by 0 j . In the general setting, is corrupted by noise, thus = + v i where v t is zero-mean Gaussian. Let ( Δ be the vector of relative position measurements in the local frame at each pose and. If robot orientation Θ * is known at each pose, then the SLAM problem becomes, A = R(0 * ) T ( Δ

[0080] W A = A'p where A' is a matrix composed of elements in the set {-1,0,1} and p is the vector of robot positions in the global frame. Thus, when robot orientation Θ is known, the position estimation problem is linear. Moreover, when unbiased global heading measurements are available, the problem may be very accurately linearized. This may provide localization accuracy leading to robustness in the front-end and enabling computationally low-cost linear-least squares solutions for the back-end. The same concept may be used with 3D problems where the relative orientation measurement 0 7 - 9 t may be represented as R 7 R .

[0081] The indoor approach includes three aspects. 1 ) Sensing absolute orientation of the robot using structural cues. 2) Fusing absolute orientation measurements to the front-end, i.e., a scan matching algorithm. 3) Solving the batch optimization problem to compute global estimates at loop closure.

[0082] Independent absolute orientation estimates of the robot heading may be determined. In an indoor environment the relative heading of the robot with respect to the building's true north may be used as the absolute heading. The orientation sensing method may detect structural features of the environment. In most indoor environments, e.g. , offices, factories, warehouses etc. the ceiling structure usually has straight line features. For example, ceiling corrugation or ceiling lights in most industrial buildings are aligned along one direction which may be detected by a ceiling facing camera. Ceiling direction may be estimated as follows.

[0083] First, extract line features from a ceiling image, the line features may be an edge of a light fixture or a part of a corrugated ceiling. Next, compute orientation of the line features in the image frame. The system may then create a histogram of the orientation data with bins of width b in range [0, 2π). The system may then create a window of width W around the bin with highest frequency, i.e., the bin with maximum observations. The system then computes the weighted mean of observations in the window. The ceiling direction may then be computed as an angle $θ ε ε [Ο, π).

[0084] Line direction may be ambiguous, i.e., it may be difficult to differentiate north from south. Therefore, gyro data may be used in the intermediate time between absolute orientation measurements. Gyros may provide data at >100 Hz and therefore may be used to account for the angle wrap-around issue in absolute orientation detection. To estimate the robot heading, initial heading at time t 0 is assumed to be known.

[0085] To a regular scan matching based front-end we add a Kalman filter after the scan match step. This Kalman filter fuses relative orientation estimates from scan matching with absolute orientation estimates as computed previously (using a ceiling facing camera and gyro). Let 6 k ~ N (e k , σ| ) be the absolute orientation measurement at time t k and be the heading estimate of the robot at t k _ x . Let

<50 f c-i ,f c ~ N(Se k _ l k , ase k _ i k ) be the relative orientation measurement from pose x k _ x to x k as computed by scan matching, Then 9 k is computed as follows.

[0086] First the relative orientation estimate is used to compute a prediction,

Θ k — Θ + δ Θ fc _ l fc

[0087] Then, the absolute orientation measurement is used to compute the update as follows:

Yk = 0fc Θ k Θ k - Θ k + ^k k

[0088] Fusing orientation information may add robustness to the front-end. Small errors in relative orientation measurements may add up over time to create a super linear growth in localization error. This problem may arise from the non-linear nature of the orientation.

[0089] In an embodiment, the SLAM back-end may use the graph generated by the front-end along with absolute orientation data and solve a two-step optimization problem. The first step may be the estimation of robot orientation using the absolute orientation and relative orientation measurements followed by a second step in which a linearized least-squares optimization problem may be solved for the robot position.

[0090] Robot orientation θ ε (-π, π], thus as the robot navigates, the relative orientation measurements may not provide information about the angle wrap around. Let δθί j be the relative orientation measurement from pose x ; to x j , then

S9 j = - Θ

where φ is the module operator such that φ(θ) ε (-π, π] . Thus the regularized relative orientation measurement δθ is

Se tJ = 6j - e t + 2k i} n

[0091] Here kj j is the integer ambiguity. In our approach, since absolute orientation measurements are available, the integer ambiguity can be simply be calculated as:

ky = round ( (δθ j - (BJ - θ )/2π)

[0092] Once the regularization constants are computed, we formulate a linear estimation problem by stacking together the absolute orientation measurement vector Θ and regularized relative orientation measurement vector δθ as β - [ θ ] ~ [ ο ι ] θ + [ ^ ] '

Β

which can be solved for the global orientation estimate as

Θ = (B T Rp 1 B)- 1 B T Rp 1 ?

and the estimate error covariance is (B T 1 B) "1 [0093] Once a global orientation estimate Θ is computed, we proceed to compute robot position at each pose. We know that a relative pose measurement from pose x ; to Xj contains a relative position measurement Δ as

j = Ri(Pj - Pi)

where ( Δ ί; is the displacement measured in the local frame of pose x t and p i( p 7 are the 2D positions. Let ( Δ ~ N ( ( Δ, l R A ) be the vector of all local relative position measurements. The vector of local relative measurements can be transformed to the global frame. Thus the linear estimation problem may be formulated as: wb = R ( Δ = A'p + w v A .

R = R(0) is the corresponding composition of rotation matrices parametrized by the estimated heading 0, p is the vector of robot positions, A is a matrix with each row containing elements of the set [-1 ,0 +1 ] and w v A ~ N(0, W R A = C T l R A is the noise vector.

[0094] After computing the orientation estimates Θ along with the transformed global relative position measurements they are stacked to give a new measurement vector γ:

7 = h w ( ^, 9) + v w = [ R A ] = [ A 0 + [ ^] .

Θ

A

[0095] The error covariance R of measurement vector γ is given by:

where Vh w is the Jacobian of measurement function h w given by:

Vh w = [R M 'Δ],

0 I

where =— . Thus:

ae

wR A + M∑ e M T Μ∑ θ

Y [ Θ Μ Τ e J - [0096] Thus, the solution to the linear estimation problem is given by:

[0097] Note that the above involves the inversion of a large sparse matrix R y which may not be suitable for implementation due to complexity and potential numerical issues. However, this inversion may be avoided by analytically computing the information matrix Ω γ = R "1 using block-matrix inversion rules as: i l r L M T w R -i ίΐ θ + Μ τ w R -i M J- [0098] In an embodiment, a robot may be configured with a LiDAR with 360 sensing, a monocular ceiling facing camera, an IMU, and a processor. The robot may be deployed in a warehouse where GPS signal is degraded or unavailable. The ceiling of the warehouse may be equipped with rectangular light fixtures at regular intervals which may be leveraged for orientation estimation. In some embodiments, the processor may threshold the image such that a binary image is created. Thus the ceiling lights may appear as rectangular bright spots while rest of the image appears black. Figure 10A shows the ceiling camera's view and FIG. 10B shows the thresholded binary image. Heading estimates may be computed at 30 Hz. A first image captured by the camera may be used to determine an orientation of the robot.

For example, the edge of the light fixture in the picture may be determined to be 0 degrees. As the robot travels and takes a subsequent image of the ceiling the angle of the light fixture in the subsequent image may be determined relative to 0 degree angle. This determination may be used to determine the absolute orientation. The processor may use a bin size b=0.5 degrees for the histogram operation used for orientation estimation. The processor may create a map of the warehouse based on inputs from the LiDAR, camera, and IMU using the techniques described above.

[0099] In the following discussion and in the claims, the terms "including" and

"comprising" are used in an open-ended fashion, and thus should be interpreted to mean "including, but not limited to... ." In addition, the term "couple" or "couples" is intended to mean either an indirect or a direct connection. Thus, if a first device couples to a second device, that connection may be through a direct connection, or through an indirect connection accomplished via other devices and connections.

Further, the term "software" includes any executable code capable of running on a processor, regardless of the media used to store the software. Thus, code stored in memory (e.g., non-volatile memory), and sometimes referred to as "embedded firmware," is included within the definition of software. The recitation "based on" is intended to mean "based at least in part on." Therefore, if X is based on Y, may be based on Y and any number of other factors. [00100] The above discussion is meant to be illustrative of the principles and various embodiments of the disclosure. Numerous variations and modifications will become apparent to those skilled in the art once the above disclosure is fully appreciated. For example, while some embodiments have been described herein with respect to facility inspection and risk assessment, those skilled in the art will understand that the principles disclosed herein are applicable to assessing risk associated with a wide variety of tangible inspection subjects or units. It is intended that the following claims be interpreted to embrace all such variations and modifications.