STABILIZING VIDEO TO REDUCE CAMERA AND FACE MOVEMENT

Title:

STABILIZING VIDEO TO REDUCE CAMERA AND FACE MOVEMENT

Document Type and Number:

WIPO Patent Application WO/2019/212749

Kind Code:

Abstract:

The subject matter described in this disclosure can be embodied in methods and systems for stabilizing video. A computing system determines a stabilized location of a facial feature in a frame of video accounting for its location in a previous frame. The computing system determines a physical camera pose in virtual space and maps the frame into virtual space. The computing system determines an optimized virtual camera pose using an optimization process that determines (1) a difference between the stabilized location of the facial feature and a location of the facial feature when viewed from a potential virtual camera pose, (2) a difference between the potential virtual camera pose and a previous virtual camera pose, and (3) a difference between the potential virtual camera pose and the physical camera pose. The computing system generates the stabilized view of the frame using the optimized virtual camera pose.

More Like This:

JP6147116	An imaging device, its control method, and a control program
JPH09168113	AUTOMATIC FOCUS DEVICE
WO/2019/039099	CONTROL DEVICE, CONTROL SYSTEM, CONTROL METHOD, PROGRAM AND RECORDING MEDIUM

Inventors:

LIANG CHIA-KAI (US)
SHI FUHAO (US)

Application Number:

PCT/US2019/027934

Publication Date:

November 07, 2019

Filing Date:

April 17, 2019

Export Citation:

Click for automatic bibliography generation Help

Assignee:

GOOGLE LLC (US)

International Classes:

H04N5/232

Domestic Patent References:

WO2008114264A2

2008-09-25

Foreign References:

EP2219364A1

2010-08-18

Other References:

D. FLEET ET AL. (EDS): "ECCV 2014, Part IV, LNCS 8692", 6 September 2014, SPRINGER, ISBN: 978-3-642-17318-9, article BELL STEVEN ET AL: "A Non-Linear Filter for Gyroscope-Based Video Stabilization", pages: 294 - 308, XP047296575
MATTHIAS GRUNDMANN ET AL: "Auto-directed video stabilization with robust L1 optimal camera paths", COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011 IEEE CONFERENCE ON, IEEE, 20 June 2011 (2011-06-20), pages 225 - 232, XP032038027, ISBN: 978-1-4577-0394-2, DOI: 10.1109/CVPR.2011.5995525
YINGLAN MA ET AL: "Video Stabilization and Face Saliency-based Retargeting", 1 October 2016 (2016-10-01), XP055600506, Retrieved from the Internet [retrieved on 20190628]

Attorney, Agent or Firm:

DOMMER, Andrew (US)

Download PDF:

View/Download PDF PDF Help

Claims:

WHAT IS CLAIMED IS:

1. A computer-implemented video stabilization method, comprising:

receiving, by a computing system, a video stream that includes multiple frames and that was captured by a physical camera;

determining, by the computing system and in a frame of the video stream that was captured by the physical camera, a location of a facial feature of a face that is depicted in the frame;

determining, by the computing system and using information received from a movement or orientation sensor coupled to the physical camera, a pose of the physical camera in a virtual space;

mapping, by the computing system, the frame of the video stream that was captured by the physical camera into the virtual space;

determining, by the computing system, an optimized pose of a virtual camera viewpoint in the virtual space from which to generate a stabilized view of the frame, using an optimization process that:

(i) determines a difference between the stabilized location of the facial feature and a location of the facial feature in a stabilized view of the frame viewed from a potential pose of the virtual camera viewpoint;

(ii) determines a difference between the potential pose of the virtual camera viewpoint in the virtual space and a previous pose of the virtual camera viewpoint in the virtual space; and

(iii) determines a difference between the potential pose of the virtual camera viewpoint in the virtual space and the pose of the physical camera in the virtual space; and

generating, by the computing system, the stabilized view of the frame using the optimized pose of the virtual camera viewpoint in the virtual camera space.

2. The computer-implemented video stabilization method of claim 1 , further comprising:

presenting, by the computing system, the stabilized view of the frame on a display of the computing system.

3. The computer-implemented video stabilization method of claim 1 or 2, wherein the movement or orientation sensor comprises a gyroscope.

4. The computer-implemented video stabilization method of any one claims 1 to 3, wherein:

the computing system determines the location the facial feature of the face that is depicted in the frame based on locations of multiple respective facial landmarks that are depicted in the frame; and

the computing system determines the difference between the stabilized location of the facial feature and the location of the facial feature in the stabilized view of the frame by measuring deviations between locations of the multiple facial landmarks in the stabilized view of the frame and the stabilized location of the facial feature.

5. The computer-implemented video stabilization method of claim 1 , wherein the optimization process comprises minimizing a value for a pose parameter being based on at least one of the following variables:

- a deviation between a landmark in the stabilized view of the frame to the stabilized location of the facial feature;

- a difference between the potential pose of the virtual camera viewpoint for the frame and the pose of the virtual camera viewpoint for the previous frame;

- a difference between a camera rotation in the virtual space for the

frame and a real camera rotation in the virtual space for the frame;

- a spherical angle between a camera rotation in the virtual space and a real camera rotation in the virtual space;

- a change of offsets to a virtual principal point between the frame and a previous frame; and - an amount of undefined pixels in the stabilized view of the frame that is generated using the potential pose of the virtual camera view point in the virtual space.

6. The computer-implemented video stabilization method of any one of the preceding claims, wherein the optimization process comprises a non-linear computational solver that optimizes values for multiple variables.

7. The computer-implemented video stabilization method of any one of the preceding claims, wherein the optimization process

determines an amount of undefined pixels in the stabilized view of the frame that is generated using the potential pose of the virtual camera view point in the virtual space.

8. The computer-implemented video stabilization method of any one of the preceding claims, wherein the optimization process:

determines a difference between:

(a) an offset of a principal point of the stabilized view of the frame that is generated using the potential pose of the virtual camera view point in the virtual space, and

(b) an offset of a previous principal point of a previous stabilized view of the frame that was generated using the previous pose of the virtual camera viewpoint in the virtual space.

9. The computer-implemented video stabilization method of any one of the preceding claims, wherein generating the stabilized view of the frame includes mapping a subset of scanlines of the frame to perspectives viewed from the optimized pose of the virtual camera viewpoint, and interpolating other of the scanlines of the frame.

10. The computer-implemented video stabilization method of any one of the preceding claims, wherein determining the stabilized location of the facial feature comprises using a location optimization process that:

(i) determines a difference between a potential stabilized location of the facial feature and an actual location of the facial feature in the frame;

(ii) determines a difference between the potential stabilized location of the facial feature and a location of the facial feature in a previous frame; and

(iii) accounts for a contraint on a distance between the potential stabilized location of the facial feature and a location of the facial feature in the frame.

1 1. The computer-implemented video stabilization method of any one of the preceding claims, further comprising:

selecting the face that is depicted in the frame of the video stream that was captured by the physical camera as a face to track from among multiple faces depicted in the frame of the video stream that was captured by the physical camera by:

selecting the face based on sizes of each of the multiple faces, selecting the face based on distances of each of the multiple faces to a center of the frame, or

selecting the face based on distances between a face selected for tracking in a previous frame and each of the multiple faces.

12. The computer-implemented video stabilization method of claim 1 , wherein the optimized pose of the virtual camera viewpoint has a different location and rotation in the virtual space than the pose of the physical camera.

13. The computer-implemented video stabilization method of any one of the preceding claims, wherein the frame of the video stream in the stabilized view is warped based on the optimized pose so that the frame appears to have been captured from the optimized pose of the virtual camera viewpoint rather than from the pose of the physical camera in the physical space.

14. A computerized system, comprising:

a camera;

a motion or orientation sensor physically coupled to the camera;

one or more processors;

one or more non-transitory computer-readable devices including instructions that, when executed by the one or more processors, cause performance of operations that include: