A METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR CODING A 360-DEGREE PANORAMIC VIDEO

Title:

A METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR CODING A 360-DEGREE PANORAMIC VIDEO

Document Type and Number:

WIPO Patent Application WO/2017/051072

Kind Code:

Abstract:

There are disclosed various methods, apparatuses and computer program products for video encoding. In some embodiments the method comprises reconstructing a 360-degree panoramic source picture for inter-layer prediction; deriving an inter- layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of: upsampling at leasta part of the 360-degree panoramic source picture, wherein said upsampling comprises filtering samples of a border region of the 360-degree panoramic source picture using at least partly one or more sample values of an opposite side border region and/or one or more variable values associated with one or more blocks of the opposite side border region; determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following: one or more sample values of an opposite side border region; one or more variable values associated with one or more blocks of the opposite side border region.

Inventors:

HANNUKSELA MISKA (FI)

Application Number:

PCT/FI2016/050653

Publication Date:

March 30, 2017

Filing Date:

September 21, 2016

Export Citation:

Click for automatic bibliography generation Help

Assignee:

NOKIA TECHNOLOGIES OY (FI)

International Classes:

H04N19/597; H04N5/232; H04N19/105

Foreign References:

US20060034374A1	2006-02-16
JP4258879B2	2009-04-30
US20060034374A1	2006-02-16
JP4258879B2	2009-04-30

Other References:

CHEN L. ET AL.: "Disparity-compensated Inter- layer Motion Prediction Using Standardized HEVC Extensions", IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS, 24 May 2015 (2015-05-24), pages 2776 - 2779, XP055373261
BOYCE J. ET AL.: "Draft high efficiency video coding (HEVC) version 2, combined format range extensions (RExt), scalability (SHVC), and multi-view (MV-HEVC) extensions", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29/WG 11 18TH MEETING, 30 June 2014 (2014-06-30), SAPPORO, JP, XP009509397
CHEN L. ET AL.: "Disparity-compensated Inter- layer Motion Prediction Using Standardized HEVC Extensions", IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS, 24 May 2015 (2015-05-24), pages 2776 - 2779, XP055373261
BOYCE J. ET AL.: "Draft high efficiency video coding (HEVC) version 2, combined format range extensions (RExt), scalability (SHVC), and multi-view (MV-HEVC) extensions", JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC) OF ITU-T SG 16 WP 3 AND ISO/IEC JTC 1/SC 29/WG 11 18TH MEETING, 30 June 2014 (2014-06-30), Sapporo, JP
See also references of EP 3354029A4

Attorney, Agent or Firm:

NOKIA TECHNOLOGIES OY et al. (FI)

Download PDF:

View/Download PDF PDF Help

Claims:

A method comprising

- reconstructing a 360-degree panoramic source picture for inter-layer prediction;

- deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of:

o determining a reference region that crosses a picture boundary of the 360- degree panoramic source picture, and including in the reference region one or both of the following:

• one or more sample values of an opposite side border region;

• one or more variable values associated with one or more blocks of the opposite side border region.

The method according to claim 1, further comprising one or more of the following:

- deriving a disparity value;

- setting syntax elements for the reference region left offset and right offset equal to the disparity value; and

- setting syntax elements for the reference region top offset and bottom offset equal to zero.

The method according to claim 2, further comprising deriving the disparity value from pictures of one or more access unit using one or more of the following:

- deriving an average disparity from one or more depth maps associated with said pictures:

- deriving the disparity from camera parameters applying said pictures;

- deriving an average disparity between views, using a stereo matching algorithm;

- deriving an average disparity from inter- view motion vector applying between picture of different views. The method according to any of the previous claims 1 to 3, further comprising one or more of the following:

- deriving an average disparity value for one or more pictures;

- setting syntax elements for a scaled reference layer left offset and a scaled reference layer right offset equal to the average disparity value; and

- setting syntax element for a scaled reference layer top offset and a scaled reference layer bottom offset equal to zero.

The method according to any of the previous claims 1 to 4, further comprising;

- creating two occurrences of a base- view picture in reference pictures lists, wherein first occurrence is a conventional inter-layer reference picture and the second occurrence is a resampled picture, and

- optionally indicating the creation of the two occurrences in the bitstream.

A method comprising

- encoding samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region

in processing of the samples of the border region;

- wherein said processing of the samples is one or both of the following: prediction of the samples of the border region, reconstruction of the samples of the border region, and wherein the processing comprises one or more of the following:

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the

following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region. A method comprising

- decoding samples of a border region of a 360-degree panoramic picture, wherein the decoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy decoding on the basis of one or both of the

following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

An apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

- to reconstruct a 360-degree panoramic source picture for inter-layer prediction;

- to derive an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of:

o upsampling at least a part of the 360-degree panoramic source picture, wherein said upsampling comprises filtering samples of a border region of the 360-degree panoramic source picture using at least partly one or more sample values of an opposite side border region and/or one or more variable values associated with one or more blocks of the opposite side border region; determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following:

• one or more sample values of an opposite side border region;

• one or more variable values associated with one or more blocks of the opposite side border region.

9. The apparatus according to claim 8, further comprising computer program code configured to cause the apparatus to perform at least the following:

- to derive a disparity value;

- to set syntax elements for the reference region left offset and right offset equal to the disparity value; and

- to set syntax elements for the reference region top offset and bottom offset equal to zero.

10. The apparatus according to claim 9, further comprising computer program code configured to cause the apparatus to perform at least the following: to derive the disparity value from pictures of one or more access unit by using one or more of the following:

- deriving an average disparity from one or more depth maps associated with said pictures:

- deriving the disparity from camera parameters applying said pictures;

- deriving an average disparity between views, using a stereo matching algorithm;

- deriving an average disparity from inter- view motion vector applying between picture of different views.

11. The apparatus according to any of the previous claims 8 to 10, further comprising computer program code configured to cause the apparatus to perform at least the following:

- deriving an average disparity value for one or more pictures;

- setting syntax elements for a scaled reference layer left offset and a scaled reference layer right offset equal to the average disparity value; and

- setting syntax element for a scaled reference layer top offset and a scaled reference layer bottom offset equal to zero.

12. The apparatus according to any of the previous claims 8 to 11, further comprising computer program code configured to cause the apparatus to perform at least the following: - creating two occurrences of a base- view picture in reference pictures lists, wherein first occurrence is a conventional inter-layer reference picture and the second occurrence is a resampled picture, and

- optionally indicating the creation of the two occurrences in the bitstream.

13. An apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

- to encode samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

14. An apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

- to decode samples of a border region of a 360-degree panoramic picture, wherein the decoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region: • one or more variable values associated with one or more blocks of the opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy decoding on the basis of one or both of the

following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region. 15. An apparatus comprising

- means for processing;

- means for reconstructing a 360-degree panoramic source picture for inter- layer prediction;

- means for deriving an inter- layer reference picture from the 360-degree panoramic source picture, wherein the means for deriving is configured to perform one or both of

o determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following:

• one or more sample values of an opposite side border region;

• one or more variable values associated with one or more blocks of the opposite side border region.

16. An apparatus comprising - means for processing;

- means for encoding samples of a border region of a 360-degree panoramic picture, wherein the means for encoding is configured to utilize one or both of the following

• one or more sample values of an opposite side border region:

· one or more variable values associated with one or more blocks of the opposite side border region

o in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

· the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the

following:

· the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

17. An apparatus comprising

- means for processing;

- means for decoding samples of a border region of a 360-degree panoramic picture, wherein the means for decoding is configured to utilize one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region

o in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values; o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy decoding on the basis of one or both of the

following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

18. A computer program product comprising a computer-readable medium bearing computer program code embodied therein for use with a computer, the computer program code comprising:

- code for reconstructing a 360-degree panoramic source picture for inter-layer prediction;

- code for deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of

o code for upsampling at least a part of the 360-degree panoramic source picture, wherein said upsampling comprises filtering samples of a border region of the 360- degree panoramic source picture using at least partly one or more sample values of an opposite side border region and/or one or more variable values associated with one or more blocks of the opposite side border region;

o code for determining a reference region that crosses a picture boundary of the 360- degree panoramic source picture, and including in the reference region one or both of the following:

• one or more sample values of an opposite side border region;

• one or more variable values associated with one or more blocks of the opposite side border region. 19. A computer program product comprising a computer-readable medium bearing computer program code embodied therein for use with a computer, the computer program code comprising:

- code for encoding samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region o in processing of the samples of the border region;

wherein said processing of the samples is one or both of the following: prediction of the samples of the border region, reconstruction of the samples of the border region, and wherein the processing comprises one or more of the following:

o code for obtaining a prediction block for intra prediction based on the one or more sample values;

o code for filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

• the one or more variable values associated with the one or more blocks of the opposite side border region;

o code for tuning context-adaptive entropy coding on the basis of one or both of the following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

20. A computer program product comprising a computer-readable medium bearing computer program code embodied therein for use with a computer, the computer program code comprising:

- code for decoding samples of a border region of a 360-degree panoramic picture, wherein the decoding comprises utilizing one or both of the following

• one or more sample values of an opposite side border region:

• one or more variable values associated with one or more blocks of the opposite side border region

in processing of the samples of the border region;

o code for obtaining a prediction block for intra prediction based on the one or more sample values;

o code for filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

• the one or more sample values of the opposite side border region;

· the one or more variable values associated with the one or more blocks of the opposite side border region; code for tuning context-adaptive entropy decoding on the basis of one or both of the following:

• the one or more sample values of an opposite side border region;

• the one or more variable values associated with one or more blocks of the opposite side border region.

Description:

A METHOD, AN APPARATUS AND A COMPUTER PROGRAM PRODUCT FOR CODING

A 360-DEGREE PANORAMIC VIDEO

TECHNICAL FIELD

[0001 ] The present embodiments relate to coding of 360-degree panoramic video.

BACKGROUND

[0002] This section is intended to provide a background or context to the invention that is recited in the claims. The description herein may include concepts that could be pursued, but are not necessarily ones that have been previously conceived or pursued. Therefore, unless otherwise indicated herein, what is described in this section is not prior art to the description and claims in this application and is not admitted to be prior art by inclusion in this section.

[0003] 360-degree panoramic images and video cover horizontally the full 360-degree field-of-view around the capturing position. 360-degree panoramic video content can be acquired e.g. by stitching pictures of more than one camera sensor to a single 360-degree panoramic image. Also, a single image sensor can be used with an optical arrangement to generate 360-degree panoramic image.

SUMMARY

[0004] Some embodiments provide a method and an apparatus for implementing the method for encoding and decoding 360-degree panoramic video.

[0005] Various aspects of examples of the invention are provided in the detailed description.

[0006] According to a first aspect, there is provided a method comprising:

reconstructing a 360-degree panoramic source picture for inter-layer prediction;

deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of:

o determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following:

^■ one or more sample values of an opposite side border region;

■ one or more variable values associated with one or more blocks of the opposite side border region.

[0007] According to a second aspect, there is provided a method comprises encoding samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following

^■ one or more sample values of an opposite side border region:

^■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

^■ the one or more sample values of the opposite side border region; - the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

According to a third aspect, there is provided a method comprises

decoding samples of a border region of a 360-degree panoramic picture, wherein the decoding comprises utilizing one or both of the following

^■ one or more sample values of an opposite side border region:

■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

■ the one or more sample values of the opposite side border region;

^■ the one or more variable values associated with the one or more blocks of the opposite side border region; o tuning context-adaptive entropy decoding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

[0009] According to a fourth aspect, there is provided an apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

reconstructing a 360-degree panoramic source picture for inter-layer prediction;

deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of:

o determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following:

^■ one or more sample values of an opposite side border region;

^■ one or more variable values associated with one or more blocks of the opposite side border region.

[0010] According to a fifth aspect, there is provided an apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

encoding samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following

^■ one or more sample values of an opposite side border region:

^■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

^■ the one or more sample values of the opposite side border region;

^■ the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

[001 1] According to a sixth aspect, there is provided an apparatus comprising at least one processor; at least one memory including computer program code, the at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus to perform at least the following:

decoding samples of a border region of a 360-degree panoramic picture, wherein the decoding comprises utilizing one or both of the following

^■ one or more sample values of an opposite side border region:

^■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

- wherein said processing of the samples is one or both of the following: prediction of the

samples of the border region, reconstruction of the samples of the border region, and wherein the processing comprises one or more of the following:

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

^■ the one or more sample values of the opposite side border region;

^■ the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy decoding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

[0 12] According to a seventh aspect, there is provided an apparatus comprises

- means for processing;

means for reconstructing a 360-degree panoramic source picture for inter- layer prediction; means for deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the means for deriving is configured to perform one or both of

o determining a reference region that crosses a picture boundary of the 360-degree panoramic source picture, and including in the reference region one or both of the following:

^■ one or more sample values of an opposite side border region;

^■ one or more variable values associated with one or more blocks of the opposite side border region.

According to an eighth aspect, there is provided an apparatus comprises

means for processing;

means for encoding samples of a border region of a 360-degree panoramic picture, wherein the means for encoding is configured to utilize one or both of the following

^■ one or more sample values of an opposite side border region:

^■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

^■ the one or more sample values of the opposite side border region;

^■ the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy coding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

According to a ninth aspect, there is provided an apparatus comprises

means for processing; means for decoding samples of a border region of a 360-degree panoramic picture, wherein the means for decoding is configured to utilize one or both of the following

^■ one or more sample values of an opposite side border region:

^■ one or more variable values associated with one or more blocks of the

opposite side border region

in processing of the samples of the border region;

o obtaining a prediction block for intra prediction based on the one or more sample values;

o filtering intermediate reconstructed samples of the border region on the basis of one or both of the following:

^■ the one or more sample values of the opposite side border region;

^■ the one or more variable values associated with the one or more blocks of the opposite side border region;

o tuning context-adaptive entropy decoding on the basis of one or both of the following:

^■ the one or more sample values of an opposite side border region;

^■ the one or more variable values associated with one or more blocks of the opposite side border region.

[0015] According to a tenth aspect, there is provided a computer program product comprising a computer-readable medium bearing computer program code embodied therein for use with a computer, the computer program code comprising:

code for reconstructing a 360-degree panoramic source picture for inter-layer prediction; code for deriving an inter-layer reference picture from the 360-degree panoramic source picture, wherein the deriving comprises one or both of

o code for upsampling at least a part of the 360-degree panoramic source picture, wherein said upsampling comprises filtering samples of a border region of the 360-degree panoramic source picture using at least partly one or more sample values of an opposite side border region and/or one or more variable values associated with one or more blocks of the opposite side border region;

o code for determining a reference region that crosses a picture boundary of the 360- degree panoramic source picture, and including in the reference region one or both of the following:

^■ one or more sample values of an opposite side border region;

^■ one or more variable values associated with one or more blocks of the opposite side border region. [0016] According to an eleventh aspect, there is provided a computer program product comprising a computer-readable medium bearing computer program code embodied therein for use with a computer, the computer program code comprising:

code for encoding samples of a border region of a 360-degree panoramic picture, wherein the encoding comprises utilizing one or both of the following