Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A METHOD AND SYSTEM FOR REDUCING VIDEO RESOLUTION
Document Type and Number:
WIPO Patent Application WO/2015/144899
Kind Code:
A1
Abstract:
A method for reducing video resolution, comprising the steps of processing frames of the video sequence by: creating a details bitmap (D), taking into account, for each pixel of the details bitmap (D), the values of neighboring pixels in order for the bitmap (D) to have values of pixels dependent on the amount of details at a particular area of the video frame; creating a net (N) of regularly distributed nodes and deforming the position of nodes according to the contents of the details bitmap (D) such that the nodes are brought closer to each other at places where the details bitmap (D) indicates higher amount of details; morphing the frame to a modified frame (ΡΒ') using morphing based on the net N whereas the areas with high frequencies are stretched and areas without high frequencies are compacted; and scaling down the modified frame (ΡΒ') to obtain a reduced frame (PB").

Inventors:
PACZKOWSKI JACEK (PL)
KRAMEK KRZYSZTOF (PL)
NALEWA TOMASZ (PL)
Application Number:
PCT/EP2015/056763
Publication Date:
October 01, 2015
Filing Date:
March 27, 2015
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PATENTS FACTORY LTD SP Z O O (PL)
International Classes:
G06T9/00; H04N19/132; H04N19/182; H04N19/59; H04N19/85
Domestic Patent References:
WO2013017565A12013-02-07
Foreign References:
US5253059A1993-10-12
Other References:
SALEMBIER P ET AL: "VERY LOW BIT RATE VIDEO CODING USING ACTIVE TRIANGULAR MESH", 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP). ATLANTA, MAY 7 - 10, 1996; [IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS. (ICASSP)], NEW YORK, IEEE, US, vol. CONF. 21, 7 May 1996 (1996-05-07), pages 2060 - 2063, XP000681654, ISBN: 978-0-7803-3193-8, DOI: 10.1109/ICASSP.1996.545723
HYUNGSUK OH ET AL: "Video Processing for Human Perceptual Visual Quality-Oriented Video Coding", IEEE TRANSACTIONS ON IMAGE PROCESSING, IEEE SERVICE CENTER, PISCATAWAY, NJ, US, vol. 22, no. 4, 1 April 2013 (2013-04-01), pages 1526 - 1535, XP011498252, ISSN: 1057-7149, DOI: 10.1109/TIP.2012.2233485
MUSMANN H G ET AL: "Object-oriented analysis-synthesis coding of moving images", SIGNAL PROCESSING. IMAGE COMMUNICATION, ELSEVIER SCIENCE PUBLISHERS, AMSTERDAM, NL, vol. 1, no. 2, 1 October 1989 (1989-10-01), pages 117 - 138, XP024241727, ISSN: 0923-5965, [retrieved on 19891001], DOI: 10.1016/0923-5965(89)90005-2
KAWAMURA S ET AL: "Estimation of motion using motion blur for tracking vision system", WORLD AUTOMATION CONGRESS, 2002. PROCEEDINGS OF THE 5TH BIANNUAL JUNE 9-13, 2002, PISCATAWAY, NJ, USA,IEEE, vol. 13, 9 June 2002 (2002-06-09), pages 371 - 376, XP010616447, ISBN: 978-1-889335-18-6
P. SALEMBIER ET AL.: "Very low bit rate video coding using active triangular mesh", IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING - PROCEEDINGS (ICASSP, 7 May 1996 (1996-05-07), pages 2060 - 2063, XP000681654, DOI: doi:10.1109/ICASSP.1996.545723
Attorney, Agent or Firm:
PAWLOWSKI, Adam (ul. Zeligowskiego 3/5, Lodz, PL)
Download PDF:
Claims:
CLAIMS

1 . A method for reducing video resolution, comprising the steps of processing frames of the video sequence by:

- creating a details bitmap (D), taking into account, for each pixel of the details bitmap (D), the values of neighboring pixels in order for the bitmap (D) to have values of pixels dependent on the amount of details at a particular area of the video frame;

- creating a net (N) of regularly distributed nodes and deforming the position of nodes according to the contents of the details bitmap (D) such that the nodes are brought closer to each other at places where the details bitmap (D) indicates higher amount of details;

- morphing the frame to a modified frame (ΡΒ') using morphing based on the net N whereas the areas with high frequencies are stretched and areas without high frequencies are compacted; and

- scaling down the modified frame (ΡΒ') to obtain a reduced frame (PB").

2. The method according to claim 1 , further comprising the steps of:

- analyzing the input frame (P) of the video sequence to create a motion bitmap (M) having the same resolution as the input frame (P) and values of pixels proportional to the speed of movement of pixels of the input frame (P);

- blurring the input frame (P) such that the amount of blurring particular pixels depends on the values of corresponding pixels of motion bitmap (M) to obtain a blurred frame (PB); and

- creating the details bitmap (D) based on the blurred frame (PB).

3. The method according to claim 1 , wherein the details bitmap (D) is created using a formula:

wherein: n is a size of a matrix;

Hjj is a matrix of FFT coefficients;

a,,j is a matrix comprising weight coefficients. 4. The method according to claim 1 , wherein the modified frame (ΡΒ') is scaled down to obtain a reduced frame (PB") using a scaling coefficient (kx) for x axis different from a scaling coefficient (ky) for the y axis.

5. The method according to claim 1 , further comprising compressing the reduced bitmap (PB") to obtain a compressed bitmap PBC.

6. A computer program comprising program code means for performing all the steps of the computer-implemented method according to any of claims 1 -5 when said program is run on a computer.

7. A computer readable medium storing computer-executable instructions performing all the steps of the computer-implemented method according to any of claims 1 -5 when executed on a computer. 8. A system for s reducing video resolution, the system comprising a controller (303) configured to process frames of the video sequence by:

- creating a details bitmap (D), taking into account, for each pixel of the details bitmap (D), the values of neighboring pixels in order for the bitmap (D) to have values of pixels dependent on the amount of details at a particular area of the video frame;

- creating a net (N) of regularly distributed nodes and deforming the position of nodes according to the contents of the details bitmap (D) such that the nodes are brought closer to each other at the places wherein the details bitmap (D) indicates higher amount of details;

- morphing the frame to a modified frame (ΡΒ') using morphing based on the net N whereas the areas with high frequencies are stretched and areas without high frequencies are compacted; and WO 2015/144899 <| <| PCT/EP2015/056763

- scaling down the modified frame (ΡΒ') to obtain a reduced frame (PB").

Description:
A METHOD AND SYSTEM FOR REDUCING VIDEO RESOLUTION

DESCRIPTION TECHNICAL FIELD

The present invention relates to a method and system for reducing video resolution.

BACKGROUND

There are known various methods for processing images, in particular to reduce image resolution.

A PCT application WO2013017565 presents a method for transmitting a high resolution video stream comprising high resolution video frames as low resolution video stream comprising low resolution video frames, comprising the steps of: defining a size of a region for dividing the high resolution video frames into equally-sized adjacent regions, processing the high resolution video frames in sets, each set containing a number of frames equal to the number of pixels within the region, wherein within the set: converting the first high resolution video frame of the set to a first low resolution video frame by copying the first pixel of each region to the first low resolution video frame to a position defined by a split function dependent on the pixel position in the high resolution video frame and the size of the region, converting the next high resolution video frame of the set to a next low resolution video frame by copying the next pixel of each region to the next low resolution video frame to a position defined by the split function, until all high resolution video frames of the set are converted to low resolution video frames; and transmitting the low resolution video stream comprising the low resolution video frames as a representation of the high resolution video stream comprising the high resolution video frames. The conversion is carried out in the same manner for all frames.

A publication "Very low bit rate video coding using active triangular mesh" by P. Salembier et al (1996 IEEE International conference on acoustics, speech, and signal processing - proceedings (ICASSP), Atlanta, May 7-10, 1996, pages 2060-2063) discloses a video coding approach, which relies on the notion of active meshes and can be viewed as a particular case of region-based coding. The approach is not related to transforming a video frame to reduce the high frequencies in the image before downscaling the frame. It is related to a use of morphing to transmit information about motion within the video image, i.e. morphing is used to code motion.

It would be advantageous to provide an alternative method for reducing resolution of an image, that takes into account the contents of the frame. SUMMARY

The object of the invention is a method for reducing video resolution, comprising the steps of processing frames of the video sequence by: creating a details bitmap (D), taking into account, for each pixel of the details bitmap (D), the values of neighboring pixels in order for the bitmap (D) to have values of pixels dependent on the amount of details at a particular area of the video frame; creating a net (N) of regularly distributed nodes and deforming the position of nodes according to the contents of the details bitmap (D) such that the nodes are brought closer to each other at places where the details bitmap (D) indicates higher amount of details; morphing the frame to a modified frame (ΡΒ') using morphing based on the net N whereas the areas with high frequencies are stretched and areas without high frequencies are compacted; and scaling down the modified frame (ΡΒ') to obtain a reduced frame (PB").

Preferably, the method further comprises analyzing the input frame (P) of the video sequence to create a motion bitmap (M) having the same resolution as the input frame (P) and values of pixels proportional to the speed of movement of pixels of the input frame (P); blurring the input frame (P) such that the amount of blurring particular pixels depends on the values of corresponding pixels of motion bitmap (M) to obtain a blurred frame (PB); and creating the details bitmap (D) based on the blurred frame (PB).

Preferably, the details bitmap (D) is created using a formula:

wherein:

n is a size of a matrix;

Hi is a matrix of FFT coefficients;

a, ,j is a matrix comprising weight coefficients.

Preferably, the modified frame (ΡΒ') is scaled down to obtain a reduced frame (PB") using a scaling coefficient (kx) for x axis different from a scaling coefficient (ky) for the y axis.

Preferably, the method further comprises compressing the reduced bitmap (PB") to obtain a compressed bitmap PBC.

The object of the invention is also a computer program comprising program code means for performing all the steps of the computer-implemented method as described above when said program is run on a computer, as well as a computer readable medium storing computer-executable instructions performing all the steps of the computer-implemented method as described above when executed on a computer.

Another object of the present invention is a computer-implemented system for reducing video resolution, the system comprising a controller configured to process frames of the video sequence by: creating a details bitmap (D), taking into account, for each pixel of the details bitmap (D), the values of neighboring pixels in order for the bitmap (D) to have values of pixels dependent on the amount of details at a particular area of the video frame; creating a net (N) of regularly distributed nodes and deforming the position of nodes according to the contents of the details bitmap (D) such that the nodes are brought closer to each other at the places wherein the details bitmap (D) indicates higher amount of details; morphing the frame to a modified frame (ΡΒ') using morphing based on the net N whereas the areas with high frequencies are stretched and areas without high frequencies are compacted; and scaling down the modified frame (ΡΒ') to obtain a reduced frame (PB"). BRIEF DESCRIPTION OF FIGURES

These and other objects presented herein are accomplished by providing a method and a system for reducing resolution of an image. Further details and features of the present invention, its nature and various advantages will become more apparent from the following detailed description of the preferred embodiments shown in a drawing, in which:

Fig. 1 presents an example embodiment of a method for reducing resolution of an image.

Fig. 2 presents an example embodiment of a method for displaying the reduced image,

Fig. 3 present a system for reducing resolution of a video.

Fig. 4A-4F present examples of matrices for creating the bitmap D.

Fig. 5A, 5B present a net N. NOTATION AND NOMENCLATURE

Some portions of the detailed description which follows are presented in terms of data processing procedures, steps or other symbolic representations of operations on data bits that can be performed on computer memory. Therefore, a computer executes such logical steps thus requiring physical manipulations of physical quantities.

Usually these quantities take the form of electrical or magnetic signals capable of being stored, transferred, combined, compared, and otherwise manipulated in a computer system. For reasons of common usage, these signals are referred to as bits, packets, messages, values, elements, symbols, characters, terms, numbers, or the like.

Additionally, all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Terms such as "processing" or "creating" or "transferring" or "executing" or "determining" or "detecting" or "obtaining" or "selecting" or "calculating" or "generating" or the like, refer to the action and processes of a computer system that manipulates and transforms data represented as physical (electronic) quantities within the computer's registers and memories into other data similarly represented as physical quantities within the memories or registers or other such information storage.

DETAILED DESCRIPTION

Fig. 1 shows an example embodiment of a method for reducing resolution of an image. The procedure is applied for consecutive frames the video sequence.

First, in step 101 , there is performed motion analysis in the current video frame represented by image (bitmap) P, i.e. analysis of which pixels have changed their position with respect to the previous frame. Next, in step 102, a motion bitmap M is created, having the same resolution as the input frame P, wherein the values of pixels of M are proportional to the dynamics of movement of that pixels. For example, the value of pixel is proportional to the speed of movement of object to which the pixel belongs, wherein the speed of movement is calculated based on analysis of movement of that object on the current frame compared to one or more previous frames. Alternatively, the dynamics can be calculated as a function of both speed and acceleration of movement of the object between frames. If a pixel is static, then its value on bitmap M is 0. Next, in step 103, the input bitmap P is blurred to generate a blurred bitmap PB. The amount of blurring of particular pixels of bitmap P depends on the values of corresponding pixels of motion bitmap M - the faster motion at a particular position, the more this position is blurred. Next, in step 104, the blurred image PB is analyzed to detect the areas with high amount of details, by performing detection of high frequencies. In step 105, a details bitmap D is created, having values of pixels dependent on the amount of details at a particular area.

For example, each pixel of bitmap D is created by taking into account the values of neighboring pixels of that pixels on the PB bitmap, for example 24 neighboring pixels as shown in Fig. 4A. This fragment of the PB bitmap is then transformed using Fast Fourier Transform (FFT) to a H matrix, shown in Fig. 4B.

Next, the values of pixels of bitmap D are calculated using formula: i=l j=l

wherein a, j is a matrix comprising weight coefficients which promote the high frequency components of the H matrix. Fig. 4C shows an example of the a,j matrix with linear increase of coefficients, and Fig. 4D shows an example of the a,j matrix wherein the values are squared.

If a matrix of dimensions of 5x5 is used, then the first 2 rows and columns of the D bitmap can be determined by copying there the values of row no. 3 and column no. 3 calculated according to the above formula. Similarly, the last 2 rows and columns of the D bitmap can be determined by copying there the values of row no. last-3 and column no. last-3 calculated according to the above formula. Fig. 4E presents an example of an input frame P and fig. 4F presents an example of the corresponding bitmap D.

It will be understood that matrices of different sized, such as 3x3, 7x7 etc. can be used as well.

Next, in step 106, a net N is created, which is a table defining position of net nodes, which are regularly distributed across the image. For example, for a 1920x1080 frame, the net N may have 21 x21 nodes. It means that a node will be placed at every 96 th pixel in the horizontal axis and at every 54 th pixel in the vertical axis. Fig. 5A presents an example of the net N.

In step 107 the net N is deformed according to the contents of the bitmap D. The nodes are brought closer to each other at the places wherein the bitmap D has large values and further away from each other at the places wherein the bitmap D has small values. Fig. 5B presents an example of the deformed net N.

In step 108 the frame PB is converted to a modified frame PB' having the same resolution, using a morphing technique (in particular, using warping) based on the net N. The modified frame PB' shall have less high frequencies than frame PB, because the areas with high frequencies are stretched and areas without high frequencies are compacted.

Next, in step 109, the modified frame PB' is scaled down to obtain a reduced frame PB". Scaling can be performed with different factors for x and y axis. The scaling coefficient kx, ky is selected depending on the contents of bitmap D in the x and y direction and the expected quality after decoding.

In step 1 10 the reduced bitmap PB" is compressed to obtain a compressed bitmap PBC and the definition of net N and scaling coefficients kx, ky are added to and stored as a scaled-down representation f_PBC.

Fig. 2 shows the procedure for decoding the compressed representation f_PBC. First, the compressed bitmap PBC is read in step 201 . Next, PBC is decompressed in step 202 to obtain the reduced frame PB". In step 203 the reduced frame PB" is scaled up using coefficients kx, ky to obtain the modified frame PB'. In step 204 re-morphing of the modified frame PB' is performed using the net N to obtain the frame PB. Next, frame PB is displayed in step 205.

Fig. 3 shows a system for reducing video resolution. The system may be implemented by using programmable logic circuits such as PGA (Programmable Gate Array), FPGA (Field-Programmable Gate Array) or ASIC (Application-Specific Integrated Circuit) or SoC (System on the Chip). The system comprises a suitable data bus 301 for communicating, preferably bidirectionally, all circuits of the system. Further, the system comprises a memory 302 for storing required software for the controller 303 and any temporary data needed for operation of the system, such as the video stream and image data bitmaps. The controller 303 is configured to execute software that allows for reducing image resolution, wherein such software is configured to execute the method steps of Fig. 1 . A user interface (304) allows the user to control the process.

The presented method and system can be used in applications such as video surveillance systems, digital television systems, streaming video systems etc.

It can be easily recognized, by one skilled in the art, that the aforementioned method for reducing image resolution may be performed and/or controlled by one or more computer programs. Such computer programs are typically executed by utilizing the computing resources in a computing device such as personal computers, personal digital assistants, cellular telephones, dedicated controllers or the like. Applications are stored on a non-transitory medium. An example of a non-transitory medium is a non-volatile memory, for example a flash memory or volatile memory, for example RAM. The computer instructions are executed by a suitable processor.

These memories are exemplary recording media for storing computer programs comprising computer-executable instructions performing all the steps of the computer-implemented method according the technical concept presented herein.

While the invention presented herein has been depicted, described, and has been defined with reference to particular preferred embodiments, such references and examples of implementation in the foregoing specification do not imply any limitation on the invention. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader scope of the technical concept. The presented preferred embodiments are exemplary only, and are not exhaustive of the scope of the technical concept presented herein.

Accordingly, the scope of protection is not limited to the preferred embodiments described in the specification, but is only limited by the claims that follow.