Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IMAGE ENCODING
Document Type and Number:
WIPO Patent Application WO/1986/004757
Kind Code:
A1
Abstract:
Actual picture points of a picture area are compared with interpolated values derived from selected points. If the differences are small, data for only the selected points are transmitted, otherwise the area is subdivided and each sub-area processed in the same way, so that the number of points selected for transmission is greatest in detailed areas of the image.

Inventors:
SPRIGGS HUGH (GB)
NIGHTINGALE CHARLES (GB)
TURKINGTON ROGER D (GB)
Application Number:
PCT/GB1986/000060
Publication Date:
August 14, 1986
Filing Date:
February 03, 1986
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BRITISH TELECOMM (GB)
International Classes:
G06T9/00; G06T9/40; H04N7/14; H04N7/26; H04N19/593; (IPC1-7): H04N7/137
Other References:
Proceedings of IEEE, Volume 68, No. 3, March 1980, IEEE New York (US) A.N. NETRAVALI et al.: "Picture coding: a Review", pages 366-406, see page 397, left-hand column, lines 37-43
Download PDF:
Claims:
CLAIMS
1. A method of image transmission comprising repetitively: (a) generating an estimate of a picture area by interpolation from selected sample points thereof; (b) comparing the estimated samples with the actual samples; (c) if the estimated and actual samples differ by less than a threshold criterion, transmitting picture data in respect of the said selected sample points, and if they do not, notionally dividing the area under consideration into two or more subareas and subjecting each subarea to steps (a), (b) and (c) until a minimum subarea size is reached. A method according to Claim 1 in which the said area and subareas are rectangular or square and the estimate is generated by a twodimensional interpolation process based on the four corner points of the square or rectangular area. A method according to claim lor 2 in which, in step (c), the notional division of the area is into four substantially equal parts. A method according to claim 1, 2 or 3 in which the picture data transmitted in respect of the selected points after the first are the differences between the actual sample values of those points and the estimated values for those points. A method according to any one of the preceding Claims in which the data transmitted comprises a sequence of division codes each indicating whether or not the corresponding area is divided; each division code being followed by the codes corresponding to parts of the respective area.
2. A method according to any one of the preceding Claims in which the said samples represent the differences between samples of the picture to be transmitted and those of a previously transmitted picture. 7 An apparatus for carrying out the method of Claim 1, comprising an interpolation means for generating an estimate of a picture area based on a subset of its samples, means for comparing the estimate with the actual sample values, and means arranged in dependence on whether the comparison indicates a difference of more or less than a threshold (a) to select the subset of samples for transmission or (b) to notionally divide the picture area into two or more subareas and to process each subarea in like manner.
3. n apparatus according to claim 7 in which the said area and subareas are rectangular or square and the estimate is generated by a twodimensional interpolation process based on the four corner points of the square or rectangular area. • • 9 An apparatus according to claim 7 or.
4. n which, in step (c), the notional division of the area is into four substantially equal parts.
5. 10 An apparatus according to of claim 7, 8 or 9 in which the picture data transmitted in respect of the selected points after the first are the differences between the actual sample values of those points and the estimated values for those points.
6. 11 An apparatus according to any one of claims 7 to 10 in which the data transmitted comprises a sequence of division, codes each indicating whether or not the corresponding area is divided; each division code being followed by the codes corresponding to parts of the respective one.
7. 12 An apparatus for image transmission comprising means for generating differences betweeen current image sample values and sample values of a previously transmitted image, comprising apparatus according to any one of claims 7 to 11 for encoding the said differences.
8. 13 A decoder for receiving an image encoded by the method of claim 1, comprising a frame store, processing means arranged to compute from the received division codes the addresses of image points in respect of which data is transmitted and to interpolate values for the remaining points.
Description:
IMAGE ENCOOING

The present invention concerns methods and apparatus for image coding and transmission.

The proposals are suitable for the coding of both still pictures and moving pictures, and are particularly, though not exclusively, aimed at low bit rate video coding schemes for applications such as photovideotex and video conferencing.

They aim to reduce some of the drawbacks associated with transform coding whilst achieving a similar or better compression for a given picture quality.

According to one aspect of the invention, there is provided a method of image transmission comprising repetitively:

(a) generating an estimate of a picture area by interpolation from selected sample points thereof;

(b) comparing the estimated samples with the actual samples;

(c) if the estimated and actual samples differ by less than a threshold criterion, transmitting picture data and respect of the said selected sample points, and if they do not, notionally dividing the area under consideration into two or more sub-areas and subjecting each sub-area to steps (a), (b) and (c) until a minimum sub-area size is reached. A transmitter for such transmission may comprise interpolation means for generating an estimate of a picture area based on a subset of- its samples, means for comparing the estimate with the actual sample values, and means arranged in dependence on whether the comparison indicates a difference of more or less than a threshold

(a) to select the subset of samples for transmission or

(b) to notionally divide the picture area into two or more sub-areas and to process each sub-area in like manner.

The subset of samples may be the four comer points of the area, and the interpolation process may be simple two-dimensional linear interpolation, but this is not essential. The division into sub-areas may conveniently consist of division into four substantially equal parts.

The data transmitted may comprise a sequence of division codes each indicating whether or not the corresponding area is divided; each division code being followed by the codes corresponding to parts of the respective one. In one preferred arrangement the sample point picture data are transmitted in groups of one or more samples, each group containing data in respect of the untransmitted selected sample(s) of a respective area, the sequence of groups being the same as that of the division codes which indicate that the relevant areas are undivided, whilst in another the sample point picture data

• are transmitted in groups, each group containing .data in respect of those selected samples generated by a respective division, the sequence* of -groups being the same as that of the division codes which indicate the relevant division.

As indicated the invention may be applied directly to the coding of individual pictures. However, further economy in transmission may be combining it with inter-frame differential coding and applying the present coding method being applied to the difference signal (within the predictor loop).

It will be seen that the principle of the method is a non-uniform sample structure in which non-transmitted pels are interpolated.

It is based on the idea of avoiding the use of irrelevant block structures and frequency-derived transforms whose raisons d'etre are inappropriate mathematical models (Markov processes). One embodiment of the invention will now be described with reference to the accompanying .drawings, in which

Figure 1, 2 and 3 illustrate diagrammatically an image area at various stages of the coding procedure;

Figure A is a flowchart for coder operation; Figure 5 is a block diagram of a coder; Figure 6 is a flowchart for decoder operation; Figure 7 is a block diagram of a decoder; Figure 8 illustrates a typical coder sequence; and

Figure 9 is a block diagram of an inter-frame coder. Operation 1

Referring to Figure 1, a ' block whose corner points are ABCD is schematically illustrated. The first step in coding is to calculate a new block in which all picture elements (pels) are represented by values linearly interpolated from the corner values at A, B, C and D. This new block is compared with the orginal and if no differences are found in excess of a certain threshold, t, then the process moves to operation 2. Operation 2

The addresses and values of the points A, B, C and D are transmitted. In the receiver the whole block can be reconstructed by interpolation from these points in the knowledge that the resulting block will be a good

approximation to the original. If the picture includes more than one block the process then moves to a new block and repeats operation 1. Although the process may be started with the full frame as the first block so that large inactive blocks may be and often are transmitted, the process could, of course, start from an initial sub-division.

In the case that differences found during operation 1 exceed the threshold t then operation 3 is performed. Operation 3

The block ABCO is subdivided as shown in fig 2 and one of the newly obtained subblocks, AP R for example, is selected to repeat the process from operation 1.

As the process proceeds a structure like that shown in Figure 3 will appear where the greatest number of sub-divisions wil occur at edges or over fine detail.

The process may continue until sub-division is no longer necessary, or no longer possible, but if .desired a minimum block size may be defined. It may be supposed that the addressing overhead for such a scheme would be very large, even prohibitive, but this is not the case, and a system which fully expoints the highly recursive nature of the process and reduces the address data two approximately one bit per corner point will now be described.

Starting with the full frame ABCO (figure 1), the addresses of the corner points are known in advance by the receiver (or are transmitted if not known in advance).. Operation 1 is then carried out. If it is decided that the block can be interpolated from its corner values, a "0" is transmitted. If it is decided that the block must

be sub-divided, a "1" is transmitted. The block is sub-divided into four by cutting it in half horizontally and vertically. Therefore the transmission of the "1" is sufficient to determine the addresses of the five new points needed in the sub-division. (The lengths of the sides of the block do not need to be powers of 2, provided the receiver and transmitter use the same rule for dividing odd numbers by 2). The addresses of the corner points of the four new blocks are placed on a "stack" in a predetermined order (eg as indicated by "4, 3, 2, 1" in figure 2, where 4 is at the top of the stack). Each layer of the stack consists of information on one block, its corner points and the values at those points.

The block at the top of the stack is now processed in exactly the same way. If the block can be interpolated then it is simply removed from the stack and the next block down is processed. If it has to be sub-divided it is removed from the stack and replaced by the four, new blocks. The receiver can work out, from the "0"'s and "l'- ' -s which are transmitted as above, the addresses of all the sample points and the order in which they are to be received. It works out that this addressing information is approximately one bit per sample point. Transmission of the actual sample data may be interspersed with the 1/0 sub-division information sequence, or may follow it. It will be appreciated that the addressing scheme described establishes an order of priority of the blocks, and sample data conveniently can follow this order.

One possible method would be to transmit sample data for the corners of each block not sub-divided, (ie a set of data for each "0 *1 of the sequence).

In many instances each block will abut blocks to the left and above it (with the stack priority sequence given) whose data have already been transmitted - or implied - and thus only one sample value, the bottom left-hand corner, needs to be sent.

Alternatively, data may be regarded as associated with the act of dividing a block into four ie one set of data for each "1" of the sequence. This implies that A, B, C, D, have already been sent and that P, Q, R, S, T are required, though P and R will usually, have been sent (or interpolated at the receiver) as S ' or T of a preceding block.

A transmitter for coding in accordance with these methods may comprise a frame store, means to enter image samples and processing means arranged to perform the coding. A flowchart for carrying out the second sequence mentioned above is shown in figure 4. This could be carried out by a suitably programmed microprocessor, though " for real-time' processing of moving pictures, dedicated hardware would probably be necessary to achieve the desired speed of operation.

Figure 5 shows a coder with digital to analogue converter 10 (with address control), frame store 12, processor 14 and data output port 16. Figure 6 shows an image area, together with the resulting coded output. SA, SB etc. indicate sample values corresponding to points A, B etc. Letters in brackets indicate the areas to which the division codes 1, 0 correspond; this information does not, however have to be transmitted since it can be deduced by the receiver.

Figure 7 shows a receiver with data, input port 20, processor 22, frame store 24 and output D to A converter (and address control) 26. Receiver operation is illustrated in the flowchart of Figure 8.

There are a number of methods of achieving and/or varying the degree of compression.

(a) The threshold may be increased so that larger blocks will be found to satisfy the criterion for transmission. This leads to fewer blocks and hence fewer corner points to be transmitted.

(b) The minimum allowable block size can be increased to prevent a myriad of very small blocks from being " generated with consequent saving on transmitted points.

(c) The number of bits used to represent a transmitted comer value can be reduced since contouring (sudden steps in brightness) is impossible with a scheme based on interpolation. (d) Some values may not need to be transmitted since they can be interpolated from already transmitted values.

(e) Other transmitted actual values can be replaced by differences between actual and interpolated values with consequent reduction of variance and decrease in the number of bits required for transmission of some corner points.

Apart from (d) these all introduce some degradation in picture quality, and their application must be tempered with the experience of practice. Because of the novel nature of the scheme the types of degradation seen are themselves novel and their subjective effects and their effects on interfra e coding are not yet well understood.

By way of clarification of possibilities (d) and (e) above, there follows a description of such a differential coding method. When it has been decided that the block

ABCD must be sub-divided, the values of- five (but usually three) new points are required. Each of the new points is tested to see whether its value can be approximated by

interpolation from points already known; eg the interpolation value at P is midway between the values at A and B. If the interpolation is sufficiently accurate a "0" is transmitted. If it is not sufficiently accurate a "l" is transmitted, followed by the actual value, suitably coded. At present the preferred coding scheme is to transmit the difference between the actual value and the interpolated value. This normally gives a distribution of values with a small variance centred around zero, and therefore entropy coding can be used to reduce the average number of bits needed to be transmitted per sample value.

At the receiver It is simply necessary to calculate the interpolation for each sub-area transmitted, the sample values within that area, and enter the values into a frame store which, when the entire picture has been built up in this way, can then be read out.

Figure 9 shows a block diagram of a transmitter for transmitting inter-frame differences using the proposed method, the encoder and regenerator operating as described above. That part of the circuit below the dotted line is in fact a receiver (input A, output B). At the transmitter, in fact, the interpolated signals could be extracted from the encoder rather than generated by a regenerator.