Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND APPARATUS FOR CODING INFORMATION
Document Type and Number:
WIPO Patent Application WO/1996/025005
Kind Code:
A1
Abstract:
A method of encoding invisible identification code into an image, which is highly resistant to degradation across communications links, and which does not require the presence of the original image for decoding, comprises analysing the image and determining strongly featured regions such as edges, and inserting code into such regions by altering the structure of the image in a predictable manner, as for example a concave elliptical insert centered on and aligned with an edge, which alteration is not visible to the eye. When decoding, areas of concavity are determined, and a correlation is performed with a predicted insert function to assess whether code has been inserted. A hardware embodiment is described.

Inventors:
TODD MARTIN PETER (GB)
Application Number:
PCT/GB1996/000246
Publication Date:
August 15, 1996
Filing Date:
February 05, 1996
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
CENTRAL RESEARCH LAB LTD (GB)
TODD MARTIN PETER (GB)
International Classes:
G06T1/00; G06T9/00; H04N1/32; H04N1/387; H04N5/913; H04N7/08; H04N7/081; (IPC1-7): H04N7/08; G11B20/00
Other References:
W. BENDER ET AL.: "Techniques for data hiding", PROCEEDINGS OF THE SPIE: STORAGE AND RETRIEVAL FOR IMAGE AND VIDEO DATABASES III, vol. 2420, 9 February 1995 (1995-02-09) - 10 February 1995 (1995-02-10), SAN-JOSE, pages 164 - 173, XP000571877
K. HARA ET AL.: "An improved method of embedding data into pictures by modulo masking", I.E.E.E. TRANSACTIONS ON COMMUNICATIONS, vol. 36, no. 3, March 1988 (1988-03-01), NEW-YORK, pages 315 - 331, XP002005065
Download PDF:
Claims:
CLAIMS
1. A method of inserting coded information into an image, comprising analysing the image, identifying strongly featured regions , and inserting coded information into these regions.
2. A method according to claim 1 , wherein the strongly featured regions comprise edge regions between areas of different luminance and/ or chrominance.
3. A method according to claim 1 , wherein the strongly featured regions comprise textured regions having distributed therein localised areas of different luminance and/or chrominance values.
4. A method according to any preceding claim, wherein the coded information is inserted into the image by altering the structure of the image in a predictable or identifiable manner so that the coded information can subsequently be retrieved without reference to the original image.
5. A method according to claim 3, comprising analysing a textured region by a process of cluster analysis, identifying a cluster of foreground local areas with a certain quality, and modifying the chrominance and/or luminance values of the cluster with an insert function which decreases in intensity from its centre, for representing one of two binary values.
6. A method according to claim 5, wherein the insert function is circular in extent, and is centred on the geometric centre of the cluster.
7. A method according to claim 2, including inserting along a length of the edge region an insert function whose intensity varies in a nonlinear manner, for representing one of two binary values.
8. A method according to claim 7, wherein the insert function is elliptical in extent, with its intensity gradually decreasing in a direction along its major axis, and with its major axis extending along the length of the edge.
9. A method according to claim 8, wherein the elliptical function is centred on the centre of the edge.
10. A method according to any of claims 5 to 9, wherein the insert function varies in intensity in a concave manner.
11. A method according to any preceding claim, including identifying a masking parameter for the image, and limiting the intensity of the inserted code in accordance with the masking parameter.
12. 5 12. A method for inserting coded information into an image, comprising analysing the image, identifying strongly featured regions, determining for at least one such region a masking parameter, and inserting coded information into such region in a predictable or identifiable manner by an amount limited in accordance with said masking parameter.
13. 10 13. A method according to claim 12, wherein the masking parameter determination includes assessing the image as to the degree of strength or energy of the strongly featured region within the image, and determining the intensity of the code insert in dependence on such strength assessment.
14. 14 A method according to claim 12, including assessing the image as to whether 5 the image overall contains a function of the type of which the coded information is inserted, and assessing the degree of such function.
15. 15 A method according to claim 14, wherein the coded information is inserted employing a concave function.
16. 16 A method according to claim 14, wherein the sum of the coded information 0 function and the existing function is limited if the intensity of the existing function in the image is too great.
17. 17 A method according to any of claims 12 to 16, wherein the masking parameter determination includes determining whether the strongly featured region is sufficiently well defined to permit insertion of coded information. 5.
18. A method according to any preceding claim, comprising dividing the image up into blocks formed in N rows and M columns, and carrying out said analysing and inserting steps in each block.
19. A method according to claim 18, wherein a group of blocks, selected according to a predetermined rule, are encoded according to a pseudo random sequence so that the 0 blocks in the group represent one or more bits of information.
20. A method according to claim 18 or 19, including the following steps: a) dividing image data up into blocks each formed of a predetermined number of pixels, and b) calculating an insert function to be added to the luminance of each pixel within the block based on the distance of the pixel from the central point of the edge.
21. A method according to claim 18, wherein an assessment is made of the type of image within each block, whether it has a single strongly featured region, has several strongly featured regions, or is a block having low activity in terms of image information.
22. A method according to claim 21, wherein if a block is assessed to have a low activity, a code is inserted into the block defined by a geometric region wherein the pixels within the region have a luminance modulated according to a predetermined function.
23. A method of decoding information contained in an image, the method comprising analysing the image, identifying strongly featured regions, determining for at least one such region an anticipated insertion of coded information, and correlating such anticipated insertion with the image to determine whether there has been inserted into the strongly featured region coded information.
24. A method according to claim 23, including determining the intensity of an anticipated insert function in accordance with a masking parameter based on the strength or energy of the strongly featured region.
25. A method according to claim 24, including assessing the image as to whether the image overall contains a function of the type of which the coded information is inserted, assessing the degree of such function, and performing said correlation with such assessments.
26. A method according to claim 25, wherein the coded information has been inserted employing a concave function.
27. A method according to any of claims 23 to 26, wherein the strongly featured regions comprise edge regions between areas of different luminance and/ or chrominance.
28. A method according to any of claims 23 to 26, wherein the strongly featured regions comprise textured regions having distributed therein localised areas of different luminance and/or chrominance values. it .
29. A method according to claim 28, comprising analysing a textured region by a process of cluster analysis, identifying a cluster of foreground local areas with a certain quality, and determining whether there exists a modification of the chrominance and/or luminance values of the cluster with an insert function which decreases in intensity from its centre, for representing one of two binary values.
30. A method according to claim 29, wherein the insert function is circular in extent, and is centred on the geometric centre of the cluster.
31. A method according to claim 27, including determining whether there exists along a length of the edge region an insert function whose intensity varies in a non linear manner, for representing one of two binary values.
32. A method according to claim 31 , wherein the insert function is elliptical in extent, with its intensity gradually decreasing in a direction along its major axis, and with its major axis extending along the length of the edge.
33. A method according to claim 32, wherein the elliptical function is centred on the centre of the edge.
34. A method according to any of claims 29 to 33, wherein the insert function varies in intensity in a concave manner.
35. A method according to any of claims 23 to 34, comprising dividing the image up into blocks formed in N rows and M columns, and carrying out the aforesaid decoding steps in each block.
36. A method according to claim 35, wherein a group of blocks, selected according to a predetermined rule, are decoded according to a pseudo random sequence, the blocks in the group representing one or more bits of information.
37. A method according to claim 35 or 36, including the following steps: a) dividing image data up into blocks each formed of a predetermined number of pixels, and b) calculating an insertion function to be added to the luminance of each pixel within the block based on the distance of the pixel from the centre of the edge, the function being aligned with the block orientation.
38. A method according to claim 35 wherein an assessment is made of the type of image within each block, whether it has a single strongly featured region, has several such regions, or is a block having low activity in terms of image information.
39. A method according to claim 38 wherein if a block is assessed to have a low activity, the decoding operation detects a code inserted into the block defined by a geometric region wherein the pixels within the region have a luminance modulated according to a predetermined function.
40. Apparatus for inserting coded information into an image, comprising means for analysing the image and identifying strongly featured regions , and means for inserting coded information into at least one such region.
41. 10 41.
42. Apparatus according to claim 40, wherein the analysing means is operative to identify edge regions between areas of different luminance and/ or chrominance.
43. Apparatus according to claim 40, wherein the analysing means is operative to identify textured regions having distributed therein localised areas of different luminance and/or chrominance values.
44. 15 43.
45. Apparatus according to any of claims 40 to 42, wherein the inserting means is operative to alter the structure of the image in a predictable or identifiable manner so that the coded information can subsequently be retrieved without reference to the original image.
46. Apparatus according to claim 42, wherein said analysing means is operative to 20 analyse a textured region by a process of cluster analysis, by identifying a cluster of foreground local areas with a certain quality, and the inserting means is operative to modify the chrominance and/or luminance values of the cluster with an insert function which decreases in intensity from its centre, for representing one of two binary values.
47. Apparatus according to claim 41, wherein said inserting means is operative to 25 insert along a length of the edge region an insert function whose intensity varies in a nonlinear manner, for representing one of two binary values.
48. Apparatus according to claim 45, wherein the insert function is elliptical in extent, with its intensity gradually decreasing in a direction along its major axis, and with its major axis extending along the length of the edge.
49. 30 47.
50. Apparatus according to claim 46, wherein the elliptical function is centred on the centre of the edge.
51. Apparatus according to any of claims 44 to 47, wherein the insert function varies in intensity in a concave manner.
52. Apparatus according to claim 40, including means for identifying a masking parameter for the image, and limiting the intensity of the inserted code in accordance with the masking parameter so that the inserted code is invisible.
53. 5 50. Apparatus for inserting coded information into an image, comprising means for analysing the image, means for identifying strongly featured regions, means for determining for at least one such region a masking parameter, and means for inserting coded information into such region in a predictable or identifiable manner by an amount limited in accordance with said masking parameter.
54. 10 51. Apparatus according to claim 50, wherein the means for determining the masking parameter includes means for assessing the image as to the degree of activity or energy within the image.
55. 52 Apparatus according to claim 50 or 51, wherein the means for determining the masking parameter includes means for assessing the image as to whether the image 15 overall contains a function of the type of which the coded information is inserted, and assessing the degree of such function.
56. 53 Apparatus according to claim 52, wherein the insertion means is operative to insert coded information employing a concave function.
57. 54 Apparatus according to claim 52, including means for limiting the total 20 intensity if the sum of the intensity of the coded information and the intensity of the function in the overall image is too great.
58. 55 Apparatus according to any of claims 50 to 54, wherein the masking parameter means includes means for determining whether the strongly featured region is sufficiently well defined to permit insertion of coded information.
59. 25 56. Apparatus according to any of claims 40 to 55, including means for dividing the image up into blocks formed in N rows and M columns, and carrying out said analysing and inserting steps in each block.
60. 57 Apparatus according to claim 56, wherein the inserting means includes means for encoding a group of blocks, selected according to a predetermined rule, according 30 to a pseudo random sequence so that the blocks in the group represent one or more bits of information.
61. 58 Apparatus according to claim 56, including : a) means for dividing image data up into blocks each formed of a predetermined number of pixels, and b) means for calculating an insert function to be added to the luminance of each pixel within the block based on the distance of the pixel from the central point of the edge.
62. 59 Apparatus according to claim 56, wherein the analysing means is operative to 5 assess the type of image within each block, whether it has a single strongly featured region, has more than one strongly featured region, or is a block having low activity in terms of image information.
63. 60 Apparatus according to claim 59 wherein the insertion means is arranged to insert a code into a block assessed to have a low activity, the code defined by a 10 geometric region wherein the pixels within the region have a luminance modulated according to a predetermined function. 15 61. Apparatus for decoding information contained in an image, comprising means for analysing the image, means for identifying strongly featured regions, means for determining for at least one such region an anticipated insert of coded information, and means for correlating such anticipated insertion with the image to determine whether there has been inserted coded information into such coded region. 20 62. Apparatus according to claim 61, including means for determining the intensity of an anticipated insert function in accordance with a masking parameter based on the strength or energy of the strongly featured region.
64. 63 Apparatus according to claim 61 or 62, including means for assessing the image as to whether the image overall contains a function of the type of which the 25 coded information is inserted and for assessing the degree of such parameter, and said correlation means is operative to correlate such assessments with the anticipated insertion.
65. 64 Apparatus according to claim 63, wherein the assessing means is arranged to assess a concave parameter.
66. 30 65. Apparatus according to any of claims 61 to 64, wherein the masking parameter determination means includes means for determining whether the strongly featured region is sufficiently well defined to have permitted insertion of coded information.
67. 66 Apparatus according to claim 61, wherein the identifying means is arranged to identify edge regions between areas of different luminance and/ or chrominance.
68. 67 Apparatus according to claim 61, wherein the identifying means is arranged to identify textured regions having distributed therein localised areas of different luminance and/or chrominance values.
69. 68 Apparatus according to claim 67, wherein the identifying means is arranged to 5 analyse a textured region by a process of cluster analysis, identifying a cluster of foreground local areas with a certain quality, and the estimating means is arranged to determine whether there exists a modification of the chrominance and/or luminance values of the cluster with an insert function which decreases in intensity from its centre, for representing one of two binary values.
70. 10 69. Apparatus according to claim 68, wherein the estimating means is arranged to determine an insert function circular in extent, and centred on the geometric centre of the cluster.
71. 70 Apparatus according to claim 67, wherein the estimating means is arranged to determine along a length of the edge region an insert function whose intensity varies in 15 a nonlinear manner, for representing one of two binary values.
72. 71 Apparatus according to claim 70, wherein the estimating means is arranged to determine an insert function elliptical in extent, with its intensity gradually decreasing in a direction along its major axis, and with its major axis extending along the length of the edge.
73. 20 72. Apparatus according to claim 71 , wherein the elliptical function is centred on the centre of the edge.
74. 73 Apparatus according to any of claims 68 to 72, wherein the estimating means is arranged to determine an insert function which varies in intensity in a concave manner.
75. 74 Apparatus according to any of claims 71 to 73, wherein the analysing means is 25 arranged to divide the image up into blocks formed in N rows and M columns.
76. 75 Apparatus according to claim 74, wherein the determining and estimating means are arranged to select a group of blocks, according to a predetermined rule, for decoding according to a pseudo random sequence, the blocks in the group representing one or more bits of information.
77. 30 76. Apparatus according to claim 74, including the following steps: a) means for dividing image data up into blocks each formed of a predetermined number of pixels, and b) means for calculating an insertion function to be added to the luminance of each pixel within the block based on the distance of the pixel from the centre of the edge, the function being aligned with the block orientation.
78. 77Apparatus according to claim 74 including means for assessing the type of image within each block, whether it has a single strongly featured region, has several such regions, or is a block having low activity in terms of image information.
79. 78 Apparatus according to claim 77 wherein the estimating means is arranged to detect a code inserted into a low activity block defined by a geometric region wherein the pixels within the region have a luminance modulated according to a predetermined function.
80. 79 Apparatus according to claim 78, wherein the estimating means is arranged to detect a circular insert with a concave variation of intensity in a radial direction.
81. 80 A method of coding information into an image, comprising dividing the image in MxN blocks in N rows and M columns, and inserting into selected blocks code information of one of a plurality of types, the type of code inserted depending on an assessment of the image features in the respective block.
82. 81 A method according to claim 80, wherein for each block, strongly featured regions for the eye are sought, and if identified, an appropriate code is inserted into strongly featured region.
83. 82 A method according to claim 81, wherein the strongly featured regions comprise edge regions between areas of different luminance and/or chrominance, or textured regions having distributed therein localised areas of different luminance and or chrominance.
84. A method according to any of claims 80 to 82, wherein in each block, weakly featured or background regions are sought, and if identified, an appropriate insert function is inserted into such region.
85. A method according to claim 83, wherein the insert function is a relatively large region having a constant or slowly varying luminance over its area.
86. Apparatus for coding information into an image, comprising means for dividing the image in MxN blocks in N rows and M columns, and means for inserting into selected blocks code information of one of a plurality of types, the type of code inserted depending on an assessment of the image features in the respective block.
87. Apparatus according to claim 85, including analysing means for determining, for each block, strongly featured regions for the eye, and if identified, said inserting 5 means is operative to insert an appropriate code into the strongly featured region.
88. Apparatus according to claim 86, wherein the strongly featured regions comprise edge regions between areas of different luminance and/or chrominance, or textured regions having distributed therein localised areas of different luminance and/or chrominance.
89. 10 88.
90. Apparatus according to any of claims 85 to 87, including analysing means for determining in each block, weakly featured or background regions, and if identified, said inserting means is arranged to insert an appropriate insert function into such region.
91. Apparatus according to claim 88, wherein the insert function is a relatively large region having a constant or slowly varying luminance over its area.*& 15.
92. A method of decoding information from an image, comprising dividing the image in MxN blocks in N rows and M columns, and detecting in selected blocks code information of one of a plurality of types, the type of code detected depending on an assessment of the image features in the respective block.
93. 20 91.
94. A method according to claim 90, wherein for each block, strongly featured regions for the eye are sought, and if identified, an appropriate code is detected in a strongly featured region.
95. A method according to claim 91, wherein the strongly featured regions comprise edge regions between areas of different luminance and/or chrominance, or textured 25 regions having distributed therein localised areas of different luminance and or chrominance.
96. A method according to any of claims 90 to 92, wherein in each block, weakly featured or background regions are sought, and if identified, an appropriate insert function is detected in such region.
97. 30 94.
98. A method according to claim 93, wherein the insert function is a relatively large region having a constant or slowly varying luminance over its area.
99. Apparatus for decoding information from an image, comprising means for dividing the image in MxN blocks in N rows and M columns, and means for detecting in selected blocks code information of one of a plurality of types, the type of code detected depending on an assessment of the image features in the respective block.
100. 5 96.
101. Apparatus according to claim 95, including analysing means for determining, for each block, strongly featured regions for the eye, and if identified, said detecting means is operative to detect an appropriate code in the strongly featured region.
102. Apparatus according to claim 96, wherein the strongly featured regions comprise edge regions between areas of different luminance and/or chrominance, or 10 textured regions having distributed therein localised areas of different luminance and/or chrominance.
103. Apparatus according to any of claims 95 to 97, including analysing means for determining in each block, weakly featured or background regions, and if identified, said detecting means is arranged to detect an appropriate insert function in such region.
104. 15 99.
105. Apparatus according to claim 98, wherein the insert function is a relatively large region having a constant or slowly varying luminance over its area.
106. Apparatus according to claim 99, wherein the insert function is circular having an intensity which varies in a concave manner in a radial direction.
Description:
METHOD AND APPARATUS FOR CODING INFORMATION

The present invention relates to a method and apparatus for the insertion, and subsequent decoding, of coded information into images. It is known to insert codes into images for example video transmissions or video clips or stills transmitted across a telecommunication link, for the purpose of identifying the owner of the images. There are a number of known schemes for inserting identification codes into the sync periods, and more recently it has been proposed to insert identification codes into the image itself, but in such a manner that the code cannot be detected by the eye.

All of the schemes heretofore proposed suffer from the disadvantage that low pass filtering and other processes such as data compression, which may occur in image compression algorithms or transmission across a telecommunication link may remove the code or degrade it to an extent where it cannot be recognised. EP-A-0581317 discloses a scheme wherein relative extrema of intensity within the image are detected and the intensity values are modified by a certain amount. This provides a large number of sites within the image and a signature is created by modifying the value of intensity at selected sites. Although it is claimed that this method is resistant to certain types of image processing, nevertheless a more robust scheme for transmission e.g. broadcasting is desirable. Further, a disadvantage with this method is that in order to recover the encoded digital signature, it is necessary to have to hand the original image; this severely limits the application of the method.

WO 95/14289, published on 26 May 1995, discloses the embedding of an identification code throughout an image by modulating a digitised version of the image with a small noise signal. The specific system described suffers from the disadvantage of requiring to hand the original image for code identification. Further improvements in code robustness for transmission over telecommunications links or broadcasting are also desirable.

WO 95/20291, published on 27 July 1995, discloses a method of hiding copyright related messages with a digital data work, which relies on commonly occurring patterns or sequences of data in the work acting as signposts to target data elements which are modified according to certain rules. The disclosed method suffers from a lack of robustness to signal degradation.

All of the above references suffer from a disadvantage that they are concerned wholly or principally with the digital domain, and the disclosed techniques are not suited to the analog domain, in particular where digital to analog conversion and analog to digital conversion may easily lose or degrade individual pixel values. WO 89/08915 discloses an invasive coding technique in which insignificant digits of recorded material are replaced with a code digit from an independent sequence. WO 90/09663 discloses a non-invasive technique wherein data words are identified according to predetermined criteria and a unique identifying pattern is created therefrom. Neither of these techniques is well suited to practical use such as in broadcasting applications.

An object of the present invention is to provide a highly robust method of encoding information into an image, which is highly resistant to image manipulation and degradation and is effective equally in analog and digital domains.

The present invention is based in one aspect on the realisation that coded information may be inserted into an image in strongly featured regions of the image in such a way that the code is resistant to image compression and/or low pass filtering such as may occur in transmission over telecommunication links, but is not visible to the eye, and furthermore the code does not require for decoding the presence of the original image. In a first specific aspect, the invention provides a method for inserting coded information into an image, comprising analysing the image, identifying strongly featured regions and inserting coded information into these regions.

By strongly featured regions is meant regions of primary strength to which the eye responds in viewing an image for example textured regions or lines or boundaries between two regions of different luminance. In such regions, it is possible to insert a relatively large amount of information without significantly altering the image in its appearance to the eye. It is possible in some applications in order for an adequate prospect of decoding the code, that the code is inserted at an intensity which may risk some visible artefact ; nevertheless the present invention always permits the possibility of completely invisible code insertion.

Because the method of the invention relies on an analysis of the entire image and code being inserted in strongly featured regions rather than in individual pixels as in the prior art, the code is better able to survive analog to digital conversions and digital to analog conversions, where there will inevitably be pixel misalignments

between the original and processed images, and hence the pixel values will be apparently altered.

As preferred, edge regions between areas of different luminance are employed since these are very distinctive and will permit code insertion without visibly degrading the image. However, edge regions between areas of different chrominance may alternatively or in addition be employed . In a further preferred form, textured regions may be employed as will hereinafter be described.

As preferred, the coded information is inserted into strongly featured regions by altering the structure of the image in such region in a predictable or identifiable manner. The structure of the image may be altered by inserting a distinct subimage, for example a rectangular bar or ellipse along the length of an edge region. Alternatively and as preferred, the image is altered by applying an insert function to an area including the strongly featured region, which insert function gradually decreases in intensity from the centre of its region of application, so as to blend with the surrounding region. A main advantage of applying the coded information by altering the image in a predictable or identifiable manner is that the information can be recovered in a decoding process at a remote location without having the original to hand. Thus, upon decoding, the image is analysed and a search is made for any feature resembling the structural alteration. If one or more features can provisionally be identified, then such features can be regarded as coded information; as preferred a "confidence factor" may be attached to each detected feature denoting the degree of reliability of the identification.

Prior to inserting the information, the image is preferably analysed to determine at least one masking threshold or masking parameter which provides a measure of by how much the structure of the image may be altered without risking the coded information becoming visible. If such threshold or parameter is sufficient to enable insertion of coded information of sufficient intensity to permit decoding, the coded information is inserted at a level determined by the masking parameter or threshold.

Thus as preferred an assessment is made of the strength or energy of the strongly featured regions within the image, in order to determine the permissible strength of insertion. This assessment value is employed to appropriately scale the insert function as it is inserted into the image. Further masking thresholds may be provided by assessing whether the image is suitable for code insertion, for example the degree of consistency of the edge, the definition of the edge centre, and the strength to prevent insertions when the image is determined to be unsuitable.

Thus in a more specific aspect, the invention provides a method for inserting coded information into an image, comprising analysing the image, identifying strongly featured regions, determining for at least one such region a masking parameter, and inserting coded information into such region in a predictable or identifiable manner by an amount determined by said masking parameter.

As preferred the coded information is inserted into an edge region by varying the luminance intensity in a local area, along the length of an edge, for example, by applying over the area a luminance intensity having a non-linear contour, for example a concave or convex function, which is aligned with the edge. A function applied in this way may represent a digital "V or "0" according to whether it is concave or convex (for the purposes of this specification, where the terms concave, concavity and concaveness are used, they are to be understood as including convexity, which may be regarded as concavity with a negative radius of curvature). Such a method is very resistant to signal degradation arising for example from image compression. Other methods of encoding may be envisaged for example applying a non-linear function which varies perpendicular to the edge.

For textured regions, an example of which is a carpet with several colours distributed at random in the carpet fibre, a code would be inserted by first performing a statistical cluster analysis on the image to identify regions which consist of two or more basic values which are randomly distributed. The most intense "foreground" value is determined, and a circular function, appropriately scaled by a suitable masking parameter, centred in the textured region and gradually diminishing to zero radially, is applied to the foreground pixels to modulate their intensities. In the decoding process, a similar analysis is applied to the image, and the set of foreground pixels in each textured region is analysed to assess whether such circular function has been applied.

As preferred, the image, e.g. a video frame or field, is divided up into a number MxN of blocks in M rows and N columns, each block comprising nxn pixel elements (e.g. 8x8). Strongly featured regions are searched in each block for insertion of code. In order to encode a significant amount of information into an image, it is necessary to apply the insertion function to a number of featured regions, say edges, in the image. If for example, one edge is chosen in each block into which the image is divided, then in order to increase confidence when attempting to recognise the code, the edges in one row may be encoded according to two separate pseudo-random codes, representing a

"\" or "0". Thus when an image is scanned for a code, the insertion function in each block may or may not be located with a degree of confidence. The identified functions for a row of blocks, with appropriate confidence weighting factors, are compared with the pseudo-random codes to derive a "1" or "0" with a much higher confidence factor. However as preferred and as an alternative to relying on rows of blocks, each row representing one bit, the various blocks which together represent one bit, may be distributed throughout the image according to a further predetermined code. Thus, when decoding an image, knowledge of the predetermined code enables the blocks to be grouped together for analysis with the pseudo random codes to determine the value of the bit. This method has the advantage that areas where no edges occur, for example background regions of sky, can be avoided.

Although two pseudo random codes of ones and zeros are preferred for decoding data, other pseudo random codes may be employed. For example, a single pseudo random code may be employed where a watermarking type code is required indicating ownership of the image. Other types of code may also be employed.

A problem with dividing an image into a number of blocks occurs in decoding the image, since if there are only a few edges in the image and many blocks without an edge, it is difficult to maintain synchronisation with the encoding process.

As a means of overcoming this problem, the video synchronisation process may be altered so that synchronisation occurs at the start of each row or line of blocks; if then as described above, each row represents a single bit of information, a secure method of coding is provided. One means of providing synchronisation information, for example, where video images are employed, is to use the line sync pulses as a method of synchronisation. As an alternative, a very low frequency modulation may be applied to an entire image (for example a digitised still photograph image) so that there is provided a part of a cycle or one or two cycles of a low frequency modulation across the width and/or height of the image to enable centering of the decoding process onto this low frequency modulation, in a manner somewhat analogous to an archer's target. In any event, the decoding stage will automatically align to the reception of digital information to permit the required synchronisation.

In regions of the image where there does not occur strong features, for example background regions, it may be desired to insert a code in the form of a region of fixed or variable luminance, for example a circle which although relatively large in size, is not visible to the eye. This ensures that coding information is present in all parts of an image for robustness of transmission and decoding.

In a further specific object therefore the present invention provides a method of coding information into an image, dividing the image in MxN blocks in N rows and M columns, and inserting into selected blocks code information of one of a plurality of types, the type of code inserted depending on an assessment of the image features in the respective block.

The present invention also extends to a method of decoding, and according the invention provides in a further aspect a method of decoding information contained in an image, the method comprising analysing the image, identifying strongly featured regions, determining for at least one such region an anticipated insertion of coded information, and correlating such anticipated insertion with the image to determine whether there has been inserted into the strongly featured region coded information.

As preferred in the decoding method, similar processing steps are carried out as in the encoding method, involving analysing the image and defining an anticipated insertion function, as will become clear in the description of the preferred embodiment.

The present invention also extends to apparatus for carrying out any of the aforesaid aspects of the invention.

The information encoded into an image may be used for a variety of purposes, for example as follows: - to insert copyright or identification information in video clips or films; to insert copyright or identification information into stills; to log when adverts or films are played in broadcasts, for monitoring purposes; to identify the master copy from which pirated videos are copied. The information encoded may represent coding information which initiates operations within the decoding apparatus, or provides specific identification information, for example copyright information with the name of the copyright owner etc. Alternatively, the information may be merely analogous to a watermark, serving merely to identify the source of the image information but not being specific to the particular image.

Brief Description of the Drawings

A preferred embodiment of the invention will now be described with reference to the accompanying drawing wherein:

Figures 1 to 7 are diagrams for explaining the preferred method of the present invention; and

Figures 8 and 9 are block diagrams of a preferred embodiment of apparatus of the present invention. Description of the Preferred Embodiment

In accordance with a preferred embodiment of the invention, there is hidden local insertions in edges within the image. Edge regions are known to have masking properties because of the way the human visual system works. In particular the local orientation of the edges are important, and there are specific structures in the primary visual cortex for detecting both the presence of an edge and its local orientation (to a resolution of about 32 different orientations).

The insertions are such that they do not alter the local orientation. They must also survive low pass filtering since this is a common process for images. The insertions are made along the length of a local section of edge, and make the grey level gradient along the direction of the edge either a concave or a convex function over the distance of the insertion i.e. travelling from the start to the end point of the insertion along the direction of the edge of the grey level is either greater or less than would be expected by a simple linear interpolation from the start and end points. One important point of this is that at the start and end points the grey level must return to the value of the original image in order to prevent any propagation of the distortion, or any discontinuities.

The insertions are made as a 2D function, by using an ellipse which is aligned to the local orientation. The ellipse has a cross sectional function which is a 1/2 cycle of a cos function and is used to weight the extent of the insertion, i.e. outside the ellipse no insertion is made, within it the insertion is weighted by the cos function. This gives the concave or convex function by adding or subtracting it to the image. The magnitude of the insertion can be varied according to the amount of activity in the block; for a high activity block a stronger insertion can be buried. It is not always possible to produce the required function; a block which already has a very strong convex function may require the insertion to be very large to convert it to a concave one, and this may produce a visible artefact. This is one of the reasons for using an accumulation of a number of blocks to produce a single bit (see below).

The size of the insertions and their positions are fixed by processing the image in a block by block manner, typically with a block size of 8 by 8 pixels.

There may be insufficient edge blocks within an image to generate a consistent result, particularly since the selection of these edge blocks may be inconsistent when the images are processed. This can lead to problems in keeping the decoding process in synchronisation with the encoding process, such that the correct blocks were used to determine each bit.

To overcome this, the synchronisation process is aligned to the start of a line of blocks. Thus all the blocks within the first line of blocks are used to encode/decode the first bit. Then all the blocks within the second line would encode/decode the second bit etc. This may be extended so that a number of lines are used to encode/decode each bit, which reduces the amount of information which could be inserted but improves the robustness to processing.

In a modification and as preferred, a group of blocks contributing to a single bit may be distributed throughout an image according to a predetermined code. This has the advantage as compared with a line of blocks contributing to a bit, that the blocks may be sited where strongly featured regions occur, and are not located for example in a background region with no strong features.

A Pseudo Random Sequence (PRS) is used to improve the accumulation of results from the individual blocks to determine a single bit. The PRS consists of a random but fixed sequence of +1 or -l's. +1 means that the insertions is added to produce a convex functions, -1 means that it is subtracted to produce a concave function. For the encoding process, each block is processed in turn, and the next element in the PRS determines whether the insertion is added or subtracted. There are two different sequences, one for the 1 data bit and one of the 0 data bit. Thus for the decoding process the concave/convex-ness of each block is correlated with both sequences, and the sequence which produces the highest correlation determines whether what the decoded data bit is.

A second method of coding is to code low-pass regions as well as edge regions. The low-pass regions are coded with circular insertions centred on the block. The insertions in this case are of a fixed strength, and not related to the activity in the block. This improves the robustness of the process.

The encoding and decoding algorithms are listed below as a sequence of steps:

Encoding Algorithm

1 ) split the frame into adjacent blocks each of n*n pixels

2) calculate the dominant orientation in each block

3) calculate the amount of activity in each block 4) calculate the consistency of the dominant orientation

5) to encode one data bit process each block within a predefined set of blocks distributed throughout the image, as follows:

5a) look up the next element of the PRS for the data bit

1) if it is a 1 set up add 2) if it is a -1 set to subtract

5b) segment the blocks in the following categories a) a single edge/line in a dominant orientation b) a low activity block c) several major lines or edges 5c) process the blocks as follows:- a) add/subtract an elliptical function al) centred on the edge a2) aligned to the block orientation a3) scaled by the activity in the block b) add/subtract a circular function bl) centred on the centre of the block b2) fixed strength c) no insertion

6) repeat step 5) until all the bits are encoded. Decoding Algorithm

1 ) split the frame into adjacent blocks each of n*n pixels

2) calculate the dominant orientation in each block

3) calculate the amount of activity in each block

4) calculate the consistency of the dominant orientation 5) to decode one data bit process each block within the predefined set of blocks distributed throughout the image, as follows :-

5a) segment the blocks in the following categories

a) a single edge/line in a dominant orientation b) a low activity block c) several major lines or edges

5b) process the blocks as follows :- a) calculate the concave/convex-ness of an elliptical function al) centred on the edge a2) aligned to the block orientation a3) scaled by the activity in the block b) calculate the concave/convex-ness of a circular function bl) centred on the centre of the block b2) fixed strength c) do nothing

5c) correlate the convex/concave-ness of the block with the next element of the data bit 1 PRS and accumulate the result. 5d) correlate the convex/concave-ness of the block with the next element of the data bit 0 PRS and accumulate the result.

6) compare the data bit 1 accumulated correlation with the data bit 0 accumulated correlation. The larger of the two is the decoded data bit. The size of the correlation is the confidence in the result. 7) repeat steps 5) and 6) for each data bit.

Referring now to Figures 1 to 8, the algorithmic steps listed above will now be explained in detail.

The frame axis conventions used to describe the mathematics are shown in Figure 1. A pixel is defined by its coordinates (x,y) and its luminance value is r(x,y). Note that the top left pixel of an image is the (0,0) pixel, and that the y axis has its positive direction down the frame.

As shown in Figure 2, the frame is segmented into non-overlapping blocks, each block being of size n by n pixels. Smaller values of n mean that it is more likely that only a single edge will be found in any given block. It also means that more individual blocks will be available for the correlation process. Larger values of n mean that larger, and therefore more robust, insertions can be made. In practice a

good choice of n is 8. The block axis conventions used in segmenting the frame are shown in Figure 2.

Referring to Figure 3, the local orientation for each point in the frame is calculated from four surrounding points by a process as described below. This gives a vector at each point in the frame, with the magnitude of the vector representing the strength of the feature, and the angle representing twice the local orientation. This is illustrated in Figure 3. Thus the local orientation gives a measure of gradient of luminance in a particular direction within the block. A very large value of orientation indicates the existence of an edge; In this double angle form the vectors can be vector averaged over a block to give the local orientation for the block. This provides a relatively fast estimating algorithm. e.g, as shown in Figure 3: θ - -45degrees : by convention, θ is associated with point a = (x,y)

-dx ~ 0J

-dy ~ -Q.l θ is estimated from a = r(x,y) b = r(x,y + l) c = r(x + l,y) d = r(x + l, y + l) e = d -a f = b - c Re = -2 * e * f Im = e 2 - f 2

2 Re J

θ is in single angle form

Re,Im are in double angle form orientations are averaged in the Re,Im double angle form.

The calculation of the average block vector is simply a matter of summing the local vectors for the block, as shown in Figure 4. A large average block vector indicates a strong edge running through the block. The average energy in the block

can be calculated by summing the magnitudes of the individual vectors. From these two figures, the local block orientation can be calculated by taking 1/2 the angle of the block vector, and a measure of block consistency calculated by taking the ratio of the magnitude of the block vector to the block energy. The local energy can be used to distinguish between blocks which have small activity (little variations in image) and blocks which have some activity. The consistency measure can be use to distinguish between blocks which have a consistent orientation and these which have a inconsistent orientation. This allows the blocks to be split into three categories as shown in Figure 5. For blocks with a consistent local orientation a centre of the edge needs to be calculated. The method is shown below with reference to Figure 4. Each individual vector is resolved into a component in the same orientation as the block orientation. Then the local centroid of these components is calculated in the orthogonal axis of the local orientation. This local centroid is the centre of the edge which is used to centre the insertion on the edge. During the calculation a variance is also calculated, which is used to determine if there are two edges in the same block, in which case the block is classified as inconsistent, and no insertion is made. This is to prevent the insertion being made half way between the two edges. An alternative strategy in this case would be to split the insertion and make two insertions, calculating the centres by a binary splitting local centroid algorithm.

Referring to Figure 4, the orientations are held as the real Re(x,y) and imaginary Im(x,y) components of the double angle form they are averaged in this form

Re_Λ = ∑ Re( c,y)

Im_ -4 = lm(;t,} χ ,y The energy of the block is calculated from

Strength = M_A(k,l) = ∑sqrt ( Re(x,y)Jm(x,y) ) χ .y

The local block orientation is calculated from

The block consistency is calculated from

- _ sqrt( Re_A*Re_A + Im_A*Im_A)

M_A(k,l)

To calculate the centre point c(x,y) translate coordinates to centre of block x ιl = x N

2

rotate axis to local orientation x2 = dx*x + dy*y y2 = -dy * x + dx * y calculate component of activity in block orientation r = dx * sin(θ(x,y)) + dy * (-cos(θ(x, y)) calculate local centroid of components

∑(r*y2) ley = x.y

*.y rotate and translate coordinates back

N ex = dx * lex - dy * ley H —

N cy = dy * lex + dx * ley + —

also calculate a variance figure

∑(r*y2*y2-lcy*lcy) var = ^-

*.y

Figure 5 illustrates how the different block types are processed. The oriented blocks are given an elliptical insertion aligned with the edge within the block. The

strength of the insertion is scaled by the energy within the block. The low energy blocks have a circular insertion, centred on the centre of the block, and with a fixed strength. They may also have a small dither to prevent contouring. Inconsistent blocks have no insertion. The elliptical insertion function is calculated for each point within the block based on its distance from the centre point, by rotating to align a local axis with the block orientation, and scaling the y-axis to produce an elliptical rather than circular function, as shown in Figure 6. The cross sectional function of the insertion is a 1/2 cycle of a cos function. This insertion function is then used to weight the concave/convex-ness of the individual points, so as to limit the extent of the insertion.

Referring to Figure 6, the insertion function is calculated as follows. Calculate the distance vector between point (x,y) and point (cx.cy) xl = x - ex yl = y -cy Rotate the local axis to align with the local block orientation x2 = dx * xl + dy * yl y2 = -dy * xl + dx * yl scale the y axis to produce an elliptical function in distance y3 = y2 * ratio calculate the radial distance of the point(x,y) ^ JJ MAX_d calculate the insertion function if(d > l)d = l i(x,y) = 0.5 *(cos(d * π) + l)

The insert function is appropriately scaled by the block energy factor, M_A. Whilst the scaling may be a simple proportionality factor, other scalings may be envisaged. In general, the scaled insert factor, i s> may be represented as: i s ( x,y) = f ( i(x,y), M_A )

The calculation of the convex/concave-ness of a block is illustrated in Figure 7. The edge points of the block are used to estimate the inside points using a linear interpolation in the direction of the block orientation. The difference between the

estimated value and the actual value then gives a +ve or -ve number. These numbers are then weighted by the insertion function and summed over the block to give a final +ve or -ve value for the block which indicated its concave or convex-ness.

Referring to Figure 7, predicted value at point (x,y) p(x,y) = Lin(p(xl,yl), p(x2,y2)) p(xl,yl) = Lin(r(xl l,yl),r(xl2,yl)) p(x2,y2) = Lin(r(x2, y21),r(x2,y22)) concave/convexness of point (x,y) c(x,y) = r(x, y) - p(x,y)

Overall concave/convexness of block(k,l), when scaled insert function is added:

C(kJ) = T c(x,y)* is(x,y)

*.y The measure C of block concavity is a significant factor which is computed during the encoding process, and is employed during the decoding process to derive the inserted code.

In the encoding process, the measure C is computed , and a further factor is computed from C, as will be explained in more detail below to determine by how much the scaled insert function should be further scaled to produce the desired value of C in the individual encoded pixels which are transmitted.

In the decoding process, the existing concavity C of the image block is assessed ( which will include the inserted scaled insert function ), and to this is added the concavity of the predicted insert function i s A correlation process is then employed to determine whether an insert function exists within the image.

By way of example, in a decoding process, measures of block concave/convex-ness are combined across a predefined set of blocks in order to produce a reliable result for each bit. For example the measures from 4 successive lines of blocks can be combined to produce each bit. The combination is done by correlating to one of two different pseudo-random sequences as shown below. The elements of the PRS are multiplied by the measures for successive blocks, and the results accumulated, this is done for both sequences. Then the largest of the two determines which bit is decoded, and ratio of the largest correlating value to the maximum possible one gives a measure of confidence that the correlation is correct. Note that a measure of confidence is only reliable for a large number of blocks.

EXAMPLE

- two correlation sequences e.g. Zero : +1, -1, +1,+1, -1, +1, +1, +1, -1, -1, +1

One: -1, -1, -1, +1, +1, -1, +1, -1 , +1, -1, +1

- correlated with C(x,y) e.g.

C(x,y): +0.2, -0.9, +0J, etc... zero gives: (+l)*(+0.2) +(-1)* (-0.9) + (+1)* (+0J) = +1.2 one gives: (-l)*(+0...2) + (-l)*(-0.9) + (-1)*(+0J) = +0.6 sum gives: (+0.2) + (+0.9) + (+0J) =+1.2 Maximum of zero or one determines 0 or 1 bit decision e.g. zero = +1.2 gives a 0 bit

- 100* (zero/sum) gives measure of confidence as a number up to a maximum of 100 e.g. 100* (zero/sum) = 100

Referring now to Figure 8 which shows an encoder for encoding video images, video data is input on line 10 to an 8 x 8 blocking device 12, which performs the operation shown in Figure 2 of dividing the input data into blocks each of 64 pixels. The block data DATA is then passed to two devices 14, 16 for estimating the local orientation of each point within the block and giving the real component Re_A of the orientation and the imaginary component Im_A of the orientation by a process of interpolation described with reference to Figure 3. The values are averaged in summing devices 18, 20 to give average values for the block and from these average values, the block orientation θ is calculated as at 22 by dividing the angle of the block vector by two as described with reference to Figure 4. Signals Im_A and Re_A are applied as inputs to an energy calculation unit 68 which generates a signal Strength, representing the energy or strength of the featured regions in the block, in the manner described with reference to Figure 4. A measure of the consistency of orientation in the block is obtained as at 24 by taking the ratio of the magnitude of the block vector to the block energy. This provides an output β which is applied to a logic unit 80 to be described.

The block orientation unit 22 output θ is applied to devices 26, 28 together with the individual values of each vector from units 14,16 in order to perform the calculation described with reference to Figure 4 of calculating for each vector the component of activity parallel to the axis of the block orientation. In addition, device 28 rotates the coordinates of the vectors to be parallel with the block orientation vector. The centroid of the components is computed as at 30, 32 and outputs lex, ley are applied to unit 34 which is operative to translate the components back to the original x, y axes and provide centroid components ex, cy. In addition device 36 calculates a variance figure var as described with reference to Figure 4. Devices 40, 42, 44 receive signals ex, cy, the block orientation θ, and the

Strength signal. Devices 40, 42, 44 are operative to calculate the elliptical insertion function i as described with reference to Figure 6. The Strength signal is employed to scale the insert function and produce a scaled insert function i s . The insertion function is employed to weight the amount of luminance applied to each pixel dependent on its radial position with reference to the edge centre.

Devices 46, 48, 50, 52, 54, 56 are employed to interpolate the pixel addresses of the edge and to estimate the existing concavity of the block. Firstly, a point within the block (x,y) is reconstituted from the orientation θ as at 46, 48. As described with reference to Figure 7, edge addresses xl l - x22 are estimated at 50 by a process of interpolation at the edge of the block, and the luminance values p(x,j, yi), p , Y2) are then estimated as at 52, 54 by a process of linear interpolation. The luminance of the point p(x, y) is then calculated by a further linear interpolation as at 56. The difference c(x, y) between the actual value r(x, y) and the estimated value p(x, y) is then found in subtractor 58. The value c(x, y) weighted by the insertion function i(x, y) and summed at 60 over the entire block gives a sum value C(k,l) representing the concavity of the entire block as described above.

As mentioned above, this value is employed directly in the decoding process. In the encoding process, this value is employed, as will now be described, to determine the weighting to be applied to the luminance of individual pixels. A value is derived representing the maximum strength of the insert which will not risk the insert becoming visible. This value is derived from a look up table 70 which is accessed by the Strength signal. The lookup table value is limited as at 72 and modulated as at 74 by the pseudo random code bit to be applied to the block. The result is then subtracted from the overall concavity figure C in subtractor 62. The result of the subtraction gives a multiplying factor representing by how much the insert function must be adjusted to give the appropriate luminance value for individual pixels. This value is limited at 78. If

the multiplying factor is too great, creating the risk of a visible artefact, then the limiter 78 will return a maximum value only.

The multiplying factor is subject to a logical function at 80 which receives a control input from a logic block 81 , which thresholds and combines inputs comprising the consistency angle β, the variance var, and the signal Strength from unit 68 to indicate whether the block is a suitable block for containing data. Effectively the units 80, 81 perform the function indicated in Figure 5 to assess whether the block is suitable for code insertion.

The scaled insert function i s is multiplied at unit 82 with the multiplying factor and summed at 84 on a pixel-by -pixel basis with the input data from unit 12 to provide a coded output signal as at 84.

In the case where the block is unsuitable for code insertion along an edge in that the Strength signal indicates that the block is of low activity as exemplified in Fig. 5b, then units 40-44 are adapted to compute a circular insert function. In the case where as indicated in Figure 5a, insertion along an edge is possible, then units 40 - 44 compute the elliptical insertion function i s defined above with reference to Figure 6.

Referring now to Figure 9, the decoding section which receives the coded output from the encoder operates in a very similar manner and similar units are indicated by the same reference numeral. The essential difference is that units 70 - 82 of the encoder are omitted and are replaced in the decoder by unit 100, which is operative to perform the correlation function outlined above (see EXAMPLE ) with the pseudo random codes in order to decode the data. Thus the decoder computes the overall concavity of the block as at 58, and the anticipated scaled insert function i s These values are summed as at 60 to give a value for the whole block, and a correlation is performed in unit 100 with the two pseudo random codes representing the two possible binary values.

Whilst the above has been described as a preferred embodiment, other embodiments may be implemented. For example an embodiment will now be described for encoding textured regions.

ENCODER 1. An image is to be encoded containing textured regions comprising a random mixture of small localised areas having different chrominance values. For each block of the MxN blocks of the image, the texture statistical parameters are calculated by a cluster analysis process which produces clusters of chrominance values and the variance values for each cluster. The number of clusters and cluster variances are used to identify blocks which consist of two (or more ) basic chrominance values (colours )

which are distributed in a random or relatively random pattern. The computed statistical parameters are used to identify the more intense "foreground" value. A threshold is set based on the statistical parameters, and used to identify pixels within each block which belong to the foreground value. 2. A circular function centered on the centre of the sub-bit block, with a maximum value at its centre and tapering to zero with an appropriate cross-sectional function, a 1/2 wave raised cosine function, is calculated. The magnitude of the circular function is set from the cluster statistics ( by means of empirical measurements), to maximise the insertion strength whilst limiting visibility. It is also limited by the existing concavity/ convexity of the sub-bit block, which is calculated as in the above described embodiment.

3. The insertion function thus calculated is applied to adjust the cluster value in a positive or negative manner depending on the existing concavity, according to the required sign. The adjustment is made only to those pixels which have been identified as part of the foreground value.

DECODER

1. As with step 1. of the Encoder, the statistical parameters of the sub-bit block are calculated to identify the "foreground" pixels. 2. For each identified foreground pixel , the distance from the centre of the sub- bit block (circular insert function) is calculated.

3. All different combinations of pairs Pi of foreground pixels are determined, and for each pair Pi the nearer pixel to the centre is calculated. The difference in the intensity values Vi of each pair is computed by subtracting the value of the pixel nearer the centre from the value of the pixel further from the center. The difference Di in the distances of the pixels of each pair from the centre is also calculated.

4. A factor C is now computed = ∑j Vi*Di

5. C is now the estimate of concavity and is used as in the main embodiment as described above.




 
Previous Patent: PICTURE READER

Next Patent: INTERACTIVE BROADCASTING SYSTEM