Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
GENERALIZED TEXT LOCALIZATION IN IMAGES
Document Type and Number:
WIPO Patent Application WO2001069529
Kind Code:
A3
Abstract:
In some embodiments, the invention includes a method for locating text in digital images. The method includes scaling a digital image into images of multiple resolutions and classifying whether pixels in the multiple resolutions are part of a text region. The method also includes integrating scales to create a scale integration saliency map and using the saliency map to create initial text bounding boxes through expanding the boxes from rectangles of pixels including at least one pixel to include groups of at least one pixel adjacent to the rectangles, wherein the groups have a particular relationship to a first threshold. The initial text bounding boxes are consolidated. In other embodiments, a method includes classifying whether pixels are part of text region, creating initial text bounding boxes, and consolidating the initial text bounding boxes, wherein the consolidating includes creating horizontal projection profiles having adaptive thresholds and vertical projection profiles having adaptive thresholds.

Inventors:
LIENHART RAINER W
WERNICKE AXEL
Application Number:
PCT/US2001/005757
Publication Date:
February 07, 2002
Filing Date:
February 23, 2001
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
INTEL CORP (US)
International Classes:
G06T5/00; G06V30/10; (IPC1-7): G06T5/00
Other References:
ETEMAD K ET AL: "PAGE SEGMENTATION USING DECISION INTEGRATION AND WAVELET PACKETS", PROCEEDINGS OF THE IAPR INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION. JERUSALEM, OCT. 9 - 13, 1994. CONFERENCE B: PATTERN RECOGNITION AND NEURAL NETWORKS, LOS ALAMITOS, IEEE COMP. SOC. PRESS, US, vol. 2 CONF. 12, 9 October 1994 (1994-10-09), pages 345 - 349, XP000509906, ISBN: 0-8186-6272-7
JAIN, A.K.: "Fundamentals of Digital Image Processing", 1989, PRENTICE HALL, ENGLEWOOD CLIFFS, XP002180043
SATO T ET AL: "Video OCR for digital news archive", PROCEEDINGS. 1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE (CAT. NO.98EX125), PROCEEDINGS 1998 IEEE INTERNATIONAL WORKSHOP ON CONTENT-BASED ACCESS OF IMAGE AND VIDEO DATABASE, BOMBAY, INDIA, 3 JAN. 1998, 1998, Los Alamitos, CA, USA, IEEE Comput. Soc, USA, pages 52 - 60, XP002178985, ISBN: 0-8186-8329-5
CINQUE L ET AL: "A multiresolution approach for page segmentation", PATTERN RECOGNITION LETTERS, NORTH-HOLLAND PUBL. AMSTERDAM, NL, vol. 19, no. 2, 1 February 1998 (1998-02-01), pages 217 - 225, XP004123777, ISSN: 0167-8655
MUKHERJEE D P ET AL: "DOCUMENT PAGE SEGMENTATION USING MULTISCALE CLUSTERING", PROCEEDINGS 1999 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING. ICIP'99. KOBE, JAPAN, OCT. 24 - 28, 1999, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, LOS ALAMITOS, CA: IEEE, US, vol. 1 OF 4, 24 October 1999 (1999-10-24), pages 234 - 238, XP000921753, ISBN: 0-7803-5468-0
Download PDF: