Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TABLE DATA PARSING METHOD FOR PDF FILE
Document Type and Number:
WIPO Patent Application WO/2021/145541
Kind Code:
A1
Abstract:
The present invention relates to a table data parsing method for a PDF file. The present invention comprises the steps of: generating a parse tree for a PDF file by extracting data from the PDF file and analyzing the file structure; by using the generated parse tree, retrieving the location of a page which contains a headword of a table being searched; setting a parsing range in the retrieved page, with respect to coordinates (x, y) assigned to the headword of the table being searched; and parsing table data in the parsing range that has been set. According to the present invention, a merit is achieved of enabling target table data to be accurately parsed from a PDF file.

Inventors:
GU DA HAE (KR)
KIM DONG HOON (KR)
Application Number:
PCT/KR2020/015235
Publication Date:
July 22, 2021
Filing Date:
November 03, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TITECHNOLOGY CO LTD (KR)
International Classes:
G06F16/22
Foreign References:
US20190294399A12019-09-26
KR20180080408A2018-07-12
KR100912502B12009-08-17
JPH0765034A1995-03-10
KR20090084161A2009-08-05
KR102171325B12020-10-28
Attorney, Agent or Firm:
MAJOR PATENT AND LAW FIRM (KR)
Download PDF: