Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
A document conversion device, a document conversion method, and a document conversion program
Document Type and Number:
Japanese Patent JP6091093
Kind Code:
B2
Abstract:
PROBLEM TO BE SOLVED: To convert plural documents having different formats to structuralized information.SOLUTION: A document conversion device 100 includes a format determination unit 150 that determines format of an input document, a table format determination unit 160 that determines format of a table included in the document of determined format, a structuralization unit that is prepared for each table format and converts information included in a table of a corresponding format to structuralized information, and an information conversion control unit 170 that converts information included in each table to structuralized information by the structuralization unit identified for each table on the basis of format of each table and conversion designation information.

Inventors:
Hiroshi Kono
Hiroyuki Nakazaki
Toru Takagi
Tomochi Takayama
Miyadate Yasuo
Motoyama Hisao
Yosuke Kondo
Kayomi Yagi
Application Number:
JP2012135219A
Publication Date:
March 08, 2017
Filing Date:
June 14, 2012
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NTT DATA Corporation
International Classes:
G06F17/21; G06F12/00; G06F17/24; G06T11/60
Domestic Patent References:
JP11250041A
JP2004178170A
JP10307816A
JP200699480A
JP2011170397A
JP2009151676A
JP20103155A
JP2013135219A
Other References:
石谷 康人 外2名,XML文書変換を目的としたセル分類およびセル変形に基づく表構造解析,電子情報通信学会技術研究報告,日本,社団法人電子情報通信学会,2005年 3月11日,第104巻第742号,p.157-162
山口 智由 外1名,複数表を対象とした統計データの抽出と統合,電子情報通信学会 第19回データ工学ワークショップ論文集 [online],日本,電子情報通信学会データ工学研究専門委員会,2008年 4月 7日,DEWS2008A4-5
Attorney, Agent or Firm:
Kimura Mitsuru