Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
STRUCTURED DOCUMENT PROCESSING DEVICE, STRUCTURED DOCUMENT PROCESSING METHOD, AND PROGRAM
Document Type and Number:
WIPO Patent Application WO/2021/019772
Kind Code:
A1
Abstract:
This structured document processing device includes: an analysis unit that analyzes the tree structure of a structured document; and a generation unit that, for each leaf node in the tree structure, identifies a path from the leaf node to a root node, and generates a post-conversion document including text data in which character strings are connected, the character strings relating to each node from the root node to the leaf node for each path. This configuration makes it easy to apply a neural network to the structured document.

Inventors:
NOMOTO NARICHIKA (JP)
ASANO HISAKO (JP)
TOMITA JUNJI (JP)
Application Number:
PCT/JP2019/030276
Publication Date:
February 04, 2021
Filing Date:
August 01, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NIPPON TELEGRAPH & TELEPHONE (JP)
International Classes:
G06F40/216
Foreign References:
JP2012027852A2012-02-09
Other References:
YUMI YONEI, MIZUHO IWAIHARA, MASATOSHI YOSHIKAWA: "Person Retrieval on XML Documents by Coreference that Uses Structural Features", JOURNAL OF THE DBSJ, vol. 7, no. 1, 27 June 2008 (2008-06-27), pages 151 - 156, XP009527977, ISSN: 1883-4205
Attorney, Agent or Firm:
ITOH, Tadashige et al. (JP)
Download PDF: