Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
INTER-DOCUMENT DISTANCE CALCULATOR AND TEXT RETRIEVER
Document Type and Number:
Japanese Patent JP2011175568
Kind Code:
A
Abstract:

To solve such problems that a correct similarity cannot be obtained, a similarity between synonyms and between texts of different expressions cannot be calculated with accuracy, and further a similar text following the similarity of a text structure cannot be classified when a co-occurrence between unrelated words with each other exists in a text with a plurality of subjects, even though the similarity between documents and between the document and the retrieval key during retrieval is calculated on the basis of a co-occurrence of words in a text.

A syntax analyzing means performs a morphological analysis and a dependency analysis of a character string of a document from a document inputting means, a tree structure with syntax information creating means creates a tree structure with syntax information from the syntax analysis result, a parallel node adding means adds a parallel node with nodes having a parallel relation to the tree structure with syntax information defined as a child node to the tree structure with syntax information, a parallel node ordering means orders nodes under the added parallel node, the tree structure with syntax information is edited into a tree structure with syntax information of another document, and a distance calculating means calculates the edit distance.


Inventors:
MIKAMI TAKASHI
HIRANO TAKASHI
Application Number:
JP2010040578A
Publication Date:
September 08, 2011
Filing Date:
February 25, 2010
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MITSUBISHI ELECTRIC CORP
International Classes:
G06F17/30; G06F17/21
Domestic Patent References:
JP2004110161A2004-04-08
JPH06162098A1994-06-10
JPH04102171A1992-04-03
JPH03161865A1991-07-11
JP2002032374A2002-01-31
JP2000148793A2000-05-30
JP2004110161A2004-04-08
JPH06162098A1994-06-10
JPH04102171A1992-04-03
JPH03161865A1991-07-11
JP2002032374A2002-01-31
JP2000148793A2000-05-30
Other References:
JPN6013044815; 板尾  要祐、外3名: '特徴的な意味内容を抽出する木構造マイニングのための日本語処理手法' 言語処理学会第11回年次大会発表論文集 , 20050315, p.73-76, 言語処理学会
CSNG201000460018; 板尾  要祐、外3名: '特徴的な意味内容を抽出する木構造マイニングのための日本語処理手法' 言語処理学会第11回年次大会発表論文集 , 20050315, p.73-76, 言語処理学会
JPN6013044815; 板尾  要祐、外3名: '特徴的な意味内容を抽出する木構造マイニングのための日本語処理手法' 言語処理学会第11回年次大会発表論文集 , 20050315, p.73-76, 言語処理学会
Attorney, Agent or Firm:
Michiharu Soga
Hidetoshi Furukawa
Suzuki Kenchi
Kajinami order
Kazuhiro Oyaku
Shunichi Ueda
Junichiro Yoshida