Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SPECIFIED ELEMENT VECTOR GENERATING DEVICE, CHARACTER STRING VECTOR GENERATING DEVICE, SIMILARITY CALCULATION DEVICE, SPECIFIED ELEMENT VECTOR GENERATING PROGRAM, CHARACTER STRING VECTOR GENERATING PROGRAM, SIMILARITY CALCULATION PROGRAM, SPECIFIED ELEMENT VECTOR GENERATING METHOD, CHARACTER STRING VECTOR GENERATING METHOD, AND SIMILARITY CALCULATION METHOD
Document Type and Number:
Japanese Patent JP2003288362
Kind Code:
A
Abstract:

To provide a similarity calculation device suitable to effectively calculate the similarity of a word by uniformly reflecting the word in the calculation of similarity according to the frequency of appearance.

A document vector is generated based on a plurality of document data. The document vector has an element corresponding to each morpheme, and each element is calculated so as to be a value according to the appearance frequency of the corresponding morpheme. A word vector is then generated by the inversion matrix of a document word matrix that is a set of generated document vectors. Accordingly, the word vector has the element corresponding to each document data, and each element is a value proportional to the appearance frequency of each morpheme in the corresponding data of the plurality of document data and inversely proportional to the appearance frequency of each morpheme in the plurality of document data. The similarity of the word is calculated on the basis of the word vector.


Inventors:
KAYAHARA NAOKI
Application Number:
JP2002089812A
Publication Date:
October 10, 2003
Filing Date:
March 27, 2002
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SEIKO EPSON CORP
International Classes:
G06F17/30; (IPC1-7): G06F17/30
Domestic Patent References:
JP2000339342A2000-12-08
JP2001043236A2001-02-16
JP2002073681A2002-03-12
JP2000207404A2000-07-28
JP2000112974A2000-04-21
JP2000172717A2000-06-23
JPH11167581A1999-06-22
Attorney, Agent or Firm:
Masayanagi Ueyanagi (2 outside)