Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD OF SEARCHING FOR DOCUMENT DATA FILES BASED ON KEYWORDS, AND COMPUTER SYSTEM AND COMPUTER PROGRAM THEREOF
Document Type and Number:
WIPO Patent Application WO/2011/070832
Kind Code:
A1
Abstract:
Disclosed is a method of searching for document data files based on keywords. The method comprises the steps of calculating a score or probability as a first vector that respective document data files are associated with clusters or classes intended for the clustering or classification of document data files; calculating a score or probability as a second vector in response to keywords entered in searches that either the keywords thus entered or keywords that are related to the keywords thus entered are associated with the clusters or classes; calculating the scalar product of the first vector and the second vector, wherein the scalar product value thus calculated is the score of the document data files with respect to the keywords; and finding the correlation value of document data files containing the respective classification keyword sets and of document data files whose calculated score is either greater than or equal to a prescribed threshold or are included in a higher-order prescribed proportion.

Inventors:
INAGAKI TAKESHI (JP)
Application Number:
PCT/JP2010/065631
Publication Date:
June 16, 2011
Filing Date:
September 10, 2010
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
IBM (US)
INAGAKI TAKESHI (JP)
International Classes:
G06F17/30
Foreign References:
JP2005050135A2005-02-24
JPH02235176A1990-09-18
JP2004287781A2004-10-14
Other References:
HIROMITSU NISHIZAKI ET AL.: "A Retrieval Method of Broadcast News Using Voice input Keywords", IEICE TECHNICAL REPORT, vol. 99, no. 523, 20 December 1999 (1999-12-20), pages 91 - 96
Attorney, Agent or Firm:
UENO Takeshi et al. (JP)
Tsuyoshi Ueno (JP)
Download PDF: