Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DOCUMENT CLUSTERING SYSTEM, DOCUMENT CLUSTERING METHOD, AND RECORDING MEDIUM
Document Type and Number:
WIPO Patent Application WO/2011/078186
Kind Code:
A1
Abstract:
In the provided document clustering system (100), a concept tree accumulation unit (11) stores a concept tree structure that represents a hierarchical relationship among concepts represented by each of a plurality of words. For any two words, a concept similarity computation unit (12) obtains a concept similarity, which is an index indicating how close the concepts represented by the two words are. Using concept similarities for words that appear in two documents in a document set, an inter-document similarity computation unit (13) obtains an inter-document similarity, which indicates how similar the two documents are semantically. A clustering unit (14) uses inter-document similarities to cluster the documents in the document set.

Inventors:
MIZUGUCHI HIRONORI (JP)
KUSUI DAI (JP)
Application Number:
PCT/JP2010/073042
Publication Date:
June 30, 2011
Filing Date:
December 21, 2010
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NEC CORP (JP)
MIZUGUCHI HIRONORI (JP)
KUSUI DAI (JP)
International Classes:
G06F17/30
Foreign References:
JP2006221478A2006-08-24
JPH11203319A1999-07-30
JP2008204374A2008-09-04
Attorney, Agent or Firm:
KIMURA MITSURU (JP)
Mitsuru Kimura (JP)
Download PDF: