Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TEXT CLUSTERING METHOD
Document Type and Number:
WIPO Patent Application WO/2022/126810
Kind Code:
A1
Abstract:
A text clustering method, comprising: performing word segmentation, stop word removal, and keyword extraction processing on a document set to be clustered (S1); creating a text similarity matrix, an adjacency matrix, a degree matrix, and a Laplacian matrix; calculating eigenvalues and eigenvectors of the Laplacian matrix to obtain an eigenmatrix; using a clustering method to cluster the eigenmatrix to obtain a clustering result (S6); if the number of categories is known, then setting the clustering result as the final clustering result; if the number of categories is unknown, then obtaining multiple clustering results by executing the following operations multiple times and evaluating the multiple clustering results to select a final clustering result: adjusting the clustering parameters, and executing the operations of constructing the adjacency matrix, degree matrix, and Laplacian matrix, calculating eigenvalues and eigenvectors of the Laplacian matrix to obtain an eigenmatrix, and clustering the eigenmatrix to obtain a clustering result; combining the final clustering result and the extracted keywords, and extracting a category keyword on the basis of a TF-IDF algorithm; and outputting the final clustering result and the category keyword (S10).

Inventors:
ZHANG ARES (CN)
MA MAXWELL (CN)
Application Number:
PCT/CN2021/071166
Publication Date:
June 23, 2022
Filing Date:
January 12, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
AISHU TECH CORP (CN)
International Classes:
G06F40/194
Foreign References:
CN109960730A2019-07-02
CN104462253A2015-03-25
CN108132968A2018-06-08
CN107943856A2018-04-20
US20080243829A12008-10-02
Other References:
NIU HAIYAN: "Research on Fuzzy Spectral Clustering Segmentation Algorithm and Apply It to Text Clustering", MASTER THESIS, TIANJIN POLYTECHNIC UNIVERSITY, CN, 15 March 2017 (2017-03-15), CN , XP055942873, ISSN: 1674-0246
Attorney, Agent or Firm:
BEYOND ATTORNEYS AT LAW (CN)
Download PDF: