Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ONLINE INTERNET TOPIC MINING METHOD BASED ON IMPROVED LDA MODEL
Document Type and Number:
WIPO Patent Application WO/2017/035922
Kind Code:
A1
Abstract:
Disclosed is an online Internet topic mining method based on an improved LDA model. The method corresponds to a continuous and streaming type topic mining process conducted in a segmented mode, n web pages are processed each time, and these web pages are usually acquired by web crawlers from the Internet in an online and real-time mode, and mining results of the contents of these web pages generate k topics. After current n web pages are processed, newly acquired n web pages are continuously processed through the process. The process mainly comprises initialization of On-LDA model hyper-parameters, dynamic updating of the On-LDA model hyper-parameters, Internet topic mining based on the On-LDA model and the like. By means of the present invention, the assignment method and effect of use in respect to the hyper-parameters and of a traditional LDA model in the topic mining process are radically changed. Classified information to which the web page contents belong is fully utilized to assign initial values to the model hyper-parameters, so that the initial values of the hyper-parameters completely depend on the web page contents to be mined, and the computing process is simplified and rationality is achieved.

Inventors:
YANG PENG (CN)
LU YUNCHENG (CN)
DONG YONGQIANG (CN)
Application Number:
PCT/CN2015/092047
Publication Date:
March 09, 2017
Filing Date:
October 16, 2015
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
YANG PENG (CN)
International Classes:
G06F17/30
Foreign References:
CN105138665A2015-12-09
CN102439597A2012-05-02
CN101710333A2010-05-19
CN101587493A2009-11-25
US7853596B22010-12-14
Attorney, Agent or Firm:
JIANGSU AIXIN LAW FIRM (CN)
Download PDF: