Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD OF MINING AND CLEANING UP SIMILAR BOOKS IN BOOK DATABASE, AND DEVICE UTILIZING SAME
Document Type and Number:
WIPO Patent Application WO/2017/080320
Kind Code:
A1
Abstract:
The method provides a method of mining and cleaning up similar books in a book database, and a device utilizing the same. The method comprises: determining, according to book titles of all of electronic books in a book database, a group to which each of the electronic books belongs; with respect to each of groups, computing, according to a chapter list of each of electronic books in each of the groups, similarity between electronic books in each of the groups; if similarity between two electronic books in each of the groups exceeds a preconfigured threshold, determining that the two electronic books is a pair of similar books in the book database; constructing, by employing similar books in all of the groups in the book database, a graph model of the book database, wherein each pair of similar books are two mutually connected vertices in the graph model; with respect to each of connected components in the graph model, selecting one electronic book to be kept from the connected components, and deleting the other electronic book. Applying the invention can increase a speed of mining similar books from a book database, and reduce a size of the book database.

Inventors:
ZHANG CHAO (CN)
Application Number:
PCT/CN2016/099894
Publication Date:
May 18, 2017
Filing Date:
September 23, 2016
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BEIJING QIHOO TECHNOLOGY CO (CN)
QIZHI SOFTWARE (BEIJING) COMPANY LTD (CN)
International Classes:
G06F17/30
Foreign References:
CN102024065A2011-04-20
CN105373604A2016-03-02
US20130332466A12013-12-12
Attorney, Agent or Firm:
BEIJING LONGAN LAW FIRM (CN)
Download PDF: