Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TEXT DOCUMENT CATEGORIZATION USING RULES AND DOCUMENT FINGERPRINTS
Document Type and Number:
WIPO Patent Application WO/2021/121279
Kind Code:
A1
Abstract:
Methods, apparatuses, and storage media storing instructions for classifying text documents are provided. A plurality of text documents is obtained. The plurality of text documents is classified into one or more document categories based on a plurality of classification rules. Each of the one or more document categories include one or more first text documents of the plurality of text documents. A second text document of the plurality of text documents is classified based on the plurality of classification rules as belonging to none of the one or more document categories. One or more document fingerprints are generated for respective first text documents in the one or more document categories. The second text document is classified into one of the one or more document categories based on the one or more document fingerprints.

Inventors:
REN LIWEI (US)
WANG QIAOYUE (US)
Application Number:
PCT/CN2020/136877
Publication Date:
June 24, 2021
Filing Date:
December 16, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BEIJING DIDI INFINITY TECHNOLOGY & DEV CO LTD (CN)
International Classes:
G06F16/35
Foreign References:
CN109800308A2019-05-24
CN103744964A2014-04-23
US20070005589A12007-01-04
CN104982011A2015-10-14
US20130013603A12013-01-10
CN109582783A2019-04-05
Attorney, Agent or Firm:
WITTPAT INTELLECTUAL PROPERTY LAW FIRM (CN)
Download PDF: