Title:
TEXT DOCUMENT CATEGORIZATION USING RULES AND DOCUMENT FINGERPRINTS
Document Type and Number:
WIPO Patent Application WO/2021/121279
Kind Code:
A1
Abstract:
Methods, apparatuses, and storage media storing instructions for classifying text documents are provided. A plurality of text documents is obtained. The plurality of text documents is classified into one or more document categories based on a plurality of classification rules. Each of the one or more document categories include one or more first text documents of the plurality of text documents. A second text document of the plurality of text documents is classified based on the plurality of classification rules as belonging to none of the one or more document categories. One or more document fingerprints are generated for respective first text documents in the one or more document categories. The second text document is classified into one of the one or more document categories based on the one or more document fingerprints.
More Like This:
Inventors:
REN LIWEI (US)
WANG QIAOYUE (US)
WANG QIAOYUE (US)
Application Number:
PCT/CN2020/136877
Publication Date:
June 24, 2021
Filing Date:
December 16, 2020
Export Citation:
Assignee:
BEIJING DIDI INFINITY TECHNOLOGY & DEV CO LTD (CN)
International Classes:
G06F16/35
Foreign References:
CN109800308A | 2019-05-24 | |||
CN103744964A | 2014-04-23 | |||
US20070005589A1 | 2007-01-04 | |||
CN104982011A | 2015-10-14 | |||
US20130013603A1 | 2013-01-10 | |||
CN109582783A | 2019-04-05 |
Attorney, Agent or Firm:
WITTPAT INTELLECTUAL PROPERTY LAW FIRM (CN)
Download PDF:
Previous Patent: LIQUID METAL CONDUCTIVE SLURRY AND ELECTRONIC DEVICE
Next Patent: MULTI-PURPOSE AGENT FOR ENDPOINT SCANNING
Next Patent: MULTI-PURPOSE AGENT FOR ENDPOINT SCANNING