Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DATA CLASSIFICATION METHOD, APPARATUS, DEVICE AND COMPUTER READABLE STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2019/169704
Kind Code:
A1
Abstract:
Provided in an embodiment of the present application are a data classification method, apparatus, device, and computer readable storage medium. In a circumstance in which two classes of samples are unbalanced, for a large number of samples, several same class sample sets are generated by means of downsampling; and for a few classes of samples, new samples are generated by means of upsampling; the new samples are used for mixing with the few classes of samples to form a relatively large number of samples, such that the number of samples of a sample set in which the original numbers are few is balanced with the number of samples of a sample set in which the original numbers are many; and the few classes of samples and many classes of samples predict data by means of multiple modeling, and finally a prediction result having a quantitative advantage is taken as a classification result. The accuracy of the data prediction is improved by means of a means of upsampling, downsampling and multiple-modeling multiple-predictions.

Inventors:
WU WENYUE (CN)
Application Number:
PCT/CN2018/084047
Publication Date:
September 12, 2019
Filing Date:
April 23, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G06F17/30
Foreign References:
CN106973057A2017-07-21
CN105487526A2016-04-13
EP3336739A12018-06-20
US20170132516A12017-05-11
Other References:
DU HONGLE ET AL.: "A classification algorithm based on mixed sampling for imbalanced dataset", JOURNAL OF YANSHAN UNIVERSITY, vol. 39, no. 2, 31 March 2015 (2015-03-31), XP055636317
Attorney, Agent or Firm:
SHENZHEN TALENT PATENT SERVICE (CN)
Download PDF: