Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TRAINING SAMPLE SET GENERATION FROM IMBALANCED DATA IN VIEW OF USER GOALS
Document Type and Number:
WIPO Patent Application WO/2023/078240
Kind Code:
A1
Abstract:
A method is provided, the method including: receiving a sample set for training a machine-learning model, wherein the sample set includes a plurality of classes, wherein classes within the plurality of classes have an imbalance in a number of samples; creating an enlarged minority class by generating new samples from the samples within the minority class and adding the new samples to the minority class; selecting subset samples from both the samples within the enlarged minority class and the majority class; weighting each of the subset samples based upon user input defining goals for attributes of a training sample set to be used in training the machine-learning model; and generating, using the neural network, the training sample set by re-running the selecting in view of the weighting.

Inventors:
SHARMA MITTAL RUHI (IN)
NAGALAPATTI LOKESH (IN)
PATEL HIMA (IN)
GUPTA NITIN (IN)
Application Number:
PCT/CN2022/128957
Publication Date:
May 11, 2023
Filing Date:
November 01, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
IBM (US)
IBM CHINA CO LTD (CN)
International Classes:
G06F17/14
Domestic Patent References:
WO2019033636A12019-02-21
Foreign References:
CN113194094A2021-07-30
US20210073671A12021-03-11
US20200045063A12020-02-06
US20170206457A12017-07-20
US20210133518A12021-05-06
Attorney, Agent or Firm:
ZHONGZI LAW OFFICE (CN)
Download PDF: