Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MODEL TRAINING METHOD AND RELATED DEVICE
Document Type and Number:
WIPO Patent Application WO/2024/093294
Kind Code:
A1
Abstract:
Disclosed in embodiments of the present application are a model training method and a related device, for use in performing acceleration training on a target model. In the present application, since sampling is performed in the target model, to obtain a sub-model of the target model, and the number of feature transformation layers of the sub-model is less than the number of feature transformation layers of the target model, and/or, and the size of the weight matrix of at least one of the feature transformation layers in the sub-model is smaller than the size of the weight matrix of the corresponding feature transformation layer in the target model, compared with training of the target model, training of the sub-model can improve the training efficiency. The sub-model is augmented, and the augmented model is trained to obtain a trained augmented model, so that the performance of the trained model can be guaranteed. Therefore, the technical solution of the present application improves the training efficiency while guaranteeing the performance of the model.

Inventors:
TANG YEHUI (CN)
DING NING (CN)
HAN KAI (CN)
XU CHAO (CN)
WANG YUNHE (CN)
Application Number:
PCT/CN2023/103347
Publication Date:
May 10, 2024
Filing Date:
June 28, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HUAWEI TECH CO LTD (CN)
International Classes:
G06N3/0464
Foreign References:
CN113570029A2021-10-29
US20210073646A12021-03-11
CN109242028A2019-01-18
CN112070207A2020-12-11
US20220027434A12022-01-27
Attorney, Agent or Firm:
SHENPAT INTELLECTUAL PROPERTY AGENCY (CN)
Download PDF: