Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
TRAINING MACHINE LEARNING MODELS ON A LARGE-SCALE DISTRIBUTED SYSTEM USING A JOB SERVER
Document Type and Number:
WIPO Patent Application WO/2018/196631
Kind Code:
A1
Abstract:
A computer system for training machine learning models includes a job server and a plurality of compute nodes. The job server receives jobs for training machine learning models and allocates these training jobs to groups of one or more compute nodes. The allocation is based on the current requirements of the training jobs and the current status of the compute nodes. The training jobs include updating values for the parameters (e.g., weights and biases) of the machine learning models. Preferably, the compute nodes in the training group communicate the updated values of the parameters among themselves in order to complete the training job.

Inventors:
CHEN XIN (US)
ZHOU HUA (US)
WANG DONGYAN (CN)
Application Number:
PCT/CN2018/082970
Publication Date:
November 01, 2018
Filing Date:
April 13, 2018
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MIDEA GROUP CO LTD (CN)
International Classes:
G06F9/50; G06N20/20
Foreign References:
CN105575119A2016-05-11
CN102073546A2011-05-25
CN102523249A2012-06-27
CN105069703A2015-11-18
US7596788B12009-09-29
US20130290223A12013-10-31
Other References:
ABADI M.: "Arxiv.org", CORNELL UNIVERSITY LIBRARY, article "TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems"
SHOKRI R.: "Privacy-Preserving Deep Learning", PROCEEDINGS OF THE 22ND ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, CCS, January 2015 (2015-01-01)
See also references of EP 3593247A4
Attorney, Agent or Firm:
CHINA PAT INTELLECTUAL PROPERTY OFFICE (CN)
Download PDF: