Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ACCELERATION OF GPUS IN CLOUD COMPUTING
Document Type and Number:
WIPO Patent Application WO/2021/092634
Kind Code:
A3
Abstract:
The disclosure relates to technology for acceleration of GPUs in cloud. Instructions for a computational task are accessed. An allocation of data and instructions is calculated based on the data, the instructions, and dynamic GPU resources. The data and the instructions are provided to the GPUs in accordance with the allocation, which includes scheduling a set of instructions for parallel computation of an operation of the computational task on multiple sub-matrices of a data matrix. Separate portions of information are stored into corresponding different regions of non-transitory memory of a processor core to provide concurrent access to the multiple sub-matrices to the processor core. Each sub-matrix corresponds to a portion of the data matrix for which an operation of the computational task is to be performed. Each sub-matrix contains an element in the data matrix in common with another sub-matrix of the data matrix.

Inventors:
WANG YONG (US)
ZHU YINGXUAN (US)
GKOUNTOUVAS THEODOROS (US)
SU HAN (US)
LEI HUI (US)
Application Number:
PCT/US2021/021113
Publication Date:
December 23, 2021
Filing Date:
March 05, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HUAWEI TECH CO LTD (CN)
International Classes:
G06F9/50; G06N3/063
Foreign References:
US20190188570A12019-06-20
Other References:
SANGKUG LYM ET AL: "DeLTA: GPU Performance Model for Deep Learning Applications with In-depth Memory System Traffic Analysis", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 3 April 2019 (2019-04-03), XP081164147
XUHAO CHEN: "Escort: Efficient Sparse Convolutional Neural Networks on GPUs", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 28 February 2018 (2018-02-28), XP081213571
WU XINXIN ET AL: "Accelerating Sparse Convolutional Neural Networks Based on Dataflow Architecture", 29 September 2020, COMPUTER VISION - ECCV 2020 : 16TH EUROPEAN CONFERENCE, GLASGOW, UK, AUGUST 23-28, 2020 : PROCEEDINGS; PART OF THE LECTURE NOTES IN COMPUTER SCIENCE ; ISSN 0302-9743; [LECTURE NOTES IN COMPUTER SCIENCE; LECT.NOTES COMPUTER], SPRINGER INTERNATIONAL PU, ISBN: 978-3-030-58594-5, XP047563404
Attorney, Agent or Firm:
POMERENKE, Ronald M. (US)
Download PDF: