Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
COMPUTE-INTENSIVE KERNEL GENERATOR, MICRO-KERNEL CODE CACHE, FUSED KERNEL GENERATOR AND CYCLIC DEPENDENCE FREE GRAPH PARTITIONING FOR DEEP LEARNING WORKLOADS
Document Type and Number:
WIPO Patent Application WO/2023/108894
Kind Code:
A1
Abstract:
Systems, apparatuses and methods may provide for technology that identifies a data layout associated with input tensors and output tensors, generates a micro-kernel based at least in part on the data layout, and generates a nested outer loop for a kernel, wherein the micro-kernel performs one or more subtasks associated with a task represented by the kernel. The technology also includes micro-kernel code caches, fused kernel generators and cyclic dependence free graph partitioning for deep learning workloads.

Inventors:
LI JIANHUI (US)
QIN ZHENNAN (CN)
GONG JIONG (CN)
CUI JINGZE (CN)
MEI YIJIE (CN)
SONG YUNFEI (CN)
Application Number:
PCT/CN2022/077751
Publication Date:
June 22, 2023
Filing Date:
February 24, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
INTEL CORP (US)
LI JIANHUI (US)
QIN ZHENNAN (CN)
GONG JIONG (CN)
CUI JINGZE (CN)
MEI YIJIE (CN)
SONG YUNFEI (CN)
International Classes:
G06F9/22; G06F8/41; G06F9/38; G06F17/16
Domestic Patent References:
WO2016054303A12016-04-07
Foreign References:
US20200410318A12020-12-31
US20210049231A12021-02-18
US20210117806A12021-04-22
Attorney, Agent or Firm:
BEIJING EAST IP LTD. (CN)
Download PDF: