Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
NETWORK TRAINING METHOD AND APPARATUS, ROBOT CONTROL METHOD AND APPARATUS, DEVICE, STORAGE MEDIUM, AND PROGRAM
Document Type and Number:
WIPO Patent Application WO/2023/123838
Kind Code:
A1
Abstract:
A network training method and apparatus, a robot control method and apparatus, an electronic device, a computer storage medium, and a computer program. The training method comprises: acquiring environmental state information in a target application scenario (S101); according to the environmental state information and a pre-trained reinforcement learning network, obtaining action sequence information, and determining a total reward return value corresponding to the action sequence information, the action sequence information being used for indicating at least two continuous execution actions within a future preset duration (S102); and on the basis of the total reward return value, adjusting a network parameter value of the reinforcement learning network so as to obtain a trained reinforcement learning network (S103).

Inventors:
LI CHUMING (CN)
LIU YU (CN)
WANG XIAOGANG (CN)
Application Number:
PCT/CN2022/094863
Publication Date:
July 06, 2023
Filing Date:
May 25, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SHANGHAI SENSETIME INTELLIGENT TECH CO LTD (CN)
International Classes:
G05B13/04; G06N3/04; G06N3/08
Foreign References:
CN114397817A2022-04-26
CN111612126A2020-09-01
CN113156892A2021-07-23
CN113326872A2021-08-31
US20210397961A12021-12-23
CN112882469A2021-06-01
US20210247744A12021-08-12
CN111602144A2020-08-28
CN113077052A2021-07-06
Attorney, Agent or Firm:
CHINA PAT INTELLECTUAL PROPERTY OFFICE (CN)
Download PDF: