Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
知能客体強化学習方法、装置、デバイス、及び媒体
Document Type and Number:
Japanese Patent JP7163477
Kind Code:
B2
Abstract:
Embodiments of the present disclosure disclose an intelligent agent reinforcement learning method and apparatus, a device, and a medium. The method includes: acquiring key visual information on which an intelligent agent makes a policy for a current environment image; acquiring actual key visual information of the current environment image; determining attention variation reward information based on the key visual information and the actual key visual information; and adjusting reward feedback of reinforcement learning of the intelligent agent based on the attention variation reward information.

Inventors:
▲劉▼ 春▲曉▼
Hiroshi Xue
▲張▼ ▲偉▼
林 ▲チン▼
Application Number:
JP2021500797A
Publication Date:
October 31, 2022
Filing Date:
July 16, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SHENZHEN SENSETIME TECHNOLOGY CO., LTD.
International Classes:
G06N20/00
Domestic Patent References:
JP2010287131A
Foreign References:
WO2018083672A1
Attorney, Agent or Firm:
Patent Business Corporation Unias International Patent Office