Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND DEVICE FOR PUSHING OBJECT TO USER BASED ON REINFORCEMENT LEARNING MODEL
Document Type and Number:
WIPO Patent Application WO/2020/220757
Kind Code:
A1
Abstract:
A method and device for determining a push object list for a user based on a reinforcement learning model. The method comprises: for each group of object lists, obtaining the ith state feature vector (S202); inputting the ith state feature vector into the reinforcement learning model to enable the reinforcement learning model to output a weight vector corresponding to the ith state feature vector (S204); obtaining sorting feature vectors of objects in a candidate object set corresponding to the group of object lists (S206); calculating scores of the objects in the candidate object set on the basis of a point product of the sorting feature vectors of the objects in the candidate object set and the weight vector (S208); and for the M groups of object lists, determining updated M groups of object lists on the basis of the scores of the objects in M candidate object sets corresponding to the M groups of object lists (S210), wherein each group of object lists in the updated M groups of object lists comprises i objects.

Inventors:
CHEN CEN (CN)
HU XU (CN)
FU CHILIN (CN)
ZHANG XIAOLU (CN)
Application Number:
PCT/CN2020/071699
Publication Date:
November 05, 2020
Filing Date:
January 13, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
ALIBABA GROUP HOLDING LTD (CN)
International Classes:
G06F16/9535
Foreign References:
CN110263245A2019-09-20
CN108304440A2018-07-20
CN104869464A2015-08-26
CN108805594A2018-11-13
US20140297476A12014-10-02
Attorney, Agent or Firm:
BEIJING BESTIPR INTELLECTUAL PROPERTY LAW CORPORATION (CN)
Download PDF: