Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IMAGE DESCRIPTION METHOD AND APPARATUS, COMPUTER DEVICE, AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2023/178801
Kind Code:
A1
Abstract:
Embodiments of the present application relate to the technical field of artificial intelligence, and provide an image description method and apparatus, a computer device, and a storage medium. The method comprises: acquiring an original image, and performing feature extraction on the original image to obtain an image feature; performing region detection on the original image according to the image feature to obtain a target region image; performing feature extraction on the target region image to obtain a region feature vector; performing extraction on the region feature vector by means of a subject generation model to obtain subject data, the subject data comprising a subject word vector and moment state information; performing word prediction on the subject data by means of a word generation model to obtain description words; and splicing the description words according to the moment state information to obtain a target description text for describing the original image. By means of multiple feature extractions, a target description text can contain more image details, and a description text having coherent semantics is generated hierarchically by utilizing a subject generation model and a word generation model.

Inventors:
SHU CHANG (CN)
CHEN YOUXIN (CN)
Application Number:
PCT/CN2022/090723
Publication Date:
September 28, 2023
Filing Date:
April 29, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G06V10/40
Foreign References:
CN111753078A2020-10-09
CN113468357A2021-10-01
CN111444968A2020-07-24
CN113035311A2021-06-25
Attorney, Agent or Firm:
JIAQUAN IP LAW (CN)
Download PDF: