Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
IMAGE SEGMENTATION METHOD AND APPARATUS, AND DEVICE, AND STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2022/089115
Kind Code:
A1
Abstract:
Disclosed are an image segmentation method and apparatus, and a device, and a storage medium. The image segmentation method comprises: fusing a visual feature corresponding to an original image and a text feature corresponding to a description language to obtain a multi-modal feature, the description language being used for designating a target object to be segmented in the original image; determining a visual region of the target object according to an image corresponding to the multi-modal feature, and recording an image corresponding to the visual region as a response heat map; and determining a segmentation result of the target object according to the image corresponding to the multi-modal feature and the response heat map.

Inventors:
KONG TAO (CN)
JING YA (CN)
LI LEI (CN)
Application Number:
PCT/CN2021/120815
Publication Date:
May 05, 2022
Filing Date:
September 27, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
BEIJING YOUZHUJU NETWORK TECH CO LTD (CN)
International Classes:
G06T7/11
Foreign References:
CN112184738A2021-01-05
CN110555337A2019-12-10
CN110390289A2019-10-29
US20180268548A12018-09-20
Other References:
YA JING; TAO KONG; WEI WANG; LIANG WANG; LEI LI; TIENIU TAN: "Locate then Segment: A Strong Pipeline for Referring Image Segmentation", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 30 March 2021 (2021-03-30), 201 Olin Library Cornell University Ithaca, NY 14853 , XP081919270
RONGHANG HU; MARCUS ROHRBACH; TREVOR DARRELL: "Segmentation from Natural Language Expressions", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 20 March 2016 (2016-03-20), 201 Olin Library Cornell University Ithaca, NY 14853 , XP080690843
GEN LUO; YIYI ZHOU; XIAOSHUAI SUN; LIUJUAN CAO; CHENGLIN WU; CHENG DENG; RONGRONG JI: "Multi-task Collaborative Network for Joint Referring Expression Comprehension and Segmentation", ARXIV.ORG, CORNELL UNIVERSITY LIBRARY, 201 OLIN LIBRARY CORNELL UNIVERSITY ITHACA, NY 14853, 19 March 2020 (2020-03-19), 201 Olin Library Cornell University Ithaca, NY 14853 , XP081624822
WANG FEI; JIANG MENGQING; QIAN CHEN; YANG SHUO; LI CHENG; ZHANG HONGGANG; WANG XIAOGANG; TANG XIAOOU: "Residual Attention Network for Image Classification", 2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), IEEE COMPUTER SOCIETY, US, 21 July 2017 (2017-07-21), US , pages 6450 - 6458, XP033250009, ISSN: 1063-6919, DOI: 10.1109/CVPR.2017.683
Attorney, Agent or Firm:
BEYOND ATTORNEYS AT LAW (CN)
Download PDF: