Title:
SPEAKING STATE RECOGNITION METHOD AND APPARATUS, MODEL TRAINING METHOD AND APPARATUS, VEHICLE, MEDIUM, COMPUTER PROGRAM AND COMPUTER PROGRAM PRODUCT
Document Type and Number:
WIPO Patent Application WO/2024/001539
Kind Code:
A1
Abstract:
A speaking state recognition method and apparatus, a model training method and apparatus, a vehicle, a medium, a computer program and a computer program product. The speaking state recognition method comprises: acquiring a facial image frame sequence of a target object (S101); acquiring mouth key point information of each image frame in the facial image frame sequence (S102); on the basis of the mouth key point information, determining a displacement feature of a mouth key point corresponding to the facial image frame sequence, the displacement feature representing a position change of the mouth key point between a plurality of image frames in the facial image frame sequence (S103); and determining a recognition result of the speaking state of the target object according to the displacement feature (S104).
More Like This:
Inventors:
FAN DONGYI (CN)
LI XIAOJIE (CN)
WANG FEI (CN)
QIAN CHEN (CN)
LI XIAOJIE (CN)
WANG FEI (CN)
QIAN CHEN (CN)
Application Number:
PCT/CN2023/093495
Publication Date:
January 04, 2024
Filing Date:
May 11, 2023
Export Citation:
Assignee:
SHANGHAI SENSETIME INTELLIGENT TECH CO LTD (CN)
International Classes:
G06V40/16; G06N3/04; G06N3/08; G06V10/774; G06V10/82
Domestic Patent References:
WO2020140723A1 | 2020-07-09 | |||
WO2020253051A1 | 2020-12-24 |
Foreign References:
CN115063867A | 2022-09-16 | |||
CN113873195A | 2021-12-31 | |||
CN111428672A | 2020-07-17 | |||
CN112633208A | 2021-04-09 | |||
CN111666820A | 2020-09-15 | |||
CN113486760A | 2021-10-08 |
Attorney, Agent or Firm:
CHINA PAT INTELLECTUAL PROPERTY OFFICE (CN)
Download PDF: