Title:
VOICE OR SPEECH RECOGNITION USING CONTEXTUAL INFORMATION AND USER EMOTION
Document Type and Number:
WIPO Patent Application WO/2023/004561
Kind Code:
A1
Abstract:
A method of voice or speech recognition in varied environments and/or user emotional states executed by a processor of a computing device is provided. The method comprises: determining a voice or speech recognition threshold for voice or speech recognition based on information obtained from contextual information detected in an environment from which a received audio input was captured by the computing device and an emotional classification of a user's voice in the received audio input (310); determining a confidence score for one or more key words identified in the received audio input (312); and outputting results of a voice or speech recognition analysis of the received audio input in response to the determined confidence score exceeding the determined voice or speech recognition threshold (314).
Inventors:
WEI JUN (US)
DONG XIAOXIA (US)
PAN QIMENG (US)
JIN KWIHYUK (US)
TANG TONG (US)
DONG XIAOXIA (US)
PAN QIMENG (US)
JIN KWIHYUK (US)
TANG TONG (US)
Application Number:
PCT/CN2021/108563
Publication Date:
February 02, 2023
Filing Date:
July 27, 2021
Export Citation:
Assignee:
QUALCOMM INC (US)
WEI JUN (CN)
DONG XIAOXIA (CN)
PAN QIMENG (CN)
JIN KWIHYUK (US)
TANG TONG (US)
WEI JUN (CN)
DONG XIAOXIA (CN)
PAN QIMENG (CN)
JIN KWIHYUK (US)
TANG TONG (US)
International Classes:
G10L17/20
Foreign References:
US20130304478A1 | 2013-11-14 | |||
US20200175993A1 | 2020-06-04 | |||
CN112735437A | 2021-04-30 | |||
CN108305633A | 2018-07-20 | |||
CN105556920A | 2016-05-04 | |||
CN102254551A | 2011-11-23 |
Attorney, Agent or Firm:
NTD PATENT & TRADEMARK AGENCY LTD. (CN)
Download PDF: