Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
音声認識方法、装置及びコンピュータプログラム
Document Type and Number:
Japanese Patent JP7282442
Kind Code:
B2
Abstract:
Provided are a voice recognition method, device, and computer-readable storage medium, said method comprising: obtaining a first loss function of a voice separation enhancement model and a second loss function of a voice recognition model (S202); performing back-propagation on the basis of the second loss function to train an intermediate model bridged between the voice separation enhancement model and the voice recognition model, to obtain a robust representation model (S204); combining the first loss function and the second loss function to obtain a target loss function (S206); performing joint training of the voice separation enhancement model, the robust representation model, and the voice recognition model on the basis of the target loss function, and ending training when a preset convergence condition is satisfied (S208).

Inventors:
王 ▲ジュン▼
林 永▲業▼
Application Number:
JP2022520112A
Publication Date:
May 29, 2023
Filing Date:
November 12, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
TENCENT TECHNOLOGY(SHENZHEN)COMPANY LIMITED
International Classes:
G10L15/06; G10L15/065; G10L15/16; G10L15/20
Domestic Patent References:
JP2019078857A
Foreign References:
WO2019198265A1
US20180053087
US20190043516
Other References:
Max W.Y.Lam et. al.,Extract, Adapt and Recognize: an End-to-end Neural Network for Corrupted Monaural Speech Recognition,INTERSPEECH 2019,2019年09月15日,pp.2778-2782
Attorney, Agent or Firm:
Shinya Mihiro
Naoki Matsuo