音声認識方法、装置及びコンピュータプログラム - TENCENT TECHNOLOGY(SHENZHEN)COMPANY LIMITED

Title:

音声認識方法、装置及びコンピュータプログラム

Document Type and Number:

Japanese Patent JP7282442

Kind Code:

B2

Abstract:

Provided are a voice recognition method, device, and computer-readable storage medium, said method comprising: obtaining a first loss function of a voice separation enhancement model and a second loss function of a voice recognition model (S202); performing back-propagation on the basis of the second loss function to train an intermediate model bridged between the voice separation enhancement model and the voice recognition model, to obtain a robust representation model (S204); combining the first loss function and the second loss function to obtain a target loss function (S206); performing joint training of the voice separation enhancement model, the robust representation model, and the voice recognition model on the basis of the target loss function, and ending training when a preset convergence condition is satisfied (S208).

Inventors:

王 ▲ジュン▼
林永▲業▼

Application Number:

JP2022520112A

Publication Date:

May 29, 2023

Filing Date:

November 12, 2020

Export Citation:

Click for automatic bibliography generation Help

Assignee:

TENCENT TECHNOLOGY(SHENZHEN)COMPANY LIMITED

International Classes:

G10L15/06; G10L15/065; G10L15/16; G10L15/20

Domestic Patent References:

JP2019078857A

Foreign References:

WO2019198265A1
US20180053087
US20190043516

Other References:

Max W.Y.Lam et. al.,Extract, Adapt and Recognize: an End-to-end Neural Network for Corrupted Monaural Speech Recognition,INTERSPEECH 2019,2019年09月15日,pp.2778-2782

Attorney, Agent or Firm:

Shinya Mihiro
Naoki Matsuo

Previous Patent: Irreversible additive contained in positive electrode material for secondary battery, positive elect...

Next Patent: Assembly method of brushless motor generator