Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
COMBINED LEARNING METHOD AND DEVICE USING TRANSFORMED LOSS FUNCTION AND FEATURE ENHANCEMENT BASED ON DEEP NEURAL NETWORK FOR SPEAKER RECOGNITION THAT IS ROBUST TO NOISY ENVIRONMENT
Document Type and Number:
WIPO Patent Application WO/2020/256257
Kind Code:
A2
Abstract:
Presented are a combined learning method and device using a transformed loss function and feature enhancement based on a deep neural network for speaker recognition that is robust to a noisy environment. The combined learning method using the transformed loss function and the feature enhancement based on the deep neural network for speaker recognition that is robust to the noisy environment, according to an embodiment, may comprise: a preprocessing step for learning to receive, as an input, a speech signal and remove a noise or reverberation component by using at least one of a beamforming algorithm and a dereverberation algorithm using the deep neural network; a speaker embedding step for learning to classify an utterer from the speech signal, from which the noise or reverberation component has been removed, by using a speaker embedding model based on the deep neural network; and a step for, after connecting a deep neural network model included in at least one of the beamforming algorithm and the dereverberation algorithm and the speaker embedding model, for speaker embedding, based on the deep neural network, performing combined learning by using a loss function.

Inventors:
CHANG JOON-HYUK (KR)
YANG JOONYOUNG (KR)
Application Number:
PCT/KR2020/004281
Publication Date:
December 24, 2020
Filing Date:
March 30, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV HANYANG IND UNIV COOP FOUND (KR)
International Classes:
G10L15/20
Attorney, Agent or Firm:
YANG, Sungbo (KR)
Download PDF: