Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
RESIDUAL DELAY NETWORK-BASED SPEAKER CONFIRMATION METHOD AND APPARATUS, DEVICE AND MEDIUM
Document Type and Number:
WIPO Patent Application WO/2020/224114
Kind Code:
A1
Abstract:
A residual delay network-based speaker confirmation method and apparatus, a device and a medium. Said method comprises: constructing a residual delay network, and training the residual delay network by using a preset training sample set (S101); acquiring an audio information set of a test user, the audio information set comprising registered audio and test audio (S102); performing pre-processing on the audio information set of the test user (S103); performing feature extraction on the pre-processed audio information set to obtain Mel frequency cepstrum coefficients of the registered audio and the test audio, respectively (S104); transmitting the Mel frequency cepstrum coefficient of the registered audio as an input vector to the trained residual delay network, and acquiring a feature vector outputted by the residual delay network at a session slice level as a registered feature vector of the test user (S105); transmitting the Mel frequency cepstrum coefficient of the test audio as an input vector to the trained residual delay network, and acquiring a feature vector outputted by the residual delay network at a session slice level as a feature vector to be tested of the test user (S106); inputting, into a preset probability linear discriminant analysis model, the registered feature vector and the feature vector to be tested, and acquiring a score outputted by the probability linear discrimination analysis model (S107); and outputting a speaker confirmation result according to the score (S108). Said method solves the problem of the poor accuracy of the existing text-independent speaker confirmation method in terms of short audio.

Inventors:
PENG JUNQING (CN)
WANG JIANZONG (CN)
Application Number:
PCT/CN2019/103155
Publication Date:
November 12, 2020
Filing Date:
August 29, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G10L25/84
Foreign References:
CN106683680A2017-05-17
CN108694949A2018-10-23
CN108109613A2018-06-01
CN102034472A2011-04-27
CN109166586A2019-01-08
US20180350351A12018-12-06
Attorney, Agent or Firm:
SHENZHEN ZHONGDING INTELLECTUAL PROPERTY AGENCY (CN)
Download PDF: