Title:
RESIDUAL DELAY NETWORK-BASED SPEAKER CONFIRMATION METHOD AND APPARATUS, DEVICE AND MEDIUM
Document Type and Number:
WIPO Patent Application WO/2020/224114
Kind Code:
A1
Abstract:
A residual delay network-based speaker confirmation method and apparatus, a device and a medium. Said method comprises: constructing a residual delay network, and training the residual delay network by using a preset training sample set (S101); acquiring an audio information set of a test user, the audio information set comprising registered audio and test audio (S102); performing pre-processing on the audio information set of the test user (S103); performing feature extraction on the pre-processed audio information set to obtain Mel frequency cepstrum coefficients of the registered audio and the test audio, respectively (S104); transmitting the Mel frequency cepstrum coefficient of the registered audio as an input vector to the trained residual delay network, and acquiring a feature vector outputted by the residual delay network at a session slice level as a registered feature vector of the test user (S105); transmitting the Mel frequency cepstrum coefficient of the test audio as an input vector to the trained residual delay network, and acquiring a feature vector outputted by the residual delay network at a session slice level as a feature vector to be tested of the test user (S106); inputting, into a preset probability linear discriminant analysis model, the registered feature vector and the feature vector to be tested, and acquiring a score outputted by the probability linear discrimination analysis model (S107); and outputting a speaker confirmation result according to the score (S108). Said method solves the problem of the poor accuracy of the existing text-independent speaker confirmation method in terms of short audio.
More Like This:
Inventors:
PENG JUNQING (CN)
WANG JIANZONG (CN)
WANG JIANZONG (CN)
Application Number:
PCT/CN2019/103155
Publication Date:
November 12, 2020
Filing Date:
August 29, 2019
Export Citation:
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G10L25/84
Foreign References:
CN106683680A | 2017-05-17 | |||
CN108694949A | 2018-10-23 | |||
CN108109613A | 2018-06-01 | |||
CN102034472A | 2011-04-27 | |||
CN109166586A | 2019-01-08 | |||
US20180350351A1 | 2018-12-06 |
Attorney, Agent or Firm:
SHENZHEN ZHONGDING INTELLECTUAL PROPERTY AGENCY (CN)
Download PDF:
Previous Patent: MAGNETIC TAPPING GUIDING DEVICE
Next Patent: PICTURE PROCESSING METHOD AND APPARATUS, COMPUTER DEVICE AND STORAGE MEDIUM
Next Patent: PICTURE PROCESSING METHOD AND APPARATUS, COMPUTER DEVICE AND STORAGE MEDIUM