Title:
SPEECH EVENT DETECTION METHOD AND APPARATUS, ELECTRONIC DEVICE, AND COMPUTER STORAGE MEDIUM
Document Type and Number:
WIPO Patent Application WO/2022/116420
Kind Code:
A1
Abstract:
A speech event detection method, a speech event detection apparatus (100), an electronic device (1), and a computer readable storage medium, relating to the artificial intelligence technology. The method comprises: obtaining an audio under detection, and performing acoustic feature extraction on the audio to obtain a speech frame feature sequence (S1); performing feature analysis on the speech frame feature sequence by using a self-attention mechanism-based classification model to obtain a hidden state sequence to be identified (S2); performing event identification on the hidden state sequence by using the classification model to obtain an event label sequence (S3); and performing smoothing processing on the event label sequence to obtain a speech event detection result corresponding to a speech under detection (S4). The present invention relates to the blockchain technology, and an audio under detection is stored in a blockchain node. Stability and accuracy of speech event detection are improved.
More Like This:
Inventors:
LUO JIAN (CN)
WANG JIANZONG (CN)
CHENG NING (CN)
WANG JIANZONG (CN)
CHENG NING (CN)
Application Number:
PCT/CN2021/082872
Publication Date:
June 09, 2022
Filing Date:
March 25, 2021
Export Citation:
Assignee:
PING AN TECH SHENZHEN CO LTD (CN)
International Classes:
G10L25/24; G10L25/51; G10L25/30; G10L25/45
Foreign References:
CN112447189A | 2021-03-05 | |||
CN110827804A | 2020-02-21 | |||
CN110929092A | 2020-03-27 | |||
CN111753549A | 2020-10-09 | |||
US10783434B1 | 2020-09-22 |
Attorney, Agent or Firm:
SHENZHEN WORLD INTELLECTUAL PROPERTY AGENCY (GENERAL PARTNERSHIP ) (CN)
Download PDF: