Title:
METHOD AND APPARATUS FOR PROCESSING SPEECH SIGNAL ACCORDING TO FREQUENCY DOMAIN ENERGY
Document Type and Number:
WIPO Patent Application WO/2015/139452
Kind Code:
A1
Abstract:
Provided are a method and apparatus for processing speech signal according to frequency domain energy. The method for processing speech signal according to frequency domain energy comprises: receiving an original speech signal comprising a first speech frame and a second speech frame that are adjacent (101); performing Fourier transform on the first speech frame and the second speech frame to obtain a first frequency domain signal and a second frequency domain signal respectively (102); obtaining frequency domain energy distributions of the first speech frame and the second speech frame (103); obtaining a frequency domain energy relevance coefficient of the first speech frame and the second speech frame (104); and segmenting the original speech signal according to the frequency domain energy relevance coefficient (105). A problem of insufficiently high accuracy of a segmentation result of a speech signal caused by the influence of phonemic features of the speech signal or relatively strong noise during fine segmentation of the speech signal can be solved.
Inventors:
XU LIJING (CN)
Application Number:
PCT/CN2014/088654
Publication Date:
September 24, 2015
Filing Date:
October 15, 2014
Export Citation:
Assignee:
HUAWEI TECH CO LTD (CN)
International Classes:
G10L15/04
Domestic Patent References:
WO2013107602A1 | 2013-07-25 |
Foreign References:
CN103594083A | 2014-02-19 | |||
US20060053003A1 | 2006-03-09 | |||
CN103021408A | 2013-04-03 | |||
CN101521009A | 2009-09-02 | |||
CN103458323A | 2013-12-18 |
Other References:
See also references of EP 3091534A4
Attorney, Agent or Firm:
LEADER PATENT & TRADEMARK FIRM (CN)
北京同立钧成知识产权代理有限公司 (CN)
北京同立钧成知识产权代理有限公司 (CN)
Download PDF: