METHOD AND APPARATUS FOR PROCESSING SPEECH SIGNAL ACCORDING TO FREQUENCY DOMAIN ENERGY

Title:

METHOD AND APPARATUS FOR PROCESSING SPEECH SIGNAL ACCORDING TO FREQUENCY DOMAIN ENERGY

Document Type and Number:

WIPO Patent Application WO/2015/139452

Kind Code:

A1

Abstract:

Provided are a method and apparatus for processing speech signal according to frequency domain energy. The method for processing speech signal according to frequency domain energy comprises: receiving an original speech signal comprising a first speech frame and a second speech frame that are adjacent (101); performing Fourier transform on the first speech frame and the second speech frame to obtain a first frequency domain signal and a second frequency domain signal respectively (102); obtaining frequency domain energy distributions of the first speech frame and the second speech frame (103); obtaining a frequency domain energy relevance coefficient of the first speech frame and the second speech frame (104); and segmenting the original speech signal according to the frequency domain energy relevance coefficient (105). A problem of insufficiently high accuracy of a segmentation result of a speech signal caused by the influence of phonemic features of the speech signal or relatively strong noise during fine segmentation of the speech signal can be solved.

Inventors:

XU LIJING (CN)

Application Number:

PCT/CN2014/088654

Publication Date:

September 24, 2015

Filing Date:

October 15, 2014

Export Citation:

Click for automatic bibliography generation Help

Assignee:

HUAWEI TECH CO LTD (CN)

International Classes:

G10L15/04

Domestic Patent References:

WO2013107602A1

2013-07-25

Foreign References:

CN103594083A	2014-02-19
US20060053003A1	2006-03-09
CN103021408A	2013-04-03
CN101521009A	2009-09-02
CN103458323A	2013-12-18

Other References: