多音源有音区間判定装置、方法、プログラム及びその記録媒体 - Nippon Telegraph and Telephone Corporation

Title:

多音源有音区間判定装置、方法、プログラム及びその記録媒体

Document Type and Number:

Japanese Patent JP4746533

Kind Code:

B2

Abstract:

To provide technology for determining a speech interval of each speaker, from speech signals of multiple speakers in the same sound interval, which is collected by a plurality of microphones.

A noise power estimation section 2 estimates noise power in a voiceless-sound interval, for each combination of the microphone and frequency, from each observation signal for each time frequency, which is respectively input by the plurality of microphones and converted to a frequency domain. An observation signal classification section 3 classifies an observation signal vector for each time frequency, in which each observation signal is a component, by using the estimated noise power and each observation signal, and its classification results are output. A signal separation section 4 separates each observation signal into a signal for each sound source by using the classification results. A voiced sound interval determination section 5 determines the voiced sound interval or the voiceless-sound interval of each sound source from the separated signal for each sound source.

Inventors:

Hiroshi Sawada
Akiko Araki
Kazuhiro Otsuka
Masamoto Fujimoto
Kentaro Ishizuka

Application Number:

JP2006344045A

Publication Date:

August 10, 2011

Filing Date:

December 21, 2006

Export Citation:

Click for automatic bibliography generation Help

Assignee:

Nippon Telegraph and Telephone Corporation

International Classes:

G10L21/0208; G10L21/0224; G10L21/0232; G10L21/028; G10L25/78

Domestic Patent References:

JP64081997A
JP2002236494A
JP2004170552A
JP2006208482A

Foreign References:

WO2005024788A1

Other References:

荒木章子他,"観測信号ベクトル正規化とクラスタリングによる音源分離手法とその評価",日本音響学会2005年秋季研究発表会講演論文集,2005年 9月20日,p.591-592

Attorney, Agent or Firm:

Naoki Nakao
Taku Kusano
Yukio Nakamura

Previous Patent: 線形予測係数算出方法、及びその装置とそのプログラムと、その記憶媒体

Next Patent: 銀ナノ粒子の製造方法