To provide clustering technology for accurately estimating the number of speakers and a parameter for characterizing each speaker, in dialization.
A clustering calculation device 3 includes: an observation amount creation section 26 which reads a power vector extracted from recorded conversation data, to convert it to an observation amount of the vector corresponding to dynamic Hierarchical Dirichlet Process (dHDP) approximation model; a storage means 10 for accumulating and storing a collection data of the converted observation amount; a variation post-distribution inference section 30 in which a value of post-distribution of a plurality of parameters when the plurality of clusters are created from the collection data of the observation amount by dHDP approximation model, is respectively estimated by an expectation-maximization (EM) algorithm, and sequentially stored and updated in the storage means 10; and an output control section 28 for outputting a latest estimation value of the post-distribution of the plurality of parameters, which are stored in the storage means 10, when a finishing condition set beforehand is satisfied.
COPYRIGHT: (C)2010,JPO&INPIT
Samurai Yamada
Akiko Araki
Tomohiro Nakatani
JP2008203474A |
S.Araki, M.Fujimoto, K.Ishizuka, H.Sawada, S.Makino,A DOA Based Speaker Diarization System for Real Meetings,Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008,IEEE,2008年 5月
荒木章子,澤田宏,向井良,牧野昭二,観測信号ベクトル正規化とクラスタリングによる音源分離手法とその評価,日本音響学会 2005年 秋季研究発表会講演論文集CD-ROM,2005年 9月27日,pp.591-592
L. Ren, D. B. Dunson and L. Carin,,“The Dynamic Hierarchical Dirichlet process”,,Proceedings of International Conference on Machine Learning,,2008年,pp.824-831
石黒 勝彦,ノンパラメトリックベイズを用いた会議音声話者識別のための話者クラスタリング法,日本音響学会 2009年 春季研究発表会講演論文集CD-ROM [CD-ROM],2009年 3月
J. M. Pardo, X. Anguera, C. Wooters,Speaker Diarization for Multi-Microphone Meetings Using Only Between-Channel Differences,Proceedings of the Third Joint Workshop on Multimodal Interaction and Related machine Learning Algorithms,2008年
Megumi Oishi
Shinji Nakamura