トピック境界決定方法及び装置及びトピック境界決定プログラム - Nippon Telegraph and Telephone Corporation

Title:

トピック境界決定方法及び装置及びトピック境界決定プログラム

Document Type and Number:

Japanese Patent JP4175093

Kind Code:

B2

Abstract:

To detect a meaning boundary of a phrase etc., consisting of a word groups of a speech recognition result.

For a word string generated by employing NBEST candidates for respective speech segments from inputted speech recognition result data, merging word sets included in the respective NBEST candidates by the respective speech segments and sorting words in the increasing order of start time information on the words, and removing unnecessary words from word strings, and connecting word strings of all speech segments, windows as ranges of words in word strings each constituting of a fixed number of words are specified before and after each word border, vectors representing meanings of windows are calculated by the windows, and the similarity, beginning with a cosine measure, between vectors corresponding to two successive windows is calculated as the degree of bundling, thereby approving a speech segment boundary right nearby a minimum point as a top boundary.

Inventors:

Katsuto Bessho

Application Number:

JP2002323090A

Publication Date:

November 05, 2008

Filing Date:

November 06, 2002

Export Citation:

Click for automatic bibliography generation Help

Assignee:

Nippon Telegraph and Telephone Corporation

International Classes:

G06F17/28; G10L15/183; G10L15/04; G10L15/18; G10L15/193; G10L25/00

Domestic Patent References:

JP2000099089A
JP2001154936A
JP1276266A

Foreign References:

WO2002027546A1

Other References:

緒方淳他,”講義データを対象とした音声認識と構造化の検討”,情報処理学会研究報告,SLP37-14(2001-07),p.79-84
西澤信一郎他,”名詞の文書内頻度を利用したテキストセグメンテーション”,情報処理学会研究報告,NL117-20(1997-01),p.145-152

Attorney, Agent or Firm:

Tadahiko Ito

Previous Patent: 光ヘッド装置および光学式情報記録再生装置

Next Patent: METHOD FOR PROJECTING STEREOSCOPIC IMAGE IN SPACE WITHOUT USING SCREEN