Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
トピック境界決定方法及び装置及びトピック境界決定プログラム
Document Type and Number:
Japanese Patent JP4175093
Kind Code:
B2
Abstract:

To detect a meaning boundary of a phrase etc., consisting of a word groups of a speech recognition result.

For a word string generated by employing NBEST candidates for respective speech segments from inputted speech recognition result data, merging word sets included in the respective NBEST candidates by the respective speech segments and sorting words in the increasing order of start time information on the words, and removing unnecessary words from word strings, and connecting word strings of all speech segments, windows as ranges of words in word strings each constituting of a fixed number of words are specified before and after each word border, vectors representing meanings of windows are calculated by the windows, and the similarity, beginning with a cosine measure, between vectors corresponding to two successive windows is calculated as the degree of bundling, thereby approving a speech segment boundary right nearby a minimum point as a top boundary.

COPYRIGHT: (C)2004,JPO


Inventors:
Katsuto Bessho
Application Number:
JP2002323090A
Publication Date:
November 05, 2008
Filing Date:
November 06, 2002
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Nippon Telegraph and Telephone Corporation
International Classes:
G06F17/28; G10L15/183; G10L15/04; G10L15/18; G10L15/193; G10L25/00
Domestic Patent References:
JP2000099089A
JP2001154936A
JP1276266A
Foreign References:
WO2002027546A1
Other References:
緒方淳他,”講義データを対象とした音声認識と構造化の検討”,情報処理学会研究報告,SLP37-14(2001-07),p.79-84
西澤信一郎他,”名詞の文書内頻度を利用したテキストセグメンテーション”,情報処理学会研究報告,NL117-20(1997-01),p.145-152
Attorney, Agent or Firm:
Tadahiko Ito