Title:
音声区間検出方法
Document Type and Number:
Japanese Patent JP4795919
Kind Code:
B2
Abstract:
Provided are an apparatus and method for speech segment detection, and a system for speech recognition. The apparatus is equipped with a sound receiver and an image receiver and includes: a lip motion signal detector for detecting a motion region from image frames output from the image receiver, applying lip motion image feature information to the detected motion region, and detecting a lip motion signal; and a speech segment detector for detecting a speech segment using sound frames output from the sound receiver and the lip motion signal detected from the lip motion signal detector. Since lip motion image information is checked in a speech segment detection process, it is possible to prevent dynamic noise from being misrecognized as speech.
More Like This:
WO/2022/074869 | SYSTEM AND METHOD FOR PRODUCING METADATA OF AN AUDIO SIGNAL |
JP3568785 | Speech recognition method |
Inventors:
Lee, Sue, John
Kim, Sun, Hun
Lee, Yang, Jiku
Kim, Yun, Kyu
Kim, Sun, Hun
Lee, Yang, Jiku
Kim, Yun, Kyu
Application Number:
JP2006329871A
Publication Date:
October 19, 2011
Filing Date:
December 06, 2006
Export Citation:
Assignee:
Electronics and Telecommunications Research Institute
International Classes:
G10L15/04; G10L15/28
Domestic Patent References:
JP2004310047A | ||||
JP2004271620A | ||||
JP9198082A | ||||
JP2002091466A |
Other References:
坂 義秀 Yoshihide BAN,車載カメラ映像を用いたドライバの発話区間検出 End Point Detection Using Driver's Facial Image Sequence Taken in Vehicles,電子情報通信学会技術研究報告 Vol.102 No.708 IEICE Technical Report,日本,社団法人電子情報通信学会 The Institute of Electronics,Information and Communication Engineers,2003年 4月16日,第102巻,p.111-p.116
Attorney, Agent or Firm:
Kenji Yoshitake
Hidetoshi Tachibana
Yasukazu Sato
Hiroshi Yoshimoto
Yasushi Kawasaki
Tomoya Deguchi
Hidetoshi Tachibana
Yasukazu Sato
Hiroshi Yoshimoto
Yasushi Kawasaki
Tomoya Deguchi