音声区間検出方法 - Electronics and Telecommunications Research Institute

Title:

音声区間検出方法

Document Type and Number:

Japanese Patent JP4795919

Kind Code:

B2

Abstract:

Provided are an apparatus and method for speech segment detection, and a system for speech recognition. The apparatus is equipped with a sound receiver and an image receiver and includes: a lip motion signal detector for detecting a motion region from image frames output from the image receiver, applying lip motion image feature information to the detected motion region, and detecting a lip motion signal; and a speech segment detector for detecting a speech segment using sound frames output from the sound receiver and the lip motion signal detected from the lip motion signal detector. Since lip motion image information is checked in a speech segment detection process, it is possible to prevent dynamic noise from being misrecognized as speech.

More Like This:

WO/2022/074869	SYSTEM AND METHOD FOR PRODUCING METADATA OF AN AUDIO SIGNAL
JP3568785	Speech recognition method

Inventors:

Lee, Sue, John
Kim, Sun, Hun
Lee, Yang, Jiku
Kim, Yun, Kyu

Application Number:

JP2006329871A

Publication Date:

October 19, 2011

Filing Date:

December 06, 2006

Export Citation:

Click for automatic bibliography generation Help

Assignee:

Electronics and Telecommunications Research Institute

International Classes:

G10L15/04; G10L15/28

Domestic Patent References:

JP2004310047A
JP2004271620A
JP9198082A
JP2002091466A

Other References:

坂義秀 Yoshihide BAN,車載カメラ映像を用いたドライバの発話区間検出 End Point Detection Using Driver's Facial Image Sequence Taken in Vehicles,電子情報通信学会技術研究報告 Vol.102 No.708 IEICE Technical Report,日本,社団法人電子情報通信学会 The Institute of Electronics,Information and Communication Engineers,2003年 4月16日,第102巻,p.111-p.116

Attorney, Agent or Firm:

Kenji Yoshitake
Hidetoshi Tachibana
Yasukazu Sato
Hiroshi Yoshimoto
Yasushi Kawasaki
Tomoya Deguchi

Previous Patent: 光ＣＤＭ伝送システム

Next Patent: 制御マップ最適化装置