SPEECH SECTION DETECTING DEVICE AND SPEECH RECOGNITION DEVICE, PROGRAM AND RECORDING MEDIUM

Title:

SPEECH SECTION DETECTING DEVICE AND SPEECH RECOGNITION DEVICE, PROGRAM AND RECORDING MEDIUM

Document Type and Number:

Japanese Patent JP2011059186

Kind Code:

A

Abstract:

To provide a speech section detecting device capable of suppressing the influence of acoustic noise in detecting speech section by a multi-modal speech section detection which comprehensively uses voice information and image information.

The speech section detecting device 100 includes a first multi-modal VAD section 131, which creates a sound and image feature amount combining a sound feature amount and an image feature amount, and which determines a speech section based on the sound and image feature amount; a speech uni-modal VAD section 132 for determining the speech section by using only the sound feature amount; an image uni-modal VAD section 133 for determining the speech section by using only the image feature amount; a second multi-modal VAD section 134 for determining the speech section, by combining the determination of the speech uni-modal VAD section 132 and the image uni-modal section 133; and a third multi-modal section 135 for determining the speech section, by combining the first multi-modal VAD section 131 and the second multi-modal VAD section 134 by a majority decision rule.

Inventors:

TAMURA TETSUTSUGU
TAKEUCHI SHINICHI
HAYAMIZU SATORU

Application Number:

JP2009205990A

Publication Date:

March 24, 2011

Filing Date:

September 07, 2009

Export Citation:

Click for automatic bibliography generation Help

Assignee:

UNIV GIFU

International Classes:

G10L15/04; G10L15/24; G10L25/78

Attorney, Agent or Firm:

Hironobu Onda
Makoto Onda

Previous Patent: BASE MATERIAL FOR MICROLENS ARRAY, METHOD OF MANUFACTURING THE SAME, FORMING DIE FOR BASE MATERIAL F...

Next Patent: IMAGING APPARATUS, CONTROL METHOD THEREOF AND PROGRAM