To detect a voice section of an input signal regardless of noise level.
A voice determination device 100 includes: a framing unit 120 which segments an input signal per frame to generate a framed input signal; a spectrum generation unit 122 which converts the framed input signal to generate a spectrum pattern obtained by collecting spectra per frequency; a peak detection unit 132 which determines whether an energy ratio of energy of each spectrum of the spectrum pattern and per-band energy in a divided frequency band including the spectrum out of divided frequency bands exceeds a first threshold or not; a voice determination unit 134 which determines whether the framed input signal is voice or not on the basis of the determination result; a frequency averaging unit 126 which derives average energy in a frequency direction of spectra in each divided frequency band of the spectrum pattern; and a time averaging unit 130 which derives per-band energy being an average in the time direction of average energy with respect to each divided frequency band.
JP2001265367A | 2001-09-28 | |||
JPH0431898A | 1992-02-04 |
Next Patent: IMAGE FORMING APPARATUS