To generate a synthetic sound with different pitch from that of existing element data to be a natural tone.
A storage device 14 stores element data V of a voice element for each pitch P. The element data V includes a shape parameter R indicating characteristics of a spectral shape for each frame in a segment including a voiced sound, and includes spectral data Q for each frame in a segment including a voiceless sound. An element interpolation unit 24 carries out interpolation for element data V1 and V2 to generate element data V with target pitch Pt. Specifically, for a frame in which both of the element data V1 and V2 indicate a voiced sound, a shape parameter R is interpolated at an interpolation rate α corresponding to the target pitch Pt. For a frame in which both of the element data V1 and V2 or either of them indicates a voiceless sound, sound volume E is interpolated at the interpolation rate α, and spectral data Q of the element data V1 is interpolated in accordance with sound volume E after interpolation. A voice synthesis unit 26 generates a voice signal VOUT using element data V after interpolation.
MELRAIN BRAU
TACHIBANA MAKOTO
JPH0962297A | 1997-03-07 | |||
JP2002268659A | 2002-09-20 | |||
JPH0962297A | 1997-03-07 | |||
JP2002268658A | 2002-09-20 | |||
JP2002202790A | 2002-07-19 | |||
JPH11259093A | 1999-09-24 | |||
JP2006276522A | 2006-10-12 |
US20020184032A1 | 2002-12-05 | |||
EP1239463A2 | 2002-09-11 | |||
US20030009336A1 | 2003-01-09 |
JPN6016021790; Hui YE, et al.: 'High Quality Voice Morphing' Proc. ICASSP 2004 Vol.1, 20040517, pp.9-12, IEEE
JPN6016034908; 水谷竜也,外1名: '複数素片選択融合方式による音声合成' 日本音響学会2004年春季研究発表会講演論文集-I- , 20040317, pp.217-218, 社団法人日本音響学会
Yashiro Hitoshi
Taro Takahashi