To create a library of mouth shapes only with a small amount of mouse shape data.
The library of mouth shapes is created by separating speaker- dependent and speaker independent variability. Preferably, speaker dependent variability is modeled by a speaker space 42 while the speaker independent variability (i.e., context dependency), is modeled by a set 44 of normalized mouth shapes that need be built only once. Given a small amount of data from a new speaker, it is possible to construct a corresponding mouth shape library by estimating a point in speaker space that maximizes the likelihood of adaptation data and by combining speaker dependent and speaker independent variability. To build the speaker space 42, a context independent mouth shape parametric representation is obtained. Then a supervector containing the set of context-independent mouth shapes is formed for each speaker included in the speaker space 42, Dimensionality reduction 38 is used to find the areas of the speaker space 42.
COPYRIGHT: (C)2004,JPO
JP2002304194A | ||||
JP5153581A | ||||
JP2002156989A | ||||
JP2000122677A | ||||
JP11219421A | ||||
JP10312195A |
Hiroshi Koyama
Hiroshi Takeuchi
Takahisa Shimada
Yuji Takeuchi
Katsumi Imae
Atsushi Fujita
Kazunari Ninomiya
Tomoo Harada
Takashi Goto
Iseki Katsumori