Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DEVICE, METHOD, AND PROGRAM FOR ANALYZING SPEECH SIGNAL
Document Type and Number:
WIPO Patent Application WO/2019/163753
Kind Code:
A1
Abstract:
The present invention is capable of accurately estimating a parameter inherent in a basic frequency pattern of a speech fragment from said basic frequency pattern, and reconstructing the basic frequency pattern of the speech fragment from the parameter inherent in the basic frequency pattern. A learning unit 30 learns a deep generative model on the basis of a basic frequency pattern in a speech signal and parallel data to a parameter inherent in the basic frequency pattern of the speech signal, wherein the parameter inherent in the basic frequency pattern of the speech signal is regarded as a latent variable of the deep generative model, and the deep generative model includes an encoder for estimating the latent variable from the basic frequency pattern of the speech signal and a decoder for reconstructing the basic frequency pattern of the speech signal from the latent variable.

Inventors:
TANAKA, Ko (9-11 Midori-cho 3-chome, Musashino-sh, Tokyo 85, 〒1808585, JP)
KAMEOKA, Hirokazu (9-11 Midori-cho 3-chome, Musashino-sh, Tokyo 85, 〒1808585, JP)
Application Number:
JP2019/006047
Publication Date:
August 29, 2019
Filing Date:
February 19, 2019
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NIPPON TELEGRAPH AND TELEPHONE CORPORATION (5-1 Otemachi 1-chome, Chiyoda-ku Tokyo, 16, 〒1008116, JP)
International Classes:
G10L25/90; G10L25/30
Attorney, Agent or Firm:
TAIYO, NAKAJIMA & KATO (3-17, Shinjuku 4-chome Shinjuku-k, Tokyo 22, 〒1600022, JP)
Download PDF: