Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
METHOD AND SYSTEM FOR GENERATING SYNTHESIZED SPEECH OF NEW SPEAKER
Document Type and Number:
WIPO Patent Application WO/2022/164207
Kind Code:
A1
Abstract:
The present invention relates to a method, performed by at least one processor, for generating synthesized speech of a new speaker. The method may comprise the steps of: receiving target text; acquiring speaker features of a reference speaker; acquiring information about changes in utterance features; determining speaker features of a new speaker by using the acquired speaker features of the reference speaker and the acquired information about changes in utterance features; and generating output speech for the target text by inputting the target text and the determined speaker features of the new speaker to an artificial neural network text-speech synthesis model, wherein the output speech reflects the determined speaker features of the new speaker. Here, the artificial neural network text-speech synthesis model can be trained on the basis of a plurality of training text items and speaker features of a plurality of training speakers to output speech for the plurality of training text items, wherein the output speech reflects the speaker features of the plurality of training speakers.

Inventors:
KIM, Taesu (KR)
LEE, Younggun (KR)
HWANG, Yeongtae (KR)
Application Number:
PCT/KR2022/001414
Publication Date:
August 04, 2022
Filing Date:
January 26, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
NEOSAPIENCE, INC. (KR)
International Classes:
G10L13/02; G10L13/08; G10L17/02
Attorney, Agent or Firm:
KIM, Han Sol et al. (KR)
Download PDF: