Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DEVICE AND METHOD FOR GENERATING SYNTHESIZED SPEECH IMAGE
Document Type and Number:
WIPO Patent Application WO/2023/146019
Kind Code:
A1
Abstract:
A device and a method for generating a synthesized speech image are disclosed. The device for generating a synthesized speech image according to an embodiment is a device for generating a synthesized speech image on the basis of machine learning, the device comprising: a first global geometric transformation prediction unit which receives each of a source image and a target image including the same person, and is trained to predict a global geometric transformation for global movement of the person between the source image and the target image on the basis of the source image and the target image; a local geometric transformation prediction unit which is trained to predict a local geometric transformation for local movement of the person between the source image and the target image on the basis of preconfigured input data; a geometric transformation combination unit which combines the global geometric transformation and the local geometric transformation so as to calculate an overall movement geometric transformation for overall movement of the person; an optical flow prediction unit which is trained to calculate an optical flow between the source image and the target image on the basis of the source image and the overall movement geometric transformation; and an image generation unit which is trained to reconstruct the target image on the basis of the source image and the optical flow.

Inventors:
CHAE GYEONG SU (KR)
HWANG GUEM BUEL (KR)
Application Number:
PCT/KR2022/003604
Publication Date:
August 03, 2023
Filing Date:
March 15, 2022
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
DEEPBRAIN AI INC (KR)
International Classes:
G10L21/10; G06T5/00; G06T7/269; G06T13/20; G10L15/04; G10L21/055
Domestic Patent References:
WO2002031772A22002-04-18
Foreign References:
KR20200145700A2020-12-30
US20200393943A12020-12-17
KR20180057564A2018-05-30
Other References:
MARDANI MORTEZA; GIANNAKIS GEORGIOS B.: "Robust network traffic estimation via sparsity and low rank", ICASSP, IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING - PROCEEDINGS 1999 IEEE, IEEE, 26 May 2013 (2013-05-26), pages 4529 - 4533, XP032507772, ISSN: 1520-6149, ISBN: 978-0-7803-5041-0, DOI: 10.1109/ICASSP.2013.6638517
Attorney, Agent or Firm:
T&C IP LAW FIRM (KR)
Download PDF: