Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
SPEECH ENHANCEMENT METHOD AND DEVICE USING FAST FOURIER CONVOLUTION
Document Type and Number:
WIPO Patent Application WO/2023/182765
Kind Code:
A1
Abstract:
Methods for processing and analyzing audio recordings and, in particular, for speech denoising are provided. A method for speech denoising using a fast Fourier convolution operator comprises: splitting channels of input tensor into local and global branches; using conventional convolutions for local updates of feature maps at the local branch; performing Fourier transform across frequency dimension of the global branch feature map; updating the global branch feature map in spectral domain by point-wise convolutions; applying an inverse Fourier transform to the updated global branch feature map; and summing local and global branches activations. The technical result consists in improving the quality of speech denoising and/or enhancement of speech component in a speech audio signal.

Inventors:
SHCHEKOTOV IVAN SERGEEVICH (RU)
ANDREEV PAVEL KONSTANTINOVICH (RU)
ALANOV AIBEK ARSTANBEKOVICH (RU)
IVANOV OLEG YURIEVICH (RU)
VETROV DMITRY PETROVICH (RU)
Application Number:
PCT/KR2023/003711
Publication Date:
September 28, 2023
Filing Date:
March 21, 2023
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
SAMSUNG ELECTRONICS CO LTD (KR)
International Classes:
G10L21/0208; G06N3/0455; G06N3/0464; G10L25/18; G10L25/24; G10L25/30
Domestic Patent References:
WO2021251627A12021-12-16
Foreign References:
CN113314140A2021-08-27
CN113655986A2021-11-16
CN108768542A2018-11-06
Other References:
CHI LU, JIANG BORUI, MU YADONG: "Fast Fourier Convolution", NIPS`20: PROCEEDINGS OF THE 34TH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING SYSTEMS, 1 January 2020 (2020-01-01), XP093095623
Attorney, Agent or Firm:
KIM, Tae-hun et al. (KR)
Download PDF: