Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
DEVICE FOR CLASSIFYING SOUND SOURCE USING DEEP LEARNING, AND METHOD THEREFOR
Document Type and Number:
WIPO Patent Application WO/2022/163982
Kind Code:
A1
Abstract:
The present invention relates to a device for automatically classifying an inputted sound source according to preset criteria, and more particularly, to a device for automatically classifying a sound source according to preset criteria using deep learning, and a method therefor. According to one embodiment of the present invention, disclosed is a device for classifying a sound source, comprising: a processor; and a memory that is connected to the processor and stores a deep learning algorithm and original sound data, wherein the memory stores program commands, which are executable by the processor, for: generating n pieces of image data corresponding to the original sound data, according to a preset method; generating training image data corresponding to the original sound data, using the n pieces of image data; training the deep learning algorithm using the training image data; and classifying target sound data according to preset criteria using the trained deep learning algorithm, wherein n is a natural number equal to or greater than 2.

Inventors:
JEON JIN YONG (KR)
PARK JUN HONG (KR)
KIM SANG HEON (KR)
LEE HYUN (KR)
JO HYUN IN (KR)
ZHAO HONG PING (KR)
KIM HYUN MIN (KR)
Application Number:
PCT/KR2021/017019
Publication Date:
August 04, 2022
Filing Date:
November 18, 2021
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HANYANG S&A CO LTD (KR)
International Classes:
G10L25/18; G06N20/00; G06T11/00; G10L21/0272; G10L25/51
Foreign References:
KR20190113390A2019-10-08
KR20170096083A2017-08-23
KR20200002147A2020-01-08
Other References:
BODDAPATI VENKATESH: "Classifying Environmental Sounds with Image Networks", MASTER OF SCIENCE IN COMPUTER SCIENCE, KARLSKRONA SWEDEN, 1 February 2017 (2017-02-01), Karlskrona Sweden, pages 1 - 37, XP055954958
MCLOUGHLIN IAN; ZHANG HAOMIN; XIE ZHIPENG; SONG YAN; XIAO WEI: "Robust Sound Event Classification Using Deep Neural Networks", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, vol. 23, no. 3, 1 March 2015 (2015-03-01), USA, pages 540 - 552, XP011573973, ISSN: 2329-9290, DOI: 10.1109/TASLP.2015.2389618
HONGPYEONG CHO, SANGHEON KIM, HYUN LEE, JINYONG JEON: "Pneumonia diagnosis algorithm with room acoustic consideration", THE KOREAN SOCIETY FOR NOISE AND VIBRATION ENGINEERING 30TH ANNIVERSARY AUTUMN CONFERENCE 2020; NOVEMBER 17-20, 2020, 19 November 2020 (2020-11-19), JP, pages 160, XP009538869
Attorney, Agent or Firm:
THEWAVE IP LAW FIRM (KR)
Download PDF: