Title:
DEVICE FOR CLASSIFYING SOUND SOURCE USING DEEP LEARNING, AND METHOD THEREFOR
Document Type and Number:
WIPO Patent Application WO/2022/163982
Kind Code:
A1
Abstract:
The present invention relates to a device for automatically classifying an inputted sound source according to preset criteria, and more particularly, to a device for automatically classifying a sound source according to preset criteria using deep learning, and a method therefor. According to one embodiment of the present invention, disclosed is a device for classifying a sound source, comprising: a processor; and a memory that is connected to the processor and stores a deep learning algorithm and original sound data, wherein the memory stores program commands, which are executable by the processor, for: generating n pieces of image data corresponding to the original sound data, according to a preset method; generating training image data corresponding to the original sound data, using the n pieces of image data; training the deep learning algorithm using the training image data; and classifying target sound data according to preset criteria using the trained deep learning algorithm, wherein n is a natural number equal to or greater than 2.
Inventors:
JEON JIN YONG (KR)
PARK JUN HONG (KR)
KIM SANG HEON (KR)
LEE HYUN (KR)
JO HYUN IN (KR)
ZHAO HONG PING (KR)
KIM HYUN MIN (KR)
PARK JUN HONG (KR)
KIM SANG HEON (KR)
LEE HYUN (KR)
JO HYUN IN (KR)
ZHAO HONG PING (KR)
KIM HYUN MIN (KR)
Application Number:
PCT/KR2021/017019
Publication Date:
August 04, 2022
Filing Date:
November 18, 2021
Export Citation:
Assignee:
HANYANG S&A CO LTD (KR)
International Classes:
G10L25/18; G06N20/00; G06T11/00; G10L21/0272; G10L25/51
Foreign References:
KR20190113390A | 2019-10-08 | |||
KR20170096083A | 2017-08-23 | |||
KR20200002147A | 2020-01-08 |
Other References:
BODDAPATI VENKATESH: "Classifying Environmental Sounds with Image Networks", MASTER OF SCIENCE IN COMPUTER SCIENCE, KARLSKRONA SWEDEN, 1 February 2017 (2017-02-01), Karlskrona Sweden, pages 1 - 37, XP055954958
MCLOUGHLIN IAN; ZHANG HAOMIN; XIE ZHIPENG; SONG YAN; XIAO WEI: "Robust Sound Event Classification Using Deep Neural Networks", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, vol. 23, no. 3, 1 March 2015 (2015-03-01), USA, pages 540 - 552, XP011573973, ISSN: 2329-9290, DOI: 10.1109/TASLP.2015.2389618
HONGPYEONG CHO, SANGHEON KIM, HYUN LEE, JINYONG JEON: "Pneumonia diagnosis algorithm with room acoustic consideration", THE KOREAN SOCIETY FOR NOISE AND VIBRATION ENGINEERING 30TH ANNIVERSARY AUTUMN CONFERENCE 2020; NOVEMBER 17-20, 2020, 19 November 2020 (2020-11-19), JP, pages 160, XP009538869
MCLOUGHLIN IAN; ZHANG HAOMIN; XIE ZHIPENG; SONG YAN; XIAO WEI: "Robust Sound Event Classification Using Deep Neural Networks", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, vol. 23, no. 3, 1 March 2015 (2015-03-01), USA, pages 540 - 552, XP011573973, ISSN: 2329-9290, DOI: 10.1109/TASLP.2015.2389618
HONGPYEONG CHO, SANGHEON KIM, HYUN LEE, JINYONG JEON: "Pneumonia diagnosis algorithm with room acoustic consideration", THE KOREAN SOCIETY FOR NOISE AND VIBRATION ENGINEERING 30TH ANNIVERSARY AUTUMN CONFERENCE 2020; NOVEMBER 17-20, 2020, 19 November 2020 (2020-11-19), JP, pages 160, XP009538869
Attorney, Agent or Firm:
THEWAVE IP LAW FIRM (KR)
Download PDF: