DEVICE FOR CLASSIFYING SOUND SOURCE USING DEEP LEARNING, AND METHOD THEREFOR

Title:

DEVICE FOR CLASSIFYING SOUND SOURCE USING DEEP LEARNING, AND METHOD THEREFOR

Document Type and Number:

WIPO Patent Application WO/2022/163982

Kind Code:

A1

Abstract:

The present invention relates to a device for automatically classifying an inputted sound source according to preset criteria, and more particularly, to a device for automatically classifying a sound source according to preset criteria using deep learning, and a method therefor. According to one embodiment of the present invention, disclosed is a device for classifying a sound source, comprising: a processor; and a memory that is connected to the processor and stores a deep learning algorithm and original sound data, wherein the memory stores program commands, which are executable by the processor, for: generating n pieces of image data corresponding to the original sound data, according to a preset method; generating training image data corresponding to the original sound data, using the n pieces of image data; training the deep learning algorithm using the training image data; and classifying target sound data according to preset criteria using the trained deep learning algorithm, wherein n is a natural number equal to or greater than 2.

Inventors:

JEON JIN YONG (KR)
PARK JUN HONG (KR)
KIM SANG HEON (KR)
LEE HYUN (KR)
JO HYUN IN (KR)
ZHAO HONG PING (KR)
KIM HYUN MIN (KR)

Application Number:

PCT/KR2021/017019

Publication Date:

August 04, 2022

Filing Date:

November 18, 2021

Export Citation:

Click for automatic bibliography generation Help

Assignee:

HANYANG S&A CO LTD (KR)

International Classes:

G10L25/18; G06N20/00; G06T11/00; G10L21/0272; G10L25/51

Foreign References:

KR20190113390A	2019-10-08
KR20170096083A	2017-08-23
KR20200002147A	2020-01-08

Other References:

BODDAPATI VENKATESH: "Classifying Environmental Sounds with Image Networks", MASTER OF SCIENCE IN COMPUTER SCIENCE, KARLSKRONA SWEDEN, 1 February 2017 (2017-02-01), Karlskrona Sweden, pages 1 - 37, XP055954958
MCLOUGHLIN IAN; ZHANG HAOMIN; XIE ZHIPENG; SONG YAN; XIAO WEI: "Robust Sound Event Classification Using Deep Neural Networks", IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, vol. 23, no. 3, 1 March 2015 (2015-03-01), USA, pages 540 - 552, XP011573973, ISSN: 2329-9290, DOI: 10.1109/TASLP.2015.2389618
HONGPYEONG CHO, SANGHEON KIM, HYUN LEE, JINYONG JEON: "Pneumonia diagnosis algorithm with room acoustic consideration", THE KOREAN SOCIETY FOR NOISE AND VIBRATION ENGINEERING 30TH ANNIVERSARY AUTUMN CONFERENCE 2020; NOVEMBER 17-20, 2020, 19 November 2020 (2020-11-19), JP, pages 160, XP009538869

Attorney, Agent or Firm:

THEWAVE IP LAW FIRM (KR)

Download PDF:

View/Download PDF PDF Help

Previous Patent: AROG ALDOLASE VARIANT AND METHOD FOR PRODUCING BRACHED CHAIN AMINO ACID BY USING SAME

Next Patent: DEVICE CONTROL METHOD AND APPARATUS BASED ON VEHICLE VIRTUALIZATION STRUCTURE