SPEECH ENHANCEMENT FOR SPEECH RECOGNITION APPLICATIONS IN BROADCASTING ENVIRONMENTS

Title:

SPEECH ENHANCEMENT FOR SPEECH RECOGNITION APPLICATIONS IN BROADCASTING ENVIRONMENTS

Document Type and Number:

WIPO Patent Application WO/2018/211983

Kind Code:

A1

Abstract:

A system that acquires first audio data including a voice command captured by a microphone; identifies second audio data included in broadcast content corresponding to a timing at which the first audio data is captured by the microphone; extracts the second audio data from the first audio data to generate third audio data; converts the third audio data to text data corresponding to the voice command; and outputs the text data.

Inventors:

IGARASHI TATSUYA (JP)

Application Number:

PCT/JP2018/017484

Publication Date:

November 22, 2018

Filing Date:

May 02, 2018

Export Citation:

Click for automatic bibliography generation Help

Assignee:

SONY CORP (JP)

International Classes:

G10L21/02; G10L15/20; G10L15/30; G10L21/0208; G10L21/0272

Foreign References:

US20160240210A1	2016-08-18
EP2965496A1	2016-01-13
EP2685449A1	2014-01-15
JP2013187781A	2013-09-19
JP2014153663A	2014-08-25

Attorney, Agent or Firm:

NISHIKAWA Takashi et al. (JP)

Download PDF:

View/Download PDF PDF Help

Previous Patent: IMAGE PROCESSING DEVICE AND METHOD, AND IMAGE PROCESSING SYSTEM

Next Patent: SPEAKER ARRAY AND SIGNAL PROCESSOR