Title:
SPEECH ENHANCEMENT FOR SPEECH RECOGNITION APPLICATIONS IN BROADCASTING ENVIRONMENTS
Document Type and Number:
WIPO Patent Application WO/2018/211983
Kind Code:
A1
Abstract:
A system that acquires first audio data including a voice command captured by a microphone; identifies second audio data included in broadcast content corresponding to a timing at which the first audio data is captured by the microphone; extracts the second audio data from the first audio data to generate third audio data; converts the third audio data to text data corresponding to the voice command; and outputs the text data.
Inventors:
IGARASHI TATSUYA (JP)
Application Number:
PCT/JP2018/017484
Publication Date:
November 22, 2018
Filing Date:
May 02, 2018
Export Citation:
Assignee:
SONY CORP (JP)
International Classes:
G10L21/02; G10L15/20; G10L15/30; G10L21/0208; G10L21/0272
Foreign References:
US20160240210A1 | 2016-08-18 | |||
EP2965496A1 | 2016-01-13 | |||
EP2685449A1 | 2014-01-15 | |||
JP2013187781A | 2013-09-19 | |||
JP2014153663A | 2014-08-25 |
Attorney, Agent or Firm:
NISHIKAWA Takashi et al. (JP)
Download PDF:
Previous Patent: IMAGE PROCESSING DEVICE AND METHOD, AND IMAGE PROCESSING SYSTEM
Next Patent: SPEAKER ARRAY AND SIGNAL PROCESSOR
Next Patent: SPEAKER ARRAY AND SIGNAL PROCESSOR