Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
音声認識方法、装置、機器及びコンピュータ読み取り可能な記憶媒体
Document Type and Number:
Japanese Patent JP7434137
Kind Code:
B2
Abstract:
The disclosure provides a speech recognition method, a device and a computer-readable storage medium. The method includes obtaining a first voice signal collected from a first microphone in a microphone array and a second voice signal collected from a second microphone in the microphone array, the microphone array including at least two microphones, such as two, three or six microphones. The method further includes extracting enhanced features associated with the first voice signal and the second voice signal through a neural network, and obtaining a speech recognition result based on the enhanced features extracted.

Inventors:
Chang, Se
Fan, bin
Lee, Shin
Bai Jinfeng
Chen, Shu
Jia, Ray
Application Number:
JP2020187686A
Publication Date:
February 20, 2024
Filing Date:
November 11, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
Baidu Online Network Technology (Beijing) Company Limited
International Classes:
G10L15/20; G10L15/06; G10L15/10; G10L15/16; G10L15/28
Domestic Patent References:
JP201920598A
JP1169494A
JP2019508730A
JP2017520803A
Foreign References:
US20190259409
WO2018037643A1
US20190355375
Other References:
Xiaofei Wang et al.,”Stream attention-based multi-array end-to-end speech recognition”,2019 IEEE International Conference on Acoustics,Speech and Signal processing(ICAPPS 2019),2019年4月17日,p.7105-7109
“A Breakthrough in Speech Technology: Baidu Launched SMLTA, the First Streaming Multi-layer Truncated Attention Model for Large-scale Online Speech Recognition”,http://research.baidu.com/Blog/index-view?id=109,Baidu Research,2019年1月21日
Attorney, Agent or Firm:
Kunio Ueda