音声認識方法、装置、機器及びコンピュータ読み取り可能な記憶媒体 - Baidu Online Network Technology (Beijing) Company Limited

Title:

音声認識方法、装置、機器及びコンピュータ読み取り可能な記憶媒体

Document Type and Number:

Japanese Patent JP7434137

Kind Code:

B2

Abstract:

The disclosure provides a speech recognition method, a device and a computer-readable storage medium. The method includes obtaining a first voice signal collected from a first microphone in a microphone array and a second voice signal collected from a second microphone in the microphone array, the microphone array including at least two microphones, such as two, three or six microphones. The method further includes extracting enhanced features associated with the first voice signal and the second voice signal through a neural network, and obtaining a speech recognition result based on the enhanced features extracted.

Inventors:

Chang, Se
Fan, bin
Lee, Shin
Bai Jinfeng
Chen, Shu
Jia, Ray

Application Number:

JP2020187686A

Publication Date:

February 20, 2024

Filing Date:

November 11, 2020

Export Citation:

Click for automatic bibliography generation Help

Assignee:

Baidu Online Network Technology (Beijing) Company Limited

International Classes:

G10L15/20; G10L15/06; G10L15/10; G10L15/16; G10L15/28

Domestic Patent References:

JP201920598A
JP1169494A
JP2019508730A
JP2017520803A

Foreign References:

US20190259409
WO2018037643A1
US20190355375

Other References:

Xiaofei Wang et al.,”Stream attention-based multi-array end-to-end speech recognition”,2019 IEEE International Conference on Acoustics,Speech and Signal processing(ICAPPS 2019),2019年4月17日,p.7105-7109
“A Breakthrough in Speech Technology: Baidu Launched SMLTA, the First Streaming Multi-layer Truncated Attention Model for Large-scale Online Speech Recognition”,http://research.baidu.com/Blog/index-view?id=109,Baidu Research,2019年1月21日

Attorney, Agent or Firm:

Kunio Ueda

Previous Patent: Solar power generation equipment inspection system

Next Patent: Object recognition device and object recognition method