SOUND CAPTURING - HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH

Title:

SOUND CAPTURING

Document Type and Number:

WIPO Patent Application WO/2018/219582

Kind Code:

Abstract:

Sound capturing which includes applying a far-field microphone functionality to a multiplicity of first microphone signals to provide a first output signal, and applying a less directional microphone functionality to one or more second microphone signals to provide a second output signal.

Inventors:

CHRISTOPH MARKUS (DE)
PFAFFINGER GERHARD (DE)
KRONLACHNER MATTHIAS (DE)

Application Number:

PCT/EP2018/061303

Publication Date:

December 06, 2018

Filing Date:

May 03, 2018

Export Citation:

Click for automatic bibliography generation Help

Assignee:

HARMAN BECKER AUTOMOTIVE SYSTEMS GMBH (DE)

International Classes:

H04R3/00; G10L21/00

Domestic Patent References:

WO2010019192A1

2010-02-18

Foreign References:

US20160241955A1	2016-08-18
EP1538867A1	2005-06-08
EP2437517A1	2012-04-04
US20140350935A1	2014-11-27
US20070053455A1	2007-03-08
JP2007147732A	2007-06-14
EP0869697A2	1998-10-07

Other References:

GÓMEZ P ET AL: "Multiple source separation in the frequency domain using Negative Beamforming", EURSPEECH 2001 - SCANDINAVIA, vol. 4, 31 December 2001 (2001-12-31), pages 2619, XP007004932

Attorney, Agent or Firm:

WESTPHAL, MUSSGNUG & PARTNER (DE)

Download PDF:

View/Download PDF PDF Help

Claims:

CLAIMS:

1. A sound capturing system comprising:

a first signal processing path configured to apply a far-field microphone functionality based on a multiplicity of first microphone signals and to provide a first output signal to a speech processing arrangement; and

a second signal processing path configured to apply a less directional microphone functionality than the far-field microphone functionality based on one or more second microphone signals and to provide a second output signal to the speech processing arrangement.

2. The system of claim 1, further comprising a multi-channel high-pass filter block, the high-pass filter block comprising a multiplicity of high-pass filters operatively connected upstream of at least one of the first signal processing path and the second signal processing path.

3. The system of claim 1 or 2, further comprising a microphone array, the microphone array comprising a multiplicity of microphones that provides at least one of the multiplicity of first microphone signals and the multiplicity of second microphone signals.

4. The system of any of claims 1 to 3, wherein the first signal processing path comprises: a multi-channel acoustic echo canceling block comprising a multiplicity of acoustic echo cancelers and configured to receive the filtered or unfiltered multiplicity of first microphone signals;

a multi-channel fix beamforming block comprising a multiplicity of fix beamformers and operatively connected downstream of the multi-channel acoustic echo canceling block;

a beam steering block operatively connected downstream of the multi-channel fix beamforming block and configured to provide at least one fix-beam signal; and

an adaptive beamforming block operatively connected downstream of the beam steering block and configured to provide a directional beam signal steered towards a target position.

5. The system of claim 4, wherein the first signal processing path further comprises at least one of:

a first noise reduction block operatively connected downstream of the adaptive beamforming block and configured to remove noise from the beam signal provided by the adaptive beamforming block;

a first automatic gain control block operatively connected downstream of the adaptive beamforming block and configured to provide a first automatic gain control output signal with a controlled signal amplitude; and

a first limiter block operatively connected downstream of the adaptive beamforming block and configured to provide a first limiter output signal with a signal amplitude that is under predetermined value.

6. The system of claim 4 or 5, wherein the beam steering block is further configured to provide a positive fix-beam signal and a negative fix-beam signal, the positive fix-beam signal representing a beam pointing in a direction in a room with currently the highest signal-to-noise ratio and negative fix-beam signal representing a beam pointing in a direction in a room with currently the lowest signal-to-noise ratio.

7. The system of claim 4 or 5, wherein the beam steering block is further configured to provide a positive fix-beam signal and a negative fix-beam signal, the positive fix-beam signal representing a beam pointing in a direction in a room with currently the highest signal-to-noise ratio and negative fix-beam signal representing a beam pointing into an opposite direction.

8. The system of any of claims 1 to 7, wherein the second signal processing path comprises:

a multi-channel delay block comprising a multiplicity of delays and connected to microphone array or the high-pass filter block;

a first summing block operatively connected downstream of the multi-channel delay block and configured to sum up the delayed filtered or unfiltered multiplicity of second microphone signals to provide a sum signal; and a first single-channel acoustic echo canceling block comprising an acoustic echo canceler, and configured to receive the sum signal and to provide the less directional signal.

9. The system of claim 8, the system further comprising a delay calculation block, wherein:

the beam steering block is further configured to provide a delay steering signal; the multi-channel delay block is further configured to provide a multiplicity of controllable delays; and

the multi-channel delay calculation block is configured to control the multiplicity of controllable delays based on the delay steering signal from the beam steering block.

10. The system of claim 9, wherein the multiplicity of delays comprises fractional delays.

11. The system of any of claims 1 to 7, wherein the second signal processing path comprises:

a first multi-channel allpass filter block comprising a multiplicity of allpass filters and operatively connected to microphone array or the high-pass filter block;

a second summing block operatively connected downstream of the multi-channel delay block and configured to sum up the delayed filtered or unfiltered multiplicity of second microphone signals to provide a sum signal; and

a second single-channel acoustic echo canceling block comprising an acoustic echo canceler, and configured to receive the sum signal and to provide the less directional signal.

12. The system of any of claims 4 to 7, wherein the second signal processing path comprises:

a second multi-channel allpass filter block comprising a multiplicity of allpass filters and operatively connected to the multi-channel acoustic echo canceling block; a second summing block operatively connected downstream of the multi-channel delay block and configured to sum up the delayed filtered or unfiltered multiplicity of second microphone signals to provide a sum signal.

13. The system of claim 11 or 12, wherein at least one of the first multi-channel allpass filter block and the second multi-channel allpass filter block comprises allpass filters with randomly distributed cut-off frequencies that are arranged around a notch in the resulting magnitude frequency response.

14. The system of any of claims 8 to 13, wherein the second signal processing path further comprises at least one of:

a second noise reduction block operatively connected downstream of the summing block and configured to remove noise from the sum signal provided by the summing block;

a second automatic gain control block operatively connected downstream of the summing block and configured to provide a second automatic gain control output signal with a controlled signal amplitude; and

a second limiter block operatively connected downstream of the summing block and configured to provide a second limiter output signal with a signal amplitude that is equal to or below a predetermined value.

15. The system of any of claims 1 to 14, wherein the speech processing arrangement comprises a speech recognition block operatively connected downstream of at least one of the first signal processing path and second signal path.

16. The system of any of claims 1 to 15, wherein the speech processing arrangement comprises a key word search processing block or a hands-free-processing block operatively connected downstream of the at least one of the second signal processing path and first signal processing path.

17. The system of claim 4 or 5, wherein the second signal processing path further comprises a second summing block operatively connected downstream of the multi-channel fix beamforming block and configured to sum up the output signals thereof to provide a sum signal; and at least one of:

a second noise reduction block operatively connected downstream of the summing block and configured to remove noise from the sum signal provided by the summing block;

18. The system of claim 4 or 5, wherein the second signal processing path further comprises

a second summing block operatively connected downstream of the multi-channel fix beamforming block and configured to sum up the output signals thereof that are related to the more negative beams to provide a sum signal; and at least one of:

a second noise reduction block operatively connected downstream of the summing block and configured to remove noise from the sum signal provided by the summing block;

19. The system of claim 4 or 5, wherein the second signal processing path further comprises

a second summing block operatively connected downstream of the multi-channel fix beamforming block and configured to sum up the output signals of the most negative beam and at least one neighboring beam at each side thereof to provide a sum signal; and at least one of:

a second noise reduction block operatively connected downstream of the summing block and configured to remove noise from the sum signal provided by the summing block;

20. The system of claim 4 or 5, wherein the second signal processing path is operatively connected downstream of the beam steering block and further comprises at least one of: a second noise reduction block operatively connected downstream of the summing block and configured to remove noise from the sum signal provided by the summing block;

21. A sound capturing method comprising:

applying a far-field microphone functionality to a multiplicity of first microphone signals to provide a first output signal for speech processing; and

applying a less directional microphone functionality than the far-field microphone functionality to one or more second microphone signals to provide a second output signal for speech processing.

22. The method of claim21, further comprising multi-channel high-pass filtering of at least one of the multiplicity of first microphone signals and the one or more second microphone signals before at least one of applying the far-field microphone functionality and applying the less directional microphone functionality.

23. The method of claim 21 or 22, further comprising providing at least one of the multiplicity of first microphone signals and the multiplicity of second microphone signals with a microphone array, the microphone array comprising a multiplicity of microphones.

24. The method of any of claims 21 to 23, wherein applying a far-field microphone functionality comprises:

multi-channel acoustic echo canceling with a multiplicity of acoustic echo cancelers based on the filtered or unfiltered multiplicity of first microphone signals; multi-channel fix beamforming with a multiplicity of fix beamformers downstream of the multi-channel acoustic echo canceling;

beam steering downstream of the multi-channel fix beamforming to provide at least one fix-beam signal; and

adaptive beamforming downstream of the beam steering to provide a directional beam signal steered to a target position.

25. The method of claim 24, wherein applying a far-field microphone functionality further comprises at least one of:

first noise reduction downstream of the adaptive beamforming to remove noise from the beam signal provided by the adaptive beamforming;

first automatic gain control downstream of the adaptive beamforming to provide a first automatic gain control output signal with a controlled signal amplitude; and

first limiting downstream of the adaptive beamforming to provide a first limited output signal with a signal amplitude that is equal or below a predetermined value.

26. The method of claim 24 or 25, wherein the beam steering is further configured to provide a positive fix-beam signal and a negative fix-beam signal, the positive fix-beam signal representing a beam pointing in a direction in a room with currently the highest signal-to-noise ratio and negative fix-beam signal representing a beam pointing in a direction in a room with currently the lowest signal-to-noise ratio.

27. The method of claim 24 or 25, wherein the beam steering is further configured to provide a positive fix-beam signal and a negative fix-beam signal, the positive fix-beam signal representing a beam pointing in a direction in a room with currently the highest signal-to-noise ratio and negative fix-beam signal representing a beam pointing into an opposite direction.

28. The method of any of claims 21 to 27, wherein applying the less-directional microphone functionality comprises:

multi-channel delaying with a multiplicity of delays the filtered or unfiltered second microphone signals;

first summing downstream of the multi-channel delaying configured to sum up the delayed filtered or unfiltered multiplicity of second microphone signals to provide a sum signal; and

first single-channel acoustic echo canceling with an acoustic echo canceler based on the sum signal to provide the less directional signal.

29. The method of claim 28, wherein the multiplicity of delays comprises fractional delays.

30. The method of claim 28 or 29, the method further comprises delay calculation, wherein:

the beam steering is further configured to provide a delay steering signal;

the multi-channel delaying is further configured to provide a multiplicity of controllable delays; and

the delay calculation is configured to control the multiplicity of controllable delays based on the delay steering signal from the beam steering.

31. The method of any of claims 21 to 30, wherein applying the less-directional microphone functionality comprises: first multi-channel allpass filtering with a multiplicity of allpass filters of the filtered or unfiltered second microphone signals;

second summing operatively downstream of the multi-channel delaying to sum up the delayed filtered or unfiltered multiplicity of second microphone signals to provide a sum signal; and

second single-channel acoustic echo canceling with an acoustic echo canceler based on the sum signal to provide the less-directional signal.

32. The method of any of claims 24 to 27, wherein applying the less-directional microphone functionality comprises:

second multi-channel allpass filtering with a multiplicity of allpass filters downstream of the multi-channel acoustic echo canceling; and

second summing of the delayed filtered or unfiltered multiplicity of second microphone signals downstream of the multi-channel delaying to provide a sum signal.

33. The method of claim 31 or 32, wherein at least one of the first multi-channel allpass filtering and the second multi-channel allpass filtering comprises allpass filtering with randomly distributed cut-off frequencies that are arranged around a notch in the resulting magnitude frequency response.

34. The method of any of claims 28 to 32, wherein applying the less-directional microphone functionality further comprises at least one of:

second noise reduction downstream of the first or second summing to remove noise from the sum signal provided by the first or second summing;