Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ADAPTIVE REVERBERATION CANCELLATION SYSTEM
Document Type and Number:
WIPO Patent Application WO/2017/063693
Kind Code:
A1
Abstract:
A signal processor for determining a plurality of drive signals for driving a plurality of loud- speakers to cancel a reverberation effect in a listening area, wherein the signal processor is configured to determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero, determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients, estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error, and update the plurality of drive signals based on the estimated transfer function, wherein the signal processor is configured to repeatedly carry out the above steps.

Inventors:
JIN WENYU (DE)
GROSCHE PETER (DE)
Application Number:
PCT/EP2015/073818
Publication Date:
April 20, 2017
Filing Date:
October 14, 2015
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
HUAWEI TECH CO LTD (CN)
JIN WENYU (DE)
GROSCHE PETER (DE)
International Classes:
H04S7/00; G10L21/0208; H04R3/02; H04R3/04
Domestic Patent References:
WO2015062658A12015-05-07
Other References:
DELCROIX M ET AL: "Dereverberation and Denoising Using Multichannel Linear Prediction", IEEE TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, IEEE SERVICE CENTER, NEW YORK, NY, USA, vol. 15, no. 6, 1 August 2007 (2007-08-01), pages 1791 - 1801, XP011187707, ISSN: 1558-7916, DOI: 10.1109/TASL.2007.899286
SPORS SASCHA ET AL: "Active listening room compensation for massive multichannel sound reproduction systems using wave-domain adaptive filtering", THE JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, AMERICAN INSTITUTE OF PHYSICS FOR THE ACOUSTICAL SOCIETY OF AMERICA, NEW YORK, NY, US, vol. 122, no. 1, 1 July 2007 (2007-07-01), pages 354 - 369, XP012102317, ISSN: 0001-4966, DOI: 10.1121/1.2737669
LARS-JOHAN BRANNMARK ET AL: "Improved loudspeaker-room equalization using multiple loudspeakers and MIMO feedforward control", 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2012) : KYOTO, JAPAN, 25 - 30 MARCH 2012 ; [PROCEEDINGS], IEEE, PISCATAWAY, NJ, 25 March 2012 (2012-03-25), pages 237 - 240, XP032227105, ISBN: 978-1-4673-0045-2, DOI: 10.1109/ICASSP.2012.6287861
Attorney, Agent or Firm:
KREUZ, Georg (DE)
Download PDF:
Claims:
CLAIMS 1. A signal processor (100) for determining a plurality of drive signals for driving a plurality of loudspeakers (230; 410; 510) to cancel a reverberation effect in a listening area (430, 432, 435), wherein the signal processor (100) is configured to:

determine (330; 604) from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero, determine (340; 604) a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients, estimate (350; 606) a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error, and

update (360; 608) the plurality of drive signals based on the estimated transfer function,

wherein the signal processor is configured to repeatedly carry out the above steps. 2. The signal processor (100) of claim 1, wherein the signal processor is further configured to, when determining (330) the plurality of measured physical coefficients, minimize an error measure between the measured audio signals and a linear transformation of the measured physical coefficients, and minimize a number of non-zero entries of the plurality of measured physical coefficients. 3. The signal processor (100) of claim 2, wherein the signal processor is further configured to, when minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients, determine a vector b of the plurality of measured physical coefficients according to:

wherein ||y||p is a p-norm of a vector y, Φ is a M x N sensing matrix comprising columns with the physical sound functions, N » M, v is an M x 1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area (430, 432, 435), wherein in particular signal processor is configured to randomly chose the M locations. 4. The signal processor (100) of one of the previous claims, wherein the basis of physical sound functions is orthogonal with regard to an inner product that for a first vector bi and a second vector bj is representable as:

wherein R is a reproduction region (435) of the plurality of loudspeakers (230; 410;

is a weighting function and otherwise.

5. The signal processor (100) of one of the previous claims, wherein the basis of physical sound functions comprises an orthonormal set of physical sound functions obtained from a modified Gram-Schmidt process on plane wave functions corresponding to a plurality of angles. 6. The signal processor (100) of one of the previous claims, wherein the transfer function assigns a zero-coupling between a first and a second coefficient of the basis of physical sound functions, in particular wherein the transfer function is representable as a diagonal matrix U(k). 7. The signal processor (100) of one of the previous claims, wherein the signal processor is further configured to, when estimating (360; 606) the transfer function, estimate the diagonal matrix U(k) using a Least Mean Squares filter and/or using a Recursive Least Squares filter. 8. The signal processor (100) of one of claims 6 and 7, wherein the signal processor is further configured to, when estimating the diagonal matrix U(k), compute an n-th element of the diagonal matrix U(k) according to

where is a gain factor, preferably defined as

a forgetting factor, is an n-th diagonal element of a τ-th iteration of the diagonal matrix, is an n-th element of the plurality of desired physical coefficients,

and is an n-th element of a τ-th iteration of the plurality of measured physical coefficients. 9. The signal processor (100) of one of the previous claims, wherein the signal processor is further configured to, when updating the drive signal, compute a drive signal update σ* such that an energy level of the drive signal update σ* is limited with an upper bound, wherein in particular the energy level of the drive signal update σ* is computed as a square value of the drive signal update σ* . 10. The signal processor (100) of claim 9, wherein the signal processor is further configured to, when updating the drive signal, compute the drive signal update σ* as

wherein represents a pre-determined sound field coefficient matrix of Green's

functions for the plurality of loudspeakers assuming a free-field propagation, / is an identity matrix, is an estimate of the diagonal matrix, and Nt is a predetermined

parameter, in particular wherein is a reflection coeffi

cient and Νω is a number of walls of the listening area (430, 432, 435). 11 . The signal processor (100) of one of the previous claims, wherein the signal processor is further configured to perform an initial step of preconditioning the drive signal update σ* to 0 and/or preconditioning the diagonal matrix U(k) to an identity matrix. 12. A sound device (200) for generating a plurality of drive signals for driving a plurality of loudspeakers (230; 410; 510) to cancel a reverberation effect in a listening area (430, 432, 435), the sound device comprising:

an output (210) for driving the plurality of loudspeakers with the plurality of drive signals,

an input (220) for receiving one or more measured audio signals, and a signal processor (100) according to one of the previous claims, configured to update the plurality of drive signals.

13. A method (300) for generating a plurality of drive signals for driving a plurality of loudspeakers (230; 410; 510) to cancel a reverberation effect in a listening area (430, 432, 435), the method comprising:

driving (310) the plurality of loudspeakers with an initial plurality of drive signals,

measuring (320) one or more audio signals at one or more measurement locations,

determining (330; 604) from the one or more measured audio signals a plurality of measured physical coefficients of in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero,

determining (340; 604) a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients, estimating (350; 606) a transfer function from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error, and

updating (360; 608) the initial plurality of drive signals based on the estimated transfer function,

wherein the above steps are carried out repeatedly.

14. The method (300) of claim 13, wherein minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients comprises a step of determining a vector b of the plurality of measured physical coefficients according to:

wherein ||y||p is a p-norm of a vector y, Φ is a M x N sensing matrix comprising columns with the physical sound functions, N » M, v is an M x 1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area, wherein in particular signal processor is configured to randomly chose the M locations. 15. A computer-readable storage medium storing program code, the program code comprising instructions for carrying out the method of one of claims 13 and 14.

Description:
ADAPTIVE REVERBERATION CANCELLATION SYSTEM

TECHNICAL FIELD

The present invention relates to a signal processor, a sound device, and a method for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area. The present invention also relates to a computer-readable storage medium.

BACKGROUND

Reproduction of a desired multi zone sound field over a region of interest has drawn the attention of researchers in recent years. However, the majority of existing works in this area do not take into account the reverberant environments that practical multi zone sound reproduction systems will encounter. The reverberation compensation process is difficult to handle due to the unknown reverberant room channel and the large number of loudspeakers and microphones required by existing sound field reproduction systems. Reverberation is the collection of reflected sounds from the surfaces in an enclosure. It is created when a sound or signal is reflected in an enclosed environment that leads to a large number of reflections and then gradually decay as the sound is absorbed by walls, scatterers and air. This is most noticeable when the sound source stops but the reflections continue to exist till they reach zero amplitude. The majority of the sound field reproduction techniques are de- signed with free-field assumption, but this is not the case in most real implementations.

Room reverberation poses a major challenge in sound field reproduction and the unwanted reverberation generally leads to poor sound field reproduction and localization confusion for the listeners. Therefore, reverberation cancelation techniques are indispensable for a reproduction system with real-world settings. The most natural approaches are the passive techniques. For example, the room can be equipped with acoustic absorption materials, so that a modest attenuation of sound reflection is provided. However, the related costs pose a major challenge for this method and it is difficult to realize in many real-world application scenarios (e.g., sound field reproduction in an office or home environment). More technically advanced passive approaches may use fixed or variable directivity higher order loudspeakers in order to minimize the sound radiation directing towards the walls of a room. However, it requires some specific sound reproduction apparatus, which is difficult to achieve in practice.

To equalize the room reverberation, the inverse of the room response is generally applied to loudspeaker driving signals. Techniques have been suggested that are based on mode matching to reproduce a single-zone sound field accurately over the entire control region in reverberant rooms. An approach of reproducing a multi zone sound field within a desired region using sparse methods was introduced. This allowed a reduced number of randomly placed measurements to sparsely estimate the room transfer functions from the loudspeakers over the desired region in the domain of plane wave decomposition. The estimates were then used to derive the optimal least-squares solution for the loudspeaker filter gains. For these approaches, a prior measurement of the room transfer function for all the employed loudspeak- ers was needed. This is time-consuming to implement in practice and its performance is vulnerable to any changes in the ambient environment conditions during the measurement process.

Wave Domain Adaptive Filtering (WDAF) is a more practical approach to the application of reverberation cancelation in sound field reproduction. It has been introduced to active listening room compensation in Wave Field Synthesis systems. The wave-domain representation of the sound field was described using transformations on the microphone array input and the loudspeaker output respectively. These techniques suffer from practical issues, e.g. a large number of microphones is required for the room channel estimation. Additionally, the adap- tive processes in these techniques are shown to diverge in some reverberant environments that feature low direct-to-reverberant-path power ratios. The iterative calculation of the pseudoinverse in each iteration is needed, which may lead to ill-conditioning problems and channel estimation errors. SUMMARY OF THE INVENTION

The objective of the present invention is to provide a signal processor, a sound device, a method for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, wherein the signal processor, the sound device, and the method for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening areas overcome one or more of the above-mentioned problems of the prior art.

A first aspect of the invention provides a signal processor for determining a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, wherein the signal processor is configured to:

determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero,

determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients,

estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error, and

- update the plurality of drive signals based on the estimated transfer function, wherein the signal processor is configured to carry out the above steps once, or two or more times, e.g. to repeatedly carry out the above steps.

The necessity of a large number of loudspeaker-microphone channels for existing sound rendering systems complicates the application of multi zone sound field reproduction in reverber- ant environments. The signal processor of the first aspect provides an adaptive reverberation cancelation for multi zone sound field reproduction using sparse methods. The use of sparse methods results in a significantly reduced number of required microphones for the estimation of the reproduced sound field. The signal processor also facilitates the system convergence over a wide frequency range in reverberant environments.

In embodiments of the invention, updating the plurality of drive signals comprises a step of computing an update filter, i.e., a set of update filter elements that reflect the reverberation cancellation. Preferably, the signal processor is configured to carry out the above-mentioned steps repeatedly until the residual error is sufficiently small, e.g. smaller than a predetermined threshold.

Mathematically speaking, the signal processor of the first aspect can be configured to find a sparse vector b such that Ob approximates the measured signal v, wherein Φ is a matrix with columns which comprise physical sound functions.

The signal processor of the first aspect can be used in a multi zone sound field reproduction system which comprises a circular array of Q loudspeakers and M microphones. The loud- speakers are placed outside the desired reproduction region and the microphones can be randomly placed within the selected zones of interest. The proposed system can be, for example, applied to teleconference systems and car audio systems, in which a circular or linear loudspeaker array is employed and the microphones are freely distributed around the listeners. The adaptive reverberation cancelation system aims to rectify the reverberation effects based on iterative feedback from sparse microphone measurements and to actively play back the input signals via the loudspeaker array with updated FIR gain filters.

Let l q (t) be the driving signal for the q-th loudspeaker and v m (t) be the recorded signal of the m-th microphone measurement. Taking the Fourier transform, the received measurements at the microphones can be expressed in matrix form as

where are the loudspeaker driving signals,

are the microphone measurements, and C(k) represents the channel between the (m, q)-th microphone-loudspeaker pair at the frequency k. Note that we can separate the channel effects C(k) into the direct and reverberant path

sent the direct and reverberant channels between the (m,q)-th microphone - loudspeaker pair.

In a preferred embodiment, an orthonormal set of basis functions {G n } is used, which describes any physically feasible sound field by implementing a modified Gram-Schmidt pro- cess on plane wave functions arriving from various angles. Therefore, we express the measurements in (1) as: where b n (k) are the coefficients for the reproduced sound field and x m represents the m-th microphone location. Note that N is set to be sufficiently large.

The plurality of measured physical coefficients can be seen as a sparse approximation, i.e., a sparse vector y that approximately solves an under-determined linear system of equations. The measurements in v are the products of rows of the sensing matrix Φ and the sparse signal y. To provide an accurate and stable estimate of y from the insufficient observation v, when y is sufficiently sparse, it is advantageous if the observation value is the linear projection of the sparse signal onto an incoherent basis. A proposed formulation is consistent with this require- ment that the random samplings of the sound pressure field in v are incoherent with the original basis of y.

In a first implementation of the signal processor according to the first aspect, the processor is further configured to, when determining the plurality of measured physical coefficients, mini- mize an error measure between the measured audio signals and a linear transformation of the measured physical coefficients, and minimize a number of non-zero entries of the plurality of measured physical coefficients.

The linear transformation can be a sensing matrix, i.e., it can comprise in its columns the basis function vectors of the basis of physical sound functions.

By simultaneously minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients, it is ensured that the measurements are processed as accurately as possible, while still obtaining a sparse vector b of the plurality of measured physical coefficients, which can easily be processed.

In a second implementation of the signal processor according to the first aspect, the signal processor is further configured to, when minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients, determining a vector b of the plurality of measured physical coefficients according to:

wherein ||y|| p is a p-norm of a vector y, Φ is a M x N sensing matrix comprising columns with the physical sound functions, N » M, v is an M x 1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area, wherein in particular the M locations are chosen randomly.

The sensing matrix Φ in an embodiment is an M x N sensing matrix whose columns preferably contain the values of the basis functions G n (x; k) at M microphone locations.

The signal processor may comprise an input for obtaining information on the M locations, i.e. the locations can be random, but known or approximately known to the signal processor.

This represents a particular efficient way of computing the plurality of measured physical coefficients.

In a third implementation of the signal processor according to the first aspect, the basis of physical sound functions is orthogonal with regard to an inner product that for a first vector bi and a second vector bj is representable as:

wherein R is a reproduction region of the plurality of loudspeakers, w(x) is a weighting function and is 1 for i=j and 0 otherwise.

In other words, the basis of physical sound functions can be chosen to be orthogonal with regard to an inner product that is defined as an integral over the reproduction region, e.g. an area between the plurality of loudspeakers.

In a fourth implementation of the signal processor according to the first aspect, the basis of physical sound functions comprises an orthonormal set of physical sound functions obtained from a modified Gram-Schmidt process on plane wave functions corresponding to a plurality of angles.

This has the advantage that the basis of physical sound functions can be used to describe any feasible sound field and match the desired sound f field in a weighted least-square sense.

In a fifth implementation of the signal processor according to the first aspect the transfer function assigns a zero-coupling between a first and a second coefficient of the basis of physical sound functions, in particular wherein the transfer function is representable as a diagonal matrix U(k).

Assuming a zero-coupling of the transfer function between different coefficients of the basis of physical sound functions has the advantage that the computation is simplified. In particular, a diagonal representation of the transfer function as a diagonal matrix U(k) leads to a significant simplification of the computation.

In a sixth implementation of the signal processor according to the first aspect, the signal pro- cessor is further configured to, when estimating the transfer function, estimating the diagonal matrix U(k) using a Least Mean Squares filter and/or using a Recursive Least Squares filter.

These represent efficient ways of computing the diagonal matrix. In a seventh implementation of the signal processor according to the first aspect, the signal processor is further configured to, when estimating the diagonal matrix U(k), computing an n- th element of the diagonal matrix U(k) according to

where is a gain factor, preferably defined a a for.

getting factor, is an n-th diagonal element of a τ-th iteration of the diagonal matrix,

is an n-th element of the plurality of desired physical coefficients, and is an n-

th element of a τ-th iteration of the plurality of measured physical coefficients. This represents a particularly efficient way of iteratively computing the diagonal matrix U(k).

In an eighth implementation of the signal processor according to the first aspect, the signal processor is further configured to, when updating the drive signal, computing a drive signal update σ * such that an energy level of the drive signal update σ * is limited with an upper bound, wherein in particular the energy level of the drive signal update σ * is computed as a square value of σ * . Limiting an energy level of the drive signal update has the advantage that the process of updating the drive signal towards the desired optimal drive signal proceeds in small steps. Thus, undesired sound effects during the updating of the drive signal are avoided. In a ninth implementation of the signal processor according to the first aspect the signal processor is further configured to, when updating the drive signal, computing the drive signal up

wherein represents a pre-determined sound field coefficient matrix of Green's func

tions for the plurality of loudspeakers assuming a free-field propagation, / is an identity matrix, is an estimate of the diagonal matrix, and is a predetermined parameter, in par

ticula is a reflection coefficient and is a number of

walls of the listening area.

This represents an efficient way of implementing the updates of the drive signal. In particular, the above-defined iterative process makes use of the diagonal structure of the matrix U(k) and limits an energy level of the update of the drive signal.

In a tenth implementation of the signal processor according to the first aspect, the signal processor is further configured to perform an initial step of preconditioning the drive signal update σ * to 0 and/or preconditioning the diagonal matrix U(k) to an identity matrix. The initial preconditioning steps have the advantage that the plurality of drive signals are initialized with a sensible starting point and the method implementation by the signal processor can thus converge faster towards the desired optimal solution.

In embodiments of the invention, the signal processor is configured to determine the drive signal update by determining an update filter. In this case, the update filter can be preconditioned to 0, i.e., the update filter is preconditioned as a zero update.

A second aspect of the invention refers to a sound device for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, the sound device comprising: an output for driving the plurality of loudspeakers with the plurality of drive signals, an input for receiving one or more measured audio signals, and

a signal processor according to the first aspect or one of its implementations, wherein the signal processor is configured to update the plurality of drive signals.

A third aspect of the invention refers to a method for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area, the method comprising:

driving the plurality of loudspeakers with an initial plurality of drive signals,

- measuring one or more audio signals at one or more measurement locations,

determining from the one or more measured audio signals a plurality of measured physical coefficients of in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero,

determining a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients,

estimating a transfer function from the plurality of measured physical coefficients and the plurality of desired physical coefficients, based on the determined residual error, and

updating the initial plurality of drive signals based on the estimated transfer function, wherein the above steps are carried out once, or two or more times, e.g. repeatedly.

The methods according to the third aspect of the invention can be performed by the signal processor according to the first aspect of the invention. Further features or implementations of the method according to the third aspect of the invention can perform the functionality of the signal processor according to the first aspect of the invention and its different implementation forms. In a first implementation of the method of the third aspect, minimizing the error measure and minimizing the number of non-zero entries of the plurality of measured physical coefficients comprises a step of determining a vector b of the plurality of measured physical coefficients according to: wherein ||y|| p is a p-norm of a vector y, Φ is a M x N sensing matrix comprising columns with the physical sound functions, N » M, v is an M x 1 observation vector which comprises the one or more measured audio signals corresponding to M locations within the listening area, wherein in particular signal processor is configured to randomly chose the M locations.

A fourth aspect of the invention refers to a computer-readable storage medium storing program code, the program code comprising instructions for carrying out the method of the third aspect or one of its implementations.

BRIEF DESCRIPTION OF THE DRAWINGS

To illustrate the technical features of embodiments of the present invention more clearly, the accompanying drawings provided for describing the embodiments are introduced briefly in the following. The accompanying drawings in the following description are merely some embodiments of the present invention, but modifications on these embodiments are possible without departing from the scope of the present invention as defined in the claims.

FIG. 1 shows a signal processor in accordance with an embodiment of the present invention,

FIG. 2 shows a sound device in accordance with a further embodiment of the present invention,

FIG. 3 shows a flowchart of a method for reverberation cancellation in accordance with a further embodiment of the present invention,

FIG. 4 shows a structure of a multi zone sound field reproduction system in accordance with a further embodiment of the present invention,

FIG. 5 shows an overview of the operation of the adaptive reverberation cancelation system in accordance with a further embodiment of the present invention, and

FIG. 6 shows a simplified flow chart of a method for reverberation cancellation in accordance with a further embodiment of the present invention. Detailed Description of the Embodiments

FIG. 1 shows a signal processor 100 for determining a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area.

The signal processor 100 comprises a coefficient unit 110 which is configured to determine from one or more measured audio signals a plurality of measured physical coefficients in a basis of physical sound functions, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero. The basis of physical sound functions can be fixed or there can be several bases of physical sound functions, wherein a specific basis can be chosen, e.g. by setting a basis selection parameter.

The signal processor 100 further comprises a residual error unit 120 which is configured to determine a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients. The signal processor 100 further comprises a transfer unit 130, which is configured to estimate a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients, based on the determined residual error. The signal processor 100 further comprises an update unit 140 which is configured to update the plurality of drive signals based on the estimated transfer function. The update unit 140 can be configured to generate an initial update as zero, i.e., to initially generate a drive signal that corresponds to an input signal. The input signal can be provided to the signal processor 100 from an external unit or the input signal can be determined in the signal processor 100.

The signal processor 100 is configured to control its units such that they repeatedly compute updates to the plurality of drive signals. The coefficient unit 110, residual error unit 120, transfer unit 130 and the update unit 140 can be realized in the same physical hardware, for example they can be realized as different parts of a programming of the signal processor 100. FIG. 2 shows a sound device 200 for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area. The sound device 200 comprises an output 210 for driving the plurality of loudspeakers with the plurality of drive signals 212, an input 220 for receiving one or more measured audio signals, and a signal processor 230, e.g. the signal processor of FIG. 1, configured to update the plurality of drive signals.

FIG. 3 shows a flow chart of a method 300 for generating a plurality of drive signals for driving a plurality of loudspeakers to cancel a reverberation effect in a listening area. The method comprises a first step of driving 310 the plurality of loudspeakers with an initial plurality of drive signals.

The method comprises a second step of measuring 320 one or more audio signals at one or more measurement locations. For example, the one or more audio signals can be measured using microphones that are placed at random locations in the listening area. The method can comprise a further step of determining positions of the randomly placed microphones, such that measured audio signals can be correlated with positions of the corresponding microphones.

In a third step 330 from the one or more measured audio signals a plurality of measured phys- ical coefficients in a basis of physical sound functions is determined, such that a sum of the physical sound functions, weighted with the plurality of measured physical coefficients approximates the one or more measured audio signals, wherein at least half of the plurality of measured physical coefficients are zero. In particular, at least ¾ or preferably at least 90% of the plurality of measured physical coefficients can be required to be zero.

In a fourth step 340 a residual error between the plurality of measured physical coefficients and a plurality of desired physical coefficients is determined. In a fifth step 350 a transfer function describing a transformation from the plurality of desired physical coefficients to the plurality of measured physical coefficients is determined based on the determined residual error. In a sixth step 360, an updated version of the initial plurality of drive signals is determined based on the estimated transfer function. The updated version of the initial plurality of drive signal is output to a plurality of loudspeakers, and the method can continue in step 320.

In a further step (not shown in FIG. 3), it can be determined whether the residual error is smaller than a predetermined threshold error. If it is smaller, the updated drive signal can be output and no further iterations of the method are performed. If the residual error is larger than the predetermined threshold, execution of the method continues with the first step, wherein the plurality of loudspeakers is now driven with the updated plurality of drive signals instead of the initial plurality of drive signals.

FIG. 4 shows a structure of a multi zone sound field reproduction system 400 in accordance with a further embodiment of the present invention. The multi zone sound field reproduction system 400 comprises an adaptive room reverberation cancelation system 420, an array of loudspeakers 410, a first microphone array 440 that is located in a first listening zone 430 and a second microphone array 442 that is located in a second listening zone 432. The array of loudspeakers defines a listening area 435 that comprises the first and second listening zone 430, 432.

The adaptive room reverberation cancelation system 420 comprises a sound device, e.g. the sound device of FIG. 2, with an input, output and a signal processor. The input is configured to receive audio signals 441 from the first and second microphone array 440, 442. The output is configured to drive the array of loudspeakers 410 with drive signals 421.

FIG. 5 shows an overview of the operation of a multi zone sound field reproduction system 500 in accordance with a further embodiment of the present invention. The multi zone sound field reproduction system 500 comprises an adaptive reverberation cancelation system 520 and a loudspeaker array 510 that is located in a reverberant room 512. The multi zone sound field reproduction system 500 further comprises a summing unit 522. In FIG. 5, the summing unit 522 is shown as a unit that is external to the adaptive reverberation cancelation system 520. However, in other embodiments, the summing unit 522 could be part of the adaptive reverberation cancelation system.

In a τ-th iteration, the adaptive reverberation cancelation system 520 generates an updated drive signal which drives the plurality of loudspeakers 510. The walls of the re

verberant room 512 reflect the generated sound waves.

Microphones 540 measure a plurality of audio signals 541 in the reproduction region and from these measured audio signals a plurality of measured physical coefficients is de-

termined. A difference between the measured physical coefficients and a plurality of desired physical coefficients is formed in the summing unit 522 and fed back to the adaptive reverberation cancelation system 520. Based on the difference, which represents a residual error 523, the adaptive reverberation cancelation system updates the drive signal, which begins a next iteration of the iterative reverberation cancellation process.

FIG. 6 shows a flowchart of the adaptive reverberation method in accordance with a further embodiment of the present invention.

In a first step 602, the loudspeaker drive signals are preconditioned to i.e., the initial up-

date is 0.

In a second step 604, a plurality of measured physical coefficients is determined in a basis of physical sound functions, such that a sum of the physical sound functions of the basis, wherein the sum is weighted with the plurality of measured physical coefficients, approxi- mates the one or more measured audio signals.

Based on a difference between the plurality of measured physical coefficients and a plurality of desired physical coefficients, a new residual error is determined. In a third step 606, diagonal entries of a diagonal matrix are determined using RLS

adaptive filtering methods.

In a fourth step 608, the array of loudspeakers is driven with the updated plurality of drive signals. If the residual error is sufficiently small, the method can output the sum of a predefined driving signal (e.g. an input signal times a predefined filter in the frequency domain) and the

update signal In embodiments of the invention, the update signal can be deter- mined based on an update filter, e.g. by applying the update filter to the predefined driving signal.

In further step 610, an Inverse Fourier Transform is applied to the updated plurality of drive signals and in further step 612, the Fourier-transformed signals 611 are plaid

back with the plurality of speakers. The method then continues in step 604, with an incremented iteration index τ.

In the following, it is described in more detail how a sparse approximation method can be used to calculate from the randomly-placed measurements within the selected

zones of interest.

A basic principle of the method is to assume that the reproduced sound field results

from only a small number of basis Helmholtz solutions. Based on this assumption, we consider the following lp norm (where nonconvex optimization problem

where y is the basis function coefficient set, the dictionary Φ is an M x N sensing matrix (N » M) whose columns contain the values of G n (x; k) at M locations and v is an M x 1 observation vector which contains the values of the actual reproduced sound field S(x; k) at M randomly chosen locations within the desired region. The error is related to the he additive com- plex Gaussian noise level. Let y be a sparse signal, i.e., y has a limited number of non-zero entries at unknown locations. Therefore, we can apply the regularized Iteratively Reweighted Least Squares (IRLS) algorithm to solve equation (3) and derive the optimal estimator y that characterizes the reproduced sound field in reverberant environments:

where y has only non-zero components and can be used as an estimate of the basis function coefficients Overall, we formulate the calculation of the sound field coefficients b n (k) based on the sound field measurements in (1) in the following matrix form

where is a transformation matrix (N x M) expressing the rela-

tionship of b(k) and v(k), which can be seen as the projection from the sparse measurements onto the subspace spanned by the orthonormal set

The desired multi zone sound field S d (x; k) and the actual reproduced sound field in a reverberant room S(x; k) can be characterized that represents the respective coef-

ficient sets of the orthonormal basis function Note that the coefficients for can

be derived offline.

Consider the reverberant room channel as a transformation between the reproduced sound field and the desired sound field, which can be further expressed by a linear transformation of the basis function coefficients:

where represents the reverberant room effects at the wave-

number k. Note that we parameterize U(k) with a diagonal structure following the assumption that the couplings between the sound field coefficients with different indices can be neglected in the defined basis function domain.

The room channel transformation U can be estimated in an iterative fashion. We define

as the measured sound field coefficients at the microphones after updating the loud¬

speaker signals. An accurate estimate of the room channel transformation can be

achieved if the squared norm of the residual error j s minimized, which also

leads to an accurate matching between the actual reproduced sound field and the desired multi zone sound field over the desired reproduction region. This can be treated as an adaptive filtering problem and U(k) can be estimated actively by using algorithms such as Least Mean Squares (LMS) filter and Recursive Least Squares (RLS) filter.

Due to the diagonal structure of U(k), calculating the unknown diagonal entries U n (k) can be further simplified as a single-tap adaptive filtering problem. Let be the estimate of

U(k) at the τ th adaption step, we have:

where 1S the gain factor is the forgetting factor. We

choose the RLS algorithm as it provides a fast convergence rate. Therefore, equation (7) can be applied to obtain an iterative estimate of the diagonal elements based on the residual

error at the τ th adaption step.

The optimal filter updating signal on the loudspeaker array can be derived based on the active estimate of the room channel transformation. It is designed to minimize the residual error and ensure the estimation convergence. We precondition the initial loudspeaker array signals to reproduce the desired multi zone sound field under free-field assumption. Therefore, the coefficients for the desired sound field can be expressed by replacing with the direct

channel in equation (5): represent the pre-determined sound field coefficient matrix of the Green' functions for all loudspeakers assuming free-field propagation. Incorporating the room chan- nel model in 6) and the estimator we have

Following (9), the measured sound field coefficients after adding updating signals

ίο the loudspeakers can be given by

We can write the difference between the measured and desired sound field coefficients using

(8) and (10):

where I is an identity matrix.

An efficient reverberation compensation and accurate sound field reproduction

achieved by finding the optimal loudspeaker filter updating signals CT W that minimize Therefore, a multi-constraint convex optimization is formulated with the ob

jective of minimizing the error between the measured and desired sound field coefficients, while also guaranteeing the convergence:

can be calculated offline. The value is adjustable and it depends how reverberant

the room environment is. It can be set to be less or equal to where is me reflection coefficients and N w is the number of walls. Note that the additional constraints on the energy of each of the loudspeaker filter updating signals are applied so that the reverberation effects of are insignificant and can be consistently mitigate the adaptive process,

thereby avoiding the active calculation of pseudo-inverse of the reverberation channel matrix. These formulations guarantee the system convergence and lead to less computational complexity and faster convergence than prior art.

To summarize, in embodiments of the invention, the reproduced sound field is described as a weighted series of orthonormal basis functions over the desired reproduction region, which is then used to adaptively equalize the desired multi zone sound field in terms of the basis function coefficients. An adaptive reverberation cancelation system for multi zone sound field re- production using sparse microphone measurements is proposed. The proposed approach expresses the sound field as a space-frequency orthonormal basis function expansion the desired reproduction region. We consider the reproduced sound field as a linear transformation of the desired sound field. We then introduce the adaptive channel estimation process using sparse methods to identify these transformations directly in the orthogonal basis function domain and derive the required loudspeaker updating signals that compensate the room reverberation and guarantee the convergence of the adaptive estimation in reverberant environments.

Advantages of embodiments of the invention include: - The presented signal processor, sound device and method do not require a prior measurement of the transfer functions of the employed loudspeaker. They can adapt to the alteration of ambient environment condition during the measurement process. The presented signal processor, sound device and method provide an accurate reproduction of the desired sound field under the same hardware provision and environment settings by employing the sparse methods, i.e. the same performance can be achieved using a smaller number of microphone measurements.

The presented signal processor, sound device and method show a better convergence behavior to a good reproduction performance, especially in the reverberant rooms that feature low direct-to-reverberant-path power ratios. This is achieved by formulating a novel multi-constraint convex optimization and avoiding the active calculation of pseudo-inverse of the reverberation channel matrix, which guarantee the system convergence.

The adaptive reverberation cancelation system rectifies the unwanted reverberation effects based on iterative feedbacks from a small number of microphone measurements, so that the listeners can still enjoy an accurate sound field reproduction even in extreme complex environments (e.g. car chamber).

Less computational complexity and faster convergence. Applications of embodiments of the invention include any sound reproduction system or surround sound system using multiple loudspeakers.

In particular, embodiments of the presented invention can be applied to

TV speaker systems,

- car entertaining systems,

teleconference systems, and/or

home cinema system,

where personal listening environments for one or multiple listeners is desirable. The foregoing descriptions are only implementation manners of the present invention, the protection of the scope of the present invention is not limited to this. Any variations or replacements can be easily made by a person skilled in the art. Therefore, the protection scope of the present invention should be subject to the protection scope of the attached claims.