Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
MITIGATING ACOUSTIC FEEDBACK IN HEARING AIDS WITH FREQUENCY WARPING BY ALL-PASS NETWORKS
Document Type and Number:
WIPO Patent Application WO/2021/055513
Kind Code:
A1
Abstract:
A method and system or device such as a hearing aid are provided for processing audio signals. In accordance with the method, an audio signal is received and divided into a plurality of frequency sub-bands. For each of the frequency sub-band signals, the signal is further divided into overlapping temporal frames. Each of the temporal frames are windowed. Frequency warping is performed on each of the windowed frames. Overlap-and-add is performed on the frequency warped frames. The frequency warped sub-bands are combined into a full band to provide a frequency warped signal.

Inventors:
GARUDADRI HARINATH (US)
LEE CHING-HUA (US)
CHEN KUAN-LIN (US)
HARRIS FRED (US)
RAO BHASKAR (US)
Application Number:
PCT/US2020/051124
Publication Date:
March 25, 2021
Filing Date:
September 16, 2020
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
UNIV CALIFORNIA (US)
International Classes:
H04R25/00; H04B15/00
Foreign References:
US20100177917A12010-07-15
US20090147966A12009-06-11
US20130170660A12013-07-04
Other References:
KATES ET AL.: "Principles of Digital Dynamic-Range Compression", TRENDS AMPLIF, vol. 9, no. 2, 2005, pages 45 - 76, XP055167747, Retrieved from the Internet [retrieved on 20201125], DOI: 10.1177/108471380500900202
PARFIENIUK ET AL.: "Near-Perfect Reconstruction oversampled Nonuniform Cosine-Modulated Filter Banks Based on Frequency Warping and Subband Merging.", INTL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, vol. 58, no. 2, 2012, pages 177 - 192, XP055808017, Retrieved from the Internet [retrieved on 20201125]
Attorney, Agent or Firm:
MAYER, Stuart H. et al. (US)
Download PDF:
Claims:
Claims

1. A method for processing audio signals, comprising: receiving an audio signal; dividing the audio signal into a plurality of frequency sub-bands; for each of the frequency sub-band signals, further dividing the signal into overlapping temporal frames; windowing each temporal frame; performing frequency warping on each of the windowed temporal frames; and performing overlap-and-add on the frequency warped temporal frames; combining frequency warped sub-bands into a full band to provide a frequency warped signal.

2. The method of claim 1, wherein the frequency warping is performed using a chain of all-pass filters.

3. The method of claim 1, wherein performing frequency warping includes performing frequency warping with a different warping parameter in at least two of sub-bands.

4. The method of claim 1, wherein performing frequency warping includes performing frequency warping with negative values of a warping parameter in each of the sub-bands.

5. The method of claim 1, wherein performing frequency warping includes performing frequency warping in at least one of the sub-bands with a warping parameter that is a function of gain provided in the sub-band.

6. The method of claim 1, wherein performing frequency warping includes performing frequency warping in each of the sub-bands with a warping parameter that is a function of gain provided in the respective sub-band.

7. The method of claim 1, further comprising processing the audio signals to mitigate acoustic feedback in a system having a microphone and a speaker, wherein the frequency warped signal is provided in an acoustic feedback path of the system

8. The method of claim 7, further comprising performing adaptive feedback cancellation (AFC) in the acoustic feedback path using the frequency warped signal.

9. The method of claim 8, wherein the AFC is a least mean square AFC process.

10. The method of claim 1, further comprising performing dynamic range compression on the audio signal prior to performing frequency warping.

11. A method for decomposing and recombining an audio signal used in an acoustic application, comprising: receiving an audio signal generated; dividing the audio signal into a plurality of frequency sub-bands; for each of the frequency sub-bands, further dividing the sub-bands into multiple, overlapping temporal frames and windowing each of the overlapping temporal frames to provide windowed segments; directing each of the windowed frames for each of the sub-bands through a different all-pass filter with a corresponding warping parameter to provide frequency warped windowed frames; for each of the frequency sub-bands, aligning different ones of the frequency warped windowed frames and performing overlap-and-add on the aligned frequency warped windowed frames to obtain frequency warped sub-bands; and combining the frequency warped sub-bands into a full band.

12. The method of claim 11, wherein the acoustic application is mitigation of acoustic feedback in an audio system.

13. The method of claim 11, wherein the acoustic application is acoustic echo cancellation.

14. The method of claim 11, wherein the acoustic application is a hearing aid processing application.

15. A hearing aid device, comprising: a microphone configured to receive an audible input signal from an environment and convert the audible input signal to an electrical audio input signal; a multi-band hearing aid processing circuit configured for processing the electrical audio input signal; a multi-band frequency warping circuit configured to receive an electrical audio signal from the multi -band hearing aid processing circuit, the multi-band frequency warping circuit being configured to: divide the electrical audio signal into a plurality of frequency sub bands; for each of the frequency sub-band signals, further divide the electrical audio signal into overlapping frames; window each frame; perform frequency warping on each of the windowed frames; and perform overlap-and-add on the frequency warped frames; combine frequency warped sub-bands into a full band to provide a frequency warped signal to the speaker; a speaker configured to receive the frequency warped signal from the multi band frequency warping circuit and emit an audible output signal into an ear of a user; and an adaptive feedback cancellation circuit located in an acoustic feedback path between an output of the microphone and an input to the speaker, the adaptive feedback cancellation circuit being configured to receive as inputs a portion of the electrical audio input signal from the microphone and the electrical audio signal from the multi-band frequency warping circuit and provide an output as an input to the multi-band hearing aid processing circuit.

16. The hearing aid device of claim 15, wherein the multi -band frequency warping circuit includes a chain of all-pass filters for performing the frequency warping.

17. The hearing aid device of claim 15, wherein the multi -band frequency warping circuit performs frequency warping with a different warping parameter in at least two of sub-bands.

18. The hearing aid device of claim 15, wherein the multi-band frequency warping circuit performs frequency warping with negative values of a warping parameter in each of the sub-bands.

18. The hearing aid device of claim 15, wherein the multi-band frequency warping circuit performs frequency warping in at least one of the sub-bands with a warping parameter that is a function of gain provided in the sub-band.

19. The hearing aid device of claim 15, wherein the multi -band frequency warping circuit performs frequency warping in each of the sub-bands with a warping parameter that is a function of gain provided in the respective sub-band.

20. The hearing aid device of claim 15, wherein the multi -band frequency warping circuit performs frequency warping in at least one of the sub-bands with a warping parameter that is a function of one or more hearing aid parameters.

Description:
MITIGATING ACOUSTIC FEEDBACK IN HEARING AIDS WITH FREQUENCY WARPING BY ALL-PASS NETWORKS

Government Funding

[0001] This invention was made with government support under DC015436 awarded by the National Institutes of Health. The government has certain rights in the invention.

Cross Reference to Related Applications

[0002] This application claims the benefit of U.S. Provisional Application No. 62/901,013, filed September 16, 2019 and U.S. Provisional Application No. 62/901,287, filed September 17, 2019, the contents of which are incorporated herein by reference.

Background

[0003] Improving acoustic feedback reduction hearing aids (HAs) such as those that employ behind the ear, receiver in the canal (BTE-RIC) transducers is an ongoing area of research. An example of such a HA may be found in L. Pisha, S. Hamilton, D. Sengupta, C.-H. Lee, K. C. Vastare, T. Zubatiy, S. Luna, C. Yalcin, A. Grant, R. Gupta, G. Chockalingam, B. D. Rao, and H. Garudadri, “A wearable platform for research in augmented hearing,” in Proc. Asilomar Conf. Signals, Syst, Comput. (ACS SC), 2018, pp. 223-227.

[0004] In order to compensate for mild to moderate hearing loss, commercial HAs and Open Speech Platform (OSP) provide an average gain of 35— 38 dB. In the emerging form factors for advanced HAs and hearables, including conventional BTE- RICs, there is a significant acoustic coupling between the microphones and loudspeakers (called receivers in the telephony and HA communities). This acoustic coupling varies significantly based on surroundings (e.g. hats, scarves, hands, and walls that come in close proximity to the transducers) and can cause the system to become unstable, when the audio content includes characteristic frequencies of the system. This instability results in brief “howling" artifacts and they can be of immense annoyance to the HA users.

[0005] Howling artifacts manifest when multiple factors collude to fulfill the magnitude and phase conditions of the Nyquist stability criterion (NSC). Adaptive feedback cancellation (AFC) has been the work horse for breaking NSC to avoid instabilities in many audio applications, including HAs. Typically, the AFC deploys the least mean square (LMS) based approaches to mitigate the magnitude condition in NSC. On the other hand, frequency shifting (FS) and other ad hoc methods mainly deal with the phase condition.

Summary

[0006] In one aspect, a system and method are provided for processing audio signals. In accordance with the method, an audio signal is received and divided into a plurality of frequency sub-bands. For each of the frequency sub-band signals, the signal is further divided into overlapping temporal frames. Each of the temporal frames are windowed. Frequency warping is performed on each of the windowed frames. Overlap-and-add is performed on the frequency warped frames. The frequency warped sub-bands are combined into a full band to provide a frequency warped signal.

[0007] In one particular embodiment, all-pass filters may be employed to perform the frequency warping. Frequency warping helps in breaking the Nyquist stability criterion and can be used to improve adaptive feedback cancellation (AFC). In more detail, traditional AFC methods rely on breaking the Nyquist stability criterion in the amplitude domain, often using Least Means Square (LMS) approaches. Existing methods for breaking the Nyquist stability criterion in the phase domain include frequency shifting (FS), phase modulation, time-varying all-pass filters to introduce phase shifts, linear predictive coding vocoder.

[0008] Frequency warping helps break the Nyquist stability criterion in both the amplitude and phase domains. A combination of LMS based AFC and frequency warping can provide additional stable gains, without resulting in howling side effects due to feedback. [0009] In one particular embodiment, frequency warping is performed after performing dynamic range compression and before AFC in the hearing aid signal processing chain. In another embodiment, frequency warping is performed after noise cancellation and before dynamic range compression in the hearing aid signal processing chain.

[0010] Based on informal subjective assessments, distortions due to frequency warping are fairly benign. While common objective metrics like the perceptual evaluation of speech quality (PESQ) and the hearing-aid speech quality index (HASQI) may not adequately capture distortions due to frequency warping and acoustic feedback artifacts from a perceptual perspective, they are still instructive in assessing the proposed method.

[0011] Quality improvements with frequency warping have been demonstrated for a basic AFC (PESQ: 2.6 to 3.5 and HASQI: 0.65 to 0.78) at a gain setting of 20; and an advanced AFC (PESQ: 2.8 to 3.2 and HASQI: 0.66 to 0.73) for a gain of 30. From investigations, frequency warping provides larger improvements for basic AFC, but still improves overall system performance for many AFC approaches.

[0012] This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used as an aid in determining the scope of the claimed subject matter. Furthermore, the claimed subject matter is not limited to implementations that solve any or all disadvantages noted in any part of this disclosure.

Brief Description of the Drawings

[0013] FIG. 1 shows one example of an all-pass network that may be employed for frequency warping.

[0014] FIG. 2 shows one example of a real-time frequency warping arrangement that employs an all-pass network. [0015] FIG. 3 shows one example of a multichannel real-time frequency warping arrangement that uses band-pass filters (BPFs).

[0016] FIG. 4 shows a functional block diagram of one example of an adaptive feedback cancellation (AFC) arrangement placed in parallel with a hearing aid (HA).

[0017] FIG. 5 is a graph showing the average perceptual evaluation of speech quality (PESQ) of the HA as a function of the warping parameter a.

[0018] FIG. 6A is a graph showing the average PESQ of the HA output as a function of the warping parameter a for AFC using LMS; and FIG. 6B is a graph showing the average PESQ of the HA output as a function of the warping parameter a for AFC using SLMS.

[0019] FIG. 7A shows a spectrogram of a feedback-compensated signal with no freping for LMS with an HA gain at 20 and an HASQI score of 0.81; FIG. 7B shows a spectrogram of a feedback-compensated signal with freping enabled (a= -0.02 ) for LMS with an HA gain at 20 and an HASQI score of 0.84; FIG. 7C shows a spectrogram of a feedback-compensated signal with no freping for SLMS with an HA gain at 30 and an HASQI score of 0.79; and FIG. 7D shows a spectrogram of a feedback-compensated signal with freping enabled (a= -0.02 ) for SLMS with an HA gain at 20 and an HASQI score of 0.82.

[0020] FIG. 8A and 8B show the HASQI score of the feedback compensated signal for AFC using LMS (Fig. 8A) and SLMS (Fig. 8B) at different HA gains.

[0021] FIG. 9 shows a block diagram of one example of a signal processing device 100 that may employ the techniques described herein.

Detailed Description

[0022] The discrete representation of continuous signals and systems is described in A. V. Oppenheim and D. H. Johnson, “Discrete representation of signals,” Proc.

IEEE, vol. 60, no. 6, pp. 681-691, 1972 (referred to herein after as “Oppenheim and Johnson”), which is incorporated herein by reference in its entirety. It also includes detailed recipes to "transform the frequency axis in a nonlinear manner." This frequency warping is accomplished using an all-pass network. [0023] The techniques shown in Oppenheim and Johnson are employed for hearing aids (HAs) and is referred to as “freping," a portmanteau for frequency warping. A common type of hearing loss is the sloping hearing loss, where the impaired user has a limited ability to perceive high-frequency content. Typically, the intervention is to boost the high-frequency components or move the content to lower frequencies. The former introduces challenges for acoustic feedback control, while the latter facilitates better feedback reduction. Another less common type of hearing loss, but more challenging for providing meaningful interventions is the "cookie bite" hearing loss, wherein it is difficult for the impaired person to perceive mid-frequency content, compared with low- and high-frequency components. As demonstrated below, freping can provide an additional tool to the audiologist for managing individual hearing loss profiles. In particular, freping is shown to mitigate the Nyquist stability criterion (NSC) in conjunction with LMS based AFC approaches.

All-Pass Networks

[0024] The all-pass networks described in Oppenheim and Johnson realize a nonlinear mapping of the frequency axis as controlled by a single warping parameter a. Let co =2 (f/f s ) be the normalized angular frequency where /is the original frequency and / is the sampling rate. The mapping Q (.) is: (1) ' ' where = 2p(/ / Z) and f is the warped frequency.

[0025] It can be shown that the nonlinear frequency mapping (1) between the original signal v(n) and the frequency -warped signal q(k) can be achieved by passing the time- reversed signal v(-n) through a linear time-invariant system Hk(z) given as: and taking the output Hi/z) at n=0 as q(k). It can thus be implemented as the network shown in FIG. 1. As shown, the first two stages act as (i) low-pass filters when a is positive and the network warps frequencies higher and (ii) high-pass filters when a is negative and the network warps frequencies lower. The remaining stages realize the actual frequency warping based on the bilinear transformation. Note that when a=0, it simply passes through the input without any spectral modifications.

[0026] The frequency -warped output is given by sampling q k (n), the output signal at the kth stage, along the cascade chain at n=0, i.e., q(k) = q k ( 0). In other words, the input sequence is first flipped and then passed through the network; the last sample of the output sequence at the k-th stage is taken as the k-th sample of the final frequency- warped sequence.

[0027] It is worth noting that in practice we need to truncate the signal for the all-pass network to be realizable. Therefore, the warping performance will depend on other factors such as the length and the type of the window function used.

Freping: Real-Time Frequency Warping

[0028] The all-pass networks described above are adopted for real-time frequency manipulations as illustrated in Figure 2. As shown, the input signal is first divided into overlapping frames and windowed using a proper window function. Each windowed segment then goes through the all-pass network to perform frequency warping with a specified warping parameter a. Finally, the overlap-and-add method (described, for instance, in J. B. Allen, “Short term spectral analysis, synthesis, and modification by discrete fourier transform,” IEEE Trans. Acoust., Speech, Signal Process., vol. 25, no. 3, pp. 235-238, 1977, which is hereby incorporated by reference in its entirety) is applied to produce the frequency-warped signal.

[0029] To allow a more flexible way of manipulating spectral characteristics, multichannel freping as illustrated in FIG. 3 may be employed. The system utilizes a set of band-pass filters (BPFs) which divide the input signal into M frequency bands and a set of warping parameters a=[ai, ... ,OIM] T Each band goes through an independent all-pass network with the corresponding warping parameter. The output signals of all the frequency bands are summed up to produce the frequency -warped signal. In many practical situations, it is convenient to reuse the multichannel compression in HA processing for freping. For specific types of hearing loss (e.g. sloping, cookie-bite, etc.), increasing the gain in higher frequency bands aids to fulfill the magnitude condition of NSC and freping hinders the phase condition to occur. Thus, freping provides a way for simultaneously optimizing the parameters of multichannel compression and frequency lowering in HAs for individual hearing loss. In some embodiments, discussed below, we limit ourselves to negative values of a so that freping always shifts spectral content lower.

[0030] In another embodiment, the values of alpha [on,... ,(CM] T can depend on the values of gain and/or other hearing aid parameters in that particular band. For example, the value of a, can be made a function of the gain in that particular band.

Freping for Acoustic Feedback Reduction [0031] The benefits of freping for mitigating acoustic feedback along with LMS based AFC are demonstrated below, with the motivation of improving feedback control in hearing aids such as the ear, receiver in the canal BTE-RIC systems referenced above.

[0032] In some embodiments, the AFC framework used in C.-H. Lee, B. D. Rao, and H. Garudadri, “Sparsity promoting LMS for adaptive feedback cancellation,” in Proc. Europ. Signal Process. Conf. (EUSIPCO), 2017, pp. 226-230, which is depicted in FIG. 4 and is incorporated herein by reference in its entirety, may be employed. Of course, more generally, a wide variety of different AFC arrangements may be employed. As shown, the AFC filter W(z,n), placed in parallel with the HA processing G(z,n), is the transfer function of an L- tap adaptive filter w (n) = [wo(n), wi(n),..., W L -i(n) T that continuously adjusts its coefficients to capture the time- varying nature of the acoustic feedback path F(z,n). d(n) is the microphone input which contains the clean signal x(n) and the feedback signal y(n) caused by the HA output o(n) passing through the feedback path. (n) is the feedback estimate e(n) = d(n ) — (n) is the feedback-compensated signal. A(z,n) is a time-varying pre-filter to decorrelate the input and output signals based on the prediction error method (PEM) shown in A. Spriet, S. Doclo, M. Moonen, and J. Wouters, “Feedback control in hearing aids,” Springer Handbook of Speech Process., pp. 979-1000, 2008. B(z) is a band-limited filter to concentrate on the frequency region where oscillation is more likely to occur.

[0033] Typically, LMS-type algorithms are carried out for coefficient adaptation using the pre-filtered signals U (n) and (n) to update the AFC filter w(n) as: e step size parameter, d > 0 is a small constant to prevent division by zero, and < t 2 (h)= rs 2 (n- 1)+ (1 -p)(u 2 r(n) +e 2 f-(n)) is the power estimate with a forgetting factor 0 < p< 1.

.The update rule (3) is actually the "modified" LMS using the sum method described in J. E. Greenberg, “Modified LMS algorithms for speech processing with an adaptive noise canceller,” IEEE Trans. Speech Audio Process., vol. 6, no. 4, pp. 338-351,

1998) and has been widely used in AFC works.

[0034] An advanced AFC algorithm, based on the LMS, is the sparsity promoting LMS (SLMS) proposed in C.-H. Lee, B. D. Rao, and H. Garudadri, “Sparsity promoting LMS for adaptive feedback cancellation,” in Proc. Europ. Signal Process. Conf. (EUSIPCO), 2017, pp. 226-230, which leverages the sparsity of the feedback path impulse response to achieve faster convergence for improvement. The SLMS update rule includes an additional sparsity promoting term S(n). where S(n) = is anZ-by-Z diagonal matrix and the diagonal elements are updated according to s L (n) = (5) where p e (0,2] is the sparsity control parameter and c>0 is a small positive constant to avoid stagnation of the algorithm. [0035] Without any feedback control mechanism, the frequency responses of the HA processing G(e ;6J , n) and the feedback path F(e ;6J , n)form a closed-loop system which exhibits instability that leads to howling. The NSC states that the closed-loop system becomes unstable whenever the following magnitude and phase conditions are both fulfilled:

When AFC in employed, it becomes: n) is the estimated feedback path frequency response. The AFC aims at minimizing to mitigate the magnitude condition.

[0036] It is well-known that the LMS-type algorithms widely used in AFC suffer from biased estimation due to signal correlation. Consequently, the feedback path estimate can be erroneous if decorrelation is not carefully considered. Although the PEM-based pre-filter has provided a certain amount of decorrelation, further improvement is achievable by inserting additional signal processing into the forward path of the HA, usually placed at the location denoted * in FIG. 4. Existing methods include frequency shifting (FS), phase modulation, time-varying all-pass filters to introduce phase shifts, linear predictive coding vocoder, to name just a few. In general, quality degradation might be introduced by these decorrelation methods and thus there is the trade-off between the sound quality and the decorrelation ability for AFC improvement

[0037] Freping may play a similar role for decorrelation as FS. Freping introduces nonlinear frequency shifts and the distortions appear to be perceptually benign based on informal subjective assessments. As instability is most likely to occur at the high- frequency region, it is reasonable to manipulate the high-frequency content while keeping the low-frequency region intact to avoid degradation in quality. By providing additional decorrelation, freping can reduce the AFC bias and thus a better feedback path estimate can be obtained, thereby improving the magnitude condition in NSC.

On the other hand, freping also helps avoid the microphone and receiver signals from remaining continuously in phase with each other. This prevents the phase condition in NSC to hold at the same frequency at two consecutive instants. Consequently, the input and output sounds could not build up in amplitude as effectively. Therefore, the likelihood of instability is reduced.

[0038] Note that the approach shown in C. Boukis, D. P. Mandic, and A. G. Constantinides, “Toward bias minimization in acoustic feedback cancellation systems,” J. Acoust. Soc. Am., vol. 121, no. 3, pp. 1529-1537, 2007, also utilizes all pass filters to achieve decorrelation, in which time-varying poles are used for introducing phase shifts. This is different from freping, which manipulates the spectral magnitude as well. Since freping has similarities to FS, we compare them in the following section.

Evaluation

[0039] We evaluate the freping system described herein using computer simulations in MATLAB at a sampling rate of 16 kHz. We implemented a 6-band system using a set of BPFs with non-uniform bandwidth whose center frequencies are 250, 500,

1000, 2000, 4000, and 6000 Hz, respectively. Frames of 128 samples with 50% overlap were utilized. The Hann function was applied for windowing. 25 male and 25 female speech signals from TIMIT database were used for simulations.

[0040] In this evaluation we directly performed freping on the speech signal and measured the frequency distortion at the output using the MATLAB implementation of the (wide-band) perceptual evaluation of speech quality (PESQ). The PESQ score gives a good prediction of the mean opinion score and has been suggested for quantifying spectral distortion brought by FS. FIG. 5 shows the average PESQ score of the freping output over the 50 speech files as a function of the warping parameter a, for the cases of operating on the full-band (a=a[1, 1, 1, 1, 1, 1] T and on the last two (high) frequency bands (a = a[0, 0, 0, 0, 1, I] 1 ). We can see that quality degradation is minor in the latter case. [0041] Now we consider the practical scenario of HA as in FIG. 4. We examine freping with a = a[0, 0, 0, 0, 1, 1] T on top of the LMS and the SLMS. The experimental setup was as follows. The HA processing G(z, ) = gz ~A where g is the HA gain and D is the sample delay chosen to have a total HA latency under 10 msec (from d(n) to o(n)). The feedback path impulse response was measured using a BTE-RIC device with open fitting on a dummy head with a handset placed on the ear — the most challenging scenario for breaking NSC. For the AFC, we used L = 100, m = 0.005, p = 0.985, and d = 10 -6 for both LMS and SLMS. For the SLMS we used p = 1.5 and c = 10 -6 as suggested in C.-H. Lee, B. D. Rao, and H. Garudadri, “Sparsity promoting LMS for adaptive feedback cancellation,” in Proc. Europ. Signal Process. Conf. (EUSIPCO), 2017, pp. 226-230. In all simulations, the AFC filter coefficients were initialized as all zeros.

[0042] FIGs. 6A-6D show the average PESQ score of the HA output over the 50 speech files for several values of the warping parameter a. From the results we can see that when we increase a in magnitude from 0, acoustic feedback gets better controlled, resulting in improved quality. However, further increasing a in magnitude leads to higher spectral distortion and thus the quality drops. This indicates the trade off between the reduction of feedback artifacts and frequency distortion, and is better seen in the case of a more aggressive gain setting.

[0043] We now focus on quantifying the improvement brought by freping in reducing feedback artifacts. In the remaining evaluation process, a was used as suggested by the results in FIGs. 6A-6D. Also from FIG. 5, this choice of a corresponds to an average PESQ of 4.55, which indicates good quality. Furthermore, based on informal subjective assessments, distortions introduced with a in the vicinity of this choice are fairly benign. According to equation (1), for this choice of a, the center frequencies of the fifth and sixth frequency bands would move from 4000 and 6000 Hz to 3898 and 5927Hz, respectively.

[0044] We compare the performance with an existing FS method based on the analytical representation of signal using the Hilbert transform. The amount of shift was set to 12 Hz, only applied to the frequency region above 1.5 kHz. When directly performed, this arrangement gives an average PESQ score of 4.47 of the FS output over the 50 speech files, which is comparable but slightly lower than that of the freping result.

[0045] For evaluation, we compare the feedback-compensated e(n) with the clean signal x(n), using the hearing-aid speech quality index (HASQI), which has been adopted in prior AFC work. The HASQI score ranges from 0 to 1, where a higher value indicates better quality. FIGs. 7A-7D present examples of spectrograms of the feedback-compensated signal for several cases. We can see that freping effectively reduces the howling components, resulting in improved quality.

[0046] FIGs. 8A-8B demonstrate the advantages of using freping by showing the average HASQI score over the 50 speech files for various gain settings. From the results we see that both the basic (LMS) and advanced (SLMS) AFC algorithms can benefit from freping. This indicates the ability of the proposed frequency warping method to further improve feedback reduction on top of many AFC approaches. Moreover, compared to FS, freping demonstrates better performance under all the gain settings.

[0047] Finally, we compare the added stable gain (ASG), which is the additional gain due to feedback control mechanism that the HA can still operate in the stable state, for the cases of AFC, AFC with FS, and AFC with freping. We used the ASG estimation approach proposed in C.-H. Lee, J. M. Kates, B. D. Rao, and H. Garudadri, “Speech quality and stable gain trade-offs in adaptive feedback cancellation for hearing aids,”

J. Acoust. Soc. Am., vol. 142, no. 4, pp. EL388-EL394, 2017), where a HASQI below 0.8 was considered of unacceptable quality. The results are shown in Table 1, obtained from the average of 5 male and 5 female speech files. We can see that freping can improve the ASG on top of both the basic and advanced AFC algorithms. Compared to the FS, a higher ASG can be achieved by using freping.

Table I : IO (in dli) comparison. [0048] In summary, all-pass networks are employed for frequency warping, which is referred to herein as “freping." We described a real-time realization of multichannel freping for use in HAs and its use for breaking the NSC in acoustic feedback control. Experimental results demonstrate quality improvements with freping for basic and advanced AFC approaches. For a desired quality lower bound (e.g. HASQI = 0.8), we found ASG improvements of 2.5 and 1.4 dB for LMS and SLMS with freping, respectively.

Example Signal Processing Device

[0049] FIG. 9 shows a block diagram of one example of a signal processing device 100 that may employ the techniques described herein. In one particular example, illustrated in the figure, the signal processing device 100 is a hearing aid, although more generally it may be a signal processing device that is employed in a wide variety of different applications. Signal processing device 100 may comprise at least one input transducer 105 and an output transducer 110. The input transducer 105 may be configured to convert an input 101 to an input signal 102. In one embodiment the input transducer 105 may be a microphone that converts an audible input signal to an electrical audio input signal and the output transducer 110 may be a speaker that converts an electrical audio signal to an audible output signal.

[0050] The input to the input transducer 105 may include the audible input signal 101 and feedback 195. The feedback 195 may comprise at least a modified or unmodified portion of an output 111 (desired output 111' is also shown) from the output transducer 110. The output 111 may propagate wirelessly through a feedback path 190. Propagation of the output 111 through the feedback path 190 may cause modification (e.g. attenuation, interference, and/or phase shifting) of at least a portion of the output 111.

[0051] The electrical audio input signal 102 from the input transducer 105 is directed to a signal processing circuit, which in the case of a hearing aid is a multi -band hearing aid processing circuit 140. The multi-band hearing aid processing circuit 140 may be configured to at least amplify at least a portion of the electrical audio input signal 102. The output signals 112 from the multi -band hearing aid processing circuit 140 are directed to a multi -band frequency warping circuit 150 such as shown in FIG. 3. The frequency warped signal 115 output from the multi -band frequency warping circuit 150 is directed as input to the output transducer 110. An AFC circuit 170 receives as inputs a portion of the electrical audio input signal 102 from the input transducer 105 and a portion 175 of the frequency warped signal 115 from the multi band frequency warping circuit 150. The AFC circuit 170 generates an output signal 180 that is provided to the input of the multi-band hearing aid processing circuit 140.

[0052] While the techniques described herein have been illustrated for use in a hearing aid processing application, more generally the techniques described herein may be employed in a wide variety of different applications including, without limitation, acoustic echo cancellation, active noise cancellation and acoustic feedback in various audio systems.

Illustrative Computing Environment

[0053] Aspects of the subject matter described herein may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, program modules include routines, programs, objects, components, data structures, and so forth, which perform particular tasks or implement particular abstract data types. Aspects of the subject matter described herein may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote computer storage media including memory storage devices.

[0054] Also, it is noted that some embodiments have been described as a process which is depicted as a flow diagram or block diagram. Although each may describe the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be rearranged. A process may have additional steps not included in the figure.

[0055] The claimed subject matter may be implemented as a method, apparatus, or article of manufacture using standard programming and/or engineering techniques to produce software, firmware, hardware, or any combination thereof to control a computer to implement the disclosed subject matter. For instance, the claimed subject matter may be implemented as a computer-readable storage medium embedded with a computer executable program, which encompasses a computer program accessible from any computer-readable storage device or storage media. For example, computer readable storage media can include but are not limited to magnetic storage devices (e.g., hard disk, floppy disk, magnetic strips . . . ), optical disks (e.g., compact disk (CD), digital versatile disk (DVD) . . . ), smart cards, and flash memory devices (e.g., card, stick, key drive . . . ). However, computer readable storage media do not include transitory forms of storage such as propagating signals, for example. Of course, those skilled in the art will recognize many modifications may be made to this configuration without departing from the scope or spirit of the claimed subject matter.

[0056] Moreover, as used in this application, the terms "component," "module," “engine,” "system," “apparatus,” "interface," or the like are generally intended to refer to a computer-related entity, either hardware, a combination of hardware and software, software, or software in execution. For example, a component or module may be, but is not limited to being, a process running on a processor, a processor, an object, an executable, a thread of execution, a program, and/or a computer. By way of illustration, both an application running on a controller and the controller can be a component. One or more components may reside within a process and/or thread of execution and a component may be localized on one computer and/or distributed between two or more computers.

[0057] The foregoing described embodiments depict different components contained within, or connected with, different other components. It is to be understood that such depicted architectures are merely exemplary, and that in fact many other architectures can be implemented which achieve the same functionality. In a conceptual sense, any arrangement of components to achieve the same functionality is effectively "associated" such that the desired functionality is achieved. Hence, any two components herein combined to achieve a particular functionality can be seen as "associated with" each other such that the desired functionality is achieved, irrespective of architectures or intermediary components. Likewise, any two components so associated can also be viewed as being "operably connected", or "operably coupled", to each other to achieve the desired functionality. [0058] While various embodiments have been described above, it should be understood that they have been presented by way of example, and not limitation. It will be apparent to persons skilled in the relevant art(s) that various changes in form and detail can be made therein without departing from the spirit and scope. In fact, after reading the above description, it will be apparent to one skilled in the relevant art(s) how to implement alternative embodiments. Thus, the present embodiments should not be limited by any of the above described exemplary embodiments.




 
Previous Patent: COATINGS WITH IMPROVED ADHESION

Next Patent: POLYMERASES