Sorry, you need to enable JavaScript to visit this website.

OBJECTIVE COMPARISON OF SPEECH ENHANCEMENT ALGORITHMS WITH HEARING LOSS SIMULATION

Citation Author(s):
Zhuohuang Zhang, Yi Shen, Donald S. Williamson
Submitted by:
Zhuohuang Zhang
Last updated:
7 May 2019 - 1:03pm
Document Type:
Poster
Document Year:
2019
Event:
Presenters:
Zhuohuang Zhang
Paper Code:
3662
 

Many speech enhancement algorithms have been proposed over the years and it has been shown that deep neural networks can lead to significant improvements. These algorithms, however, have not been validated for hearing-impaired listeners. Additionally, these algorithms are often evaluated under a limited range of signal-to-noise ratios (SNR). Here, we construct a diverse speech dataset with a broad range of SNRs and noises. Several enhancement algorithms are compared under both normal-hearing and simulated hearing-impaired conditions, where the perceptual evaluation of speech quality (PESQ) and hearing-aid speech quality index (HASQI) are used as objective metrics. The impact of the data’s frequency scale (Mel versus linear) on performance is also evaluated. Results show that a long short-term memory (LSTM) network with data in the Mel-frequency domain yields the best performance for PESQ, and a Bidirectional LSTM network with data in the linear frequency scale performs the best in hearing-impaired settings. The Mel-frequency scale results in improved PESQ scores, but reduced HASQI scores.
https://ieeexplore.ieee.org/document/8683040

up
0 users have voted: