Sorry, you need to enable JavaScript to visit this website.

Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition

Citation Author(s):
Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng
Submitted by:
Yuchen Hu
Last updated:
5 May 2022 - 3:17am
Document Type:
Poster
Event:
Presenters:
Yuchen Hu
Paper Code:
SPE-10.4
 

Speech enhancement (SE) aims to suppress the additive noise from noisy speech signals to improve the speech's perceptual quality and intelligibility. However, the over-suppression phenomenon in the enhanced speech might degrade the performance of downstream automatic speech recognition (ASR) task due to the missing latent information. To alleviate such problem, we propose an interactive feature fusion network (IFF-Net) for noise-robust speech recognition to learn complementary information from the enhanced feature and original noisy feature. Experimental results show that the proposed method achieves absolute word error rate (WER) reduction of 4.1% over the best baseline on RATS Channel-A corpus. Our further analysis indicates that the proposed IFF-Net can complement some missing information in the over-suppressed enhanced feature.

up
0 users have voted: