Noise Robust Speech Recognition

ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC

Read more about ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC
Log in to post comments

This paper addresses the task of Automatic Speech Recognition (ASR) with music in the background, where the accuracy of recognition may deteriorate significantly.
To improve the robustness of ASR in this task, e.g. for broadcast news transcription or subtitles creation, we adopt two approaches:
1) multi-condition training of the acoustic models and 2) denoising autoencoders followed by acoustic model training on the preprocessed data.
In the latter case, two types of autoencoders are considered: the fully connected and the convolutional network.

posterICASSP2017_MalekZdanskyCerva.pdf

posterICASSP2017_MalekZdanskyCerva.pdf (356)

Categories:: Audio and Acoustic Signal Processing

12 Views