Machine Learning Techniques in Speech and Language Processing

Speech Enhancement with Convolutional-Recurrent Networks

Read more about Speech Enhancement with Convolutional-Recurrent Networks
Log in to post comments

We propose an end-to-end model based on convolutional and recurrent neural networks for speech enhancement. Our model is purely data-driven and does not make any assumptions about the type or the stationarity of the noise. In contrast to existing methods that use multilayer perceptrons (MLPs), we employ both convolutional and recurrent neural network architectures. Thus, our approach allows us to exploit local structures in both the frequency and temporal domains.

keynote_slides.pdf

icassp2018-3744 (672)

Categories:: Source Separation and Signal Enhancement

44 Views

Mismatched Training Data Enhancement for Automatic Recognition of Children’s Speech using DNN-HMM

The increasing profusion of commercial automatic speech recognition technology applications has been driven by big-data techniques, making use of high quality labelled speech datasets. Children’s speech displays greater time and frequency domain variability than typical adult speech, lacks the depth and breadth of training material, and presents difficulties relating to capture quality. All of these factors act to reduce the achievable performance of systems that recognise children’s speech.

ISCSLP_poster(MengjieQian) .pdf

ISCSLP_poster(MengjieQian) .pdf (48)

Categories:: Audio and Acoustic Signal Processing

7 Views