Sorry, you need to enable JavaScript to visit this website.

Analyzing Uncertainties in Speech Recognition Using Dropout

Citation Author(s):
Hervé Bourlard
Submitted by:
Apoorv Vyas
Last updated:
11 May 2019 - 8:55am
Document Type:
Poster
Document Year:
2019
Event:
Presenters:
Apoorv Vyas
Paper Code:
4576
 

The performance of Automatic Speech Recognition (ASR) systems is often measured using Word Error Rates (WER) which requires time-consuming and expensive manually transcribed data. In this paper, we use state-of-the-art ASR systems based on Deep Neural Networks (DNN) and propose a novel framework which uses ``Dropout'' at the test time to model uncertainty in prediction hypotheses. We systematically exploit this uncertainty to estimate WER without the need for explicit transcriptions. In addition, we show that the predictive uncertainty can also be used to accurately localize the errors made by the ASR system. We study the performance of our approach on Switchboard database where it predicts WER accurately within a range of 2.6% and 5.0% for HMM-DNN and Connectionist Temporal Classification (CTC) ASR systems, respectively.

up
0 users have voted: