Sorry, you need to enable JavaScript to visit this website.

SEMI-SUPERVISED TRAINING OF ACOUSTIC MODELS USING LATTICE-FREE MMI

Citation Author(s):
Vimal Manohar, Hossein Hadian, Daniel Povey, Sanjeev Khudanpur
Submitted by:
Vimal Manohar
Last updated:
18 April 2018 - 12:19pm
Document Type:
Presentation Slides
Document Year:
2018
Event:
Presenters:
Vimal Manohar
Paper Code:
SP-L4.2
 

The lattice-free MMI objective (LF-MMI) has been used in supervised training of
state-of-the-art neural network acoustic models for automatic speech
recognition (ASR). With large amounts of unsupervised data available,
extending this approach to the semi-supervised scenario is of significance.
Finite-state transducer (FST) based supervision used with LF-MMI provides a
natural way to incorporate uncertainties when dealing with unsupervised data.
In this paper,
we describe various extensions to standard LF-MMI training to allow the use
as supervision of lattices obtained via decoding of unsupervised data. The
lattices are rescored with a strong LM. We
investigate different methods for splitting the lattices and incorporating
frame tolerances into the supervision FST. We report results on
different subsets of Fisher English, where we achieve WER recovery of
59-64\% using lattice supervision, which is significantly better than
using just the best path transcription.

up
0 users have voted: