Documents
Poster
Poster
Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier

- Citation Author(s):
- Submitted by:
- Kshitiz Kumar
- Last updated:
- 7 May 2019 - 3:33pm
- Document Type:
- Poster
- Document Year:
- 2019
- Event:
- Presenters:
- Kshitiz Kumar
- Paper Code:
- ICASSP19005
- Categories:
- Keywords:
- Log in to post comments
Confidences are integral to ASR systems, and applied to data selection, adaptation, ranking hypotheses, arbitration etc.Hybrid ASR system is inherently a match between pronunciations and AM+LM evidence but current confidence features lack pronunciation information. We develop pronunciation embeddings to represent and factorize acoustic score in relevant bases, and demonstrate 8-10% relative reduction in false alarm (FA) on large scale tasks. We generalize to standard NLP embeddings like Glove, and show 16% relative reduction in FA in combination with Glove.