Language Understanding I: End-to-end Framework

Integration of Pre-trained Networks with Continuous Token Interface For End-to-End Spoken Language Understanding

Read more about Integration of Pre-trained Networks with Continuous Token Interface For End-to-End Spoken Language Understanding
Log in to post comments

Most End-to-End (E2E) Spoken Language Understanding (SLU) networks leverage the pre-trained Automatic Speech Recognition (ASR) networks but still lack the capability to understand the semantics of utterances, crucial for the SLU task. To solve this, recently proposed studies use pre-trained Natural Language Understanding (NLU) networks. However, it is not trivial to fully utilize both pre-trained networks; many solutions were proposed, such as Knowledge Distillation (KD), cross-modal shared embedding, and network integration with Interface.

icassp_slu_seo_kwak_lee_ver3.pdf

icassp_slu_seo_kwak_lee_ver3.pdf (183)

Categories:: Other

5 Views