Spoken Language Understanding (SLP-UNDE)

QUESTION ANSWERING FOR SPOKEN LECTURE PROCESSING

Read more about QUESTION ANSWERING FOR SPOKEN LECTURE PROCESSING
Log in to post comments

This paper presents a question answering (QA) system developed for spoken lecture processing. The questions are presented to the system in written form and the answers are returned from lecture videos. In contrast to the widely studied reading comprehension style QA – the machine understands a passage of text and answers the questions related to that passage – our task introduces the challenge of searching the answers on longer text where the text corresponds to the erroneous transcripts of the lecture videos.

ICASSP2019merve_poster.pdf

ICASSP2019merve_poster.pdf (499)

Categories:: Spoken Language Understanding (SLP-UNDE)

88 Views

QUESTION ANSWERING FOR SPOKEN LECTURE PROCESSING

Read more about QUESTION ANSWERING FOR SPOKEN LECTURE PROCESSING
Log in to post comments

ICASSP2019merve_poster.pdf

ICASSP2019merve_poster.pdf (390)

Categories:: Spoken Language Understanding (SLP-UNDE)

12 Views

REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION

Read more about REVISITING HIDDEN MARKOV MODELS FOR SPEECH EMOTION RECOGNITION
Log in to post comments

ICASSP2019_Poster_symao.pdf

ICASSP2019_Poster_symao.pdf (381)

Categories:: Spoken Language Understanding (SLP-UNDE)

33 Views

USING DEEP-Q NETWORK TO SELECT CANDIDATES FROM N-BEST SPEECH RECOGNITION HYPOTHESES FOR ENHANCING DIALOGUE STATE TRACKING

ICASSP 2019#2338.pdf

ICASSP 2019#2338.pdf (418)

Categories:: Spoken Language Understanding (SLP-UNDE)

79 Views

Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning

Read more about Adversarial Advantage Actor-Critic Model for Task-Completion Dialogue Policy Learning
Log in to post comments

This paper presents a new method --- adversarial advantage actor-critic (Adversarial A2C), which significantly improves the efficiency of dialogue policy learning in task-completion dialogue systems. Inspired by generative adversarial networks (GAN), we train a discriminator to differentiate responses/actions generated by dialogue agents from responses/actions by experts.

poster_icassp2018_v2.pptx

poster_icassp2018_v2.pptx (561)

Categories:: Spoken Language Understanding (SLP-UNDE)

20 Views

AN END-TO-END APPROACH TO JOINT SOCIAL SIGNAL DETECTION AND AUTOMATIC SPEECH RECOGNITION

Social signals such as laughter and fillers are often observed in natural conversation, and they play various roles in human-to-human communication. Detecting these events is useful for transcription systems to generate rich transcription and for dialogue systems to behave as we do such as synchronized laughing or attentive listening. We have studied an end-to-end approach to directly detect social signals from speech by using connectionist temporal classification (CTC), which is one of the end-to-end sequence labelling models.

201804_ICASSP2018_poster.pdf

201804_ICASSP2018_poster.pdf (516)

Categories:: Spoken Language Understanding (SLP-UNDE)

9 Views

Incorporating ASR Errors with Attention-based, Jointly Trained RNN for Intent Detection and Slot Filling

schumann_icassp_presentation.pdf

schumann_icassp_presentation.pdf (479)

schumann_icassp_presentation.pdf

schumann_icassp_presentation.pdf (501)

Categories:: Spoken Language Understanding (SLP-UNDE)

56 Views

ATTENTION-BASED LSTM FOR PSYCHOLOGICAL STRESS DETECTION FROM SPOKEN LANGUAGE USING DISTANT SUPERVISION

attention-based-lstm-poster.pdf

attention-based-lstm-poster.pdf (493)

Categories:: Spoken Language Understanding (SLP-UNDE)

55 Views

Lexico-acoustic Neural-based Models for Dialog Act Classification

Read more about Lexico-acoustic Neural-based Models for Dialog Act Classification
Log in to post comments

Recent works have proposed neural models for dialog act classification in spoken dialogs.
However, they have not explored the role and the usefulness of acoustic information.
We propose a neural model that processes both lexical and acoustic features for classification.
Our results on two benchmark datasets reveal that acoustic features are helpful in improving the overall accuracy.

icassp-2018-poster.pdf

icassp-2018-poster.pdf (537)

Categories:: Spoken Language Understanding (SLP-UNDE)

14 Views

Joint Verification-Identification in End-to-End Multi-Scale CNN Framework for Topic Identification

We present an end-to-end multi-scale Convolutional Neural
Network (CNN) framework for topic identification (topic ID).
In this work, we examined multi-scale CNN for classification
using raw text input. Topical word embeddings are learnt at
multiple scales using parallel convolutional layers. A technique
to integrate verification and identification objectives is
examined to improve topic ID performance. With this approach,
we achieved significant improvement in identification
task. We evaluated our framework on two contrasting

Final.pdf

Final.pdf (491)

Categories:: Spoken Language Understanding (SLP-UNDE)
Neural network learning (MLR-NNLR)

44 Views

Spoken Language Understanding (SLP-UNDE)

Pages