Other

Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features　of Deep Neural Network

In the present study, we quantify entrainment characteristics of conversation with the aim of automatic assessment of the severity of autism spectrum disorder (ASD). We focus on pairs of utterances immediate before and after turn-takings, which have prosodic/acoustic similarities.

Ochi2022_ICASSP_poster_v2.pdf

Ochi2022_ICASSP_poster_v2.pdf (174)

Categories:: Other

4 Views

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster.pdf

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster.pdf (175)

Categories:: Other

10 Views

MASSIVE UNSOURCED RANDOM ACCESS BASED ON BILINEAR VECTOR APPROXIMATE MESSAGE PASSING presentation

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing presentation.pdf

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing presentation.pdf (175)

Categories:: Other

21 Views

A KNOWLEDGE/DATA ENHANCED METHOD FOR JOINT EVENT AND TEMP RELATION EXTRACTIONORAL

Read more about A KNOWLEDGE/DATA ENHANCED METHOD FOR JOINT EVENT AND TEMP RELATION EXTRACTIONORAL
Log in to post comments

Understanding temporal relations (TempRels) between events is an important task that could benefit many downstream NLP applications. This task inevitably faces the challenges of both a limited amount of high-quality training data and a very biased distribution of TempRels. These problems will substantially hurt the performance of extraction systems because they are inclined to predict dominant TempRels when training with a limited amount of data.

KJETE.pdf

KJETE.pdf (150)

Categories:: Other

17 Views

Provable Sample Complexity Guarantees for Learning of Continuous-Action Graphical Games with Nonparametric Utilities

icassp_nonpara_presentation.pdf

Provable Sample Complexity Guarantees for Learning of Continuous-Action Graphical Games with Nonparametric Utilities (165)

Categories:: Other

15 Views

Information Theoretic Limits for Standard and One-bit Compressed Sensing with Graph-structured Sparsity

icassp_cs_presentation.pdf

Information Theoretic Limits for Standard and One-bit Compressed Sensing with Graph-structured Sparsity (166)

Categories:: Other

13 Views

Domain Generalized Few-Shot Image Classification Via Meta Regularization Network

Read more about Domain Generalized Few-Shot Image Classification Via Meta Regularization Network
Log in to post comments

1435_poster.pdf

1435 Poster (131)

1435_presentation.pdf

1435 Slide (140)

Categories:: Other

15 Views

Integration of Pre-trained Networks with Continuous Token Interface For End-to-End Spoken Language Understanding

Most End-to-End (E2E) Spoken Language Understanding (SLU) networks leverage the pre-trained Automatic Speech Recognition (ASR) networks but still lack the capability to understand the semantics of utterances, crucial for the SLU task. To solve this, recently proposed studies use pre-trained Natural Language Understanding (NLU) networks. However, it is not trivial to fully utilize both pre-trained networks; many solutions were proposed, such as Knowledge Distillation (KD), cross-modal shared embedding, and network integration with Interface.

icassp_slu_seo_kwak_lee_ver3.pdf

icassp_slu_seo_kwak_lee_ver3.pdf (163)

Categories:: Other

5 Views

OPENFEAT: Improving Speaker Identification by Open-set Few-shot Embedding Adaptation with Transformer

Household speaker identification with few enrollment utterances is an important yet challenging problem, especially when household members share similar voice characteristics and room acoustics. A common embedding space learned from a large number of speakers is not universally applicable for the optimal identification of every speaker in a household.

OPENFEAT_ICASSP.pdf

OPENFEAT_ICASSP.pdf (167)

Categories:: Other