Audio and Acoustic Signal Processing

ICASSP 2022 SPE-L1.5_EXPLOITING ANNOTATORS’ TYPED DESCRIPTION OF EMOTION PERCEPTION TO MAXIMIZE UTILIZATION OF RATINGS FOR SPEECH EMOTION RECOGNITION

Read more about ICASSP 2022 SPE-L1.5_EXPLOITING ANNOTATORS’ TYPED DESCRIPTION OF EMOTION PERCEPTION TO MAXIMIZE UTILIZATION OF RATINGS FOR SPEECH EMOTION RECOGNITION
Log in to post comments

SPE-L1.5_EXPLOITING ANNOTATORS’ TYPED DESCRIPTION OF EMOTION PERCEPTION TO MAXIMIZE UTILIZATION OF RATINGS FOR SPEECH EMOTION RECOGNITION.pdf

SPE-L1.5_EXPLOITING ANNOTATORS’ TYPED DESCRIPTION OF EMOTION PERCEPTION TO MAXIMIZE UTILIZATION OF RATINGS FOR SPEECH EMOTION RECOGNITION.pdf (471)

Categories:: Audio and Acoustic Signal Processing

88 Views

HYBRID ATTENTION-BASED PROTOTYPICAL NETWORKS FOR FEW-SHOT SOUND CLASSIFICATION

Read more about HYBRID ATTENTION-BASED PROTOTYPICAL NETWORKS FOR FEW-SHOT SOUND CLASSIFICATION
Log in to post comments

In recent years, prototypical networks have been widely used
in many few-shot learning scenarios. However, as a metric-
based learning method, their performance often degrades in
the presence of bad or noisy embedded features, and outliers
in support instances. In this paper, we introduce a hybrid at-
tention module and combine it with prototypical networks for
few-shot sound classification. This hybrid attention module
consists of two blocks: a feature-level attention block, and

My poster ICASSP 2022.pdf

My poster ICASSP 2022.pdf (309)

Categories:: Applications in Music and Audio Processing (MLR-MUSI)
Audio and Acoustic Signal Processing

64 Views

ATTACHMENT RECOGNITION IN SCHOOL-AGE CHILDREN: A MULTIMODAL APPROACH BASED ON LANGUAGE AND PARALANGUAGE ANALYSIS

Icassp2022_poster_Attachment.pdf

Icassp2022_poster_Attachment.pdf (277)

Categories:: Audio and Acoustic Signal Processing

15 Views

Attachment Recognition

Read more about Attachment Recognition
Log in to post comments

Icassp2022_poster.pdf

Icassp2022_poster.pdf (367)

Categories:: Audio and Acoustic Signal Processing

14 Views

Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks

Read more about Conformer-Based Self-Supervised Learning for Non-Speech Audio Tasks
Log in to post comments

Representation learning from unlabeled data has been of major interest in artificial intelligence research. While self-supervised speech representation learning has been popular in the speech research community, very few works have comprehensively analyzed audio representation learning for non-speech audio tasks. In this paper, we propose a self-supervised audio representation learning method and apply it to a variety of downstream non-speech audio tasks.

poster_id_3268.pdf

poster_id_3268.pdf (289)

Categories:: Audio and Acoustic Signal Processing

18 Views

TOWARDS FASTER CONTINUOUS MULTI-CHANNEL HRTF MEASUREMENTS BASED ON LEARNING SYSTEM MODELS

Measuring personal head-related transfer functions (HRTFs) is essential in binaural audio. Personal HRTFs are not only required for binaural rendering and for loudspeaker-based binaural reproduction using crosstalk cancellation, but they also serve as a basis for data-driven HRTF individualization techniques and psychoacoustic experiments. Although many attempts have been made to expedite HRTF measurements, the rotational velocities in today’s measurement systems remain lower than those in natural head movements.

poster_2886.pdf

poster_2886.pdf (253)

Categories:: Audio and Acoustic Signal Processing

23 Views

AECMOS: A speech quality assessment metric for echo impairment

Read more about AECMOS: A speech quality assessment metric for echo impairment
Log in to post comments

Traditionally, the quality of acoustic echo cancellers is evaluated using intrusive speech quality assessment measures such as ERLE \cite{g168} and PESQ \cite{p862}, or by carrying out subjective laboratory tests. Unfortunately, the former are not well correlated with human subjective measures, while the latter are time and resource consuming to carry out. We provide a new tool for speech quality assessment for echo impairment which can be used to evaluate the performance of acoustic echo cancellers.