ICASSP 2022

ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

SleepGAN: Towards Personalized Sleep Therapy Music

Read more about SleepGAN: Towards Personalized Sleep Therapy Music
1 comment
Log in to post comments

8940_Yang.pdf

8940_Yang.pdf (262)

talk.pdf

talk.pdf (212)

Categories:: Other

39 Views

poster of the paper 'End-to-End Speech Recognition from Federated Acoustic Models'

Read more about poster of the paper 'End-to-End Speech Recognition from Federated Acoustic Models'
2 comments
Log in to post comments

ICASSP22_poster_YanGao.pdf

ICASSP22_poster_YanGao.pdf (320)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

24 Views

slides for the paper 'End-to-End Speech Recognition from Federated Acoustic Models'

Read more about slides for the paper 'End-to-End Speech Recognition from Federated Acoustic Models'
1 comment
Log in to post comments

FL_ASR ICASSP22.pptx

FL_ASR ICASSP22.pptx (298)

FL_ASR ICASSP22.pptx

FL_ASR ICASSP22.pptx (284)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

31 Views

NVC-Net: End-to-End Adversarial Voice Conversion

Read more about NVC-Net: End-to-End Adversarial Voice Conversion
Log in to post comments

NVCNet_slides.pdf

NVC-Net slides (281)

Categories:: Other applications of machine learning (MLR-APPL)

14 Views

ATTENTIVE MAX FEATURE MAP AND JOINT TRAINING FOR ACOUSTIC SCENE CLASSIFICATION

Read more about ATTENTIVE MAX FEATURE MAP AND JOINT TRAINING FOR ACOUSTIC SCENE CLASSIFICATION
Log in to post comments

Various attention mechanisms are being widely applied to acoustic scene classification. However, we empirically found that the attention mechanism can excessively discard potentially valuable information, despite improving performance. We propose the attentive max feature map that combines two effective techniques, attention and a max feature map, to further elaborate the attention mechanism and mitigate the above-mentioned phenomenon. We also explore various joint training methods, including multi-task learning, that allocate additional abstract labels for each audio recording.

ICASSP2022_AMFM_poster_final.pdf

ICASSP2022_AMFM_poster_final.pdf (253)

Categories:: Audio Processing Systems

17 Views

Entrainment Analysis for Assessment of Autistic Speech Prosody Using Bottleneck Features　of Deep Neural Network

In the present study, we quantify entrainment characteristics of conversation with the aim of automatic assessment of the severity of autism spectrum disorder (ASD). We focus on pairs of utterances immediate before and after turn-takings, which have prosodic/acoustic similarities.

Ochi2022_ICASSP_poster_v2.pdf

Ochi2022_ICASSP_poster_v2.pdf (231)

Categories:: Other

6 Views

A Minimally Supervised Approach for Medical Image Quality Assessment in Domain Shift Settings

Accurate disease diagnosis requires objective assessment of clinical image quality. Automated image quality assessment (IQA) could enhance screening and diagnosis workflows. However, development of generalizable quality assessment tools requires large labeled clinical image datasets from different sites. Obtaining these datasets is often infeasible; and quality indicators may vary with acquisition settings due to domain shift. We introduce a minimally-supervised

8754-2.pdf

8754-2.pdf (307)

Categories:: Medical imaging

33 Views

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster.pdf

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing Poster.pdf (230)

Categories:: Other

14 Views

MASSIVE UNSOURCED RANDOM ACCESS BASED ON BILINEAR VECTOR APPROXIMATE MESSAGE PASSING presentation

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing presentation.pdf

Massive Unsourced Random Access Based on Bilinear Vector Approximate Message Passing presentation.pdf (228)

Categories:: Other

26 Views

End-To-End Deep Learning-Based Adaptation Control for Frequency-Domain Adaptive System Identification

We present a novel end-to-end deep learning-based adaptation control algorithm for frequency-domain adaptive system identification. The proposed method exploits a deep neural network to map observed signal features to corresponding step-sizes which control the filter adaptation. The parameters of the network are optimized in an end-to-end fashion by minimizing the average normalized system distance of the adaptive filter.

deepAdControl_haubner_ICASSP_2022_presentation_upload.pdf

Presentation Slides (275)

Categories:: Echo Cancellation

13 Views

Pages