ICASSP 2022

ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Spatio-Temporal PRRS Epidemic Forecasting via Factorized Deep Generative Modeling

Read more about Spatio-Temporal PRRS Epidemic Forecasting via Factorized Deep Generative Modeling
Log in to post comments

Abstract:

Poster_Shams_ICASSP_2022 (2).pdf

Poster_Shams_ICASSP_2022 (2).pdf (253)

Categories:: Machine Learning for Signal Processing

33 Views

Provable Sample Complexity Guarantees for Learning of Continuous-Action Graphical Games with Nonparametric Utilities

icassp_nonpara_presentation.pdf

Provable Sample Complexity Guarantees for Learning of Continuous-Action Graphical Games with Nonparametric Utilities (226)

Categories:: Other

16 Views

Information Theoretic Limits for Standard and One-bit Compressed Sensing with Graph-structured Sparsity

icassp_cs_presentation.pdf

Information Theoretic Limits for Standard and One-bit Compressed Sensing with Graph-structured Sparsity (227)

Categories:: Other

13 Views

A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It

End-to-end Automatic Speech Recognition (ASR) models are commonly trained over spoken utterances using optimization methods like Stochastic Gradient Descent (SGD). In distributed settings like Federated Learning, model training requires transmission of gradients over a network. In this work, we design the first method for revealing the identity of the speaker of a training utterance with access only to a gradient.

ICASSP22 - A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter It.pdf

slides (284)

Categories:: Distributed and Cooperative Learning (MLR-DIST)

24 Views

Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Transformer-based architectures have been the subject of research aimed at understanding their overparameterization and the non-uniform importance of their layers. Applying these approaches to Automatic Speech Recognition, we demonstrate that the state-of-the-art Conformer models generally have multiple ambient layers. We study the stability of these layers across runs and model sizes, propose that group normalization may be used without disrupting their formation, and examine their correlation with model weight updates in each layer.

Slides - Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training.pdf

Slides - Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training (240)

Categories:: Neural network learning (MLR-NNLR)

15 Views

Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training

Poster - Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training (1).pdf

Poster - Exploring Heterogeneous Characteristics of Layers in ASR Models for More Efficient Training (496)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)
Neural network learning (MLR-NNLR)

14 Views

CRAMÉR-RAO BOUND AND ANTENNA SELECTION OPTIMIZATION FOR DUAL RADAR-COMMUNICATION DESIGN

ICASSP2022_slides.pdf

ICASSP2022_slides.pdf (653)

Categories:: Sensor Array Processing

15 Views

Adaptive Variational Nonlinear Chirp Mode Decomposition

Read more about Adaptive Variational Nonlinear Chirp Mode Decomposition
Log in to post comments

Variational nonlinear chirp mode decomposition (VNCMD) is a recently introduced method for nonlinear chirp signal decomposition that has aroused notable attention in various fields. One limiting aspect of the method is that its performance relies heavily on the setting of the bandwidth parameter.

#5736 Poster.pdf

Poster (519)

#5736 Sildes.pptx

Presentation Slides (376)

#5736 Transcript of Presentation.docx

Transcript of Presentation (336)

Categories:: Signal and System Modeling, Representation and Estimation

266 Views

SKIPPING MEMORY LSTM FOR LOW-LATENCY REAL-TIME CONTINUOUS SPEECH SEPARATION

Read more about SKIPPING MEMORY LSTM FOR LOW-LATENCY REAL-TIME CONTINUOUS SPEECH SEPARATION
Log in to post comments

Continuous speech separation for meeting pre-processing has recently become a focused research topic. Compared to the data in utterance-level speech separation, the meeting-style audio stream lasts longer, has an uncertain number of speakers. We adopt the time-domain speech separation method and the recently proposed Graph-PIT to build a super low-latency online speech separation model, which is very important for the real application. The low-latency time-domain encoder with a small stride leads to an extremely long feature sequence.