
ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

- Read more about HIRL: Hybrid Image Restoration based on Hierarchical Deep Reinforcement Learning via Two-Step Analysis
- Log in to post comments
- Categories:

- Read more about JE2Net: Joint Exploitation and Exploration in Reinforcement Learning Based Image Restoration
- Log in to post comments
- Categories:

- Categories:

- Read more about SCREEN & RELAX: ACCELERATING THE RESOLUTION OF ELASTIC-NET BY SAFE IDENTIFICATION OF THE SOLUTION SUPPORT
- Log in to post comments
In this paper, we propose a procedure to accelerate the resolution of the well-known ``Elastic-Net'' problem. Our procedure is based on the (partial) identification of the solution support and the reformulation of the original problem into a problem of reduced dimension. The identification of the support leverages the novel concept of ``safe relaxing'' where one aims to identify non-zero coefficients of the solution.
poster.pdf

- Categories:

- Read more about SCREEN & RELAX: ACCELERATING THE RESOLUTION OF ELASTIC-NET BY SAFE IDENTIFICATION OF THE SOLUTION SUPPORT
- Log in to post comments
In this paper, we propose a procedure to accelerate the resolution of the well-known ``Elastic-Net'' problem. Our procedure is based on the (partial) identification of the solution support and the reformulation of the original problem into a problem of reduced dimension. The identification of the support leverages the novel concept of ``safe relaxing'' where one aims to identify non-zero coefficients of the solution.
slides.pdf

- Categories:

- Read more about Multimodal Transformer With Learnable Frontend and Self Attention for Emotion Recognition
- Log in to post comments
In this work, we propose a novel approach for multi-modal emotion recognition from conversations using speech and text. The audio representations are learned jointly with a learnable audio front-end (LEAF) model feeding to a CNN based classifier. The text representations are derived from pre-trained bidirectional encoder representations from transformer (BERT) along with a gated recurrent network (GRU). Both the textual and audio representations are separately processed using a bidirectional GRU network with self-attention.
- Categories:

- Read more about NODE-SCREENING TESTS FOR THE L0-PENALIZED LEAST-SQUARES PROBLEM
- Log in to post comments
We present a novel screening methodology to safely discard irrelevant nodes within a generic branch-and-bound (BnB) algorithm solving the l0-penalized least-squares problem. Our contribution is a set of two simple tests to detect sets of feasible vectors that cannot yield optimal solutions. This allows to prune nodes of the BnB search tree, thus reducing the overall optimization time. One cornerstone of our contribution is a nesting property between tests at different nodes that allows to implement them with a low computational cost.
poster.pdf

- Categories:

- Read more about NODE-SCREENING TESTS FOR THE L0-PENALIZED LEAST-SQUARES PROBLEM
- Log in to post comments
We present a novel screening methodology to safely discard irrelevant nodes within a generic branch-and-bound (BnB) algorithm solving the l0-penalized least-squares problem. Our contribution is a set of two simple tests to detect sets of feasible vectors that cannot yield optimal solutions. This allows to prune nodes of the BnB search tree, thus reducing the overall optimization time. One cornerstone of our contribution is a nesting property between tests at different nodes that allows to implement them with a low computational cost.
slides.pdf

- Categories:

- Read more about Improving Inference for Spatial Signals by Contextual False Discovery Rates
- Log in to post comments
A spatial signal is monitored by a large-scale sensor network. We propose a novel method to identify areas where the signal behaves interestingly, anomalously, or simply differently from what is expected. The sensors pre-process their measurements locally and transmit a local summary statistic to a fusion center or a cloud. This saves bandwidth and energy. The fusion center or cloud computes a spatially varying empirical Bayes prior on the signal’s spatial behavior. The spatial domain is modeled as a fine discrete grid.
- Categories:

- Read more about LATTICEBART: LATTICE-TO-LATTICE PRE-TRAINING FOR SPEECH RECOGNITION
- Log in to post comments
poster.pdf

- Categories: