ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Mixed Monotonic Programming for Fast Global Optimization

Read more about Mixed Monotonic Programming for Fast Global Optimization
Log in to post comments

While globally optimal solutions to many convex programs can be computed efficiently in polynomial time, this is, in general, not possible for nonconvex optimization problems. Therefore, locally optimal approaches or other efficient suboptimal heuristics are usually applied for practical implementations. However, there is also a strong interest in computing globally optimal solutions of nonconvex problems in offline simulations in order to benchmark faster suboptimal algorithms. Global solutions often rely on monotonicity properties.

Matthiesen-MMP-poster.pdf

Poster (350)

Matthiesen-MMP-slides.pdf

Slides (340)

Categories:: Communication Systems and Applications

26 Views

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

The end-to-end approaches for single-channel target speech extraction have attracted widespread attention. However, the studies for end-to-end multi-channel target speech extraction are still relatively limited. In this work, we propose two methods for exploiting the multi-channel spatial information to extract the target speech. The first one is using a target speech adaptation layer in a parallel encoder architecture. The second one is designing a channel decorrelation mechanism to extract the inter-channel differential information to enhance the multi-channel encoder representation.

poster-revised.pdf

Poster of presentation (317)

Categories:: Speech Enhancement (SPE-ENHA)

21 Views

Plug-And-Play Learned Gaussian-mixture Approximate Message Passing

Read more about Plug-And-Play Learned Gaussian-mixture Approximate Message Passing
Log in to post comments

icassp2021-poster.pdf

icassp2021-poster.pdf (430)

Categories:: Signal Transmission and Reception

10 Views

Globally Optimal Beamforming for Rate Splitting Multiple Access

Read more about Globally Optimal Beamforming for Rate Splitting Multiple Access
Log in to post comments

We consider globally optimal precoder design for rate splitting multiple access in Gaussian multiple-input single-output downlink channels with respect to weighted sum rate and energy efficiency maximization. The proposed algorithm solves an instance of the joint multicast and unicast beamforming problem and includes multicast- and unicast-only beamforming as special cases. Numerical results show that it outperforms state-of-the-art algorithms in terms of numerical stability and converges almost twice as fast.

Matthiesen-RSMA-Poster.pdf

Poster (393)

Matthiesen-RSMA-Slides.pdf

Slides (327)

Categories:: Communication Systems and Applications

108 Views

Deep Residual Echo Suppression with a Tunable Tradeoff Between Signal Distortion and Echo Suppression

In this paper, we propose a residual echo suppression method using a UNet neural network that directly maps the outputs of a linear acoustic echo canceler to the desired signal in the spectral domain. This system embeds a design parameter that allows a tunable tradeoff between the desired-signal distortion and residual echo suppression in double-talk scenarios. The system employs 136 thousand parameters, and requires 1.6 Giga floating-point operations per second and 10 Mega-bytes of memory.

Amir Ivry - Poster.pdf

Amir Ivry - Poster.pdf (401)

Amir Ivry - Slides.pdf

Amir Ivry - Slides.pdf (409)

Categories:: Echo Cancellation

19 Views

TIME-DOMAIN CONCENTRATION AND APPROXIMATION OF COMPUTABLE BANDLIMITED SIGNALS

Read more about TIME-DOMAIN CONCENTRATION AND APPROXIMATION OF COMPUTABLE BANDLIMITED SIGNALS
Log in to post comments

Turing computability deals with the question of what is theoretically computable on a digital computer, and hence is relevant whenever digital hardware is used. In this paper we study different possibilities to define computable bandlimited signals and systems. We consider a definition that uses finite Shannon sampling series as approximating functions and another that employs computable continuous functions together with an effectively computable time concentration. We discuss the advantages and drawbacks of both definitions and analyze the connections and differences.

icassp2021_computability_time_concentration.pdf

Poster of the ICASSP 2021 presentation (465)

Categories:: Signal and System Modeling, Representation and Estimation
Sampling and Reconstruction
Signal Transmission and Reception

29 Views

Social Learning Under Inferential Attacks

Read more about Social Learning Under Inferential Attacks
Log in to post comments

ICASSP_2021_Ntemos_poster.pdf

ICASSP_2021_Ntemos_poster.pdf (306)

Categories:: Adaptive Signal Processing

13 Views

Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement -- Slides

Read more about Deep Multi-Frame MVDR Filtering for Single-Microphone Speech Enhancement -- Slides
Log in to post comments

Multi-frame algorithms for single-microphone speech enhancement, e.g., the multi-frame minimum variance distortionless response (MFMVDR) filter, are able to exploit speech correlation across adjacent time frames in the short-time Fourier transform (STFT) domain. Provided that accurate estimates of the required speech interframe correlation vector and the noise correlation matrix are available, it has been shown that the MFMVDR filter yields a substantial noise reduction while hardly introducing any speech distortion.