ICASSP 2022

ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Trade-offs in decentralized multi-antenna architectures: The WAX decomposition

Read more about Trade-offs in decentralized multi-antenna architectures: The WAX decomposition
Log in to post comments

Current research on multi-antenna architectures is trending towards increasing the amount of antennas in the base stations (BSs) so as to increase the spectral efficiency. As a result, the interconnection bandwidth and computational complexity required to process the data using centralized architectures is becoming prohibitively high. Decentralized architectures can reduce these requirements by pre-processing the data before it arrives at a central processing unit (CPU).

Trade-offs in decentralized multi-antenna architectures. The WAX decomposition_(no_tran).pdf

Trade-offs in decentralized multi-antenna architectures. The WAX decomposition_(no_tran).pdf (556)

Categories:: Signal Processing for Communications and Networking

29 Views

Cell-free massive MIMO: Exploiting the WAX decomposition

Read more about Cell-free massive MIMO: Exploiting the WAX decomposition
Log in to post comments

Cell-free massive multiple-input multiple-output (MIMO) consists of a large set of distributed access points (APs) serving a number of users. The APs can be far from each other, and they can also have a big number of antennas. Thus, decentralized architectures have to be considered so as to reduce the interconnection bandwidth to a central processing unit (CPU) and make the system scalable. On the other hand, the APs in a heterogeneous network might have limited processing capabilities and fully-decentralized processing may not be available.

Cell-free massive MIMO. Exploiting the WAX decomposition_(no_tran).pdf

Cell-free massive MIMO. Exploiting the WAX decomposition_(no_tran).pdf (515)

Categories:: Signal Processing for Communications and Networking

27 Views

BLIND UNMIXING USING A DOUBLE DEEP IMAGE PRIOR

Read more about BLIND UNMIXING USING A DOUBLE DEEP IMAGE PRIOR
Log in to post comments

ICASSP2022.pdf

slides (553)

Categories:: Source separation (MLR-SSEP)

12 Views

BLIND UNMIXING USING A DOUBLE DEEP IMAGE PRIOR

Read more about BLIND UNMIXING USING A DOUBLE DEEP IMAGE PRIOR
Log in to post comments

Poster.pdf

poster (316)

Categories:: Source separation (MLR-SSEP)

8 Views

Multitask Gaussian Process with Hierarchical Latent Interactions

Read more about Multitask Gaussian Process with Hierarchical Latent Interactions
Log in to post comments

Presentation-2022.pdf

Presentation-2022.pdf (209)

Categories:: Pattern recognition and classification (MLR-PATT)

4 Views

A Method for Estimating the Grouping of Participants in Classroom Group Work Using Only Audio Information

ic_grouping_poster_1.pdf

ic_grouping_poster_1.pdf (252)

Categories:: Audio Processing Systems

11 Views

Generalization Ability of MOS Prediction Networks

Read more about Generalization Ability of MOS Prediction Networks
Log in to post comments

Automatic methods to predict listener opinions of synthesized speech remain elusive since listeners, systems being evaluated, characteristics of the speech, and even the instructions given and the rating scale all vary from test to test. While automatic predictors for metrics such as mean opinion score (MOS) can achieve high prediction accuracy on samples from the same test, they typically fail to generalize well to new listening test contexts.

ecooper-slides.pdf

ecooper-slides.pdf (376)

Categories:: Audio and Acoustic Signal Processing

21 Views

Automatic Assessment of the Degree of Clinical Depression from Speech Using X-Vectors

Read more about Automatic Assessment of the Degree of Clinical Depression from Speech Using X-Vectors
Log in to post comments

Depression is a frequent and curable psychiatric disorder, detrimentally affecting daily activities, harming both work-place productivity and personal relationships. Among many other symptoms, depression is associated with disordered
speech production, which might permit its automatic screening by means of the speech of the subject. However, the choice of actual features extracted from the recordings is not trivial. In this study, we employ x-vectors, a DNN-based

icassp2022-José-Egas.pptx

icassp2022-José-Egas.pptx (263)

Categories:: Speech Analysis (SPE-ANLS)

26 Views

ICASSP 2022 Presentation Ronchini

Read more about ICASSP 2022 Presentation Ronchini
Log in to post comments

This paper proposes a benchmark of submissions to Detection and Classification Acoustic Scene and Events 2021 Challenge (DCASE) Task 4 representing a sampling of the state-of-the-art in Sound Event Detection task. The submissions are evaluated according to the two polyphonic sound detection score scenarios proposed for the DCASE 2021 Challenge Task 4, which allow to make an analysis on whether submissions are designed to perform fine-grained temporal segmentation, coarse-grained temporal segmentation, or have been designed to be polyvalent on the scenarios proposed.