IEEE ICASSP 2024

IEEE ICASSP 2024 - IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The IEEE ICASSP 2024 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

CRYPTO-MINE: Cryptanalysis via Mutual Information Neural Estimation

Read more about CRYPTO-MINE: Cryptanalysis via Mutual Information Neural Estimation
Log in to post comments

The use of Mutual Information (MI) as a measure to evaluate the efficiency of cryptosystems has an extensive history. However, estimating MI between unknown random variables in a high-dimensional space is challenging. Recent advances in machine learning have enabled progress in estimating MI using neural networks. This work presents a novel application of MI estimation in the field of cryptography. We propose applying this methodology directly to estimate the MI between plaintext and ciphertext in a chosen plaintext attack.

CryptoMine.pdf

CryptoMine.pdf (198)

Categories:: Communications and Network Security
Information-theoretic learning (MLR-INFO)

22 Views

Open-Set deepfake detection to fight the unknown

Read more about Open-Set deepfake detection to fight the unknown
Log in to post comments

In this paper, we design a new open-set method to detect deepfakes that does not assume information about the techniques behind the deepfakes generation. Contrary to existing methods, which build upon known telltales left by the deepfake creation process, we assume no prior knowledge about the sample generation, thus presenting a method for blind deepfake detection, a necessary step toward true generalization.

ICASSP2024_PAPER4642_apresentation.pdf

ICASSP2024_PAPER4642_apresentation.pdf (258)

Categories:: Information Forensics and Security

17 Views

DROPFL: Client Dropout Attacks Against Federated Learning Under Communication Constraints

Federated learning (FL) has emerged as a promising paradigm for decentralized machine learning while preserving data privacy. However, under communication constraints, the standard FL protocol faces the risk of client dropout. Although some research has focused on the risk from the perspectives of communication optimization and privacy protection, it is still challenging to deal with the client dropout issue in dynamic networks, where clients may join or drop the training process at any time.

ICASSP_DropFL_Poster.pdf

ICASSP_DropFL_Poster.pdf (250)

Categories:: Information Forensics and Security

27 Views

AEGIS-Net: Attention-Guided Multi-Level Feature Aggregation for Indoor Place Recognition

We present AEGIS-Net, a novel indoor place recognition model that takes in RGB point clouds and generates global place descriptors by aggregating lower-level color, geometry features and higher-level implicit semantic features. However, rather than simple feature concatenation, self-attention modules are employed to select the most important local features that best describe an indoor place. Our AEGIS-Net is made of a semantic encoder, a semantic decoder and an attention-guided feature embedding.

Poster.pdf

Poster.pdf (483)

Categories:: Image/Video Storage, Retrieval

24 Views

DATA-SCARCE CONDITION MODELING REQUIRES MODEL-BASED PRIOR REGULARIZATION

Read more about DATA-SCARCE CONDITION MODELING REQUIRES MODEL-BASED PRIOR REGULARIZATION
Log in to post comments

In the metallurgical industry, taking measurements during production can be infeasible or undesired, and only the terminated process can be measured. This poses problems for regression models, as the intermediate target values for a time series are hidden in the accumulated end-of-process measurement. The lack of data quality and quantity also often limits the modeling to linear estimators, as neural networks struggle to converge and/or overfit on scarce noisy data.

ICASSP_2024_Poster.pdf

ICASSP_2024_Poster.pdf (159)

Categories:: Machine Learning for Signal Processing

21 Views

Distributed Decision-Making for Community Structured Networks

Read more about Distributed Decision-Making for Community Structured Networks
Log in to post comments

Traditional social learning frameworks consider environments with a homogeneous state where each agent receives observations conditioned on the same hypothesis. In this work, we study the distributed hypothesis testing problem for graphs with a community structure, assuming that each cluster receives data conditioned on some different true state. This situation arises in many scenarios, such as when sensors are spatially distributed, or when individuals in a social network have differing views or opinions.

ICASSP2024_1589.pdf

ICASSP2024_1589.pdf (262)

Categories:: Signal Processing for Communications and Networking

18 Views

Poster for the paper "Dynamic Bandwidth Variational Mode Decomposition"

Read more about Poster for the paper "Dynamic Bandwidth Variational Mode Decomposition"
Log in to post comments

Signal decomposition techniques aim to break down nonstationary signals into their oscillatory components, serving as a preliminary step in various practical signal processing applications. This has motivated researchers to explore different strategies, yielding several distinct approaches. A wellknown optimization-based method, the Variational Mode Decomposition (VMD), relies on the formulation of an optimization problem utilizing constant-bandwidthWiener filters. However, this poses limitations in constant bandwidth and the need for constituent count.

DBVMD_Poster.pdf

DBVMD_Poster.pdf (230)

Categories:: Signal Processing Theory and Methods

28 Views

Poster - Controllable Prosody Generation With Partial Inputs

Read more about Poster - Controllable Prosody Generation With Partial Inputs
Log in to post comments

Appropriate prosodic choices depend on the context. One approach is for a human-in-the- loop (HitL) to pick the best prosody.
Often there are specific nuanced prosodic choices that convey the intended meaning in a given context.
We propose a system where HitL users can provide any number of prosodic controls. This allows for flexibility and removes the need for redundant (inefficient) work defining the entire prosodic specification.

014_ICASSP 2024_A0 Poster_V3_PRINT.pdf

014_ICASSP 2024_A0 Poster_V3_PRINT.pdf (183)

Categories:: Speech Synthesis and Generation, including TTS (SPE-SYNT)

21 Views

Unsupervised Accent Adaptation Through Masked Language Model Correction Of Discrete Self-Supervised Speech Units

Self-supervised pre-trained speech models have strongly improved speech recognition, yet they are still sensitive to domain shifts and accented or atypical speech. Many of these models rely on quantisation or clustering to learn discrete acoustic units. We propose to correct the discovered discrete units for accented speech back to a standard pronunciation in an unsupervised manner. A masked language model is trained on discrete units from a standard accent and iteratively corrects an accented token sequence by masking unexpected cluster sequences and predicting their common variant.

Poster_FINAL.pdf

Poster for ICASSP 2024 (959)

Categories:: Speech Adaptation/Normalization (SPE-ADAP)
Robust Speech Recognition (SPE-ROBU)

22 Views

WFTNet: Exploiting Global and Local Periodicity in Long-term Time Series Forecasting

Read more about WFTNet: Exploiting Global and Local Periodicity in Long-term Time Series Forecasting
Log in to post comments

Recent CNN and Transformer-based models tried to utilize frequency and periodicity information for long-term time series forecasting. However, most existing work is based on Fourier transform, which cannot capture fine-grained and local frequency structure. In this paper, we propose a Wavelet-Fourier Transform Network (WFTNet) for long-term time series forecasting.

WFTNet.pdf

WFTNet.pdf (205)

Categories:: Neural network learning (MLR-NNLR)

39 Views

IEEE ICASSP 2024

Pages