ICASSP 2023

IEEE ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2023 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Delay-aware Backpressure Routing Using Graph Neural Networks

Read more about Delay-aware Backpressure Routing Using Graph Neural Networks
Log in to post comments

We propose a throughput-optimal biased backpressure (BP) algorithm for routing, where the bias is learned through a graph neural network that seeks to minimize end-to-end delay. Classical BP routing provides a simple yet powerful distributed solution for resource allocation in wireless multi-hop networks but has poor delay performance. A low-cost approach to improve this delay performance is to favor shorter paths by incorporating pre-defined biases in the BP computation, such as a bias based on the shortest path (hop) distance to the destination.

BiasBP-ICASSP2023-8min-poster.pdf

Poster (1093)

BiasBP-ICASSP2023-8min-upload.pdf

Slides (333)

2211.10748.pdf

Paper, preprint (362)

Categories:: Other
Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks
Machine Learning for Signal Processing

54 Views

ModEFormer: Modality-preserving embedding for audio-video synchronization using transformers

Lack of audio-video synchronization is a common problem during television broadcasts and video conferencing, leading to an unsatisfactory viewing experience. A widely accepted paradigm is to create an error detection mechanism that identifies the cases when audio is leading or lagging. We propose ModEFormer, which independently extracts audio and video embeddings using modality-specific transformers.

2023_ICASSP_ModEFormer.pdf

Paper (255)

5444_ModEFormer_ICASSP_2023.pdf

Presentation slides (269)

5444_Akash_Gupta_ModeFormer.pdf

Poster (330)

Categories:: Neural network learning (MLR-NNLR)
Multimodal signal processing

172 Views

A Statistical Interpretation of the Maximum Subarray Problem

Read more about A Statistical Interpretation of the Maximum Subarray Problem
Log in to post comments

Maximum subarray is a classical problem in computer science that given an array of numbers aims to find a contiguous subarray with the largest sum. We focus on its use for a noisy statistical problem of localizing an interval with a mean different from background. While a naive application of maximum subarray fails at this task, both a penalized and a constrained version can succeed.

Max_Subarray_video.pdf

Slides (219)

Max_Subarray_poster.pdf

Poster (224)

Categories:: Statistical Signal Processing

73 Views

On Negative Sampling for Contrastive Audio-Text Retrieval

Read more about On Negative Sampling for Contrastive Audio-Text Retrieval
Log in to post comments

This paper investigates negative sampling for contrastive learning in the context of audio-text retrieval. The strategy for negative sampling refers to selecting negatives (either audio clips or textual descriptions) from a pool of candidates for a positive audio-text pair. We explore sampling strategies via model-estimated within-modality and cross-modality relevance scores for audio and text samples. With a constant training setting on the retrieval system from [1], we study eight sampling strategies, including hard and semi-hard negative sampling.

ICASSP2023_5376_slides.pdf

ICASSP2023 Presentation Slides (209)

Categories:: Other

18 Views

IMPROVING MUSIC GENRE CLASSIFICATION FROM MULTI-MODAL PROPERTIES OF MUSIC AND GENRE CORRELATIONS PERSPECTIVE

ICASSP poster.pdf

ICASSP poster.pdf (98)

Categories:: Music Signal Processing

36 Views

Joint Unmixing And Demosaicing Methods For Snapshot Spectral Images

Read more about Joint Unmixing And Demosaicing Methods For Snapshot Spectral Images
Log in to post comments

Recent technological advances in design and processing speed have successfully demonstrated a new snapshot mosaic imaging sensor architecture (SSI), allowing miniaturized platforms to efficiently acquire the spatio-spectral content of the dynamic scenes from a single exposure. However, SSI systems have a fundamental trade-off between spatial and spectral resolution because they associate each pixel with a specific spectral band.

Poster_ICASSP_2023_Kinan_Abbas.pdf

Poster ICASSP 2023 Kinan Abbas (306)

ICASSP_2023_Short_Video_Kinan_Abbas.pdf

Slides (262)

Categories:: Image/Video Processing
Source separation (MLR-SSEP)

58 Views

SD-PINN: Physics Informed Neural Networks for Spatially Dependent PDEs

Read more about SD-PINN: Physics Informed Neural Networks for Spatially Dependent PDEs
Log in to post comments

ICASSP23_Peter.pdf

ICASSP23_Peter.pdf (287)

Categories:: Machine Learning for Signal Processing

27 Views

Modeling the Wave Equation Using Physics-Informed Neural Networks Enhanced with Attention to Loss Weights

Presentation for ICASSP 2023: Modeling the Wave Equation Using Physics-Informed Neural Networks Enhanced with Attention to Loss Weights.

ICASSP_Slides.pdf

Presentation slides and video: ICASSP 2023 (646)

Categories:: Signal Processing Education

48 Views

A NOVEL STATE CONNECTION STRATEGY FOR QUANTUM COMPUTING TO REPRESENT AND COMPRESS DIGITAL IMAGES

Quantum image processing draws a lot of attention due to faster data computation and storage compared to classical data processing systems. Converting classical image data into the quantum domain and state label preparation complexity is still a challenging issue. The existing techniques normally connect the pixel values and the state position directly. Recently, the EFRQI (efficient flexible representation of the quantum image) approach uses an auxiliary qubit that connects the pixel-representing qubits to the state position qubits via Toffoli gates to reduce state connection.

Poster_ICASSP-2023_Ershadul_Final_For_Printing.pptx

icassp 2023 (230)

Categories:: Image/Video Coding

25 Views

Wireless location tracking via complex-domain Super MDS with time series self-localization information

We propose a wireless localization algorithm based on complex-domain super multidimensional scaling (CD-SMDS) augmented with a self-localization (SL) component, whereby each target tracks its own motion by incorporating bearing in- formation, obtained e.g., from integrated inertial sensors.

ICASSP2023_poster.pdf

Poster (284)

202306_ICASSP_NISHI (5).pdf

Manuscript (316)

Categories:: Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks

29 Views

Pages