ICASSP 2023

IEEE ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2023 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Untrained graph neural networks for denoising

Read more about Untrained graph neural networks for denoising
Log in to post comments

A fundamental problem in signal processing is to denoise a signal. While there are many well-performing methods for denoising signals defined on regular domains, including images defined on a two-dimensional pixel grid, many important classes of signals are defined over irregular domains that can be conveniently represented by a graph. This paper introduces two untrained graph neural network architectures for graph signal denoising, develops theoretical guarantees for their denoising capabilities in a simple setup, and provides empirical evidence in more general scenarios.

ICASSP23_Untrained_GNN_Signal_Denoising_poster.pdf

Poster that will be presented at ICASSP 2023 (286)

ICASSP23-Untrained_GNN_Signal_Denoising_slides.pdf

Slides of the video introducing the work (307)

Categories:: Neural network learning (MLR-NNLR)

76 Views

Feature Selection and Text Embedding For Detecting Dementia from Spontaneous Cantonese

Dementia is a severe cognitive impairment that affects the health of older adults and creates a burden on their families and caretakers. This paper analyzes diverse hand-crafted features extracted from spoken languages and selects the most discriminative ones for dementia detection. Recently, the performance of dementia detection has been significantly improved by utilizing Transformer-based models that automatically capture the structural and linguistic properties of spoken languages. We investigate Transformer-based features and propose an end-to-end system for dementia detection.

ICASSP2023.pdf

ICASSP2023.pdf (413)

Categories:: Spoken Language Processing

43 Views

MASKED MODELING DUO: LEARNING REPRESENTATIONS BY ENCOURAGING BOTH NETWORKS TO MODEL THE INPUT

Masked Autoencoders is a simple yet powerful self-supervised learning method. However, it learns representations indirectly by reconstructing masked input patches. Several methods learn representations directly by predicting representations of masked patches; however, we think using all patches to encode training signal representations is suboptimal. We propose a new method, Masked Modeling Duo (M2D), that learns representations directly while obtaining training signals using only masked patches.

20230607-ICASSP2023-M2D-Poster.pptx

20230607-ICASSP2023-M2D-Poster.pptx (332)

Categories:: Audio Processing Systems
Neural network learning (MLR-NNLR)

31 Views

Federated Intelligent Terminals Facilitate Stuttering Monitoring

Read more about Federated Intelligent Terminals Facilitate Stuttering Monitoring
Log in to post comments

Stuttering is a complicated language disorder. The most common form of stuttering is developmental stuttering, which begins in childhood. Early monitoring and intervention are essential for the treatment of children with stuttering. Automatic speech recognition technology has shown its great potential for non-fluent disorder identification, whereas the previous work has not considered the privacy of users' data. To this end, we propose federated intelligent terminals for automatic monitoring of stuttering speech in different contexts.

ICASSP_camera_slides.pdf

Presentation Slides(PDF) (296)

ICASSP_camerady.pptx

Presentation Slides(PPT) (295)

Categories:: Speech Analysis (SPE-ANLS)
Applications
Biomedical signal processing

89 Views

COLUMN-BASED MATRIX APPROXIMATION WITH QUASI-POLYNOMIAL STRUCTURE

Read more about COLUMN-BASED MATRIX APPROXIMATION WITH QUASI-POLYNOMIAL STRUCTURE
Log in to post comments

A novel matrix completion problem is considered herein: observations based on fully sampled columns and quasi-polynomial side information is exploited. The framework is motivated by quantum chemistry problems wherein full matrix computation is expensive, but partial computations only lead to column information. The proposed algorithm successfully estimates the row-space of a true matrix given a priori knowledge of the true matrix. A theoretical error bound is provided, which captures the possible inaccuracies of the side information.

ICASSP_2023-4.pdf

ICASSP_2023-4.pdf (247)

JeongminChae_ICASSP_v2.pdf

Poster (182)

Categories:: Statistical Signal Processing

26 Views

Recursive/Iterative Unique Projection-Aggregation Decoding of Reed-Muller Codes

Read more about Recursive/Iterative Unique Projection-Aggregation Decoding of Reed-Muller Codes
Log in to post comments

We describe recursive unique projection-aggregation (RUPA) decoding and iterative unique projection-aggregation (IUPA) decoding of Reed-Muller (RM) codes, which remove non-unique projections from the recursive projection-aggregation (RPA) and iterative projection-aggregation (IPA) algorithms respectively.
We show that these algorithms have competitive error-correcting performance while requiring up to 95% projections less than the baseline RPA algorithm.

Poster.pdf

RuPA/IuPA poster (presented in ICASSP2023) (696)

Categories:: Signal Transmission and Reception

33 Views

Active selection of source patients in transfer learning for epileptic seizure detection using Riemannian Manifold

Epileptic seizure detection from long recordings of scalp electroencephalography (EEG) is a challenging task owing to their unpredictability in nature with the inclusion of noise, artifacts and subject dependency. We hypothesize that selection of training EEG data plays important role in the model performance. Thus, we introduced an active learning based training data selection and modification method with a Riemannian geometry, centroid alignment, tangent space mapping and a support vector machine classifier.

ICASSP2023_hassan_Poster_Final.pdf

Active selection of source patients in transfer learning for epileptic seizure detection using Riemannian Manifold (215)

Categories:: Biomedical signal processing

18 Views

Restoration of Time-Varying Graph Signals using Deep Algorithm Unrolling

Read more about Restoration of Time-Varying Graph Signals using Deep Algorithm Unrolling
Log in to post comments

In this paper, we propose a restoration method of time-varying graph signals, i.e., signals on a graph whose signal values change over time, using deep algorithm unrolling. Deep algorithm unrolling is a method that learns parameters in an iterative optimization algorithm with deep learning techniques. It is expected to improve convergence speed and accuracy while the iterative steps are still interpretable. In the proposed method, the minimization problem is formulated so that the time-varying graph signal is smooth both in time and spatial domains.

slide.pdf

slide.pdf (275)

ICASSP2023_Kojima_poster.pdf

ICASSP2023_Kojima_poster.pdf (279)

Categories:: Signal Processing Theory and Methods

76 Views

Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention Mechanism

Instrument playing technique (IPT) is a key element of musical presentation. However, most of the existing works for IPT detection only concern monophonic music signals, yet little has been done to detect IPTs in polyphonic instrumental solo pieces with overlapping IPTs or mixed IPTs. In this paper, we formulate it as a framelevel multi-label classification problem and apply it to Guzheng, a Chinese plucked string instrument. We create a new dataset, Guzheng Tech99, containing Guzheng recordings and onset, offset, pitch, IPT annotations of each note.

FRAME(1).pdf

FRAME(1).pdf (304)

Categories:: Music Signal Processing

23 Views

Block-Based Color Constancy: The Deviation of Salient Pixels

Read more about Block-Based Color Constancy: The Deviation of Salient Pixels
Log in to post comments

We recently proposed a color constancy method based on the observations that the human visual system might be "discounting the illuminant" by using space-average color and the highest luminance patches. Based on these observations, our algorithm relies on two assumptions: (i) there are several bright pixels in the scene, and (ii) the world is gray, on average. The main idea of the algorithm is to estimate the illuminant by finding the deviation of the brightest pixels from the gray value. During experiments, we observed that some pixels decrease the performance of the method.

Slides_only.pdf

The presentation of "Block-Based Color Constancy: The Deviation of Salient Pixels" (222)

Categories:: Image/Video Processing

18 Views

Pages