ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Slides for ICASSP 2021 paper on structure-aware alignment

Read more about Slides for ICASSP 2021 paper on structure-aware alignment
Log in to post comments

The identification of structural differences between a music performance and the score is a challenging yet integral step of audio-to-score alignment, an important subtask of music signal processing. We present a novel method to detect such differences between the score and performance for a given piece of music using progressively dilated convolutional neural networks. Our method incorporates varying dilation rates at different layers to capture both short-term and long-term context, and can be employed successfully in the presence of limited annotated data.

Ruchit_slides_ICASSP21.pdf

Ruchit_slides_ICASSP21.pdf (532)

Categories:: Music Signal Processing

188 Views

(Poster) Unified Gradient Reweighting for Model Biasing with Applications to Source Separation

Recent deep learning approaches have shown great improvement in audio source separation tasks. However, the vast majority of such work is focused on improving average separation performance, often neglecting to examine or control the distribution of the results. In this paper, we propose a simple, unified gradient reweighting scheme, with a lightweight modification to bias the learning process of a model and steer it towards a certain distribution of results. More specifically, we reweight the gradient updates of each batch, using a user-specified probability distribution.

gradient_reweighting_poster_icassp21 (1).pdf

Poster (333)

Categories:: Source Separation and Signal Enhancement
Neural network learning (MLR-NNLR)

22 Views

MULTI-OBJECT TRACKING USING POISSON MULTI-BERNOULLI MIXTURE FILTERING FOR AUTONOMOUS VEHICLES

The ability of an autonomous vehicle to perform 3D tracking is essential for safe planing and navigation in cluttered environments. The main challenges for multi-object tracking (MOT) in autonomous driving applications reside in the inherent uncertainties regarding the number of objects, when and where the objects may appear and disappear, and uncertainties regarding objects' states. Random finite set (RFS) based approaches can naturally model these uncertainties accurately and elegantly, and they have been widely used in radar-based tracking applications.

ICASSP_2021_SP.pdf

Presentation Slides for ICASSP 2021 paper #2837 (331)

Categories:: Automotive Applications
Signal and System Modeling, Representation and Estimation

32 Views

MULTI-OBJECT TRACKING USING POISSON MULTI-BERNOULLI MIXTURE FILTERING FOR AUTONOMOUS VEHICLES

ICASSP_Poster_SP.pdf

Poster for ICASSP 2021 paper #2837 (412)

Categories:: Automotive Applications
Signal and System Modeling, Representation and Estimation

23 Views

Slides: Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision

Most music source separation systems require large collections of isolated sources for training, which can be difficult to obtain. In this work, we use musical scores, which are comparatively easy to obtain, as a weak label for training a source separation system. In contrast with previous score-informed separation approaches, our system does not require isolated sources, and score is used only as a training target, not required for inference.

icassp2021_presentation_amy_transcription.pdf

icassp2021_presentation_amy_transcription.pdf (391)

Categories:: Source Separation and Signal Enhancement
Music Signal Processing

23 Views

Poster: Transcription Is All You Need: Learning To Separate Musical Mixtures With Score As Supervision

icassp2021_poster_amy_transcription.pdf

icassp2021_poster_amy_transcription.pdf (307)

Categories:: Source Separation and Signal Enhancement
Music Signal Processing

12 Views

Overcoming Measurement Inconsistency in Deep Learning for Linear Inverse Problems: Applications in Medical Imaging

The remarkable performance of deep neural networks (DNNs) currently makes them the method of choice for solving linear inverse problems. They have been applied to super-resolve and restore images, as well as to reconstruct MR and CT images. In these applications, DNNs invert a forward operator by finding, via training data, a map between the measurements and the input images. It is then expected that the map is still valid for the test data. This framework, however, introduces measurement inconsistency during testing.

5170_slides.pdf

5170_slides.pdf (384)

5170_Poster.pdf

5170_Poster.pdf (297)

Categories:: Medical image analysis

22 Views

MULTI-DIRECTIONAL CONVOLUTION NETWORKS WITH SPATIAL-TEMPORAL FEATURE PYRAMID MODULE FOR ACTION RECOGNITION

Recent attempts show that factorizing 3D convolutional filters into separate spatial and temporal components brings impressive improvement in action recognition. However, traditional temporal convolution operating along the temporal dimension will aggregate unrelated features, since the feature maps of fast-moving objects have shifted spatial positions. In this paper, we propose a novel and effective Multi-Directional convolution (MDConv), which extracts features along different spatial-temporal orientations.

poster-4.pdf

The poster (362)

Categories:: Image/Video Processing

24 Views

Audio Dequantization Using (Co)Sparse (Non)Convex Methods

Read more about Audio Dequantization Using (Co)Sparse (Non)Convex Methods
Log in to post comments

The paper deals with the hitherto neglected topic of audio dequantization. It reviews the state-of-the-art sparsity-based approaches and proposes several new methods. Convex as well as non-convex approaches are included, and all the presented formulations come in both the synthesis and analysis variants. In the experiments the methods are evaluated using the signal-to-distortion ratio (SDR) and PEMO-Q, a perceptually motivated metric.

Audio Dequantiztion Using (Co)Sparse (Non)Convex Methods Poster.pdf

Poster (292)

Audio Dequantization Using (Co)Sparse (Non)Convex Methods Presentation Slides.pdf

Presentation Slides (296)

Categories:: Source Separation and Signal Enhancement

17 Views

Continuous CNN for Nonuniform Time Series

Read more about Continuous CNN for Nonuniform Time Series
Log in to post comments

CNN for time series data implicitly assumes that the data are uniformly sampled, whereas many event-based and multi-modal data are nonuniform or have heterogeneous sampling rates. Directly applying regular CNN to nonuniform time series is ungrounded, because it is unable to recognize and extract common patterns from the nonuniform input signals. In this paper, we propose the Continuous CNN (\myname), which estimates the inherent continuous inputs by interpolation, and performs continuous convolution on the continuous input.

CCNN_video.pdf

Presentation slides (316)

Categories:: Neural network learning (MLR-NNLR)

30 Views

Pages