ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Correlated Tensor Factorization for Audio Source Separation

Read more about Correlated Tensor Factorization for Audio Source Separation
Log in to post comments

This paper presents an ultimate extension of nonnegative matrix factorization (NMF) for audio source separation based on full covariance modeling over all the time-frequency (TF) bins of the complex spectrogram of an observed mixture signal. Although NMF has been widely used for decomposing an observed power spectrogram in a TF-wise manner, it has a critical limitation that the phase values of interdependent TF bins cannot be dealt with.

icassp-2018-yoshii-poster.pdf

icassp-2018-yoshii-poster.pdf (555)

Categories:: Source Separation and Signal Enhancement

45 Views

Time-Frequency Networks for Audio Super-Resolution

Read more about Time-Frequency Networks for Audio Super-Resolution
Log in to post comments

Audio super-resolution (a.k.a. bandwidth extension) is the challenging task of increasing the temporal resolution of audio signals. Recent deep networks approaches achieved promising results by modeling the task as a regression problem in either time or frequency domain. In this paper, we introduced Time-Frequency Network (TFNet), a deep network that utilizes supervision in both the time and frequency domain. We proposed a novel model architecture which allows the two domains to be jointly optimized.

audio_sr_poster.pdf

audio_sr_poster.pdf (652)

Categories:: Audio and Acoustic Signal Processing

343 Views

Image Restoration with Deep Generative Models

Read more about Image Restoration with Deep Generative Models
Log in to post comments

Many image restoration problems are ill-posed in nature, hence, beyond the input image, most existing methods rely on a carefully engineered image prior, which enforces some local image consistency in the recovered image. How tightly the prior assumptions are fulfilled has a big impact on the resulting task performance. To obtain more flexibility, in this work, we proposed to design the image prior in a data-driven manner. Instead of explicitly defining the prior, we learn it using deep generative models.

presentation.pdf

presentation.pdf (980)

Categories:: Image, Video, and Multidimensional Signal Processing

227 Views

A UNIFIED APPROACH TO GENERATING SOUND ZONES USING VARIABLE SPAN LINEAR FILTERS

Read more about A UNIFIED APPROACH TO GENERATING SOUND ZONES USING VARIABLE SPAN LINEAR FILTERS
Log in to post comments

Sound zones are typically created using Acoustic Contrast Control (ACC), Pressure Matching (PM), or variations of the two. ACC maximizes the acoustic potential energy contrast between a listening zone and a quiet zone. Although the contrast is maximized, the phase is not controlled. To control both the amplitude and the phase, PM instead minimizes the difference between the reproduced sound field and the desired sound field in all zones.

[Poster] VAST_ICASSP2018_final+.pdf

[Poster] VAST_ICASSP2018_final+.pdf (413)

Categories:: Spatial and Multichannel Audio

97 Views

Deep Clustering with Gated Convolutional Networks

Read more about Deep Clustering with Gated Convolutional Networks
Log in to post comments

ICASSP2018_ppt.pdf

ICASSP2018_ppt.pdf (746)

Categories:: Audio and Acoustic Signal Processing

87 Views

ICASSP 2018 Tutorial T11 Natual and Augmented Listening for VR/AR/MR

Read more about ICASSP 2018 Tutorial T11 Natual and Augmented Listening for VR/AR/MR
Log in to post comments

This tutorial aims to equip the participants with basic and advanced signal processing techniques that can be used in VR/AR applications to create a natural and augmented listening experience using headsets.
This tutorial is divided into 5 sections and cover following topics:
Introduction to spatial audio, fundamentals in natural listening, and emerging audio applications

ICASSP2018_Tutorial_T11_Natual_and_Augmented_Listening_for_VR_AR_MR.pdf

Tutorial slides (564)

Categories:: Spatial and Multichannel Audio

202 Views

AN IMPROVED INITIALIZATION FOR LOW-RANK MATRIX COMPLETION BASED ON RANK-1 UPDATES

Read more about AN IMPROVED INITIALIZATION FOR LOW-RANK MATRIX COMPLETION BASED ON RANK-1 UPDATES
Log in to post comments

Given a data matrix with partially observed entries, the low-rank matrix completion problem is one of finding a matrix with the lowest rank that perfectly fits the given observations. While there exist convex relaxations for the low-rank completion problem, the underlying problem is inherently non-convex, and most algorithms (alternating projection, Riemannian optimization, etc.) heavily depend on the initialization. This paper proposes an improved initialization that relies on successive rank-1 updates.