ICASSP 2019

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Blind Denoising of Mixed Gaussian-Impulse Noise by Single CNN

Read more about Blind Denoising of Mixed Gaussian-Impulse Noise by Single CNN
Log in to post comments

icassp_denoise.pdf

icassp_denoise.pdf (420)

Categories:: Image/Video Processing

60 Views

A Recursive Least-Squares Algorithm Based on the Nearest Kronecker Product Decomposition

presentation_ICASSP_2019_3.pdf

presentation_ICASSP_2019_3.pdf (527)

Categories:: Echo Cancellation

20 Views

Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks

Read more about Wav2Pix: Speech-conditioned Face Generation using Generative Adversarial Networks
Log in to post comments

Speech is a rich biometric signal that contains information about the identity, gender and emotional state of the speaker. In this work, we explore its potential to generate face images of a speaker by conditioning a Generative Adversarial Network (GAN) with raw speech input. We propose a deep neural network that is trained from scratch in an end-to-end fashion, generating a face directly from the raw speech waveform without any additional identity information (e.g reference image or one-hot encoding).

icassp-2019-wav2pix.pdf

slides (884)

Categories:: Audio for Multimedia

101 Views

Adversarial variational Bayes methods for Tweedie compound Poisson mixed models

Read more about Adversarial variational Bayes methods for Tweedie compound Poisson mixed models
Log in to post comments

The Tweedie Compound Poisson-Gamma model is routinely used for modeling non-negative continuous data with a discrete probability mass at zero. Mixed models with random effects account for the covariance structure related to the grouping hierarchy in the data. An important application of Tweedie mixed models is pricing the insurance policies, e.g. car insurance. However, the intractable likelihood function, the unknown variance function, and the hierarchical structure of mixed effects have presented considerable challenges for drawing inferences on Tweedie.

poster.pdf

poster.pdf (498)

Categories:: Bayesian learning; Bayesian signal processing (MLR-BAYL)

13 Views

Efficient Multi-agent Cooperative Navigation in Unknown Environments with Interlaced Deep Reinforcement Learning

EMCN.pdf

EMCN.pdf (403)

Categories:: Machine Learning for Signal Processing

46 Views

Speech as a Biomarker for Obstructive Sleep Apnea Detection

Read more about Speech as a Biomarker for Obstructive Sleep Apnea Detection
Log in to post comments

Obstructive sleep apnea (OSA) is a prevalent sleep disorder, responsible for a decrease of people’s quality of life, and significant morbidity and mortality associated with hypertension and cardiovascular diseases. OSA is caused by anatomical and functional alterations in the upper airways, thus we hypothesize that the speech properties of OSA patients are altered, making it possible to detect OSA through voice analysis.

botelho_presentation.pdf

Speech as a Biomarker for Obstructive Sleep Apnea Detection - Presentation Slides (427)

Categories:: Speech Analysis (SPE-ANLS)

26 Views

ENHANCING MUSIC FEATURES BY KNOWLEDGE TRANSFER FROM USER-ITEM LOG DATA

Read more about ENHANCING MUSIC FEATURES BY KNOWLEDGE TRANSFER FROM USER-ITEM LOG DATA
Log in to post comments

1905_ICASSP_dmlee_compact.pdf

1905_ICASSP_dmlee_compact.pdf (462)

Categories:: Music Signal Processing

12 Views

UNSUPERVISED TRAINING OF A DEEP CLUSTERING MODEL FOR MULTICHANNEL BLIND SOURCE SEPARATION

We propose a training scheme to train neural network-based source separation algorithms from scratch when parallel clean data is unavailable. In particular, we demonstrate that an unsupervised spatial clustering algorithm is sufficient to guide the training of a deep clustering system. We argue that previous work on deep clustering requires strong supervision and elaborate on why this is a limitation.

2019-05-09_unsupervised_poster.pdf

2019-05-09_unsupervised_poster.pdf (511)

Categories:: Audio and Acoustic Signal Processing

45 Views

ALL-NEURAL ONLINE SOURCE SEPARATION, COUNTING, AND DIARIZATION FOR MEETING ANALYSIS

Read more about ALL-NEURAL ONLINE SOURCE SEPARATION, COUNTING, AND DIARIZATION FOR MEETING ANALYSIS
Log in to post comments

Automatic meeting analysis comprises the tasks of speaker counting, speaker diarization, and the separation of overlapped speech, followed by automatic speech recognition. This all has to be carried out on arbitrarily long sessions and, ideally, in an online or block-online manner. While significant progress has been made on individual tasks, this paper presents for the first time an all-neural approach to simultaneous speaker counting, diarization and source separation.

presentation.pdf

presentation.pdf (564)

Categories:: Source Separation and Signal Enhancement

15 Views

Prediction of multi-target dynamics using discrete descriptors: An interactive approach

We propose a probabilistic method to track and interpret interactions of moving objects. The proposed method is based on the analysis of location data from different moving objects that modify their dynamics according to rules of interactions, namely attractive and repulsive forces governing moving objects in a scene. Our method uses a Bayesian structure to identify key elements of the interplay rules and facilitates the prediction of objects' dynamics as the interacting system.

ICASSP2019Poster.pdf

ICASSP2019Poster.pdf (369)

Categories:: Machine Learning for Signal Processing

15 Views

Pages