ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

A compressive sensing-based active user and symbol detection for massive machine type communications

icassp18_r3.pdf

icassp18_r3.pdf (586)

Categories:: Communications and Networking

20 Views

ICASSP 2018 Tutorial T11 Natual and Augmented Listening for VR/AR/MR

Read more about ICASSP 2018 Tutorial T11 Natual and Augmented Listening for VR/AR/MR
Log in to post comments

This tutorial aims to equip the participants with basic and advanced signal processing techniques that can be used in VR/AR applications to create a natural and augmented listening experience using headsets.
This tutorial is divided into 5 sections and cover following topics:
Introduction to spatial audio, fundamentals in natural listening, and emerging audio applications

ICASSP2018_Tutorial_T11_Natual_and_Augmented_Listening_for_VR_AR_MR.pdf

ICASSP2018_Tutorial_T11_Natual_and_Augmented_Listening_for_VR_AR_MR.pdf (675)

Categories:: Spatial and Multichannel Audio

391 Views

APPROXIMATE BELIEF PROPAGATION DECODER FOR POLAR CODES

Read more about APPROXIMATE BELIEF PROPAGATION DECODER FOR POLAR CODES
Log in to post comments

icassp2018_poster.pdf

icassp2018_poster.pdf (718)

Categories:: Design and Implementation of Signal Processing Systems

8 Views

Restoration of ultrasound images using spatially-variant kernel deconvolution

Read more about Restoration of ultrasound images using spatially-variant kernel deconvolution
Log in to post comments

Most of the existing ultrasound image restoration methods consider a spatially-invariant point-spread function (PSF) model and circulant boundary conditions. While computationally efficient, this model is not realistic and severely limits the quality of reconstructed images. In this work, we address ultrasound image restoration under the hypothesis of vertical variation of the PSF. To regularize the solution, we use the classical elastic net constraint.

Florea2018ICASSP.pdf

Florea2018ICASSP.pdf (654)

Categories:: Medical imaging

53 Views

NATURAL TTS SYNTHESIS BY CONDITIONING WAVENET ON MEL SPECTROGRAM PREDICTIONS

Read more about NATURAL TTS SYNTHESIS BY CONDITIONING WAVENET ON MEL SPECTROGRAM PREDICTIONS
Log in to post comments

ICASSP 2018 - Tacotron 2.pdf

ICASSP 2018 - Tacotron 2.pdf (1608)

Categories:: Speech Synthesis and Generation, including TTS (SPE-SYNT)

80 Views

SVSGAN: SINGING VOICE SEPARATION VIA GENERATIVE ADVERSARIAL NETWORK

Read more about SVSGAN: SINGING VOICE SEPARATION VIA GENERATIVE ADVERSARIAL NETWORK
Log in to post comments

fan18icassp_poster.pdf

fan18icassp_poster.pdf (506)

Categories:: Music Signal Processing

35 Views

Attention-based End-to-end Speech Recognition on Voice Search

Read more about Attention-based End-to-end Speech Recognition on Voice Search
Log in to post comments

SP-L1.4.pdf

SP-L1.4.pdf (721)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

16 Views

AN END-TO-END APPROACH TO JOINT SOCIAL SIGNAL DETECTION AND AUTOMATIC SPEECH RECOGNITION

Social signals such as laughter and fillers are often observed in natural conversation, and they play various roles in human-to-human communication. Detecting these events is useful for transcription systems to generate rich transcription and for dialogue systems to behave as we do such as synchronized laughing or attentive listening. We have studied an end-to-end approach to directly detect social signals from speech by using connectionist temporal classification (CTC), which is one of the end-to-end sequence labelling models.