ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

UNSUPERVISED LEARNING APPROACH TO FEATURE ANALYSIS FOR AUTOMATIC SPEECH EMOTION RECOGNITION

The scarcity of emotional speech data is a bottleneck of developing automatic speech emotion recognition (ASER) systems. One way to alleviate this issue is to use unsupervised feature learning techniques to learn features from the widely available general speech and use these features to train emotion classifiers. These unsupervised methods, such as denoising autoencoder (DAE), variational autoencoder (VAE), adversarial autoencoder (AAE) and adversarial variational Bayes (AVB), can capture the intrinsic structure of the data distribution in the learned feature representation.

icassp-2018-poster.pdf

icassp-2018-poster.pdf (620)

Categories:: Audio and Acoustic Signal Processing

68 Views

Robust Spoken Language Understanding with unsupervised ASR-error adaptation

Read more about Robust Spoken Language Understanding with unsupervised ASR-error adaptation
Log in to post comments

Robustness to errors produced by automatic speech recognition (ASR) is essential for Spoken Language Understanding (SLU). Traditional robust SLU typically needs ASR hypotheses with semantic annotations for training. However, semantic annotation is very expensive, and the corresponding ASR system may change frequently. Here, we propose a novel unsupervised ASR-error adaptation method, obviating the need of annotated ASR hypotheses.

zhu-icassp18-poster.pdf

zhu-icassp18-poster.pdf (758)

Categories:: Spoken Language Processing

82 Views

An Instrumental Intelligibility Metric Based on Information Theory

Read more about An Instrumental Intelligibility Metric Based on Information Theory

kuykkleijnhendriks2018 (3).pdf

kuykkleijnhendriks2018 (3).pdf (537)

Categories:: Speech Processing

12 Views

MANIFOLD-BASED ANALYSIS OF NATURAL STOCHASTIC TEXTURES WITH APPLICATION IN TEXTURE SYNTHESIS

Embedding textured images in manifolds reveals latent information regarding texture structure and allows useful analysis of these high dimensional images in a low dimensional space. We present a framework for analysis and synthesis of natural stochastic textures (NST) which constitute an important subset of textures that are modelled as realizations of random processes. The randomness of NST differentiates them from other types of images and requires a dedicated method for analysis and synthesis. We demonstrate several applications of this framework.

ido-z-icassp18-talk.pdf

ido-z-icassp18-talk.pdf (396)

Categories:: Image/Video Processing

10 Views

EFFICIENT CONVOLUTIONAL DICTIONARY LEARNING USING PARTIAL UPDATE FAST ITERATIVE SHRINKAGE-THRESHOLDING ALGORITHM

Convolutional sparse representations allow modeling an entire image as an alternative to the more common independent patch-based
formulations. Although many approaches have been proposed to efficiently solve the convolutional dictionary learning (CDL) problem,
their computational performance is constrained by the dictionary update stage. In this work, we include two improvements to existing

poster_3908.pdf

poster_3908.pdf (470)

Categories:: Signal Processing Theory and Methods

12 Views

Secrecy Capacity Under List Decoding for a Channel With a Passive Eavesdropper and an Active Jammer

Presentation.pdf

Presentation.pdf (940)

Categories:: Communications and Network Security

11 Views

Learning-Based Acoustic Source-Microphone Distance Estimation using the Coherent-to-Diffuse Power Ratio

presentation_final.pdf

presentation_final.pdf (803)

Categories:: Audio and Acoustic Signal Processing

36 Views

ESTIMATION OF THE SOUND FIELD AT ARBITRARY POSITIONS IN DISTRIBUTED MICROPHONE NETWORKS BASED ON DISTRIBUTED RAY SPACE TRANSFORM

Paper #3031.pdf

Paper #3031.pdf (343)

Categories:: Source Separation and Signal Enhancement

18 Views

Mobile Bayesian Spectrum Learning for Heterogeneous Networks

Read more about Mobile Bayesian Spectrum Learning for Heterogeneous Networks
Log in to post comments

Spectrum sensing in heterogeneous networks is very challenging as it usually requires a large number of static secondary users (SUs) to obtain the global spectrum states. In this paper, we tackle the spectrum sensing in heterogeneous networks from a new perspective. We exploit the mobility of multiple SUs to simultaneously collect spatial-temporal spectrum sensing data. Then, we propose a novel non-parametric Bayesian learning model, referred to as beta process hidden Markov model to capture the spatio-temporal correlation in the collected spectrum data.