ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Weighted Block Sparse Bayesian Learning for Basis Selection

Read more about Weighted Block Sparse Bayesian Learning for Basis Selection
Log in to post comments

poster-weighted-block.pdf

poster-weighted-block.pdf (391)

Categories:: Sampling and Reconstruction

19 Views

Improved Algorithms for Differentially private Orthogonal Tensor Decomposition

Read more about Improved Algorithms for Differentially private Orthogonal Tensor Decomposition
Log in to post comments

Tensor decompositions have applications in many areas including signal processing, machine learning, computer vision and neuroscience. In this paper, we propose two new differentially private algorithms for orthogonal decomposition of symmetric tensors from private or sensitive data; these arise in applications such as latent variable models. Differential privacy is a formal privacy framework that guarantees protections against adversarial inference.

Imtiaz_Sarwate_dpOTD_slides_ICASSP2018.pdf

Presentation slides (427)

Categories:: Learning theory and algorithms (MLR-LEAR)

19 Views

Differentially private Distributed Principal Component Analysis

Read more about Differentially private Distributed Principal Component Analysis
Log in to post comments

Differential privacy is a cryptographically-motivated formal privacy definition that is robust against strong adversaries. The principal component analysis (PCA) algorithm is frequently used in signal processing, machine learning, and statistics pipelines. In many scenarios, private or sensitive data is distributed across different sites: in this paper we propose a differentially private distributed PCA scheme to enable collaborative dimensionality reduction.

Imtiaz_Sarwate_dpPCA_slides_ICASSP2018.pdf

Presentation slides (499)

Categories:: Learning theory and algorithms (MLR-LEAR)

38 Views

EXPLORING THE USE OF GROUP DELAY FOR GENERALISED VTS BASED NOISE COMPENSATION

Read more about EXPLORING THE USE OF GROUP DELAY FOR GENERALISED VTS BASED NOISE COMPENSATION
Log in to post comments

In earlier work we studied the effect of statistical normalisation for phase-based features and observed it leads to a significant robustness improvement. This paper explores the extension of the generalised Vector Taylor Series (gVTS) noise compensation approach to the group delay (GD) domain. We discuss the problems it presents, propose some solutions and derive the corresponding formulae. Furthermore, the effects of additive and channel noise in the GD domain were studied.

ICASSP2018_Slides.pdf

Presentation.SLIDES (511)

ICASSP18_Erfan_128k.zip

Presentation.MP3 (484)

Categories:: Robust Speech Recognition (SPE-ROBU)

16 Views

FAST ROBUST TRACKING VIA DOUBLE CORRELATION FILTER FORMULATION

Read more about FAST ROBUST TRACKING VIA DOUBLE CORRELATION FILTER FORMULATION
Log in to post comments

Over the past few years, fast and robust trackers based on Kernelized Correlation Filters have shown top notch performance on the Visual Object Tracking challenge. However there is still scope for obtaining higher performance through the use of reasonable approximations that can easily be shown to work through empirical methods. We study some variants derived from the Discriminative Scale Space Tracker and show significant improvement in tracking performance.

Wipro_ICASSP_2018_poster.pdf

Poster (387)

Categories:: Image/Video Processing

55 Views

Common and Individual Feature Extraction using Tensor Decompositions: A Remedy for the Curse of Dimensionality?

A novel method for common and individual feature analysis from exceedingly large-scale data is proposed, in order to ensure the tractability of both the computation and storage and thus mitigate the curse of dimensionality, a major bottleneck in modern data science. This is achieved by making use of the inherent redundancy in so-called multi-block data structures, which represent multiple observations of the same phenomenon taken at different times, angles or recording conditions.

KISIL_ICASSP_2018.pdf

KISIL_ICASSP_2018.pdf (572)

Categories:: Emerging: Big Data

30 Views

A Novel Method for Human Bias Correction of Continuous-time Annotations

Read more about A Novel Method for Human Bias Correction of Continuous-time Annotations
Log in to post comments

Human annotations are of integral value in human behavior studies and in particular for the generation of ground truth for behavior prediction using various machine learning methods. These often subjective human annotations are especially required for studies involving measuring and predicting hidden mental states (e.g. emotions) that cannot effectively be measured or assessed by other means. Human annotations are noisy and prone to the influence of several factors including personal bias, task ambiguity, environmental distractions, and health state.

booth_ICASSP_2018.pdf

booth_ICASSP_2018.pdf (592)

Categories:: Signal Processing Theory and Methods

93 Views

MULTI-VIEW AUDIO-ARTICULATORY FEATURES FOR PHONETIC RECOGNITION ON RTMRI-TIMIT DATABASE

In this paper, we investigate the use of articulatory informa-
tion, and more specifically real time Magnetic Resonance
Imaging (rtMRI) data of the vocal tract, to improve speech
recognition performance. For the purpose of our experiments,
we use data from the rtMRI-TIMIT database. Firstly, Scale
Invariant Feature Transform (SIFT) features are extracted for
each video frame. Afterwards, the SIFT descriptors of each
frame are transformed to a single histogram per picture, by
using the Bag of Visual Words methodology. Since this kind

ICASSP_2018_poster_final.pdf

ICASSP_2018_poster_final.pdf (1105)

Categories:: Audio and Acoustic Signal Processing

12 Views

Framework For Evaluation Of Sound Event Detection In Web Videos

Read more about Framework For Evaluation Of Sound Event Detection In Web Videos
Log in to post comments

The largest source of sound events is web videos. Most videos lack sound event labels at segment level, however, a significant number of them do respond to text queries, from a match found using metadata by search engines. In this paper we explore the extent to which a search query can be used as the true label for detection of sound events in videos. We present a framework for large-scale sound event recognition on web videos. The framework crawls videos using search queries corresponding to 78 sound event labels drawn from three datasets.

icassp-poster-framework.pdf

ICASSP 2018 Framework for Evaluation of Sound Event Detection in Web Videos Poster (593)

icassp-poster-framework.pdf

icassp-poster-framework.pdf (483)

Categories:: Multimedia databases and digital libraries

17 Views

A CONVERSATIONAL NEURAL LANGUAGE MODEL FOR SPEECH RECOGNITION IN DIGITAL ASSISTANTS

Read more about A CONVERSATIONAL NEURAL LANGUAGE MODEL FOR SPEECH RECOGNITION IN DIGITAL ASSISTANTS
Log in to post comments

Speech recognition in digital assistants such as Google Assistant can
potentially benefit from the use of conversational context consisting of user
queries and responses from the agent. We explore the use of recurrent,
Long Short-Term Memory (LSTM), neural language models (LMs) to model the conversations
in a digital assistant. Our proposed methods effectively capture the context of
previous utterances in a conversation without modifying the underlying LSTM
architecture. We demonstrate a 4% relative improvement in recognition performance

conversation.pdf

conversation.pdf (436)

Categories:: Audio and Acoustic Signal Processing

64 Views

Pages