ICASSP 2019

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

A DOUBLE-CROSS-CORRELATION PROCESSOR FOR BLIND SAMPLING RATE OFFSET ESTIMATION IN ACOUSTIC SENSOR NETWORKS

Signal synchronization in wireless acoustic sensor networks requires an accurate estimation of the sampling rate offset (SRO) inevitably present in signals acquired by sensors of ad-hoc networks. Although some sophisticated methods for blind SRO estimation have been recently proposed in this very young field of research, there is still a need for the development of new ideas and concepts especially regarding robust approaches with low computational complexity.

ICASSP_2019_ID_1899_poster.pdf

ICASSP_2019_ID_1899_poster.pdf (450)

Categories:: Loudspeaker and Microphone Array Signal Processing

47 Views

Estimation of gaze region using two dimensional probabilistic maps constructed using convolutional neural networks

Predicting the gaze of a user can have important applications in hu- man computer interactions (HCI). They find applications in areas such as social interaction, driver distraction, human robot interaction and education. Appearance based models for gaze estimation have significantly improved due to recent advances in convolutional neural network (CNN). This paper proposes a method to predict the gaze of a user with deep models purely based on CNNs.

Jha_2019-poster.pdf

Jha_2019-poster.pdf (483)

Categories:: Image/Video Processing

34 Views

Retrieving speech samples with similar emotional content using a triplet loss function

The ability to identify speech with similar emotional content is valuable to many applications, including speech retrieval, surveil- lance, and emotional speech synthesis. While current formulations in speech emotion recognition based on classification or regression are not appropriate for this task, solutions based on preference learn- ing offer appealing approaches for this task. This paper aims to find speech samples that are emotionally similar to an anchor speech sample provided as a query. This novel formulation opens interest- ing research questions.

Harvill_2019-poster.pdf

Harvill_2019-poster.pdf (551)

Categories:: Speech Analysis (SPE-ANLS)

44 Views

On the Transferability of Adversarial Examples Against CNN-Based Image Forensics

Read more about On the Transferability of Adversarial Examples Against CNN-Based Image Forensics
Log in to post comments

Recent studies have shown that Convolutional Neural Networks (CNN) are relatively easy to attack through the generation of so-called adversarial examples. Such vulnerability also affects CNN-based image forensic tools. Research in deep learning has shown that adversarial examples exhibit a certain degree of transferability, i.e., they maintain part of their effectiveness even against CNN models other than the one targeted by the attack. This is a very strong property undermining the usability of CNN’s in security-oriented applications.

ICASSP 2019.pdf

ICASSP 2019.pdf (570)

Categories:: Multimedia security and content protection

119 Views

LMS: PAST, PRESENT AND FUTURE: Puzzles, Problems and Potentials

Read more about LMS: PAST, PRESENT AND FUTURE: Puzzles, Problems and Potentials
Log in to post comments

We give a brief history of the performance analysis of LMS.
Using averaging theory we show when and why the ‘independence
assumption’ ‘works’; we preface this with a fast
heuristic explanation of averaging methods, clarifying their
connection to the ‘ODE’ method. We then extend the discussion
to more recent distributed versions such as diffusion
LMS and consensus. While single node LMS is a single timescale
algorithm it turns out that distributed versions are twotime
scale systems, something that is not yet widely understood.

vsLMS-ICASSP19.pdf

vsLMS-ICASSP19.pdf (842)

Categories:: Signal Processing Education

133 Views

Speech Landmark Bigrams for Depression Detection from Naturalistic Smartphone Speech

Read more about Speech Landmark Bigrams for Depression Detection from Naturalistic Smartphone Speech
Log in to post comments

Detection of depression from speech has attracted significant research attention in recent years but remains a challenge, particularly for speech from diverse smartphones in natural environments. This paper proposes two sets of novel features based on speech landmark bigrams associated with abrupt speech articulatory events for depression detection from smartphone audio recordings. Combined with techniques adapted from natural language text processing, the proposed features further exploit landmark bigrams by discovering latent articulatory events.

ICASSP2019_Huang_V01_uploaded.pdf

ICASSP2019_Huang_V01_uploaded.pdf (845)

Categories:: Speech Processing

85 Views

Graph Signal Sampling via Reinforcement Learning

Read more about Graph Signal Sampling via Reinforcement Learning
Log in to post comments

We model the sampling and recovery of clustered graph signals as a reinforcement learning (RL) problem. The signal sampling is carried out by an agent which crawls over the graph and selects the most relevant graph nodes to sample. The goal of the agent is to select signal samples which allow for the most accurate recovery. The sample selection is formulated as a multi-armed bandit (MAB) problem, which lends naturally to learning efficient sampling strategies using the well-known gradient MAB algorithm.

Poster_Abramenko_Jung.pdf

Poster_Abramenko_Jung.pdf (633)

Categories:: Other applications of machine learning (MLR-APPL)

111 Views

ROBUST M-ESTIMATION BASED MATRIX COMPLETION

Read more about ROBUST M-ESTIMATION BASED MATRIX COMPLETION
Log in to post comments

Conventional approaches to matrix completion are sensitive to outliers and impulsive noise. This paper develops robust and computationally efficient M-estimation based matrix completion algorithms. By appropriately arranging the observed entries, and then applying alternating minimization, the robust matrix completion problem is converted into a set of regression M-estimation problems. Making use of differ- entiable loss functions, the proposed algorithm overcomes a weakness of the lp-loss (p ≤ 1), which easily gets stuck in an inferior point.

ICASSP_2019_Robust_M_Estimation_Based_Matrix_Completion_Poster.pdf

ICASSP_2019_Robust_M_Estimation_Based_Matrix_Completion_Poster.pdf (599)

Categories:: Statistical Signal Processing

107 Views

When can a System of Subnetworks be Registered Uniquely?

Read more about When can a System of Subnetworks be Registered Uniquely?
Log in to post comments

Consider a network with N nodes in d dimensions, and M overlapping subsets P_1,...,P_M (subnetworks). Assume that the nodes in a given P_i are observed in a local coordinate system. We wish to register the subnetworks using the knowledge of the observed coordinates. More precisely, we want to compute the positions of the N nodes in a global coordinate system, given P_1,...,P_M and the corresponding local coordinates. Among other applications, this problem arises in divide-and-conquer algorithms for localization of adhoc sensor networks.

SinghICASSP19.pdf

Unique Point Cloud Registration (537)

Categories:: Communications and Networking

27 Views

Speech Emotion Recognition Using Multi-hop Attention Mechanism

Read more about Speech Emotion Recognition Using Multi-hop Attention Mechanism
Log in to post comments

In this paper, we are interested in exploiting textual and acoustic data of an utterance for the speech emotion classification task. The baseline approach models the information from audio and text independently using two deep neural networks (DNNs). The outputs from both the DNNs are then fused for classification. As opposed to using knowledge from both the modalities separately, we propose a framework to exploit acoustic information in tandem with lexical data.

yoon2019speech_slide.pdf

presentation slide (936)

Categories:: Neural network learning (MLR-NNLR)

265 Views

Pages