ICASSP 2017

ICASSP is the world's largest and most comprehensive technical conference on signal processing and its applications. It provides a fantastic networking opportunity for like-minded professionals from around the world. ICASSP 2017 conference will feature world-class presentations by internationally renowned speakers and cutting-edge session topics. Visit ICASSP 2017

ON DNN POSTERIOR PROBABILITY COMBINATION IN MULTI-STREAM SPEECH RECOGNITION FOR REVERBERANT ENVIRONMENTS

A multi-stream framework with deep neural network (DNN) classifiers has been applied in this paper to improve automatic speech recognition (ASR) performance in environments with different reverberation characteristics. We propose a room parameter estimation model to determine the stream weights for DNN posterior probability combination with the aim of obtaining reliable log-likelihoods for decoding. The model is implemented by training a multi-layer

poster_icassp17_xiongetal.pdf

poster_icassp17_xiongetal.pdf (639)

Categories:: Robust Speech Recognition (SPE-ROBU)

6 Views

HEARTMATE: AUTOMATED INTEGRATED ANOMALY ANALYSIS FOR EFFECTIVE REMOTE CARDIAC HEALTH MANAGEMENT

Remote cardiac health management is an important healthcare application. We have developed Heartmate that enables basic screening of cardiac health using low cost sensors or smartphone-inbuilt sensors without manual intervention. It consists of robust denoising algorithm along with effective anomaly analytics for physiological signals. Heartmate identifies and eliminates signal corruption as well as detects cardiac anomaly condition from physiological cardiac signals like heart sound or phonocardiogram (PCG) and photoplethysmogram (PPG).

chetanya_poster.pdf

Poster for the demo to be shown at ICASSP 2017 (306)

Categories:: Other applications of machine learning (MLR-APPL)

20 Views

RECONSTRUCTION OF 3D SURFACE FROM 2D HOLOGRAPHIC SIGNAL BASED ON KALMAN FILTER

Read more about RECONSTRUCTION OF 3D SURFACE FROM 2D HOLOGRAPHIC SIGNAL BASED ON KALMAN FILTER
Log in to post comments

RECONSTRUCTION OF 3D SURFACE FROM 2D HOLOGRAPHIC SIGNAL BASED ON KALMAN FILTER.pdf

RECONSTRUCTION OF 3D SURFACE FROM 2D HOLOGRAPHIC SIGNAL BASED ON KALMAN FILTER.pdf (80)

Categories:: Image, Video, and Multidimensional Signal Processing

13 Views

DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM

Read more about DEEP LEARNING BASED AUTOMATIC VOLUME CONTROL AND LIMITER SYSTEM
Log in to post comments

Automatic speech recognition is now playing an important role in volume control and adjustment of modern smart speakers. According to the recognition results by using the advanced deep neural network technology, this paper proposes an efficient processing system for automatic volume control (AVC) and limiter. The theoretical analyses, subjective and objective testing results show that the proposed processing system can offer a significant improvement for speech recognition performance during audio playback and improvement for audio playback performance in smart speakers.

ICASSP2017_poster_paper1196.pdf

ICASSP2017_poster_paper1196.pdf (331)

Categories:: Emerging DSP Applications

20 Views

Fast Exemplar Selection Algorithm for Matrix Approximation and Representation: A Variant oASIS Algorithm

Extracting inherent patterns from large data using decompositions of
data matrix by a sampled subset of exemplars has found many applications
in machine learning. We propose a computationally efficient
algorithm for adaptive exemplar sampling, called fast exemplar selection
(FES). The proposed algorithm can be seen as an efficient
variant of the oASIS algorithm (Patel et al). FES iteratively selects incoherent
exemplars based on the exemplars that are already sampled.
This is done by ensuring that the selected exemplars forms a positive

conference_poster_4.pdf

conference_poster_4.pdf (321)

Categories:: Machine Learning for Signal Processing

8 Views

Image Denoising via Group Sparsity Residual Constraint

Read more about Image Denoising via Group Sparsity Residual Constraint
Log in to post comments

Group sparsity or nonlocal image representation has shown great potential in image denoising. However, most existing methods only consider the nonlocal self-similarity (NSS) prior of noisy input image, that is, the similar patches collected only from degraded input, which makes the quality of image denoising largely depend on the input itself. In this paper we propose a new prior model for image denoising, called group sparsity residual constraint (GSRC).

ICASSP--2017.pdf

123 (532)

Categories:: Image, Video, and Multidimensional Signal Processing

2 Views

AUTOMATIC DETECTION OF SYLLABLE STRESS USING SONORITY BASED PROMINENCE FEATURES FOR PRONUNCIATION EVALUATION

Automatic syllable stress detection is useful in assessing and diagnosing the quality of the pronunciation of second language (L2) learners in an automated way. Typically, the syllable stress depends on three prominence measures -- intensity level, duration, pitch -- around the sound unit with the highest sonority in the respective syllable. Stress detection is often formulated as a binary classification task using cues from the feature contours representing the prominence measures.

ICASSP17.pdf

ICASSP17.pdf (549)

Categories:: Speech Analysis (SPE-ANLS)

13 Views

TWO-DIMENTIONAL ANTI-JAMMING COMMUNICATION BASED ON DEEP REINFORCEMENT LEARNING

Read more about TWO-DIMENTIONAL ANTI-JAMMING COMMUNICATION BASED ON DEEP REINFORCEMENT LEARNING
Log in to post comments

poster_ICASSP17_paper1551.pdf

poster_ICASSP17_paper1551.pdf (345)

Categories:: Communications and Network Security

8 Views

ENHANCED DEPTH ESTIMATION FOR HAND-HELD LIGHT FIELD CAMERAS

Read more about ENHANCED DEPTH ESTIMATION FOR HAND-HELD LIGHT FIELD CAMERAS
Log in to post comments

ENHANCED DEPTH ESTIMATION FOR HAND-HELD LIGHT FIELD CAMERAS_YANWEN QIN.pdf

ENHANCED DEPTH ESTIMATION FOR HAND-HELD LIGHT FIELD CAMERAS_YANWEN QIN.pdf (349)

Categories:: Image/Video Processing

13 Views

ROBUST VISUAL TRACKING VIA DEEP DISCRIMINATIVE MODEL

Read more about ROBUST VISUAL TRACKING VIA DEEP DISCRIMINATIVE MODEL
Log in to post comments

In this paper, we exploit deep convolutional features for object appearance modeling and propose a simple while effective deep iscriminative model (DDM) for visual tracking. The proposed DDM takes as input the deep features and outputs an object-background confidence map. Considering that both spatial information from lower convolutional layers and semantic information from higher layers benefit object tracking, we construct multiple deep discriminative models (DDMs) for each layer and combine these confidence maps from each layer to obtain the final object-background confidence map.

ICASSP_17_Tracking_Poster.pdf

ICASSP_17_Tracking_Poster.pdf (814)

Categories:: Image/Video Processing

7 Views

Pages