ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

ICASSP2018-SADL

Read more about ICASSP2018-SADL
Log in to post comments

ICASSP2018-SADL.pdf

ICASSP2018-SADL.pdf (476)

Categories:: Machine Learning for Signal Processing

5 Views

Dropout approaches for LSTM based speech recognition systems

Read more about Dropout approaches for LSTM based speech recognition systems
Log in to post comments

In this paper we examine dropout approaches in a Long Short Term Memory (LSTM) based automatic speech recognition (ASR) system trained with the Connectionist Temporal Classification (CTC) loss function. In particular, using an Eesen based LSTM-CTC speech recognition system, we present dropout implementations that result in significant improvements in speech recognizer performance on Librispeech and GALE Arabic datasets, with 24.64% and 13.75% relative reduction in word error rates (WER) from their respective baselines.

ICASSP2018-dropout poster.pdf

ICASSP2018 Poster (438)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

45 Views

Distributed Maximum Likelihood using Dynamic Average Consensus

Read more about Distributed Maximum Likelihood using Dynamic Average Consensus
Log in to post comments

This paper presents the formulation and analysis of a novel distributed maximum likelihood algorithm that utilizes a first-order optimization scheme. The proposed approach utilizes a static average consensus algorithm to reach agreement on the initial condition to the iterative optimization scheme and a dynamic average consensus algorithm to reach agreement on the gradient direction. The current distributed algorithm is guaranteed to exponentially recover the performance of the centralized algorithm.

George_ICASSP_v1.pdf

George_ICASSP_v1.pdf (400)

Categories:: Audio and Acoustic Signal Processing

8 Views

Trade-offs in Data-Driven False Data Injection Attacks Against the Power Grid

Read more about Trade-offs in Data-Driven False Data Injection Attacks Against the Power Grid
Log in to post comments

We address the problem of constructing false data injection (FDI) attacks that can bypass the bad data detector (BDD) of a power grid. The attacker is assumed to have access to only power flow measurement data traces (collected over a limited period of time) and no other prior knowledge about the grid. Existing related algorithms are formulated under the assumption that the attacker has access to measurements collected over a long (asymptotically infinite) time period, which may not be realistic.

ICASSP_DataDriven_FDI.pdf

Poster presentation (568)

Categories:: Applications
Emerging: Smart Grid & Energy Management

15 Views

RECOGNIZING MINIMAL FACIAL SKETCH BY GENERATING PHOTOREALISTIC FACES WITH THE GUIDANCE OF DESCRIPTIVE ATTRIBUTES

Cross-modal sketch-photo recognition is of vital importance
in law enforcement and public security. Most existing methods
are dedicated to bridging the gap between the low-level
visual features of sketches and photo images, which is limited
due to intrinsic differences in pixel values. In this paper, based
on the intuition that sketches and photo images are highly correlated
in the semantic domain, we propose to jointly utilize
the low-level visual features and high-level facial attributes to

xiao_yang.pdf

xiao_yang.pdf (484)

Categories:: Image, Video, and Multidimensional Signal Processing

10 Views

RECOGNIZING MINIMAL FACIAL SKETCH BY GENERATING PHOTOREALISTIC FACES WITH THE GUIDANCE OF DESCRIPTIVE ATTRIBUTES

xiao_yang.pdf

xiao_yang.pdf (481)

Categories:: Image, Video, and Multidimensional Signal Processing

6 Views

END-TO-END NEURAL NETWORK BASED AUTOMATED SPEECH SCORING

Read more about END-TO-END NEURAL NETWORK BASED AUTOMATED SPEECH SCORING
Log in to post comments

icassp2018_final.pdf

icassp2018_final.pdf (531)

Categories:: Audio and Acoustic Signal Processing

41 Views

Speech Enhancement with Convolutional-Recurrent Networks

Read more about Speech Enhancement with Convolutional-Recurrent Networks
Log in to post comments

We propose an end-to-end model based on convolutional and recurrent neural networks for speech enhancement. Our model is purely data-driven and does not make any assumptions about the type or the stationarity of the noise. In contrast to existing methods that use multilayer perceptrons (MLPs), we employ both convolutional and recurrent neural network architectures. Thus, our approach allows us to exploit local structures in both the frequency and temporal domains.

keynote_slides.pdf

icassp2018-3744 (639)

Categories:: Source Separation and Signal Enhancement

41 Views

Robust PCA via Dictionary Based Outlier Pursuit

Read more about Robust PCA via Dictionary Based Outlier Pursuit
Log in to post comments

ICASSP18_poster.pdf

ICASSP18_poster.pdf (567)

Categories:: Signal Processing Theory and Methods

6 Views

Deformation Stability of Deep Convolutional Neural Networks on Sobolev Spaces

Read more about Deformation Stability of Deep Convolutional Neural Networks on Sobolev Spaces
Log in to post comments

Our work is based on a recently introduced mathematical theory of deep convolutional neural networks (DCNNs).
It was shown that DCNNs are stable with respect to deformations of bandlimited input functions.
In the present paper, we generalize this result: We prove deformation stability on Sobolev spaces.
Further, we show a weak form of deformation stability for the whole input space L2.
The basic components of DCNNs are semi-discrete frames.
For practical applications, a concrete choice is necessary.

talk.pdf

talk.pdf (515)

Categories:: Machine Learning for Signal Processing

153 Views

Pages