ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

An Unsupervised Anomalous Event Detection Framework with Class-Aware Source Separation

This paper presents a novel problem of detection and localization of anomalous events due to a certain class of objects in video data with applications to smart surveillance. A baseline system is proposed that uses a convolutional neural network (CNN) to generate pixel level masks corresponding to objects of a class of interest. A Restricted Boltzmann Machine (RBM) is then trained on the mask to learn patterns of normal behavior. The free energy of the RBM is used to detect the presence of an anomaly while the reconstruction error is used to localize the anomaly.

ICASSP 2018 Poster.pptx

ICASSP 2018 Poster.pptx (377)

Categories:: Machine Learning for Signal Processing

72 Views

Towards online spike sorting for high-density neural probes using discriminative template matching with suppression of interfering spikes

Spike sorting is the process of assigning each detected neuronal spike in an extracellular recording to its putative source neuron. A linear filter design is proposed where the filter output allows for threshold-based spike sorting of high-density neural probe data. The proposed filter design is based on optimizing the signal-to-peak-interference ratio for each detectable neuron in a data-driven way.

ICASSP_POSTER.pdf

ICASSP_POSTER.pdf (1701)

Categories:: Biomedical signal processing

28 Views

A Novel Thresholding Technique for the Denoising of Multicomponent Signals

Read more about A Novel Thresholding Technique for the Denoising of Multicomponent Signals
Log in to post comments

This paper addresses the issues of the denoising and retrieval of the components of multicomponent signals from their short-time Fourier transform (STFT). After having recalled the hard-thresholding technique, in the STFT context, we develop a new thresholding technique by exploiting some limitations of the former. Numerical experiments illustrating the benefits of the proposed method to retrieve the modes of noisy multicomponent signals conclude the paper.

presentation_cor_Sylvain.pdf

presentation_cor_Sylvain.pdf (403)

Categories:: Audio and Acoustic Signal Processing

25 Views

Benchmarking Uncertainty Estimates with Deep Reinforcement Learning for Dialogue Policy Optimisation

ICASSP Presentation (1).pdf

ICASSP Presentation (1).pdf (610)

Categories:: Audio and Acoustic Signal Processing

29 Views

A Study of Training Targets for Deep Neural Network-Based Speech Enhancement Using Noise Prediction

ICASSP_2018_Poster_Paper_4035v1.pdf

ICASSP_2018_Poster_Paper_4035v1.pdf (579)

Categories:: Speech Enhancement (SPE-ENHA)

48 Views

SCALABLE SENTIMENT FOR SEQUENCE-TO-SEQUENCE CHATBOT RESPONSE WITH PERFORMANCE ANALYSIS

Conventional seq2seq chatbot models only try to find the sentences with the highest probabilities conditioned on the input sequences, without considering the sentiment of the output sentences. Some research works trying to modify the sentiment of the output sequences were reported. In this paper, we propose five models to scale or adjust the sentiment of the chatbot response: persona-based model, reinforcement learning, plug and play model, sentiment transformation network and cycleGAN, all based on the conventional seq2seq model.

lee.pdf

lee.pdf (505)

Categories:: Audio and Acoustic Signal Processing

16 Views

Deep Learning Based Speech Beamforming

Read more about Deep Learning Based Speech Beamforming
Log in to post comments

Multi-channel speech enhancement with ad-hoc sensors has been a challenging task. Speech model guided beamforming algorithms are able to recover natural sounding speech, but the speech models tend to be oversimplified or the inference would otherwise be too complicated. On the other hand, deep learning based enhancement approaches are able to learn complicated speech distributions and perform efficient inference, but they are unable to deal with variable number of input channels.

deep learning based speech beamforming.pdf

DeepBeam (377)

Categories:: Source Separation and Signal Enhancement

46 Views

Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs

Read more about Improved TDNNs using Deep Kernels and Frequency Dependent Grid-RNNs
Log in to post comments

Time delay neural networks (TDNNs) are an effective acoustic model for large vocabulary speech recognition. The strength of the model can be attributed to its ability to effectively model long temporal contexts. However, current TDNN models are relatively shallow, which limits the modelling capability. This paper proposes a method of increasing the network depth by deepening the kernel used in the TDNN temporal convolutions. The best performing kernel consists of three fully connected layers with a residual (ResNet) connection from the output of the first to the output of the third.

tdnn_lecture_4.pdf

tdnn_lecture_4.pdf (561)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

12 Views

High-order Tensor Completion for Data Recovery via Sparse Tensor-train OptimizationICASSP18001

In this paper, we aim at the problem of tensor data completion. Tensor-train decomposition is adopted because of its powerful representation ability and linear scalability to tensor order. We propose an algorithm named Sparse Tensor-train Optimization (STTO) which considers incomplete data as sparse tensor and uses first-order optimization method to find the factors of tensor-train decomposition. Our algorithm is shown to perform well in simulation experiments at both low-order cases and high-order cases.

ICASSP_PPT_Yuan.pdf

ICASSP18001 (409)

20 Views

A PRAGMATIC AUTHENTICATION SYSTEM USING ELECTROENCEPHALOGRAPHY SIGNALS

Read more about A PRAGMATIC AUTHENTICATION SYSTEM USING ELECTROENCEPHALOGRAPHY SIGNALS
Log in to post comments

EEG-based authentication is an emerging research field. In this work, a realistic authentication system using Electroencephalography signals, was developed aiming to show that brain signals contain sufficient information to be used in security systems. The dataset used was composed of 29 users on 4 different days via the cheap Neurosky Mindwave headset with a single dry electrode, and 10 users on 3 different days via Emotiv with 14 electrodes. Various techniques, features, and algorithms were examined to achieve the highest security.

brainlock_poster.pdf

brainlock_poster.pdf (493)

Categories:: Biomedical signal processing

39 Views

Error message