ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

MITIGATION OF NONLINEAR DISTORTION IN SOUND ZONE CONTROL BY CONSTRAINING INDIVIDUAL LOUDSPEAKER DRIVER CONTROL EFFORTS

Loudspeaker drivers are subject to nonlinear distortion in the low frequency range at high input levels. In sound zone control, distortion not only reduces the acoustic contrast between zones, but also gives perceived artefacts. Standard sound zone methods, such as acoustic contrast control, apply a constraint to the overall input power, but individual loudspeaker drivers are not controlled and the nonlinear distortion is mainly produced by the loudspeaker drivers with the highest input power.

Poster1.pdf

Poster1.pdf (1064)

Categories:: Loudspeaker and Microphone Array Signal Processing

47 Views

A Random Matrix and Concentration Inequalities framework for Neural Networks Analysis

Read more about A Random Matrix and Concentration Inequalities framework for Neural Networks Analysis
Log in to post comments

Our article provides a theoretical analysis of the asymptotic performance of a regression or classification task performed by a simple random neural network. This result is obtained by leveraging a new framework at the crossroads between random matrix theory and the concentration of measure theory. This approach is of utmost interest for neural network analysis at large in that it naturally dismisses the difficulty induced by the non-linear activation functions, so long that these are Lipschitz functions.

conc_measure_NN_ICASSP18(3).pdf

conc_measure_NN_ICASSP18(3).pdf (354)

Categories:: Neural network learning (MLR-NNLR)

40 Views

CLASSIFIER CASCADE TO AID IN DETECTION OF EPILEPTIFORM TRANSIENTS IN INTERICTAL EEG

Read more about CLASSIFIER CASCADE TO AID IN DETECTION OF EPILEPTIFORM TRANSIENTS IN INTERICTAL EEG
Log in to post comments

Presence of interictal epileptiform discharges (IED) in the electroencephalogram (EEG) is indicative of epilepsy. Automated
software for annotating EEGs of Patients with suspected epilepsy is substantial for diagnosis and management of epilepsy.
A large amount of data is needed for training and evaluating the performance of an effective IED detection system. IEDs occur
infrequently in the EEG of most patients, hence, interictal EEG recordings contain mostly background waveforms. As the first

ICASSP2018 Poster Elham A.pdf

ICASSP2018 Poster Elham A.pdf (350)

Categories:: Biomedical signal processing

11 Views

A JOINT SOURCE CHANNEL ARITHMETIC MAP DECODER USING PROBABILISTIC RELATIONS AMONG INTRA MODES IN PREDICTIVE VIDEO COMPRESSION

In this paper, residual redundancy in compressed videos is exploited to alleviate transmission errors using joint source channel arithmetic decoding. A new method is proposed to estimate a priori probability in MAP metric of H.264 intra modes decoder. The decoder generates a decoding tree using a breadth first search algorithm. An introduced statistical model is then implemented stage by stage over the decoding tree.

ICASSP_Presentation_20180409.pdf

ICASSP_Presentation_20180409.pdf (507)

Categories:: Image/Video Coding

20 Views

QUANTISATION EFFECTS IN DISTRIBUTED OPTIMISATION

Read more about QUANTISATION EFFECTS IN DISTRIBUTED OPTIMISATION
Log in to post comments

In this presentation, the effects of quantisation on distributed convex optimisation algorithms are explored via the lens of monotone operator theory. Specifically, by representing transmission quantisation via an additive noise model, we demonstrate how quantisation can be viewed as an instance of an inexact Krasnoselskii-Mann scheme. In the case of two distributed solvers, the Alternating Direction Method of Multipliers and the Primal Dual Method of Multipliers, we further demonstrate how an adaptive quantisation scheme can be constructed to reduce transmission costs between nodes.

Presentation QUANTISATION EFFECTS IN DISTRIBUTED OPTIMISATION.pdf

Presentation QUANTISATION EFFECTS IN DISTRIBUTED OPTIMISATION.pdf (301)

Categories:: Audio and Acoustic Signal Processing

5 Views

ENTROPY BASED PRUNING OF BACKOFF MAXENT LANGUAGE MODELS WITH CONTEXTUAL FEATURES

Read more about ENTROPY BASED PRUNING OF BACKOFF MAXENT LANGUAGE MODELS WITH CONTEXTUAL FEATURES
Log in to post comments

In this paper, we present a pruning technique for maximum en- tropy (MaxEnt) language models. It is based on computing the exact entropy loss when removing each feature from the model, and it ex- plicitly supports backoff features by replacing each removed feature with its backoff. The algorithm computes the loss on the training data, so it is not restricted to models with n-gram like features, al- lowing models with any feature, including long range skips, triggers, and contextual features such as device location.

poster.pdf

poster.pdf (453)

Categories:: Language Modeling, for Speech and SLP (SLP-LANG)

19 Views

FAST DECENTRALIZED LEARNING VIA HYBRID CONSENSUS ADMM

Read more about FAST DECENTRALIZED LEARNING VIA HYBRID CONSENSUS ADMM
Log in to post comments

The present work introduces the hybrid consensus alternating direction method of multipliers (H-CADMM), a novel framework for optimization over networks which unifies existing distributed optimization approaches, including the centralized and the decentralized consensus ADMM. H-CADMM provides a flexible tool that leverages the underlying graph topology in order to achieve a desirable sweet-spot between node-to-node communication overhead and rate of convergence -- thereby alleviating known limitations of both C-CADMM and D-CADMM.

ICASSP2018.pdf

H-CADMM (696)

Categories:: Distributed and Cooperative Learning (MLR-DIST)
Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks

10 Views

On the analysis of training data for wavenet-based speech synthesis

Read more about On the analysis of training data for wavenet-based speech synthesis
Log in to post comments

In this paper, we analyze how much, how consistent and how accurate data WaveNet-based speech synthesis method needs to be abletogeneratespeechofgoodquality. Wedothisbyaddingartiﬁcial noise to the description of our training data and observing how well WaveNet trains and produces speech. More speciﬁcally, we add noise to both phonetic segmentation and annotation accuracy, and we also reduce the size of training data by using a fewer number of sentences during training of a WaveNet model.

poster.pdf

poster.pdf (597)

Categories:: Speech Synthesis and Generation, including TTS (SPE-SYNT)

36 Views

Joint Verification-Identification in End-to-End Multi-Scale CNN Framework for Topic Identification

We present an end-to-end multi-scale Convolutional Neural
Network (CNN) framework for topic identification (topic ID).
In this work, we examined multi-scale CNN for classification
using raw text input. Topical word embeddings are learnt at
multiple scales using parallel convolutional layers. A technique
to integrate verification and identification objectives is
examined to improve topic ID performance. With this approach,
we achieved significant improvement in identification
task. We evaluated our framework on two contrasting

Final.pdf

Final.pdf (446)

Categories:: Spoken Language Understanding (SLP-UNDE)
Neural network learning (MLR-NNLR)

44 Views

REMOTE PHOTOPLETHYSMOGRAPHY USING NONLINEAR MODE DECOMPOSITION

Read more about REMOTE PHOTOPLETHYSMOGRAPHY USING NONLINEAR MODE DECOMPOSITION
Log in to post comments

Remote Photoplethysmography (rPPG) is a contactless noninvasive
method for measuring physiological signals such as
the heart rate (HR) using the light reflected from the facial
tissue. Signal decomposition approaches are used to extract
the heart rate signal from the subtle changes in the skin color.
In this paper, we show that a recently proposed signal decomposition
method, namely nonlinear mode decomposition
(NMD), is quite successful in estimating the heart rate signal
from face videos in the presence of subject motion. Experimental

NMD_Halil_PDF.pdf

NMD_Halil_PDF.pdf (431)

Categories:: Medical image analysis

19 Views

Pages