ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.
- Read more about MITIGATION OF NONLINEAR DISTORTION IN SOUND ZONE CONTROL BY CONSTRAINING INDIVIDUAL LOUDSPEAKER DRIVER CONTROL EFFORTS
- 1 comment
- Log in to post comments
Loudspeaker drivers are subject to nonlinear distortion in the low frequency range at high input levels. In sound zone control, distortion not only reduces the acoustic contrast between zones, but also gives perceived artefacts. Standard sound zone methods, such as acoustic contrast control, apply a constraint to the overall input power, but individual loudspeaker drivers are not controlled and the nonlinear distortion is mainly produced by the loudspeaker drivers with the highest input power.
Poster1.pdf
- Categories:
- Read more about A Random Matrix and Concentration Inequalities framework for Neural Networks Analysis
- Log in to post comments
Our article provides a theoretical analysis of the asymptotic performance of a regression or classification task performed by a simple random neural network. This result is obtained by leveraging a new framework at the crossroads between random matrix theory and the concentration of measure theory. This approach is of utmost interest for neural network analysis at large in that it naturally dismisses the difficulty induced by the non-linear activation functions, so long that these are Lipschitz functions.
- Categories:
- Read more about CLASSIFIER CASCADE TO AID IN DETECTION OF EPILEPTIFORM TRANSIENTS IN INTERICTAL EEG
- Log in to post comments
Presence of interictal epileptiform discharges (IED) in the electroencephalogram (EEG) is indicative of epilepsy. Automated
software for annotating EEGs of Patients with suspected epilepsy is substantial for diagnosis and management of epilepsy.
A large amount of data is needed for training and evaluating the performance of an effective IED detection system. IEDs occur
infrequently in the EEG of most patients, hence, interictal EEG recordings contain mostly background waveforms. As the first
- Categories:
- Read more about A JOINT SOURCE CHANNEL ARITHMETIC MAP DECODER USING PROBABILISTIC RELATIONS AMONG INTRA MODES IN PREDICTIVE VIDEO COMPRESSION
- Log in to post comments
In this paper, residual redundancy in compressed videos is exploited to alleviate transmission errors using joint source channel arithmetic decoding. A new method is proposed to estimate a priori probability in MAP metric of H.264 intra modes decoder. The decoder generates a decoding tree using a breadth first search algorithm. An introduced statistical model is then implemented stage by stage over the decoding tree.
- Categories:
- Read more about QUANTISATION EFFECTS IN DISTRIBUTED OPTIMISATION
- Log in to post comments
In this presentation, the effects of quantisation on distributed convex optimisation algorithms are explored via the lens of monotone operator theory. Specifically, by representing transmission quantisation via an additive noise model, we demonstrate how quantisation can be viewed as an instance of an inexact Krasnoselskii-Mann scheme. In the case of two distributed solvers, the Alternating Direction Method of Multipliers and the Primal Dual Method of Multipliers, we further demonstrate how an adaptive quantisation scheme can be constructed to reduce transmission costs between nodes.
- Categories:
- Read more about ENTROPY BASED PRUNING OF BACKOFF MAXENT LANGUAGE MODELS WITH CONTEXTUAL FEATURES
- Log in to post comments
In this paper, we present a pruning technique for maximum en- tropy (MaxEnt) language models. It is based on computing the exact entropy loss when removing each feature from the model, and it ex- plicitly supports backoff features by replacing each removed feature with its backoff. The algorithm computes the loss on the training data, so it is not restricted to models with n-gram like features, al- lowing models with any feature, including long range skips, triggers, and contextual features such as device location.
poster.pdf
- Categories:
The present work introduces the hybrid consensus alternating direction method of multipliers (H-CADMM), a novel framework for optimization over networks which unifies existing distributed optimization approaches, including the centralized and the decentralized consensus ADMM. H-CADMM provides a flexible tool that leverages the underlying graph topology in order to achieve a desirable sweet-spot between node-to-node communication overhead and rate of convergence -- thereby alleviating known limitations of both C-CADMM and D-CADMM.
ICASSP2018.pdf
- Categories:
- Read more about On the analysis of training data for wavenet-based speech synthesis
- Log in to post comments
In this paper, we analyze how much, how consistent and how accurate data WaveNet-based speech synthesis method needs to be abletogeneratespeechofgoodquality. Wedothisbyaddingartificial noise to the description of our training data and observing how well WaveNet trains and produces speech. More specifically, we add noise to both phonetic segmentation and annotation accuracy, and we also reduce the size of training data by using a fewer number of sentences during training of a WaveNet model.
poster.pdf
- Categories:
- Read more about Joint Verification-Identification in End-to-End Multi-Scale CNN Framework for Topic Identification
- Log in to post comments
We present an end-to-end multi-scale Convolutional Neural
Network (CNN) framework for topic identification (topic ID).
In this work, we examined multi-scale CNN for classification
using raw text input. Topical word embeddings are learnt at
multiple scales using parallel convolutional layers. A technique
to integrate verification and identification objectives is
examined to improve topic ID performance. With this approach,
we achieved significant improvement in identification
task. We evaluated our framework on two contrasting
- Categories:
- Read more about REMOTE PHOTOPLETHYSMOGRAPHY USING NONLINEAR MODE DECOMPOSITION
- Log in to post comments
Remote Photoplethysmography (rPPG) is a contactless noninvasive
method for measuring physiological signals such as
the heart rate (HR) using the light reflected from the facial
tissue. Signal decomposition approaches are used to extract
the heart rate signal from the subtle changes in the skin color.
In this paper, we show that a recently proposed signal decomposition
method, namely nonlinear mode decomposition
(NMD), is quite successful in estimating the heart rate signal
from face videos in the presence of subject motion. Experimental
- Categories: