ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Energy Efficient Consensus over Directed Graphs

Read more about Energy Efficient Consensus over Directed Graphs
Log in to post comments

ICASSP 2018.pdf

ICASSP 2018.pdf (317)

Categories:: Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks

5 Views

MULTIPLE-MODEL AND REDUCED-ORDER KALMAN FILTERING FOR PATHOLOGICAL HAND TREMOR EXTRACTION

ICASSP_v1.pdf

ICASSP_v1.pdf (489)

Categories:: Audio and Acoustic Signal Processing

3 Views

Depression Speaks: Automatic Discrimination Between Depressed and Non-Depressed Speakers Based on Nonverbal Speech Features

This article proposes an automatic approach - based on nonverbal speech features - aimed at the automatic discrimination between depressed and non-depressed speakers. The experiments have been performed over one of the largest corpora collected for such a task in the literature ($62$ patients diagnosed with depression and $54$ healthy control subjects), especially when it comes to data where the depressed speakers have been diagnosed as such by professional psychiatrists.

icassp.pdf

icassp.pdf (1013)

Categories:: Audio for Multimedia

65 Views

A Dynamic Latent Variable Model for Source Separation

Read more about A Dynamic Latent Variable Model for Source Separation
Log in to post comments

We propose a novel latent variable model for learning latent bases for time-varying non-negative data. Our model uses a mixture multinomial as the likelihood function and proposes a Dirichlet distribution with dynamic parameters as a prior, which we call the dynamic Dirichlet prior. An expectation maximization (EM) algorithm is developed for estimating the parameters of the proposed model.

ICASSP_18.pdf

ICASSP_18.pdf (466)

Categories:: Source Separation and Signal Enhancement

29 Views

A COMPLETE END-TO-END SPEAKER VERIFICATION SYSTEM USING DEEP NEURAL NETWORKS: FROM RAW SIGNALS TO VERIFICATION RESULT

End-to-end systems using deep neural networks have been widely studied in the field of speaker verification. Raw audio signal processing has also been widely studied in the fields of automatic music tagging and speech recognition. However, as far as we know, end-to-end systems using raw audio signals have not been explored in speaker verification. In this paper, a complete end-to-end speaker verification system is proposed, which inputs raw audio signals and outputs the verification results. A pre-processing layer and the embedded speaker feature extraction models were mainly investigated.

ICASSP2018_RAE2E_poster_v3.pdf

ICASSP2018_RAE2E_poster_v3.pdf (724)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

125 Views

PARALLEL VECTOR FIELD REGULARIZED NON-NEGATIVE MATRIX FACTORIZATION FOR IMAGE REPRESENTATION

presentation_1.pdf

presentation_1.pdf (416)

Categories:: Learning theory and algorithms (MLR-LEAR)

28 Views

TasNet: time-domain audio separation network for real-time, single-channel speech separation

Robust speech processing in multi-talker environments requires effective speech separation. Recent deep learning systems have made significant progress toward solving this problem, yet it remains challenging particularly in real-time, short latency applications. Most methods attempt to construct a mask for each source in time-frequency representation of the mixture signal which is not necessarily an optimal representation for speech separation.

ICASSP2018-poster.pdf

ICASSP2018-poster.pdf (705)

Categories:: Source Separation and Signal Enhancement
Speech Enhancement (SPE-ENHA)
Source separation (MLR-SSEP)
Neural network learning (MLR-NNLR)

84 Views

A TIME-RESTRICTED SELF-ATTENTION LAYER FOR ASR

Read more about A TIME-RESTRICTED SELF-ATTENTION LAYER FOR ASR
Log in to post comments

Self-attention -- an attention mechanism where the input and output
sequence lengths are the same -- has
recently been successfully applied to machine translation, caption generation, and phoneme recognition.
In this paper we apply a restricted self-attention mechanism (with
multiple heads) to speech recognition. By ``restricted'' we
mean that the mechanism at a particular frame only sees input from a
limited number of frames to
the left and right. Restricting the context makes it easier to

Poster - Self-attention.pdf

Poster - Self-attention.pdf (558)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

154 Views

ITERATIVE DEEP NEURAL NETWORKS FOR SPEAKER-INDEPENDENT BINAURAL BLIND SPEECH SEPARATION

ICASSPDemo.pptx

Audio examples (396)

Categories:: Source Separation and Signal Enhancement

3 Views

QoI: Assessing Participation in Threat Information Sharing

Read more about QoI: Assessing Participation in Threat Information Sharing
Log in to post comments

We introduce the notion of Quality of Indicator (QoI) to assess the level of contribution by participants in threat intelligence sharing. We exemplify QoI by metrics of the correctness, relevance, utility, and uniqueness of indicators. We build a system that extrapolates the metrics using a machine learning process over a reference set of indicators.

JemanPark_ICASSP_QoI.pptx

JemanPark_ICASSP_QoI.pptx (104)

Categories:: Communications and Network Security

6 Views

Pages