ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Low Rank Fourier Ptychography

Read more about Low Rank Fourier Ptychography
Log in to post comments

poster_lrptych_new.pdf

poster_lrptych_new.pdf (417)

Categories:: Image/Video Processing

13 Views

ON COMPRESSIVE SENSING OF SPARSE COVARIANCE MATRICES USING DETERMINISTIC SENSING MATRICES

Talk_StRIP-Kronecker.pdf

Talk_StRIP-Kronecker.pdf (566)

Categories:: Sampling and Reconstruction

11 Views

MMSE Adaptive Waveform Design for a MIMO Active Sensing System Tracking Multiple Moving Targets

SJH_ICASSP_2018.pdf

SJH_ICASSP_2018.pdf (463)

Categories:: Adaptive Array Signal Processing

6 Views

Pyroomacoustics: A Python package for audio room simulation and array processing algorithms

We present pyroomacoustics, a software package aimed at the rapid development and testing of audio array processing algorithms.

poster.pdf

poster.pdf (736)

Categories:: Spatial and Multichannel Audio

171 Views

Mutual-Information-Private Online Gradient Descent Algorithm

Read more about Mutual-Information-Private Online Gradient Descent Algorithm
Log in to post comments

A user implemented privacy preservation mechanism is proposed for the online gradient descent (OGD) algorithm. Privacy is measured through the information leakage as quantified by the mutual information between the usersʼ outputs and learnerʼs inputs. The input perturbation mechanism proposed can be implemented by individual users with a space and time complexity that is independent of the horizon T. For the proposed mechanism, the information leakage is shown to be bounded by the Gaussian channel capacity in the full information setting.

newfile5.pdf

newfile5.pdf (566)

Categories:: Signal Processing and Cryptography

30 Views

GENERALISED DISCRIMINATIVE TRANSFORM VIA CURRICULUM LEARNING FOR SPEAKER RECOGNITION

Read more about GENERALISED DISCRIMINATIVE TRANSFORM VIA CURRICULUM LEARNING FOR SPEAKER RECOGNITION
Log in to post comments

In this paper we introduce a speaker verification system deployed on mobile devices that can be used to personalise a keyword spotter. We describe a baseline DNN system that maps an utterance to a speaker embedding, which is used to measure speaker differences via cosine similarity. We then introduce an architectural modification which uses an LSTM system where the parameters are optimised via a curriculum learning procedure to reduce the detection error and improve its generalisability across various conditions.

Siri_PHS_CurriculumLearning_ICASSP18v3.pdf

Siri_PHS_CurriculumLearning_ICASSP18v3.pdf (1048)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

118 Views

ON THE GEOMETRY OF MIXTURES OF PRESCRIBED DISTRIBUTIONS

Read more about ON THE GEOMETRY OF MIXTURES OF PRESCRIBED DISTRIBUTIONS
Log in to post comments

GeometryMixtures-Poster-ICASSP2018.pdf

GeometryMixtures-Poster-ICASSP2018.pdf (477)

Categories:: Information-theoretic learning (MLR-INFO)

18 Views

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK

Read more about MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
Log in to post comments

WenjieGuan-3304-2018_ICASSP_POSTER.pdf

WenjieGuan-3304-2018_ICASSP_POSTER.pdf (547)

Categories:: Audio and Acoustic Signal Processing

7 Views

Software Defined Resource Allocation for Service-Oriented Networks

Read more about Software Defined Resource Allocation for Service-Oriented Networks
Log in to post comments

icassp2018_poster_v4.pdf

icassp2018_poster_v4.pdf (895)

Categories:: Signal Processing for Communications and Networking

4 Views

CBLDNN-BASED SPEAKER-INDEPENDENT SPEECH SEPARATION VIA GENERATIVE ADVERSARIAL TRAINING

In this paper, we propose a speaker-independent multi-speaker monaural speech separation system (CBLDNN-GAT) based on convolutional, bidirectional long short-term memory, deep feed-forward neural network (CBLDNN) with generative adversarial training (GAT). Our system aims at obtaining better speech quality instead of only minimizing a mean square error (MSE). In the initial phase, we utilize log-mel filterbank and pitch features to warm up our CBLDNN in a multi-task manner.

conference_poster_4.pdf

conference_poster_4.pdf (535)

Categories:: Source Separation and Signal Enhancement

92 Views

Pages