ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

OCT VOLUMETRIC DATA RESTORATION VIA PRIMAL-DUAL PLUG-AND-PLAY METHOD

Read more about OCT VOLUMETRIC DATA RESTORATION VIA PRIMAL-DUAL PLUG-AND-PLAY METHOD
Log in to post comments

This work proposes a volumetric data restoration method, especially for data acquired through an optical coherence tomography (OCT) device. OCT is a technique for acquiring a tomographic image of a specimen object in a few $\mu$m scale by using a near infrared laser. The authors have been trying dynamic observation of epithelium in cochlear of the inner ear. Currently, there is a problem to remove the influence of the measurement process as well as noise due to image sensor sensitivity.

oct-volumetric-data.pdf

oct-volumetric-data.pdf (594)

Categories:: Bioimaging and microscopy

147 Views

A Dimension-Independent Discriminant between Distributions

Read more about A Dimension-Independent Discriminant between Distributions
Log in to post comments

Henze-Penrose divergence is a non-parametric divergence measure that can be used to estimate a bound on the Bayes error in a binary classification problem. In this paper, we show that a cross- match statistic based on optimal weighted matching can be used to directly estimate Henze-Penrose divergence. Unlike an earlier approach based on the Friedman-Rafsky minimal spanning tree statistic, the proposed method is dimension-independent. The new approach is evaluated using simulation and applied to real datasets to obtain Bayes error estimates.

icassp2018.pdf

icassp2018.pdf (795)

Categories:: Signal and System Modeling, Representation and Estimation

16 Views

BSS EVAL OR PEASS? PREDICTING THE PERCEPTION OF SINGING-VOICE SEPARATION

Read more about BSS EVAL OR PEASS? PREDICTING THE PERCEPTION OF SINGING-VOICE SEPARATION
Log in to post comments

There is some uncertainty as to whether objective metrics for predicting the perceived quality of audio source separation are sufficiently accurate. This issue was investigated by employing a revised experimental methodology to collect subjective ratings of sound quality and interference of singing-voice recordings that have been extracted from musical mixtures using state-of-the-art audio source separation. A correlation analysis between the experimental data and the measures of two objective evaluation toolkits, BSS Eval and PEASS, was performed to assess their performance.

icassp18_poster_ward_et_al.pdf

icassp18_poster_ward_et_al.pdf (458)

Categories:: Source Separation and Signal Enhancement

13 Views

Language and Noise Transfer in Speech Enhancement Generative Adversarial Network

Read more about Language and Noise Transfer in Speech Enhancement Generative Adversarial Network
Log in to post comments

language-noise-transfer.pdf

language-noise-transfer.pdf (404)

Categories:: Source Separation and Signal Enhancement
Machine Learning for Signal Processing

10 Views

Matching Pursuit Based Convolutional Sparse Coding

Read more about Matching Pursuit Based Convolutional Sparse Coding
Log in to post comments

Sparse coding techniques for image processing traditionally rely on processing small overlapping patches separately followed by averaging. This has the disadvantage that the reconstructed image no longer obeys the sparsity prior used in the processing. For this purpose convolutional sparse coding has been introduced, where a shift-invariant dictionary is used and the sparsity of the recovered image is maintained. Most such strategies target the $\ell_0$ ``norm'' of the whole image, which may create an imbalanced sparsity across various regions in the image.

ICASSP2018_ConvolutionalSparseCoding.pptx

ICASSP2018_ConvolutionalSparseCoding.pptx (498)

Categories:: Signal and System Modeling, Representation and Estimation

57 Views

A sparse coding framework for gaze prediction in egocentric video

Read more about A sparse coding framework for gaze prediction in egocentric video
Log in to post comments

20180418ICASSP_official_.pdf

20180418ICASSP_official_.pdf (581)

Categories:: Image/Video Processing

13 Views

Spatial audio feature discovery with convolutional neural networks

Read more about Spatial audio feature discovery with convolutional neural networks
Log in to post comments

The advent of mixed reality consumer products brings about a pressing need to develop and improve spatial sound rendering techniques for a broad user base. Despite a large body of prior work, the precise nature and importance of various sound localization cues and how they should be personalized for an individual user to improve localization performance is still an open research problem. Here we propose training a convolutional neural network (CNN) to classify the elevation angle of spatially rendered sounds and employing Layerwise Relevance Propagation (LRP) on the trained CNN model.

Spatial_audio_feature_discovery_ICASSP_2018.pdf

Spatial_audio_feature_discovery_ICASSP_2018.pdf (509)

Categories:: Spatial and Multichannel Audio

37 Views

Soft-Target Training with Ambiguous Emotional Utterances for DNN-based Speech Emotion Classification

ICASSP2018_ando_EmoSoftTarget_v5_pub.pdf

ICASSP2018_ando_EmoSoftTarget_v5_pub.pdf (503)

ICASSP2018_ando_EmoSoftTarget_v6_pub.pdf

ICASSP2018_ando_EmoSoftTarget_v6_pub.pdf (575)

Categories:: Speech Analysis (SPE-ANLS)

59 Views

SEQUENTIAL MAXIMUM MARGIN CLASSIFIERS FOR PARTIALLY LABELED DATA

Read more about SEQUENTIAL MAXIMUM MARGIN CLASSIFIERS FOR PARTIALLY LABELED DATA
Log in to post comments

poster.pdf

poster.pdf (491)

Categories:: Sequential learning; sequential decision methods (MLR-SLER)

8 Views

Considerations regarding individualization of head-related transfer functions

Read more about Considerations regarding individualization of head-related transfer functions
Log in to post comments

This paper provides some considerations regarding using individualized head-related transfer functions for rendering binaural spatial audio over headphones. It briefly considers the degree of benefit that individualization may provide. It then examines the degree of variation existing within the ear morphology across listeners within the Sydney-York Morphological and Recording of Ears (SYMARE) database using kernel principal component analysis and the large deformation diffeomorphic metric mapping framework.

CJinICASSP2018.pdf

CJinICASSP2018.pdf (499)

Categories:: Spatial and Multichannel Audio

31 Views

Pages