ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Sound field reproduction with exterior cancellation using analytical weighting of harmonic coefficients

A method for sound field reproduction with the suppression of exterior radiation is proposed, which makes it possible to synthesize a desired sound field in a reverberant environment without prior knowledge of the transfer functions of the multiple loudspeakers. The objective function used to achieve this is formulated as the weighted sum of the interior reproduction error and exterior radiation power. The optimal driving signals are derived by harmonic expansion of both the interior and exterior sound fields.

20180413.pdf

20180413.pdf (502)

Categories:: Spatial and Multichannel Audio

14 Views

POLYPHONIC MUSIC SEQUENCE TRANSDUCTION WITH METER-CONSTRAINED LSTM NETWORKS

Read more about POLYPHONIC MUSIC SEQUENCE TRANSDUCTION WITH METER-CONSTRAINED LSTM NETWORKS
Log in to post comments

Automatic transcription of polyphonic music remains a challenging task in the field of Music Information Retrieval. In this paper, we propose a new method to post-process the output of a multi-pitch detection model using recurrent neural networks. In particular, we compare the use of a fixed sample rate against a meter-constrained time step on a piano performance audio dataset. The metric ground truth is estimated using automatic symbolic alignment, which we make available for further study.

Adrien Ycart ICASSP 2018 poster A0.pdf

Adrien Ycart ICASSP 2018 poster A0.pdf (580)

Categories:: Music Signal Processing

21 Views

MOTOR IMAGERY FOR EEG BIOMETRICS USING CONVOLUTIONAL NEURAL NETWORK

Read more about MOTOR IMAGERY FOR EEG BIOMETRICS USING CONVOLUTIONAL NEURAL NETWORK
Log in to post comments

This paper deals with electroencephalography (EEG)-based biometric identification, using a motor imagery task, specifically
performing imaginary arms and legs movements. Deep learning methods such as convolutional neural network (CNN) is used for automatic discriminative feature extraction and person identification. An extensive set of experimental tests, performed on a large database comprising EEG data collected from 40 subjects over two different sessions taken at a week distance, shows the existence of repeatable discriminative characteristics in individuals’ brain signals.

Rig Das_ICASSP_2018-48x96.pdf

Poster (508)

Categories:: Biometrics

18 Views

AUDIO-VISUAL PERSON RECOGNITION IN MULTIMEDIA DATA FROM THE IARPA JANUS PROGRAM

Read more about AUDIO-VISUAL PERSON RECOGNITION IN MULTIMEDIA DATA FROM THE IARPA JANUS PROGRAM
Log in to post comments

ICASSP18_Janus_slides.pptx

ICASSP18_Janus_slides.pptx (515)

Categories:: Audio and Acoustic Signal Processing

15 Views

3-D CNN Models FOR FAR-FIELD MULTI-CHANNEL Speech Recognition

Read more about 3-D CNN Models FOR FAR-FIELD MULTI-CHANNEL Speech Recognition
Log in to post comments

conference_poster_5.pdf

conference_poster_5.pdf (718)

Categories:: Speech Processing

18 Views

Bayesian inference for multi-line spectra in linear sensor array

Read more about Bayesian inference for multi-line spectra in linear sensor array
Log in to post comments

VietHung_ICASSP_2018.pdf

VietHung_ICASSP_2018.pdf (458)

Categories:: Machine Learning for Signal Processing

6 Views

Spectral feature mapping with mimic loss for robust speech recognition

Read more about Spectral feature mapping with mimic loss for robust speech recognition
Log in to post comments

For the task of speech enhancement, local learning objectives are agnostic to phonetic structures helpful for speech recognition. We propose to add a global criterion to ensure de-noised speech is useful for downstream tasks like ASR. We first train a spectral classifier on clean speech to predict senone labels. Then, the spectral classifier is joined with our speech enhancer as a noisy speech recognizer. This model is taught to imitate the output of the spectral classifier alone on clean speech.

icassp-2018-poster_deblin.pdf

icassp-2018-poster_deblin.pdf (542)

Categories:: Robust Speech Recognition (SPE-ROBU)

8 Views

SIMULTANEOUS SPEECH RECOGNITION AND ACOUSTIC EVENT DETECTION USING AN LSTM-CTC ACOUSTIC MODEL AND A WFST DECODER

ICASSP2018_simultaneous__poster_final.pdf

ICASSP2018_simultaneous__poster_final.pdf (761)

Categories:: Large Vocabulary Continuous Recognition/Search (SPE-LVCR)

67 Views

RATE-OPTIMAL META LEARNING OF CLASSIFICATION ERROR

Read more about RATE-OPTIMAL META LEARNING OF CLASSIFICATION ERROR
Log in to post comments

Meta learning of optimal classifier error rates allows an experimenter to empirically estimate the intrinsic ability of any estimator to discriminate between two populations, circumventing the difficult problem of estimating the optimal Bayes classifier. To this end we propose a weighted nearest neighbor (WNN) graph estimator for a tight bound on the Bayes classification error; the Henze-Penrose (HP) divergence. Similar to recently proposed HP estimators [berisha2016], the proposed estimator is non-parametric and does not require density estimation.