ICASSP 2019

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

The Direction Cosine Matrix Algorithm in Fixed-Point: Implementation and Analysis

Read more about The Direction Cosine Matrix Algorithm in Fixed-Point: Implementation and Analysis
Log in to post comments

Inertial navigation allows tracking and updating the position and orientation of a moving object based on accelerometer and gyroscope data without external positioning aid, such as GPS. Therefore, inertial navigation is an essential technique for, e.g., indoor positioning. As inertial navigation is based on integration of acceleration vector components, computation errors accumulate and make the position and orientation estimate drift.

poster_mbc.pdf

poster_mbc.pdf (473)

Categories:: DSP algorithm implementation in hardware and software

96 Views

Audio Caption: Listen and Tell

Read more about Audio Caption: Listen and Tell
Log in to post comments

ICASSP POSTER.pdf

ICASSP POSTER.pdf (198)

Categories:: Audio and Acoustic Signal Processing

27 Views

On the Fourier Representation of Computable Continuous Signals

Read more about On the Fourier Representation of Computable Continuous Signals
Log in to post comments

icassp2019_uniform.pdf

icassp2019_uniform.pdf (458)

Categories:: Signal and System Modeling, Representation and Estimation

22 Views

FRI MODELLING OF FOURIER DESCRIPTORS

Read more about FRI MODELLING OF FOURIER DESCRIPTORS
Log in to post comments

Fourier descriptors are used to parametrically represent closed contours. In practice, a finite set of Fourier descriptors can model a large class of smooth contours. In this paper, we propose a method for estimating the Fourier descriptors of a given contour from its partial samples. We take a sampling-theoretic approach to model the x and y coordinate functions of the shape and express them as a sum of weighted complex exponentials, which belong to the class of finite-rate-of-innovation (FRI) signals. The weights represent the Fourier descriptors of the shape.

ICASSP2019L_FRI-FD.pdf

ICASSP2019L_FRI-FD.pdf (395)

Categories:: Sampling and Reconstruction

20 Views

Analytic Properties of Downsampling for Bandlimited Signals

Read more about Analytic Properties of Downsampling for Bandlimited Signals
Log in to post comments

icassp2019_downsampling.pdf

icassp2019_downsampling.pdf (596)

Categories:: Sampling and Reconstruction
Multirate Signal Processing

40 Views

Semi-supervised Monaural Singing Voice Separation with a Masking Network Trained on Synthetic Mixtures - Poster

We study the problem of semi-supervised singing voice separation, in which the training data contains a set of samples of mixed music (singing and instrumental) and an unmatched set of instrumental music. Our solution employs a single mapping function g, which, applied to a mixed sample, recovers the underlying instrumental music, and, applied to an instrumental sample, returns the same sample. The network g is trained using purely instrumental samples, as well as on synthetic mixed samples that are created by mixing reconstructed singing voices with random instrumental samples.

poster_icassp_2019_singing.pdf

poster_icassp_2019_singing.pdf (375)

Categories:: Source Separation and Signal Enhancement

14 Views

A Pitch-Aware Approach to Single-Channel Speech Separation

Read more about A Pitch-Aware Approach to Single-Channel Speech Separation
Log in to post comments

Despite significant advancements of deep learning on separating speech sources mixed in a single channel, same gender speaker mix, i.e., male-male or female-female, is still more difficult to separate than the case of opposite gender mix. In this study, we propose a pitch-aware speech separation approach to improve the speech separation performance.

ICASSP2019_Poster_KeWang.pdf

Poster of "A Pitch-Aware Approach to Single-Channel Speech Separation" (361)

Categories:: Source Separation and Signal Enhancement

21 Views

CROSS-LINGUAL VOICE CONVERSION WITH BILINGUAL PHONETIC POSTERIORGRAM AND AVERAGE MODELING

This paper presents a cross-lingual voice conversion approach using bilingual Phonetic PosteriorGram (PPG) and average modeling. The proposed approach makes use of bilingual PPGs to represent speaker-independent features of speech signals from different languages in the same feature space. In particular, a bilingual PPG is formed by stacking two monolingual PPG vectors, which are extracted from two monolingual speech recognition systems. The conversion model is trained to learn the relationship between bilingual PPGs and the corresponding acoustic features.

Poster_ICASSP2019.pdf

cross lingual voice conversion with bilingual PPG (532)

Categories:: Speech Synthesis and Generation, including TTS (SPE-SYNT)

72 Views

MIRAGE: 2D sound source localization using microphone pair augmentation with echoes

Read more about MIRAGE: 2D sound source localization using microphone pair augmentation with echoes
Log in to post comments

It is commonly observed that acoustic echoes hurt perfor-mance of sound source localization (SSL) methods. We in-troduce the concept of microphone array augmentation withechoes (MIRAGE) and show how estimation of early-echocharacteristics can in fact benefit SSL. We propose a learning-based scheme for echo estimation combined with a physics-based scheme for echo aggregation.

2019_icassp_mirage_poster.pdf

Poster (515)

Categories:: Loudspeaker and Microphone Array Signal Processing
Room Acoustics and Acoustic System Modeling

150 Views

Short-Segment Heart Sound Classification Using an Ensemble of Deep Convolutional Neural Networks

This paper proposes a framework based on deep convolutional neural networks (CNNs) for automatic heart sound classification using short-segments of individual heart beats. We design a 1D-CNN that directly learns features from raw heart-sound signals, and a 2D-CNN that takes inputs of two-dimensional time-frequency feature maps based on Mel-frequency cepstral coefficients. We further develop a time-frequency CNN ensemble (TF-ECNN) combining the 1D-CNN and 2D-CNN based on score-level fusion of the class probabilities.

ICASSP19Poster-May7.pdf

ICASSP19Poster-May7.pdf (577)

Categories:: Biomedical signal processing

47 Views

Pages