ICASSP 2019

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION

Read more about TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION
Log in to post comments

We consider the problem of detecting whether a tensor signal having many missing entities lies within a given low dimensional Kronecker-Structured (KS) subspace. This is a matched subspace detection problem. Tensor matched subspace detection problem is more challenging because of the intertwined signal dimensions. We solve this problem by projecting the signal onto the KS subspace, which is a Kronecker product of different subspaces corresponding to each signal dimension. Under this framework, we define the KS subspaces and the orthogonal projection of the signal onto the KS subspace.

ICASSP.poster.pdf

Poster for the paper titled TENSOR MATCHED KRONECKER-STRUCTURED SUBSPACE DETECTION FOR MISSING INFORMATION (360)

Categories:: Pattern recognition and classification (MLR-PATT)

7 Views

FAST COMPRESSIVE SENSING RECOVERY USING GENERATIVE MODELS WITH STRUCTURED LATENT VARIABLES

Deep learning models have significantly improved the visual quality and accuracy on compressive sensing recovery. In this paper, we propose an algorithm for signal reconstruction from compressed measurements with image priors captured by a generative model. We search and constrain on latent variable space to make the method stable when the number of compressed measurements is extremely limited. We show that, by exploiting certain structures of the latent variables, the proposed method produces improved reconstruction accuracy and preserves realistic and non-smooth features in the image.

Xu, Shaojie ICCASP 2019 Presentation Slides.pdf

Xu, Shaojie ICCASP 2019 Presentation Slides.pdf (383)

Categories:: Other applications of machine learning (MLR-APPL)

18 Views

ICASSP 2019 Paper #4001: INCREASE APPARENT PUBLIC SPEAKING FLUENCY BY SPEECH AUGMENTATION

Fluent and confident speech is desirable to every speaker. But professional speech delivering requires a great deal of experience and practice. In this paper, we propose a speech stream manipulation system which can help non-professional speakers to produce fluent, professional-like speech content, in turn contributing towards better listener engagement and comprehension. We propose to achieve this task by manipulating the disfluencies in human speech, like the sounds uh and um, the filler words and awkward long silences.

poster_v2.0.pdf

poster_v2.0.pdf (410)

Categories:: Speech Enhancement (SPE-ENHA)

70 Views

Speaker Diarisation Using 2D Self-attentive Combination of Embeddings

Read more about Speaker Diarisation Using 2D Self-attentive Combination of Embeddings
Log in to post comments

Speaker diarisation systems often cluster audio segments using speaker embeddings such as i-vectors and d-vectors. Since different types of embeddings are often complementary, this paper proposes a generic framework to improve performance by combining them into a single embedding, referred to as a c-vector. This combination uses a 2-dimensional (2D) self-attentive structure, which extends the standard self-attentive layer by averaging not only across time but also across different types of embeddings.

DiarisationPresentation3.pdf

DiarisationPresentation3.pdf (351)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

13 Views

Divergence Based Weighting for Information Channels in Deep Convolutional Neural Networks for Bird Audio Detection

In this paper, we address the problem of bird audio detec-
tion and propose a new convolutional neural network archi-
tecture together with a divergence based information channel
weighing strategy in order to achieve improved state-of-the-
art performance and faster convergence. The effectiveness of
the methodology is shown on the Bird Audio Detection Chal-
lenge 2018 (Detection and Classification of Acoustic Scenes
and Events Challenge, Task 3) development data set.

2019 05 - Bird Audio DNN Divergence Poster- ICASSP19.pdf

https://ieeexplore.ieee.org/document/8682483 (346)

Categories:: Neural network learning (MLR-NNLR)

22 Views

Learning to Dequantize Speech Signals by Primal-Dual Networks: An Approach for Acoustic Sensor Networks

We introduce a method to improve the quality of simple scalar quantization in the context of acoustic sensor networks by combining ideas from sparse reconstruction, artificial neural networks and weighting filters. We start from the observation that optimization methods based on sparse reconstruction resemble the structure of a neural network. Hence, building upon a successful enhancement method, we unroll the algorithms and use this to build a neural network which we train to obtain enhanced decoding.

icassp_poster.pdf

icassp_poster.pdf (577)

Categories:: Speech Enhancement (SPE-ENHA)

21 Views

MULTI-FRAME SUPER-RESOLUTION FOR TIME-OF-FLIGHT IMAGING

Read more about MULTI-FRAME SUPER-RESOLUTION FOR TIME-OF-FLIGHT IMAGING
Log in to post comments

ICASSP_2019_v4.pdf

ICASSP_2019_v4.pdf (342)

Categories:: Image/Video Processing

61 Views

A DEEP NEURAL NETWORK BASED MANEUVERING-TARGET TRACKING ALGORITHM

Read more about A DEEP NEURAL NETWORK BASED MANEUVERING-TARGET TRACKING ALGORITHM
Log in to post comments

ICASSP2019_poster_deepMTT.pdf

ICASSP2019_poster_deepMTT.pdf (659)

Categories:: Signal and System Modeling, Representation and Estimation

20 Views

Universal Acoustic Using Neural Mixture Models

Read more about Universal Acoustic Using Neural Mixture Models
Log in to post comments

UAM_v3.pdf

UAM_v3.pdf (458)

Categories:: Audio and Acoustic Signal Processing

36 Views

BLIND QUALITY EVALUATOR FOR SCREEN CONTENT IMAGES VIA ANALYSIS OF STRUCTURE

Read more about BLIND QUALITY EVALUATOR FOR SCREEN CONTENT IMAGES VIA ANALYSIS OF STRUCTURE
Log in to post comments

Existing blind evaluators for screen content images (SCIs) are mainly learning-based and require a number of training images with co-registered human opinion scores. However, the size of existing databases is small, and it is labor-, timeconsuming and expensive to largely generate human opinion scores. In this study, we propose a novel blind quality evaluator without training.

icassp 2019 poster 2875.pdf

icassp 2019 poster 2875.pdf (462)

Categories:: Quality Assessment

23 Views

Pages