ICASSP 2016

ICASSP is the world's largest and most comprehensive technical conference on signal processing and its applications. It provides a fantastic networking opportunity for like-minded professionals from around the world. ICASSP 2016 conference will feature world-class presentations by internationally renowned speakers and cutting-edge session topics.

Sparse Reconstruction of Quantized Speech Signals

Read more about Sparse Reconstruction of Quantized Speech Signals
Log in to post comments

We propose sparse reconstruction techniques to improve the quality and / or reduce the bit-rate of standard speech coders. To that end, we assume signal sparsity in some transform domain and formulate the problem of reconstructing the original signal in terms of constrained l1-norm minimization. We use modern primal-dual methods in order to solve the resulting non-smooth convex optimization problem. Experiments show that with the proposed sparse reconstruction method the instrumentally predicted speech quality can be largely improved.

icassp_poster_brauer.pdf

icassp_poster_brauer.pdf (821)

Categories:: Speech Enhancement (SPE-ENHA)
Speech Coding (SPE-CODI)

12 Views

Low-Complexity Recursive Convolutional Precoding for OFDM-based Large-Scale Antenna Systems

Large-scale antenna (LSA) has gained a lot of attention recently since it can significantly improve
the performance of wireless systems. Similar to multiple-input multiple-output (MIMO) orthogonal
frequency division multiplexing (OFDM) or MIMO-OFDM, LSA can be also combined with OFDM to
deal with frequency selectivity in wireless channels. However, such combination suffers from substantially
increased complexity proportional to the number of antennas in LSA systems. For the conventional

ICASSP2016.pptx

ICASSP2016.pptx (307)

Categories:: MIMO Communications and Signal Processing

4 Views

Adaptive algorithms for hypergraph learning

Read more about Adaptive algorithms for hypergraph learning
Log in to post comments

ICASSP2016_Presenation1.pdf

ICASSP2016_Presenation1.pdf (330)

Categories:: Image/Video Storage, Retrieval

12 Views

Deep convolutional acoustic word embeddings using word-pair side information

Read more about Deep convolutional acoustic word embeddings using word-pair side information
Log in to post comments

Recent studies have been revisiting whole words as the basic modelling unit in speech recognition and query applications, instead of phonetic units. Such whole-word segmental systems rely on a function that maps a variable-length speech segment to a vector in a fixed-dimensional space; the resulting acoustic word embeddings need to allow for accurate discrimination between different word types, directly in the embedding space. We compare several old and new approaches in a word discrimination task.

kamper+wang+livescu_icassp2016_talk.pdf

kamper+wang+livescu_icassp2016_talk.pdf (86)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

5 Views

Xerox Conversational AI Agent (XCAI) for Enterprise Knowledgebase Q&A

Read more about Xerox Conversational AI Agent (XCAI) for Enterprise Knowledgebase Q&A
Log in to post comments

In the past 5 years significant advances in Large Vocabulary Speech Recognition (LVSR), Deep Learning (DL) and Spoken Language Understanding (SLU), along with the explosive growth of wireless network bandwidth have given rise to three compelling Conversational AI agents that are available on the Andriod, iOS and Microsoft Smartphones. Conversational AI agents such as Google Now, Apple Siri and Microsoft Cortana are now the most preferred way of mobile web search and to perform command and control of the various smartphone apps.

Xerox XCAI ICASSP 2016.pdf

Xerox XCAI ICASSP 2016.pdf (80)

Categories:: Large Vocabulary Continuous Recognition/Search (SPE-LVCR)

126 Views

Accurate Recovery of a Specularity from a few Samples of the Reflectance Function

Read more about Accurate Recovery of a Specularity from a few Samples of the Reflectance Function
Log in to post comments

Poster_vertical_3.pdf

Poster_vertical_3.pdf (421)

Categories:: Image Scanning, Display, and Printing

9 Views

SPARSITY-BASED RECONSTRUCTION METHOD FOR SIGNALS WITH FINITE RATE OF INNOVATION

Read more about SPARSITY-BASED RECONSTRUCTION METHOD FOR SIGNALS WITH FINITE RATE OF INNOVATION
Log in to post comments

In the last decade, it was shown that it is possible to reconstruct signals with finite rate of innovation (FRI signals) from the samples of their filtered versions. However, when noise is present, the present reconstruction algorithms tend to be low accuracy. In this work, a new sparsity-based reconstruction method for FRI signals is put forward. The streams of Diracs and exponential reproducing kernel are considered. Firstly, the analog time axis is quantified and aligned to grids.

ICASSP2016_poster.pdf

ICASSP2016_poster.pdf (993)

Categories:: Sampling and Reconstruction

20 Views

Two-Stage Noise Aware Training Using Asymmetric Deep Denoising Autoencoder

Read more about Two-Stage Noise Aware Training Using Asymmetric Deep Denoising Autoencoder
Log in to post comments

Ever since the deep neural network (DNN)-based acoustic model appeared, the recognition performance of automatic peech recognition has been greatly improved. Due to this achievement, various researches on DNN-based technique for noise robustness are also in progress. Among these approaches, the noise-aware training (NAT) technique which aims to improve the inherent robustness of DNN using noise estimates has shown remarkable performance. However, despite the great performance, we cannot be certain whether NAT is an optimal method for sufficiently utilizing the inherent robustness of DNN.

ICASSP2016_포스터_이강현_그래프2.pdf

ICASSP2016_포스터_이강현_그래프2.pdf (69)

Categories:: Robust Speech Recognition (SPE-ROBU)
Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

30 Views

THE SPHERICAL HARMONICS ROOT-MUSIC

Read more about THE SPHERICAL HARMONICS ROOT-MUSIC
Log in to post comments

Spherical harmonics root-MUSIC (MUltiple SIgnal Classification) technique for source localization using spherical microphone array is presented in this paper. Earlier work on root-MUSIC is limited to linear and planar arrays. Root-MUSIC for planar array utilizes the concept of manifold separation and beamspace transformation. In this paper, the Vandermonde structure of array manifold for a particular order is proved. Hence, the validity of root-MUSIC in the spherical harmonics domain is confirmed. The proposed method is evaluated by using simulated experiments on source localization.

ICASSP2016_POS1.pdf

ICASSP2016_POS1.pdf (738)

Categories:: Sensor Array Processing

13 Views

Terrain-Scattered Jammer Suppression in MIMO Radar Using Space-(Fast) Time Adaptive Processing

Jammer suppression, MIMO radar, Space-time adaptive processing

We address the problem of terrain-scattered jammer suppression in multiple-input multiple-output (MIMO) radar using space-(fast) time adaptive processing (SFTAP). The correlation function of jamming components after matched filtering at the receiving end of MIMO radar is derived, and its relationship to the correlation matrix of the transmitted waveforms is established. This correlation function serves as a theoretical measure of evaluating the matched filtering effect on the received jamming signals.

JammerSupp_Yongzhe.pdf

JammerSupp_Yongzhe.pdf (765)

Categories:: Sensor Array Processing

18 Views

Pages