ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

IMAGE ALIGNMENT VIA MULTI-MODEL GEOMETRIC FIFTING AND HIERARCHICAL HOMOGRAPHY ESTIMATION

ICASSP18001.ppt

the slides for image alignment (406)

Categories:: Image/Video Processing

16 Views

SUBSET SELECTION FOR KERNEL-BASED SIGNAL RECONSTRUCTION

Read more about SUBSET SELECTION FOR KERNEL-BASED SIGNAL RECONSTRUCTION
Log in to post comments

In this work, we introduce subset selection strategies for signal reconstruction based on kernel methods, particularly for the case of kernel-ridge regression. Typically, these methods are employed for exploiting known prior information about the structure of the signal of interest. We use the mean squared error and a scalar function of the covariance matrix of the kernel regressors to establish metrics for the subset selection problem. Despite the NP-hard nature of the problem, we introduce efficient algorithms for finding approximate solutions for the proposed metrics.

ppt_icassp2018_kernel.pdf

ppt_icassp2018_kernel.pdf (488)

Categories:: Sampling and Reconstruction

15 Views

THE CHORD GAP DIVERGENCE AND A GENERALIZATION OF THE BHATTACHARYYA DISTANCE

Read more about THE CHORD GAP DIVERGENCE AND A GENERALIZATION OF THE BHATTACHARYYA DISTANCE
Log in to post comments

Slides-ChordDivergence18April2018.pdf

Slides-ChordDivergence18April2018.pdf (442)

Categories:: Learning theory and algorithms (MLR-LEAR)

22 Views

MIMO RADAR TARGET DETECTION USING LOW-COMPLEXITY RECEIVER

Read more about MIMO RADAR TARGET DETECTION USING LOW-COMPLEXITY RECEIVER
Log in to post comments

ICASSP2018-poster.pdf

MIMO RADAR, DETECTION,TRANSMITTER SELECTION (479)

Categories:: Adaptive Array Signal Processing

17 Views

Hi, BCD! Hybrid Inexact Block Coordinate Descent for Hyperspectral Super-Resolution

Read more about Hi, BCD! Hybrid Inexact Block Coordinate Descent for Hyperspectral Super-Resolution
Log in to post comments

Hyperspectral super-resolution (HSR) is a problem of recovering a high-spectral-spatial-resolution image from a multispectral measurement and a hyperspectral measurement, which have low spectral and spatial resolutions, respectively. We consider a low-rank structured matrix factorization formulation for HSR, which is a non-convex large-scale optimization problem. Our contributions contain both computational and theoretical aspects.

ICASSP 2018 modified.pdf

ICASSP 2018 modified.pdf (436)

Categories:: Machine Learning for Signal Processing
Signal Processing Theory and Methods

57 Views

MUSIC CHORD RECOGNITION BASED ON MIDI-TRAINED DEEP FEATURE AND BLSTM-CRF HYBIRD DECODING

In this paper, we design a novel deep learning based hybrid system for automatic chord recognition. Currently, there is a bottleneck in the amount of enough annotated data for training robust acoustic models, as hand annotating time-synchronized chord labels requires professional musical skills and considerable labor. As a solution to this problem, we construct a large set of time synchronized MIDI-audio pairs, and use these data to train a Deep Residual Network (DRN) feature extractor, which can then estimate pitch class activations of real-world music audio recordings.

ICASSP2018Poster_WuYiming.pdf

ICASSP2018Poster_WuYiming.pdf (878)

Categories:: Music Signal Processing

93 Views

ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION

Read more about ROBUST FEATURE CLUSTERING FOR UNSUPERVISED SPEECH ACTIVITY DETECTION
Log in to post comments

In certain applications such as zero-resource speech processing
or very-low resource speech-language systems, it might
not be feasible to collect speech activity detection (SAD) annotations.
However, the state-of-the-art supervised SAD techniques
based on neural networks or other machine learning
methods require annotated training data matched to the target
domain. This paper establish a clustering approach for fully
unsupervised SAD useful for cases where SAD annotations
are not available. The proposed approach leverages Hartigan

FINAL_April13_7.ICASSP-Poster-2018-DipSAD_v2.pdf

FINAL_April13_7.ICASSP-Poster-2018-DipSAD_v2.pdf (438)

7 Views

HYBRID LSTM-FSMN NETWORKS FOR ACOUSTIC MODELING

Read more about HYBRID LSTM-FSMN NETWORKS FOR ACOUSTIC MODELING
Log in to post comments

FLMN Poster.pdf

FLMN Poster.pdf (338)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

24 Views

COMPARING THE INFLUENCE OF DEPTH AND WIDTH OF DEEP NEURAL NETWORK BASED ON FIXED NUMBER OF PARAMETERS FOR AUDIO EVENT DETECTION

Deep Neural Network (DNN) is a basic method used for the rare Acoustic Event Detection (AED) in synthesised audio. The structure of DNNs including Multi-Layer Perceptron (MLP) and Recurrent Neural Network (RNN) for AED tasks has rather fewer hidden layers compared with computer vision systems. This paper tries to demonstrate that a DNN with more hidden layers does not necessarily guarantee a better performance in AED tasks.

COMPARING THE INFLUENCE OF DEPTH AND WIDTH OF DEEP NEURAL NETWORK BASED ON FIXED NUMBER OF PARAMETERS FOR AUDIO EVENT DETECTION.pdf

COMPARING THE INFLUENCE OF DEPTH AND WIDTH OF DEEP NEURAL NETWORK BASED ON FIXED NUMBER OF PARAMETERS FOR AUDIO EVENT DETECTION.pdf (469)

Categories:: Other applications of machine learning (MLR-APPL)

125 Views

L0-REGULARIZED HYBRID GRADIENT SPARSITY PRIORS FOR ROBUST SINGLE-IMAGE BLIND DEBLURRING

Single-image blind deblurring is a challenging ill-posed in- verse problem which aims to estimate both blur kernel and latent sharp image from only one observation. This paper fo- cuses on first estimating the blur kernel alone and then restor- ing the latent image since it has been proven to be more feasi- ble to handle the ill-posed nature during blind deblurring. To estimate an accurate blur kernel, L0-norm of both first- and second-order image gradients is proposed to regularize the final estimation result.

ICASSP2018-LECTURE .pdf

ICASSP2018-LECTURE .pdf (385)

Categories:: Image/Video Storage, Retrieval

38 Views

Pages