ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

COVER SONG IDENTIFICATION USING SONG-TO-SONG CROSS-SIMILARITY MATRIX WITH CONVOLUTIONAL NEURAL NETWORK

In this paper, we propose a cover song identification algorithm using a convolutional neural network (CNN). We first train the CNN model to classify any non-/cover relationship, by feeding a cross-similarity matrix that is generated from a pair of songs as an input. Our main idea is to use the CNN output–the cover-probabilities of one song to all other candidate songs–as a new representation vector for measuring the distance between songs. Based on this, the present algorithm searches cover songs by applying several ranking methods: 1. sorting without using the representation vectors; 2.

ICASSP POSTER_수정8.pdf

ICASSP POSTER_수정8.pdf (370)

Categories:: Music Signal Processing

36 Views

NO-REFERENCE HDR IMAGE QUALITY ASSESSMENT METHOD BASED ON TENSOR SPACE

Read more about NO-REFERENCE HDR IMAGE QUALITY ASSESSMENT METHOD BASED ON TENSOR SPACE
Log in to post comments

Feifan Guan_ICASSP2018_Paper#1018.pdf

Feifan Guan_ICASSP2018_Paper#1018.pdf (305)

Categories:: Image/Video Processing

26 Views

Compressive Regularized Discriminant Analysis of High-Dimensional Data with Applications to Microarray Studies

We propose a modification of linear discriminant analysis, referred to as compressive regularized discriminant analysis (CRDA), for analysis of high-dimensional datasets. CRDA is specially designed for feature elimination purpose and can be used as gene selection method in microarray studies. CRDA lends ideas from ℓq,1 norm minimization algorithms in the multiple measurement vectors (MMV) model and utilizes joint-sparsity promoting hard thresholding for feature elimination.

tabassum_icassp18P.pdf

tabassum_crda-hd_poster (506)

Categories:: Statistical Signal Processing
Bioinformatics

10 Views

SINGLE DEPTH IMAGE SUPER-RESOLUTION USING CONVOLUTIONAL NEURAL NETWORKS

Read more about SINGLE DEPTH IMAGE SUPER-RESOLUTION USING CONVOLUTIONAL NEURAL NETWORKS
Log in to post comments

In this paper, we propose single depth image super-resolution using convolutional neural networks (CNN). We adopt CNN to acquire a high-quality edge map from the input low-resolution (LR) depth image. We use the high-quality edge map as the weight of the regularization term in a total variation (TV) model for super-resolution. First, we interpolate the LR depth image using bicubic interpolation and extract its low-quality edge map. Then, we get the high-quality edge map from the low-quality one using CNN.

ICASSP2018poster_Depth_rev_final .pdf

ICASSP2018poster_Depth_rev_final .pdf (384)

Categories:: Signal and System Modeling, Representation and Estimation

43 Views

Attention-based Dialog State Tracking for Conversational Interview Coaching

Read more about Attention-based Dialog State Tracking for Conversational Interview Coaching
Log in to post comments

This study proposes an approach to dialog state tracking (DST) in a conversational interview coaching system. For the interview coaching task, the semantic slots, used mostly in traditional dialog systems, are difficult to define manually. This study adopts the topic profile of the response from the interviewee as the dialog state representation. In addition, as the response generally consists of several sentences, the summary vector obtained from a long short-term memory neural network (LSTM) is likely to contain noisy information from many irrelevant sentences.

ICASSP2018_Poster_20180410-3_Wu.pdf

ICASSP2018_Poster_20180410-3_Wu.pdf (598)

Categories:: Spoken and Multimodal Dialog Systems and Applications (SLP-SMMD)

11 Views

ENVELOPE ESTIMATION BY TANGENTIALLY CONSTRAINED SPLINE

Read more about ENVELOPE ESTIMATION BY TANGENTIALLY CONSTRAINED SPLINE
Log in to post comments

Estimating envelope of a signal has various applications including empirical mode decomposition (EMD) in which the cubic $C^2$-spline based envelope estimation is generally used. While such functional approach can easily control smoothness of an estimated envelope, the so-called undershoot problem often occurs that violates the basic requirement of envelope. In this paper, a tangentially constrained spline with tangential points optimization is proposed for avoiding the undershoot problem while maintaining smoothness.

ICASSP2018kusano.pdf

ICASSP2018kusano.pdf (397)

Categories:: Signal and System Modeling, Representation and Estimation

8 Views

PARAMETRIC APPROXIMATION OF PIANO SOUND BASED ON KAUTZ MODEL WITH SPARSE LINEAR PREDICTION

The piano is one of the most popular and attractive musical instruments that leads to a lot of research on it.
To synthesize the piano sound in a computer, many modeling methods have been proposed from full physical models to approximated models. The focus of this paper is on the latter, approximating piano sound by an IIR filter.

ICASSP2018_Kobayashi_04_03.pdf

ICASSP2018_Kobayashi_04_03.pdf (408)

Categories:: Music Signal Processing

12 Views

SINGLE DEPTH IMAGE SUPER-RESOLUTION USING CONVOLUTIONAL NEURAL NETWORKS

Read more about SINGLE DEPTH IMAGE SUPER-RESOLUTION USING CONVOLUTIONAL NEURAL NETWORKS
Log in to post comments

In this paper, a novel framework for the single depth image superresolution is proposed. In our framework, we ﬁrst extract a low-quality edge map from an interpolated depth map.Then we transform the low-quality edge map to a high quality one by our trained deep convolution neural network (CNN) with two-step postprocessing. Guided by the high-quality edge map, we ﬁnally utilize a total variation (TV) based model to upsample the initial depth map.

ICASSP2018poster_Depth_rev_final.pdf

ICASSP2018poster_Depth_rev_final.pdf (397)

Categories:: Signal and System Modeling, Representation and Estimation

21 Views

REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION

Read more about REDUCING MODEL COMPLEXITY FOR DNN BASED LARGE-SCALE AUDIO CLASSIFICATION
Log in to post comments

icassp2018_yzwu_poster_ver5.pdf

icassp2018_yzwu_poster_ver5.pdf (489)

Categories:: Audio and Acoustic Signal Processing

17 Views

AUTOMATIC SPEECH ASSESSMENT FOR APHASIC PATIENTS BASED ON SYLLABLE-LEVEL EMBEDDING AND SUPRA-SEGMENTAL DURATION FEATURES

Aphasia is a type of acquired language impairment resulting from brain injury. Speech assessment is an important part of the comprehensive assessment process for aphasic patients. It is based on the acoustical and linguistic analysis of patients’ speech elicited through pre-defined story-telling tasks. This type of narrative spontaneous speech embodies multi-fold atypical characteristics related to the underlying language impairment.

poster_QinYing_ICASSP2018_final.pdf

poster_QinYing_ICASSP2018_final.pdf (555)

Categories:: Speech Processing

5 Views

Pages