ICASSP 2017

ICASSP is the world's largest and most comprehensive technical conference on signal processing and its applications. It provides a fantastic networking opportunity for like-minded professionals from around the world. ICASSP 2017 conference will feature world-class presentations by internationally renowned speakers and cutting-edge session topics. Visit ICASSP 2017

THE SHEFFIELD SEARCH AND RESCUE CORPUS

Read more about THE SHEFFIELD SEARCH AND RESCUE CORPUS
Log in to post comments

As part of an ongoing research into extracting mission-critical information from Search and Rescue speech communications, a corpus of unscripted, goal-oriented, two-party spoken conversations has been designed and collected. The Sheffield Search and Rescue (SSAR) corpus comprises about 12 hours of data from 96 conversations by 24 native speakers of British English with a southern accent. Each conversation is about a collaborative task of exploring and estimating a simulated indoor environment.

posterA0.pdf

Poster: THE SHEFFIELD SEARCH AND RESCUE CORPUS (333)

Categories:: Spoken language resources and annotation (SLP-REAN)
Spoken Language Understanding (SLP-UNDE)

8 Views

NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION

Read more about NOVEL AMPLITUDE SCALING METHOD FOR BILINEAR FREQUENCY WARPING-BASED VOICE CONVERSION
Log in to post comments

In frequency warping (FW)-based Voice Conversion (VC), the source spectrum is modified to match the frequency-axis of the target spectrum followed by an Amplitude Scaling (AS) to compensate the amplitude differences between the warped spectrum and the actual target spectrum. In this paper, we propose a novel AS technique which linearly transfers the amplitude of the frequency warped spectrum using the knowledge of a Gaussian Mixture Model (GMM)-based converted spectrum without adding any spurious peaks.

ICASSP_2017_NH.pdf

ICASSP_2017_NH.pdf (637)

Categories:: Speech Synthesis and Generation, including TTS (SPE-SYNT)

20 Views

Dual-Tree Wavelet Scattering Network with Parametric Log Transformation for Object Classification

ICASSP-Final.pdf

ICASSP-Final.pdf (322)

Categories:: Machine Learning for Signal Processing

16 Views

Online Empirical Mode Decomposition

Read more about Online Empirical Mode Decomposition
Log in to post comments

The success of Empirical Mode Decomposition (EMD) resides in its practical approach to dissect non-stationary data. EMD repetitively goes through the entire data span to iteratively extract Intrinsic Mode Functions (IMFs). This approach, however, is not suitable for data stream as the entire data set has to be reconsidered every time a new point is added. To overcome this, we propose Online EMD, an algorithm that extracts IMFs on the fly.

icassp17_poster_onlineEMD.pdf

icassp17_poster_onlineEMD.pdf (394)

Categories:: Signal Processing Theory and Methods

87 Views

SPEAKER SEGMENTATION USING DEEP SPEAKER VECTORS FOR FAST SPEAKER CHANGE SCENARIOS

Read more about SPEAKER SEGMENTATION USING DEEP SPEAKER VECTORS FOR FAST SPEAKER CHANGE SCENARIOS
Log in to post comments

A novel speaker segmentation approach based on deep neural network is proposed and investigated. This approach uses deep speaker vectors (d-vectors) to represent speaker characteristics and to ﬁnd speaker change points. The d-vector is a kind of frame-level speaker recognition feature, whose discriminative training process corresponds to the goal of discriminating a speaker change point from a single speaker speech segment in a short time window.

170223-icassp-poster-wry[02-wry].pdf

SPEAKER SEGMENTATION USING DEEP SPEAKER VECTORS FOR FAST SPEAKER CHANGE SCENARIOS (25)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

13 Views

Double Relay Communication Protocol with Power Control for Achieving Fairness in Cellular Systems

The growing demand for wireless connectivity has turned bandwidth into a scarce resource that has to be carefully managed and fairly distributed to users. However, the variability of the wireless channel can severely degrade the service received by each user. The Double Relay Communication Protocol (DRCP) is a transmission scheme that addresses these problems by exploiting spatial diversity to enhance the fairness of the system without requiring any additional infrastructure (i.e relay nodes or a backhaul connection).

PowerContol_ICASSP_poster.pdf

PowerContol_ICASSP_poster.pdf (757)

Categories:: Communications and Networking

22 Views

Wirtinger Flow Method with Optimal Stepsize for Phase Retrieval

Read more about Wirtinger Flow Method with Optimal Stepsize for Phase Retrieval
Log in to post comments

The recently reported Wirtinger flow (WF) algorithm has been demonstrated as a promising method for solving the problem of phase retrieval by applying a gradient descent scheme. An empirical choice of stepsize is suggested in practice. However, this heuristic stepsize selection rule is not optimal. In order to accelerate the convergence rate, we propose an improved WF with optimal stepsize. It is revealed that this optimal stepsize is the solution of a univariate cubic equation with real-valued coefficients.