Sorry, you need to enable JavaScript to visit this website.

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

A common problem in audio forensics is the difficulty to
authenticate an audio recording. In this paper we provide a novel
and reliable solution to this problem by making use of a control
signal, visible and audible on the actual recording, yet ignored by
the listener, the TIC-TAC signal. We describe our live watermark
solution, we incorporate it in an integrity check algorithm and we
provide meaningful preliminary tests. Their results, computed in
terms of precision show an outstanding performance: 100%

Categories:
39 Views

In this paper, we propose a privacy-preserving framework for outsourced media search applications. Considering three parties, a data owner, clients and a server, the data owner outsources the description of his data to an external server, which provides a search service to clients on the behalf of the data owner. The proposed framework is based on a sparsifying transform with ambiguization, which consists of a trained linear map, an element-wise nonlinearity and a privacy amplification.

Categories:
5 Views

A new technique for representing speech articulation with
an ultrasound-driven finite element model of the tongue is
presented. By using a snake contour extraction algorithm
with anatomically motivated constraints and a common
coordinate system between the ultrasound and the tongue
model, it is possible for the first time to obtain a realistic 3D
simulation of the tongue directly from a non-invasive sensor
(ultrasound), without mapping through any intermediate
sensor modalities, and at near real-time frame rates.

Categories:
19 Views

We introduce a model-based reconstruction
framework with deep learned (DL) and smoothness regularization
on manifolds (STORM) priors to recover free
breathing and ungated (FBU) cardiac MRI from highly undersampled
measurements. The DL priors enable us to exploit
the local correlations, while the STORM prior enables
us to make use of the extensive non-local similarities that are
subject dependent. We introduce a novel model-based formulation
that allows the seamless integration of deep learning

Categories:
4 Views

This paper focuses on face spoofing detection using video. The purpose is to find out the best scheme for this task in the end-to-end learning manner. We investigate 4 different types of structure to fully exploit the raw data in its spatial-temporal domain, which are the pure CNN, CNN with 3D convolu-tion, CNN+LSTM and CNN+Conv-LSTM. Moreover, anoth-er stream built on optical flow is also used, and with a proper fusion method, it can improve the accuracy. In experiments, we compare schemes on the raw data in single stream and fusion methods with optical flow in two streams.

Categories:
10 Views

There is growing interest in understanding the impact of architectural parameters such as depth, width, and the type of
activation function on the performance of a neural network. We provide an upper-bound on the number of free parameters
a ReLU-type neural network needs to exactly fit the training data. Whether a net of this size generalizes to test data will
be governed by the fidelity of the training data and the applicability of the principle of Occam’s Razor. We introduce the

Categories:
128 Views

This paper proposes an algorithm for the design of entropy-constrained unrestricted polar quantizer (ECUPQ) for bivariate circularly symmetric sources. The algorithm is globally optimal for the class of ECUPQs with magnitude quantizer thresholds confined to a finite set. The optimization problem is formulated as the minimization of a weighted sum of the distortion and entropy and the proposed solution is based on modeling the problem as a minimum-weight path problem in a certain weighted directed acyclic graph. The proposed algorithm enables solving the overall problem in

Categories:
1 Views

Extracting spatial information from an audio recording is a necessary step for upmixing stereo tracks to be played on surround systems. One important spatial feature is the perceived direction of the different audio sources in the recording, which determines how to remix the different sources in the surround system. The focus of this paper is the separation of two types of audio sources: primary (direct) and ambient (surrounding) sources. Several approaches have been proposed to solve the problem, based mainly on the correlation between the two channels in the stereo recording.

Categories:
66 Views

Pages