ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

A Coupled Compressive Sensing Scheme for Unsourced Multiple Access

Read more about A Coupled Compressive Sensing Scheme for Unsourced Multiple Access
Log in to post comments

slides_ICASSP.pdf

slides_ICASSP.pdf (449)

Categories:: Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks

13 Views

SEQUENCE-BASED MULTI-LINGUAL LOW RESOURCE SPEECH RECOGNITION

Read more about SEQUENCE-BASED MULTI-LINGUAL LOW RESOURCE SPEECH RECOGNITION
Log in to post comments

Techniques for multi-lingual and cross-lingual speech recognition can help in low resource scenarios, to bootstrap systems and enable analysis of new languages and domains. End-to-end approaches, in particular sequence-based techniques, are attractive because of their simplicity and elegance. While it is possible to integrate traditional multi-lingual bottleneck feature extractors as front-ends, we show that end-to-end multi-lingual training of sequence models is effective on context independent models trained using Connectionist Temporal Classification (CTC) loss.

Dalmia_ICASSP_2018.pdf

Dalmia_ICASSP_2018.pdf (505)

Categories:: Multilingual Recognition and Identification (SPE-MULT)

15 Views

END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA

Read more about END-TO-END DNN BASED SPEAKER RECOGNITION INSPIRED BY I-VECTOR AND PLDA
Log in to post comments

End-to-End_ICASSP_2018.pdf

End-to-End_ICASSP_2018.pdf (589)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

16 Views

Scheduling of Multistatic Sonobuoy Fields using Multi-Objective Optimization

Read more about Scheduling of Multistatic Sonobuoy Fields using Multi-Objective Optimization
Log in to post comments

Sonobuoy fields, comprising a network of transmitters and receivers, are commonly deployed to find and track underwater targets. For a given environment and sonobuoy field layout, the performance of such a field depends on the scheduling, that is, deciding which source should transmit, and which from a library of available waveforms should be transmitted at any given time. In this paper, we propose a novel scheduling framework based on multi-objective optimization. Specifically, we pose the two tasks of the sonobuoy field—tracking and searching—as separate, competing, objective functions.

Sonar_Slides.pdf

Slides (640)

Categories:: Sensor and Relay Networks

104 Views

Virtual Pulse Design for IEEE 802.11ad-Based Joint Communication-Radar

Read more about Virtual Pulse Design for IEEE 802.11ad-Based Joint Communication-Radar
Log in to post comments

The millimeter wave WLAN standard can be used for joint communication-radar by exploiting the waveform preamble as a radar pulse. The velocity estimation accuracy with this approach, however, is limited due to the short integration time. A physical increase in the radar pulse integration duration, however, leads to a decrease in the communication data rate.

Poster_VIRTUAL PULSE DESIGN FOR IEEE 802.11AD-BASED JOINT COMMUNICATION-RADAR.pdf

Poster_VIRTUAL PULSE DESIGN FOR IEEE 802.11AD-BASED JOINT COMMUNICATION-RADAR.pdf (396)

Categories:: Applications of Sensor Array and Multi-channel Signal Processing

28 Views

A MULTI-PERSPECTIVE APPROACH TO ANOMALY DETECTION FOR SELF-AWARE EMBODIED AGENTS

Read more about A MULTI-PERSPECTIVE APPROACH TO ANOMALY DETECTION FOR SELF-AWARE EMBODIED AGENTS
Log in to post comments

This paper focuses on multi-sensor anomaly detection for moving cognitive agents using both external and private first-person visual observations. Both observation types are used to characterize agents’ motion in a given environment. The proposed method generates locally uniform motion models by dividing a Gaussian process that approximates agents’ displacements on the scene and provides a Shared Level (SL) self-awareness based on Environment Centered (EC) models.

SS-L2.5 A MULTI-PERSPECTIVE APPROACH TO ANOMALY DETECTION FOR SELF-AWARE EMBODIED AGENTS.pdf

SS-L2.5 A MULTI-PERSPECTIVE APPROACH TO ANOMALY DETECTION FOR SELF-AWARE EMBODIED AGENTS.pdf (612)

Categories:: Applications in Data Fusion (MLR-FUSI)
Bio-inspired multimedia systems and signal processing
Image/Video Processing

24 Views

Regularized SVD-based Video Frame Saliency for Unsupervised Activity Video Summarization

Storage, browsing and analysis of human activity videos can be significantly facilitated by automated video summarization. Unsupervised key-frame extraction remains the most widely applicable technique for summarizing activity videos. However, their specific properties make the problem difficult to solve. Typical relevant algorithms fall under the video frame clustering or the dictionary-of-representatives families, with salient dictionary learning having been recently proposed.

Poster.pdf

Regularized SVD-based Video Frame Saliency for Unsupervised Activity Video Summarization (821)

Categories:: Image/Video Storage, Retrieval

6 Views

A NOVEL SELECTIVE ACTIVE NOISE CONTROL ALGORITHM TO OVERCOME PRACTICAL IMPLEMENTATION ISSUE

Selective active noise control (SANC) is a method
to select a pre-trained control filter for different
primary noises, instead of using conventional
real-time computation of the control filter coefficients.
This paper:
1. Proves the frequency-band-match method.
2. Propose a SANC based on a partitioned frequency
domain filter.
3. Both simulation and real-time experiment is
carried out to validate the algorithm.

ICASSP2018_Poster_DY2.pdf

ICASSP2018_Poster_DY2.pdf (558)

Categories:: Active Noise Control

30 Views

AN IMMERSIVE 3D AUDIO HEADSET FOR VIRTUAL AND AUGMENTED REALITY

Read more about AN IMMERSIVE 3D AUDIO HEADSET FOR VIRTUAL AND AUGMENTED REALITY
Log in to post comments

ICASSP 2018 POSTER_Final.pdf

ICASSP 2018 POSTER_Final.pdf (417)

Categories:: Spatial and Multichannel Audio

35 Views

Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification

Read more about Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification
Log in to post comments

Phonetic variability is one of the primary challenges in short duration speaker verification. This paper proposes a novel method that modifies the standard normal distribution prior in the total variability model to use a mixture of Gaussians as the prior distribution. The proposed speaker-phonetic vectors are then estimated from the posterior probability of latent variables, and each vector has a phonetic meaning.

JIANBOMA_ICASSP_2018.pdf

JIANBOMA_ICASSP_2018.pdf (534)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

46 Views

Pages