ICASSP 2020

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2020 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Speaker-aware Training of Attention-based End-to-End Speech Recognition using Neural Speaker Embeddings

In speaker-aware training, a speaker embedding is appended to DNN input features. This allows the DNN to effectively learn representations, which are robust to speaker variability.
We apply speaker-aware training to attention-based end- to-end speech recognition. We show that it can improve over a purely end-to-end baseline. We also propose speaker-aware training as a viable method to leverage untranscribed, speaker annotated data.

icassp2020-slides.pdf

icassp2020-slides.pdf (341)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

53 Views

Blood Pressure Estimation from PPG Signals Using Convolutional Neural Networks and Siamese Network

Blood pressure (BP) is a vital sign of the human body and an important parameter for early detection of cardiovascular diseases. It is usually measured using cuff-based devices or monitored invasively in critically-ill patients. This paper presents two techniques that enable continuous and noninvasive cuff-less BP estimation using photoplethysmography (PPG) signals with Convolutional Neural Networks (CNNs). The first technique is calibration-free.

ICASSP Presentation - Blood Pressure Estimation from PPG Signals.pdf

ICASSP Presentation - Blood Pressure Estimation from PPG Signals.pdf (634)

Categories:: Biomedical signal processing

522 Views

Learning with Out of Distribution Data for Audio Classification

Read more about Learning with Out of Distribution Data for Audio Classification
Log in to post comments

In supervised machine learning, the assumption that training data is labelled correctly is not always satisfied. In this paper, we investigate an instance of labelling error for classification tasks in which the dataset is corrupted with out-of-distribution (OOD) instances: data that does not belong to any of the target classes, but is labelled as such. We show that detecting and relabelling certain OOD instances, rather than discarding them, can have a positive effect on learning.

Presentation.pdf

Presentation.pdf (581)

Categories:: Audio and Acoustic Signal Processing

18 Views

Statistical Signal Processing Approach For Rain Estimation Based on Measurements From Network Management Systems

In this talk we present statistical signal processing methodologies on a real-world application of using Commercial Microwave Links (CMLs) as opportunistic sensors for rain monitoring. We formulate an appropriate parameter estimation problem, taking advantage on the empirically evaluated statistics of the rain, and present a new methodology for rain estimation given only the quantized minimum and maximum radio signal level measurements, which are being logged regularly by the network management systems.

ICASSP2020_JO_V6_NONaration.pdf

Presentation Slides (416)

Categories:: Nonlinear Systems and Signal Processing

15 Views

AV(SE)²: AUDIO-VISUAL SQUEEZE-EXCITE SPEECH ENHANCEMENT

Read more about AV(SE)²: AUDIO-VISUAL SQUEEZE-EXCITE SPEECH ENHANCEMENT
Log in to post comments

AVSE2 Presentation.pdf

AVSE2 Presentation.pdf (468)

Categories:: Speech Enhancement (SPE-ENHA)

77 Views

A segmentation based deep learning framework for multimodal retinal image registration

Multimodal image registration plays an important role in diagnosing and treating ophthalmologic diseases. In this paper, a deep learning framework for multimodal retinal image registration is proposed. The framework consists of a segmentation network, feature detection and description network, and an outlier rejection network, which focuses only on the globally coarse alignment step using the perspective transformation.

ICASSP_slides_final.pdf

ICASSP_slides_final.pdf (439)

Categories:: Medical image analysis

45 Views

BP-VB-EP Based Static and Dynamic Sparse Bayesian Learning

Read more about BP-VB-EP Based Static and Dynamic Sparse Bayesian Learning
Log in to post comments

ICASSP20_presentation.pdf

ICASSP20_presentation.pdf (305)

Categories:: Statistical Signal Processing

15 Views

Hybrid Precoding for Secure Transmission in Reflect-Array-Assisted Massive MIMO Systems

Recently, a hybrid analog-digital architecture has been proposed for multiuser MIMO transmission in the millimeter-wave spectrum using reflect-arrays. The architecture exhibits scalability and high energy-efficiency while keeping the transmitter cost-efficient. Inspired by this architecture, we design a secure multiuser hybrid analog-digital precoding scheme. This scheme utilizes the method of regularized least-squares to shape the downlink beamformers, such that the signal received via malicious terminals is effectively suppressed.

icassp20_irs_tak.pdf

Presentation Slides (381)

Categories:: MIMO Communications and Signal Processing
Communications and Network Security

107 Views

Primary Path Estimator based on Individual Secondary Path for ANC Headphones

Read more about Primary Path Estimator based on Individual Secondary Path for ANC Headphones
Log in to post comments

2020_ICASSP_PPE.pdf

2020_ICASSP_PPE.pdf (505)

Categories:: Active Noise Control

76 Views

Revisiting Fast Spectral Clustering with Anchor Graph

Read more about Revisiting Fast Spectral Clustering with Anchor Graph
Log in to post comments

In this paper, we revisit the popular affinity matrix based on the anchor graph and point out that the spectral embedding obtained using symmetric normalized Laplacian is only a side view of the bipartite structure. Based on the analysis, we propose Fast Spectral Clustering based on the Random Walk Laplacian (FRWL) method to explicitly balance the popularity of anchors and the independence of data points, which is especially important for clustering of boundary points.

Icassp_Slides.pdf

large-scale spectral clustering (1033)

Categories:: Graphical and kernel methods (MLR-GRKN)

49 Views

Pages