ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

Federated Learning With Local Differential Privacy: Trade-Offs Between Privacy, Utility, and Communication

Federated learning (FL) allows to train a massive amount of data privately due to its decentralized structure. Stochastic gradient descent (SGD) is commonly used for FL due to its good empirical performance, but sensitive user information can still be inferred from weight updates shared during FL iterations. We consider Gaussian mechanisms to preserve local differential privacy (LDP) of user data in the FL model with SGD. The trade-offs between user privacy, global utility, and transmission rate are proved by defining appropriate metrics for FL with LDP.

ICASSP 2021 poster v3.pdf

The pdf file of the poster used for the poster session. (373)

ICASSP 2021 presentation wo video.pptx

The slides (.pptx file) used for the recorded presentation. (283)

Categories:: Other

86 Views

Speech Emotion Recognition based on Listener Adaptive Models

Read more about Speech Emotion Recognition based on Listener Adaptive Models
Log in to post comments

ICASSP21_EmotionListenerAdaptiveModels_v4.pdf

ICASSP21_EmotionListenerAdaptiveModels_v4.pdf (276)

Categories:: Speech Processing

26 Views

ON THE PREDICTABILITY OF HRTFS FROM EAR SHAPES USING DEEP NETWORKS

Read more about ON THE PREDICTABILITY OF HRTFS FROM EAR SHAPES USING DEEP NETWORKS
Log in to post comments

Head-Related Transfer Function (HRTF) individualization is critical for immersive and realistic spatial audio rendering in augmented/virtual reality. Neither measurements nor simulations using 3D scans of head/ear are scalable for practical applications. More efficient machine learning approaches are being explored recently, to predict HRTFs from ear images or anthropometric features. However, it is not yet clear whether such models can provide an alternative for direct measurements or high-fidelity simulations. Here, we aim to address this question.

ICASSP2021poster.pdf

poster (272)

ICASSP2021slides.pdf

presentation slides (259)

Categories:: Spatial and Multichannel Audio

49 Views

Single-Point Array Response Control with Minimum Pattern Deviation

Read more about Single-Point Array Response Control with Minimum Pattern Deviation
Log in to post comments

Presentation Slides_Xiaoyu Ai.pptx

Slides (368)

1597XiaoyuAi.pdf

Poster (265)

Categories:: Sensor Array Processing
Adaptive Array Signal Processing

8 Views

CHANNEL-WISE MIX-FUSION DEEP NEURAL NETWORKS FOR ZERO-SHOT LEARNING

Read more about CHANNEL-WISE MIX-FUSION DEEP NEURAL NETWORKS FOR ZERO-SHOT LEARNING
Log in to post comments

ICASSP 2021 Prez.pdf

ICASSP 2021 Prez.pdf (218)

Categories:: Neural network learning (MLR-NNLR)

8 Views

Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging

Read more about Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging
Log in to post comments

poster_domain_pattern.pdf

poster_domain_pattern.pdf (231)

Categories:: Biomedical signal processing

20 Views

A Unified Approach to Translate Classical Bandit algorithms to Structured Bandits

Read more about A Unified Approach to Translate Classical Bandit algorithms to Structured Bandits
Log in to post comments

ClassMLStructured.pdf

ClassMLStructured.pdf (226)

Categories:: Sequential learning; sequential decision methods (MLR-SLER)
Learning theory and algorithms (MLR-LEAR)

5 Views

A Causal Deep Learning Framework for Classifying Phonemes in Cochlear Implants

Read more about A Causal Deep Learning Framework for Classifying Phonemes in Cochlear Implants
Log in to post comments

Chu_ICASSP2021_Poster_v2.pdf

ICASSP poster (276)

Categories:: General Topics in Speech Recognition (SPE-GASR)

37 Views

A classifier for improving cause and effect in SSVEP-based BCIs for individuals with complex communication disorders

We present CCACUSUM, a classifier for steady-state visual evoked potential (SSVEP)-based brain-computer interfaces (BCIs) that determines whether a user is attending to a flickering stimulus or is at rest. Correct classification of these two states establishes cause and effect between the BCI and its user, which is essential for helping individuals with complex communication disorders (CCDs) communicate.

slides_ICASSP.pdf

ICASSP 2021 Presentation slides (210)

habib_poster_ICASSP_3.pdf

ICASSP 2021 Poster (236)

Categories:: Biomedical signal processing

54 Views

Wake Word Detection with Streaming Transformers

Read more about Wake Word Detection with Streaming Transformers
Log in to post comments

Modern wake word detection systems usually rely on neural networks for acoustic modeling. Transformers has recently shown superior performance over LSTM and convolutional networks in various sequence modeling tasks with their better temporal modeling power. However it is not clear whether this advantage still holds for short-range temporal modeling like wake word detection. Besides, the vanilla Transformer is not directly applicable to the task due to its non-streaming nature and the quadratic time and space complexity.

ICASSP2021_poster.pdf

ICASSP2021_poster.pdf (345)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

20 Views

Pages