ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

A NOISE-ROBUST SIGNAL PROCESSING STRATEGY FOR COCHLEAR IMPLANTS USING NEURAL NETWORKS

Read more about A NOISE-ROBUST SIGNAL PROCESSING STRATEGY FOR COCHLEAR IMPLANTS USING NEURAL NETWORKS
Log in to post comments

icassp_2021_poster_v2.pdf

icassp_2021_poster_v2.pdf (334)

Categories:: Speech Perception and Psychoacoustics (SPE-SPER)

16 Views

Dense Feature Pyramid Grids Network for Single Image Deraining

Read more about Dense Feature Pyramid Grids Network for Single Image Deraining
Log in to post comments

Rainy images degrade the visional performance that may bring down the accuracy of various applications. In this paper, we propose a novel densely connected network with Dense Feature Pyramid Grids Modules, called DFPGN, to solve the rain removal task. Specifically, in the proposed DFPG, there are five operations from different layers with various pathways and scales as the input of the current layer so that each layer can fuse various features from shallower and deeper ones to improve the deraining ability of the net- work.

ICASSP2021.pptx

ICASSP2021.pptx (261)

Categories:: Image/Video Coding

10 Views

ICASSP2021_slides

20210410_ICASSP2021.pptx

20210410_ICASSP2021.pptx (293)

Categories:: Speech Enhancement (SPE-ENHA)

11 Views

Improved Probabilistic Context-free Grammars for Passwords Using Word Extraction

Read more about Improved Probabilistic Context-free Grammars for Passwords Using Word Extraction
Log in to post comments

Probabilistic context-free grammars (PCFGs) have been proposed to capture password distributions, and further been used in password guessing attacks and password strength meters. However, current PCFGs suffer from the limitation of inaccurate segmentation of password, which leads to misestimation of password probability and thus seriously affects their performance. In this paper, we propose a word extraction approach for passwords, and further present an improved PCFG model, called WordPCFG.

ICASSP2021_slides.pdf

ICASSP2021_slides.pdf (674)

poster.pdf

poster.pdf (538)

Categories:: Communications and Network Security

15 Views

Have You Made A Decision? Where? A Pilot Study on Interpretability of Polarity Analysis Based on Advising Problem

The general approaches for polarity analysis in dialogue, e.g. Multiple Instance Learning (MIL), have achieved significant progress.
However, one significant drawback of current approaches is that the contribution of an utterance towards the polarity being a \emph{black-box}.
For existing methods, the polarity contained in each utterance, which we call meta-polarity, is not explicitly utilized.
In this paper, we study the problem of adding interpretability to the overall polarity by predicting the meta-polarity at the same time.

ICASSP2021.zip

ICASSP2021 presentation and poster (288)

Categories:: Speech Processing

8 Views

Application-Layer DDoS Attacks with Multiple Emulation Dictionaries

Read more about Application-Layer DDoS Attacks with Multiple Emulation Dictionaries
Log in to post comments

We consider the problem of identifying the members of a botnet under an application-layer (L7) DDoS attack, where a target site is flooded with a large number of requests that emulate legitimate users’ patterns. This challenging problem has been recently addressed with reference to two simplified scenarios, where either all bots pick requests from the same emulation dictionary (total overlap), or they are divided in separate clusters corresponding to distinct emulation dictionaries (no overlap at all).

poster_icassp_v1bis.pdf

Poster (338)

icassp_slides.pdf

Slides (386)

Categories:: Communications and Network Security

55 Views

Consensus Based Distributed Spectral Radius Estimation

Read more about Consensus Based Distributed Spectral Radius Estimation
Log in to post comments

A consensus based distributed algorithm to compute
the spectral radius of a network is proposed. The spectral radius
of the graph is the largest eigenvalue of the adjacency matrix, and
is a useful characterization of the network graph. Conventionally,
centralized methods are used to compute the spectral radius, which
involves eigenvalue decomposition of the adjacency matrix of the
underlying graph. Our distributed algorithm uses a simple update
rule to reach consensus on the spectral radius, using only local

IEEE-ICASSP-Paper-5626.pdf

Presentation (317)

Categories:: Communication and Sensing aspects of Sensor Networks, Wireless and Ad-Hoc Networks
Signal and System Modeling, Representation and Estimation

5 Views

Large Margin Training Improves Language Models for ASR

Read more about Large Margin Training Improves Language Models for ASR
Log in to post comments

Language models (LM) have been widely deployed in modern ASR systems. The LM is often trained by minimizing its perplexity on speech transcript. However, few studies try to discriminate a "gold" reference against inferior hypotheses. In this work, we propose a large margin language model (LMLM). LMLM is a general framework that enforces an LM to assign a higher score to the "gold" reference, and a lower one to the inferior hypothesis. The general framework is applied to three pretrained LM architectures: left-to-right LSTM, transformer encoder, and transformer decoder.

ICASSP_LM_Jilin_Wang_04212021_slide_upload.pdf

Poster for "Large Margin Training Improves Language Models for ASR" (422)

Categories:: Language Modeling, for Speech and SLP (SLP-LANG)

18 Views

COMPLEX RATIO MASKING FOR SINGING VOICE SEPARATION

Read more about COMPLEX RATIO MASKING FOR SINGING VOICE SEPARATION
Log in to post comments

Music source separation is important for applications such as karaoke and remixing. Much of previous research
focuses on estimating magnitude short-time Fourier transform (STFT) and discarding phase information. We observe that,
for singing voice separation, phase has the potential to make considerable improvement in separation quality. This paper
proposes a complex-domain deep learning method for voice and accompaniment separation. The proposed method employs