Signal Processing for Adversarial Machine Learning

BLOCK-SPARSE ADVERSARIAL ATTACK TO FOOL TRANSFORMER-BASED TEXT CLASSIFIERS

Read more about BLOCK-SPARSE ADVERSARIAL ATTACK TO FOOL TRANSFORMER-BASED TEXT CLASSIFIERS
Log in to post comments

Recently, it has been shown that, in spite of the significant performance of deep neural networks in different fields, those are vulnerable to adversarial examples. In this paper, we propose a gradient-based adversarial attack against transformer-based text classifiers. The adversarial perturbation in our method is imposed to be block-sparse so that the resultant adversarial example differs from the original sentence in only a few words. Due to the discrete nature of textual data, we perform gradient projection to find the minimizer of our proposed optimization problem.

ICASSP2022.zip

Presentation slides and Poster (230)

Categories:: Neural network learning (MLR-NNLR)
Other applications of machine learning (MLR-APPL)

38 Views

Defending DNN Adversarial Attacks with Pruning and Logits Augmentation

Read more about Defending DNN Adversarial Attacks with Pruning and Logits Augmentation
Log in to post comments

Deep neural networks (DNNs) have been shown to be powerful models and perform extremely well on many complicated artificial intelligent tasks. However, recent research found that these powerful models are vulnerable to adversarial attacks, i.e., intentionally added imperceptible perturbations to DNN inputs can easily mislead the DNNs with extremely high confidence. In this work, we enhance the robustness of DNNs under adversarial attacks by using pruning method and logits augmentation, we achieve both effective defense against adversarial examples and DNN model compression.

GlobalSip_Final.pdf

GlobalSip_Final.pdf (867)

Categories:: Audio and Acoustic Signal Processing

26 Views