Sorry, you need to enable JavaScript to visit this website.

TEN-GUARD: TENSOR DECOMPOSITION FOR BACKDOOR ATTACK DETECTION IN DEEP NEURAL NETWORKS

DOI:
10.60864/hwys-6958
Citation Author(s):
Khondoker Murad Hossain, Tim Oates
Submitted by:
Khondoker Hossain
Last updated:
6 June 2024 - 10:23am
Document Type:
Poster
Document Year:
2024
Event:
Paper Code:
MLSP-P20.7
 

As deep neural networks and the datasets used to train them get larger, the default approach to integrating them into re-
search and commercial projects is to download a pre-trained model and fine tune it. But these models can have uncertain
provenance, opening up the possibility that they embed hidden malicious behavior such as trojans or backdoors, where
small changes to an input (triggers) can cause the model toproduce incorrect outputs (e.g., to misclassify). This paper
introduces a novel approach to backdoor detection that uses two tensor decomposition methods applied to network activations. This has a number of advantages relative to existing
detection methods, including the ability to analyze multiple models at the same time, working across a wide variety of network architectures, making no assumptions about the nature of triggers used to alter network behavior, and being computationally efficient. We provide a detailed description of the detection pipeline along with results on models trained on
the MNIST digit dataset, CIFAR-10 dataset, and two difficult
datasets from NIST’s TrojAI competition. These results show
that our method detects backdoored networks more accurately
and efficiently than current state-of-the-art methods.

up
0 users have voted: