Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

Bai, Jiawang; Gao, Kuofeng; Gong, Dihong; Xia, Shu-Tao; Li, Zhifeng; Liu, Wei

Computer Science > Computer Vision and Pattern Recognition

arXiv:2207.13417 (cs)

[Submitted on 27 Jul 2022]

Title:Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

Authors:Jiawang Bai, Kuofeng Gao, Dihong Gong, Shu-Tao Xia, Zhifeng Li, Wei Liu

View PDF

Abstract:The security of deep neural networks (DNNs) has attracted increasing attention due to their widespread use in various applications. Recently, the deployed DNNs have been demonstrated to be vulnerable to Trojan attacks, which manipulate model parameters with bit flips to inject a hidden behavior and activate it by a specific trigger pattern. However, all existing Trojan attacks adopt noticeable patch-based triggers (e.g., a square pattern), making them perceptible to humans and easy to be spotted by machines. In this paper, we present a novel attack, namely hardly perceptible Trojan attack (HPT). HPT crafts hardly perceptible Trojan images by utilizing the additive noise and per pixel flow field to tweak the pixel values and positions of the original images, respectively. To achieve superior attack performance, we propose to jointly optimize bit flips, additive noise, and flow field. Since the weight bits of the DNNs are binary, this problem is very hard to be solved. We handle the binary constraint with equivalent replacement and provide an effective optimization algorithm. Extensive experiments on CIFAR-10, SVHN, and ImageNet datasets show that the proposed HPT can generate hardly perceptible Trojan images, while achieving comparable or better attack performance compared to the state-of-the-art methods. The code is available at: this https URL.

Comments:	Accepted to ECCV2022; Code: this https URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2207.13417 [cs.CV]
	(or arXiv:2207.13417v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2207.13417

Submission history

From: Jiawang Bai [view email]
[v1] Wed, 27 Jul 2022 09:56:17 UTC (9,663 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Computer Vision and Pattern Recognition

Title:Hardly Perceptible Trojan Attack against Neural Networks with Bit Flips

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.