Stealthy Imitation: Reward-guided Environment-free Policy Stealing

Zhuang, Zhixiong; Nicolae, Maria-Irina; Fritz, Mario

Computer Science > Cryptography and Security

arXiv:2405.07004 (cs)

[Submitted on 11 May 2024]

Title:Stealthy Imitation: Reward-guided Environment-free Policy Stealing

Authors:Zhixiong Zhuang, Maria-Irina Nicolae, Mario Fritz

View PDF HTML (experimental)

Abstract:Deep reinforcement learning policies, which are integral to modern control systems, represent valuable intellectual property. The development of these policies demands considerable resources, such as domain expertise, simulation fidelity, and real-world validation. These policies are potentially vulnerable to model stealing attacks, which aim to replicate their functionality using only black-box access. In this paper, we propose Stealthy Imitation, the first attack designed to steal policies without access to the environment or knowledge of the input range. This setup has not been considered by previous model stealing methods. Lacking access to the victim's input states distribution, Stealthy Imitation fits a reward model that allows to approximate it. We show that the victim policy is harder to imitate when the distribution of the attack queries matches that of the victim. We evaluate our approach across diverse, high-dimensional control tasks and consistently outperform prior data-free approaches adapted for policy stealing. Lastly, we propose a countermeasure that significantly diminishes the effectiveness of the attack.

Comments:	Accepted at ICML 2024. Project page: this https URL
Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2405.07004 [cs.CR]
	(or arXiv:2405.07004v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2405.07004

Submission history

From: Zhixiong Zhuang [view email]
[v1] Sat, 11 May 2024 12:55:10 UTC (6,996 KB)

Computer Science > Cryptography and Security

Title:Stealthy Imitation: Reward-guided Environment-free Policy Stealing

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Computer Science > Cryptography and Security

Title:Stealthy Imitation: Reward-guided Environment-free Policy Stealing

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.