MEW: Evading Ownership Detection Against Deep Learning Models

  • Wenxuan Yin
  • , Haifeng Qian*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Training deep neural network (DNNs) requires massive computing resources and data, hence the trained models belong to the model owners’ Intellectual Property (IP), and it is very important to defend against the model stealing attack. Recently, a well-known approach named Dataset Inference (DI) claimed that by measuring the distance from the sample to the decision boundary, it can be determined whether the theft has occurred. In this paper, we show that DI is not enough for IP protection. To demonstrate this, we propose a new system called MEW, which combines the Model Inversion (MI) attack and Elastic Weight Consolidation (EWC) to evade the detection of DI. We first use the pre-trained adversary model to generate a data pool and adaptively select samples to approximate the Fisher Information Matrix of the adversary model. Then we use an adaptation of EWC to slightly fine-tune the adversary model which moves it decision boundary slightly. Our empirical results demonstrate that the adversary model evaded the DI detection with 40 samples. We also lay out the limitations of MEW and discuss them at last.

Original languageEnglish
Title of host publicationNeural Information Processing - 29th International Conference, ICONIP 2022, Proceedings
EditorsMohammad Tanveer, Sonali Agarwal, Seiichi Ozawa, Asif Ekbal, Adam Jatowt
PublisherSpringer Science and Business Media Deutschland GmbH
Pages127-136
Number of pages10
ISBN (Print)9789819916443
DOIs
StatePublished - 2023
Event29th International Conference on Neural Information Processing, ICONIP 2022 - Virtual, Online
Duration: 22 Nov 202226 Nov 2022

Publication series

NameCommunications in Computer and Information Science
Volume1793 CCIS
ISSN (Print)1865-0929
ISSN (Electronic)1865-0937

Conference

Conference29th International Conference on Neural Information Processing, ICONIP 2022
CityVirtual, Online
Period22/11/2226/11/22

Keywords

  • Dataset inference
  • Deep learning
  • Model Stealing attack

Fingerprint

Dive into the research topics of 'MEW: Evading Ownership Detection Against Deep Learning Models'. Together they form a unique fingerprint.

Cite this