跳到主要导航 跳到搜索 跳到主要内容

An Explainable Intellectual Property Protection Method for Deep Neural Networks Based on Intrinsic Features

  • Mingfu Xue*
  • , Xin Wang
  • , Yinghao Wu
  • , Shifeng Ni
  • , Leo Yu Zhang
  • , Yushu Zhang
  • , Weiqiang Liu
  • *此作品的通讯作者
  • Nanjing University of Aeronautics and Astronautics
  • Griffith University Queensland

科研成果: 期刊稿件文章同行评审

摘要

Intellectual property (IP) protection for deep neural networks (DNNs) has raised serious concerns in recent years. Most existing works embed watermarks in the DNN model for IP protection, which need to modify the model and do not consider/mention interpretability. In this article, for the first time, we propose an interpretable IP protection method for DNN based on explainable artificial intelligence. Compared with existing works, the proposed method does not modify the DNN model, and the decision of the ownership verification is interpretable. We extract the intrinsic features of the DNN model by using deep Taylor decomposition. Since the intrinsic feature is composed of unique interpretation of the model's decision, the intrinsic feature can be regarded as fingerprint of the model. If the fingerprint of a suspected model is the same as the original model, the suspected model is considered as a pirated model. Experimental results demonstrate that the fingerprints can be successfully used to verify the ownership of the model and the test accuracy of the model is not affected. Furthermore, the proposed method is robust to fine-tuning attack, pruning attack, watermark overwriting attack, and adaptive attack.

源语言英语
页(从-至)4649-4659
页数11
期刊IEEE Transactions on Artificial Intelligence
5
9
DOI
出版状态已出版 - 2024

指纹

探究 'An Explainable Intellectual Property Protection Method for Deep Neural Networks Based on Intrinsic Features' 的科研主题。它们共同构成独一无二的指纹。

引用此