Accelerate Black-Box Attack with White-Box Prior Knowledge

  • Jinghui Cai
  • , Boyang Wang
  • , Xiangfeng Wang*
  • , Bo Jin
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Scopus citations

Abstract

We propose an efficient adversarial attack method in the black-box setting. Our Multi-model Efficient Query Attack (MEQA) method takes advantage of the prior knowledge on different models’ relationship to guide the construction of black-box adversarial instances. The MEQA method employs several gradients from different white-box attack models and further the “best” one is selected to replace the gradient of black-box model in each step. The gradient composed by different model gradients will lead a significant loss to the black-box model on these adversarial pictures and then cause misclassification. Our key motivation is to estimate the black-box model with several existing white-box models, which can significantly increase the efficiency from the perspectives of both query sampling and calculating. Compared with gradient estimation based black-box adversarial attack methods, our MEQA method reduces the number of queries from 10000 to 40, which greatly accelerates the black-box adversarial attack. Compared with the zero query black-box adversarial attack method, which also called transfer attack method, MEQA boosts the attack success rate by 30%. We evaluate our method on several black-box models and achieve remarkable performance which proves that MEQA can serve as a baseline method for fast and effective black-box adversarial attacks.

Original languageEnglish
Title of host publicationIntelligence Science and Big Data Engineering. Big Data and Machine Learning - 9th International Conference, IScIDE 2019, Proceedings, Part 2
EditorsZhen Cui, Jinshan Pan, Shanshan Zhang, Liang Xiao, Jian Yang
PublisherSpringer
Pages394-405
Number of pages12
ISBN (Print)9783030362034
DOIs
StatePublished - 2019
Event9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019 - Nanjing, China
Duration: 17 Oct 201920 Oct 2019

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume11936 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference9th International Conference on Intelligence Science and Big Data Engineering, IScIDE 2019
Country/TerritoryChina
CityNanjing
Period17/10/1920/10/19

Keywords

  • Efficient black-box attack
  • Gradient estimation
  • Model robustness
  • Transfer attack

Fingerprint

Dive into the research topics of 'Accelerate Black-Box Attack with White-Box Prior Knowledge'. Together they form a unique fingerprint.

Cite this