PARSE: An Efficient Search Method for Black-box Adversarial Text Attacks

Pengwei Zhan; Chao Zheng; Jing Yang; Yuxiang Wang; Liming Wang; Yang Wu; Yunjian Zhang

2022 COLING COLING 2022

PARSE: An Efficient Search Method for Black-box Adversarial Text Attacks

Abstract

AbstractNeural networks are vulnerable to adversarial examples. The adversary can successfully attack a model even without knowing model architecture and parameters, i.e., under a black-box scenario. Previous works on word-level attacks widely use word importance ranking (WIR) methods and complex search methods, including greedy search and heuristic algorithms, to find optimal substitutions. However, these methods fail to balance the attack success rate and the cost of attacks, such as the number of queries to the model and the time consumption. In this paper, We propose PAthological woRd Saliency sEarch (PARSE) that performs the search under dynamic search space following the subarea importance. Experiments show that PARSE can achieve comparable attack success rates to complex search methods while saving numerous queries and time, e.g., saving at most 74% of queries and 90% of time compared with greedy search when attacking the examples from Yelp dataset. The adversarial examples crafted by PARSE are also of high quality, highly transferable, and can effectively improve model robustness in adversarial training.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — word importance ranking

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Pengwei Zhan , Chao Zheng , Jing Yang , Yuxiang Wang , Liming Wang , Yang Wu , Yunjian Zhang

Topics

Artificial Intelligence > Core AI > AI Safety Machine Learning > Learning Types > Adversarial Learning Natural Language Processing > Applications > Text Classification Natural Language Processing > Applications > Text Generation Artificial Intelligence > Core AI > Adversarial Learning

Keywords

model robustness query efficiency greedy search black-box attack adversarial example word substitution word importance ranking black-box adversarial attack adversarial text attack black-box scenario neural network vulnerability

Download PDF

Related papers

MulZDG: Multilingual Code-Switching Framework for Zero-shot Dialogue Generation 2022

The Role of Context and Uncertainty in Shallow Discourse Parsing 2022

SelfMix: Robust Learning against Textual Label Noise with Self-Mixup Training 2022

Complicate Then Simplify: A Novel Way to Explore Pre-trained Models for Text Classification 2022

Repo4QA: Answering Coding Questions via Dense Retrieval on GitHub Repositories 2022