Combating Adversarial Misspellings with Robust Word Recognition

Danish Pruthi; Bhuwan Dhingra; Zachary C. Lipton

2019 ACL ACL 2019

Combating Adversarial Misspellings with Robust Word Recognition

Abstract

AbstractTo combat adversarial spelling mistakes, we propose placing a word recognition model in front of the downstream classifier. Our word recognition models build upon the RNN semi-character architecture, introducing several new backoff strategies for handling rare and unseen words. Trained to recognize words corrupted by random adds, drops, swaps, and keyboard mistakes, our method achieves 32% relative (and 3.3% absolute) error reduction over the vanilla semi-character model. Notably, our pipeline confers robustness on the downstream classifier, outperforming both adversarial training and off-the-shelf spell checkers. Against a BERT model fine-tuned for sentiment analysis, a single adversarially-chosen character attack lowers accuracy from 90.3% to 45.8%. Our defense restores accuracy to 75%. Surprisingly, better word recognition does not always entail greater robustness. Our analysis reveals that robustness also depends upon a quantity that we denote the sensitivity.

🧭 Keyword Pioneer — adversarial defense

🐣 Hot Topic Early Bird — adversarial robustness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

Authors

Danish Pruthi , Bhuwan Dhingra , Zachary C. Lipton

Topics

Machine Learning > Learning Types > Adversarial Learning Artificial Intelligence > Core AI > Adversarial Learning Artificial Intelligence > Core AI > Language Artificial Intelligence > Core AI > Natural Language Processing

Keywords

adversarial learning adversarial robustness sentiment analysis text classification word recognition adversarial defense semi-character model spelling correction spell checking spell checker character attack

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019