Abstract
Consistency training regularizes a model by enforcing predictions of original and perturbed inputs to be similar. Previous studies have proposed various augmentation methods for the perturbation but are limited in that they are agnostic to the training model. Thus, the perturbed samples may not aid in regularization due to their ease of classification from the model. In this context, we propose an augmentation method of adding a discrete noise that would incur the highest divergence between predictions. This virtual adversarial discrete noise obtained by replacing a small portion of tokens while keeping original semantics as much as possible efficiently pushes a training model's decision boundary. Experimental results show that our proposed method outperforms other consistency training baselines with text editing, paraphrasing, or a continuous noise on semi-supervised text classification tasks and a robustness benchmark.
| Original language | English |
|---|---|
| Title of host publication | NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics |
| Subtitle of host publication | Human Language Technologies, Proceedings of the Conference |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 5646-5656 |
| Number of pages | 11 |
| ISBN (Electronic) | 9781955917711 |
| DOIs | |
| Publication status | Published - 2022 |
| Event | 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022 - Hybrid, Seattle, United States Duration: 2022 Jul 10 → 2022 Jul 15 |
Publication series
| Name | NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference |
|---|
Conference
| Conference | 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2022 |
|---|---|
| Country/Territory | United States |
| City | Hybrid, Seattle |
| Period | 22/7/10 → 22/7/15 |
Bibliographical note
Publisher Copyright:© 2022 Association for Computational Linguistics.
ASJC Scopus subject areas
- Computer Networks and Communications
- Hardware and Architecture
- Information Systems
- Software
Fingerprint
Dive into the research topics of 'Consistency Training with Virtual Adversarial Discrete Perturbation'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS