Abstract
Transformers have become the de-facto standard for natural language processing. However, dense information flows within transformers pose significant challenges for real-time and resource-constrained devices, as computational complexity grows quadratically with sequence length. To counteract such dense information flows, we propose SPARSEFLOW, a novel efficient method designed to sparsify the dense pathways of token representations across all transformer blocks. To this end, SPARSEFLOW parameterizes the information flows linking token representations to transformer blocks. These parameterized information flows are optimized to be sparse, allowing only the salient information to pass through into the blocks. To validate the efficacy of SPARSEFLOW, we conduct comprehensive experiments across diverse benchmarks (understanding and generation), scales (ranging from millions to billions), architectures (including encoders, decoders, and seq-to-seq models), and modalities (such as language-only and vision-language). The results convincingly demonstrate that sparsifying the dense information flows leads to substantial speedup gains without compromising task accuracy. For instance, SPARSEFLOW reduces computational costs by half on average, without a significant loss in accuracy.
| Original language | English |
|---|---|
| Title of host publication | Long Papers |
| Editors | Lun-Wei Ku, Andre F. T. Martins, Vivek Srikumar |
| Publisher | Association for Computational Linguistics (ACL) |
| Pages | 5937-5948 |
| Number of pages | 12 |
| ISBN (Electronic) | 9798891760943 |
| DOIs | |
| Publication status | Published - 2024 |
| Event | 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 - Bangkok, Thailand Duration: 2024 Aug 11 → 2024 Aug 16 |
Publication series
| Name | Proceedings of the Annual Meeting of the Association for Computational Linguistics |
|---|---|
| Volume | 1 |
| ISSN (Print) | 0736-587X |
Conference
| Conference | 62nd Annual Meeting of the Association for Computational Linguistics, ACL 2024 |
|---|---|
| Country/Territory | Thailand |
| City | Bangkok |
| Period | 24/8/11 → 24/8/16 |
Bibliographical note
Publisher Copyright:© 2024 Association for Computational Linguistics.
ASJC Scopus subject areas
- Language and Linguistics
- Linguistics and Language
- Computer Science Applications