Abstract
The emerging transformers suffer from the memory bottleneck because of their low data locality and large data sizes, and we are actively studying Processing in Memory (PIM), which computes inside memory, to overcome the bottleneck. However, as their model parameters get bigger, a single PIM device would be insufficient because of its limited memory capacity and computing resources. In this paper, we develop a low-overhead data communication method between PIMs on a PCIe-based multi-PIM platform. We adopt an XDMA-based PIM-to-PIM (P2P) direct data communication mechanism to eliminate redundant data movement between CPU and PIMs. Our P2P achieves 1.69x, 1.70x, 1.61x, and 1.63x speedups compared to the DMA using the system memory as a buffer when transferring 16MB, 32MB, 64MB, and 128MB data, respectively.
Original language | English |
---|---|
Title of host publication | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350371888 |
DOIs | |
Publication status | Published - 2024 |
Event | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, Taiwan, Province of China Duration: 2024 Jan 28 → 2024 Jan 31 |
Publication series
Name | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 |
---|
Conference
Conference | 2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 |
---|---|
Country/Territory | Taiwan, Province of China |
City | Taipei |
Period | 24/1/28 → 24/1/31 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Direct Memory Access
- Multi-PIM Platform
- Peer-to-Peer Communication
- Processing-in-Memory
- XDMA
ASJC Scopus subject areas
- Computer Networks and Communications
- Computer Science Applications
- Hardware and Architecture
- Information Systems
- Energy Engineering and Power Technology
- Electrical and Electronic Engineering