Low Overhead PIM-to-PIM Communication on PCIe-based Multi-PIM Platforms for Executing Large-Scale AI Models

Mun Seong Park, Seok Young Kim, Seon Wook Kim

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    The emerging transformers suffer from the memory bottleneck because of their low data locality and large data sizes, and we are actively studying Processing in Memory (PIM), which computes inside memory, to overcome the bottleneck. However, as their model parameters get bigger, a single PIM device would be insufficient because of its limited memory capacity and computing resources. In this paper, we develop a low-overhead data communication method between PIMs on a PCIe-based multi-PIM platform. We adopt an XDMA-based PIM-to-PIM (P2P) direct data communication mechanism to eliminate redundant data movement between CPU and PIMs. Our P2P achieves 1.69x, 1.70x, 1.61x, and 1.63x speedups compared to the DMA using the system memory as a buffer when transferring 16MB, 32MB, 64MB, and 128MB data, respectively.

    Original languageEnglish
    Title of host publication2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    ISBN (Electronic)9798350371888
    DOIs
    Publication statusPublished - 2024
    Event2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, Taiwan, Province of China
    Duration: 2024 Jan 282024 Jan 31

    Publication series

    Name2024 International Conference on Electronics, Information, and Communication, ICEIC 2024

    Conference

    Conference2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
    Country/TerritoryTaiwan, Province of China
    CityTaipei
    Period24/1/2824/1/31

    Bibliographical note

    Publisher Copyright:
    © 2024 IEEE.

    Keywords

    • Direct Memory Access
    • Multi-PIM Platform
    • Peer-to-Peer Communication
    • Processing-in-Memory
    • XDMA

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Computer Science Applications
    • Hardware and Architecture
    • Information Systems
    • Energy Engineering and Power Technology
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Low Overhead PIM-to-PIM Communication on PCIe-based Multi-PIM Platforms for Executing Large-Scale AI Models'. Together they form a unique fingerprint.

    Cite this