Managing fault tolerance information in multi-agents based distributed systems

Dae Won Lee, Kwang Sik Chung, Hwa Min Lee, Sungbin Park, Young Jun Lee, Heon Chang Yu, Won Gyu Lee

    Research output: Contribution to journalArticlepeer-review

    Abstract

    In a fault tolerant system using rollback-recovery protocols, the performance of the system is degraded because of the increment of saved fault tolerance information. To avoid degrading its performance, we propose novel multi-agents based garbage-collection technique that deletes useless fault tolerance information. We define and design a garbage-collection agent for garbage-collection of fault tolerance information, a information agent for management of fault tolerant information, and a facilitator agent for communication between agents. And we propose the garbage-collection algorithm(GCA) using these agents. Our rollback recovery method is based on independent checkpointing protocol and sender based pessimistic message logging protocol. To prove the correctness of the garbage-collection algorithm, we introduce failure injection during operation and compare the domain knowledge of the proposed system using GCA with the domain knowledge of another system without GCA.

    Original languageEnglish
    Pages (from-to)104-108
    Number of pages5
    JournalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2690
    Publication statusPublished - 2004

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Managing fault tolerance information in multi-agents based distributed systems'. Together they form a unique fingerprint.

    Cite this