Proximal policy optimization through a deep reinforcement learning framework for remedial action schemes of VSC-HVDC

Sungyoon Song, Yungun Jung, Gilsoo Jang, Seungmin Jung

    Research output: Contribution to journalArticlepeer-review

    9 Citations (Scopus)

    Abstract

    A proximal policy optimization (PPO)-based back-to-back VSC-HVDC emergency control strategy based on multi-agent deep reinforcement learning (DRL) approach is proposed for use in an energy management system (EMS). In this scheme, an advanced DRL algorithm is proposed by implementing both PPO and a communication neural network for large power systems. The PPO modeled as intelligent agents with objective functions have shown a higher convergence performance than have existing DRL algorithms. Further, the model was demonstrated to effectively address voltage variances caused by the high penetration of renewable energy sources. By implementing PPO, the learning procedure is stabilized and made robust to continuous changes in network topology. To escalate the effectiveness of the proposed algorithm, a comprehensive case studies were conducted on an standard test systems and Korean power system considering variations in load and PV generation and a weak centralized communication environment. The results indicate that outstanding control performance and autonomously regulated bus voltage and line flows, thereby validating the effectiveness of the method.

    Original languageEnglish
    Article number109117
    JournalInternational Journal of Electrical Power and Energy Systems
    Volume150
    DOIs
    Publication statusPublished - 2023 Aug

    Bibliographical note

    Publisher Copyright:
    © 2023 Elsevier Ltd

    Keywords

    • Artificial Intelligence
    • Energy Management System
    • Proximal Policy Optimization
    • Remedial Action Schemes
    • VSC-HVDC

    ASJC Scopus subject areas

    • Energy Engineering and Power Technology
    • Electrical and Electronic Engineering

    Fingerprint

    Dive into the research topics of 'Proximal policy optimization through a deep reinforcement learning framework for remedial action schemes of VSC-HVDC'. Together they form a unique fingerprint.

    Cite this