Stable HPC cluster management scheme through performance evaluation

Jun Weon Yoon, Tae Yeong Hong, Chan Yeol Park, Heon Chang Yu

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Citation (Scopus)

Abstract

HPC is representative tools for performing large-scale scientific calculation both academia and industry. Tachyon is a high-performance parallel computing system which constructed based on SUN Blade X6275. It composed of 3,200 computing nodes and infra-facilities. Also, this machine works with various software stacks such as file system, archive manager, compiler, debugger, parallel tools, etc. In this paper, we handle the requirements and requisites to build and manage an HPC cluster environment. In addition, we analyzed the history of batch job which include information performed by scheduler. By doing so, we are able to gauge the needs and performance of the next system to be introduced.

Original languageEnglish
Title of host publicationComputer Science and Its Applications - Ubiquitous Information Technologies
EditorsHwa Young Jeong, Ivan Stojmenovic, James J. Park, Gangman Yi
PublisherSpringer Verlag
Pages1017-1023
Number of pages7
ISBN (Electronic)9783662454015
DOIs
Publication statusPublished - 2015
Event6th FTRA International Conference on Computer Science and its Applications, CSA 2014 - Guam, United States
Duration: 2014 Dec 172014 Dec 19

Publication series

NameLecture Notes in Electrical Engineering
Volume330
ISSN (Print)1876-1100
ISSN (Electronic)1876-1119

Other

Other6th FTRA International Conference on Computer Science and its Applications, CSA 2014
Country/TerritoryUnited States
CityGuam
Period14/12/1714/12/19

Keywords

  • Benchmark
  • Cluster management
  • HPC
  • Scheduler
  • Supercomputer

ASJC Scopus subject areas

  • Industrial and Manufacturing Engineering

Fingerprint

Dive into the research topics of 'Stable HPC cluster management scheme through performance evaluation'. Together they form a unique fingerprint.

Cite this