2PTS: A two-phase task scheduling algorithm for MapReduce

Byungnam Lim, Yeeun Shim, Yon Dohn Chung

Research output: Contribution to journalArticlepeer-review

Abstract

For an efficient processing of large data in a distributed system, Hadoop MapReduce performs task scheduling such that tasks are distributed with consideration of the data locality. The data locality, however, is limitedly exploited, since it is pursued one node at a time basis without considering the global optimality. In this paper, we propose a novel task scheduling algorithm that globally considers the data locality. Through experiments, we show our algorithm improves the performance of MapReduce in various situations.

Original languageEnglish
Pages (from-to)2377-2380
Number of pages4
JournalIEICE Transactions on Information and Systems
VolumeE99D
Issue number9
DOIs
Publication statusPublished - 2016 Sept

Bibliographical note

Funding Information:
This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (No. NRF-2014R1A2A1A11053657).

Publisher Copyright:
Copyright © 2016 The Institute of Electronics, Information and Communication Engineers.

Keywords

  • Data locality
  • MapReduce
  • Task scheduling algorithm

ASJC Scopus subject areas

  • Software
  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering
  • Artificial Intelligence

Fingerprint

Dive into the research topics of '2PTS: A two-phase task scheduling algorithm for MapReduce'. Together they form a unique fingerprint.

Cite this