TY - JOUR
T1 - Epsim
T2 - A scalable and parallel marssx86 simulator with exploiting epoch-based execution
AU - Kim, Minseong
AU - Park, Chanhyun
AU - Han, Miseon
AU - Han, Youngsun
AU - Kim, Seon Wook
N1 - Funding Information:
This work was partly supported by the IT R&D program of MOTIE/KEIT [10052653, Development of processing in memory architecture and parallel processing for data bounding applications] and the SK hynix Inc. research project.
Publisher Copyright:
© 2018 IEEE.
PY - 2019
Y1 - 2019
N2 - In general, a detailed modeling and evaluation of computer architectures make a cycle-accurate simulator necessary. As the architectures become increasingly complex for parallel, cloud, and neural computing, nowadays, the complexity of the simulator grows rapidly, and thus its execution is too slow or infeasible for practical use. In order to alleviate the problem, many previous studies have focused on reducing the simulation time in a variety of ways such as using sampling methods, adding hardware accelerators, and so on. In this paper, we propose a new parallel simulation framework, called Epoch-based Parallel SIMulator, to obtain scalable speedup with large number of cores. The framework is based on a well-known cycle-accurate full-system simulator, MARSSx86. From the simulator, we build an epoch, that is an execution interval, where the architectural simulation by PTLSim does not involve any interaction with QEMU. Therefore, we can simulate epochs independently, i.e., execute multiple epochs completely in parallel by PTLSim with their live-in data. Our performance evaluation shows that we achieve 12.8× speed on average with 16-core parallel simulation from the SPEC CPU2006 benchmarks and the PARSEC benchmarks, providing the performance scalability.
AB - In general, a detailed modeling and evaluation of computer architectures make a cycle-accurate simulator necessary. As the architectures become increasingly complex for parallel, cloud, and neural computing, nowadays, the complexity of the simulator grows rapidly, and thus its execution is too slow or infeasible for practical use. In order to alleviate the problem, many previous studies have focused on reducing the simulation time in a variety of ways such as using sampling methods, adding hardware accelerators, and so on. In this paper, we propose a new parallel simulation framework, called Epoch-based Parallel SIMulator, to obtain scalable speedup with large number of cores. The framework is based on a well-known cycle-accurate full-system simulator, MARSSx86. From the simulator, we build an epoch, that is an execution interval, where the architectural simulation by PTLSim does not involve any interaction with QEMU. Therefore, we can simulate epochs independently, i.e., execute multiple epochs completely in parallel by PTLSim with their live-in data. Our performance evaluation shows that we achieve 12.8× speed on average with 16-core parallel simulation from the SPEC CPU2006 benchmarks and the PARSEC benchmarks, providing the performance scalability.
KW - Architectural simulation
KW - epoch-based execution
KW - parallel simulation
UR - http://www.scopus.com/inward/record.url?scp=85058873432&partnerID=8YFLogxK
U2 - 10.1109/ACCESS.2018.2886630
DO - 10.1109/ACCESS.2018.2886630
M3 - Article
AN - SCOPUS:85058873432
SN - 2169-3536
VL - 7
SP - 4782
EP - 4794
JO - IEEE Access
JF - IEEE Access
M1 - 8579541
ER -