TY - JOUR
T1 - Load Partitioning and Trade-Off Study for Large Matrix-Vector Computations in Multicast Bus Networks with Communication Delays
AU - Ghose, Debasish
AU - Kim, Hyoung Joong
N1 - Funding Information:
The work reported in the paper was supported by Nondirected Research Fund, Korea Research Foundation, in 1996. It was also partially supported by the Korea Telecom Research Laboratories while the first author was visiting the Kangwon National University. The authors are grateful to Mr. J. K. Anand, graduate student, Department of Aerospace Engineering, Indian Institute of Science, and Mr. H.W. Kim, graduate student, Department of Control and Instrumentation Engineering, Kangwon National University, Korea, for their expert assistance in generating the plots. The authors also appreciate the many constructive comments of the reviewers which helped in carrying out a meaningful revision of the paper.
Copyright:
Copyright 2004 Elsevier Science B.V., Amsterdam. All rights reserved.
PY - 1998/11/25
Y1 - 1998/11/25
N2 - In this paper we consider the problem of computing a large matrix-vector product in a network-based distributed computing environment comprising computers equipped with communication co-processors that may be used for communication off-loading. Communication delays, which are significant in such systems, are specifically taken into account. The important contribution of this study is to show that the optimal load partitioning, and the subsequent performance of the network, depends critically on many network parameters and load characteristics. In particular, it is shown that the size of the load plays an important role in determining the performance of the network. We consider only row-wise striping of the matrix in order to better allocate the computational burden among the processors. We derive closed-form solutions to the optimal load partitioning problem and show the existence of optimal load sharing conditions. An important and practically relevant trade-off study, from the architecture point of view, between the number of processors and the bus bandwidth is presented. Several practical load distribution strategies are considered and complete analyses for each of them is presented
AB - In this paper we consider the problem of computing a large matrix-vector product in a network-based distributed computing environment comprising computers equipped with communication co-processors that may be used for communication off-loading. Communication delays, which are significant in such systems, are specifically taken into account. The important contribution of this study is to show that the optimal load partitioning, and the subsequent performance of the network, depends critically on many network parameters and load characteristics. In particular, it is shown that the size of the load plays an important role in determining the performance of the network. We consider only row-wise striping of the matrix in order to better allocate the computational burden among the processors. We derive closed-form solutions to the optimal load partitioning problem and show the existence of optimal load sharing conditions. An important and practically relevant trade-off study, from the architecture point of view, between the number of processors and the bus bandwidth is presented. Several practical load distribution strategies are considered and complete analyses for each of them is presented
UR - http://www.scopus.com/inward/record.url?scp=0007399046&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0007399046&partnerID=8YFLogxK
U2 - 10.1006/jpdc.1998.1494
DO - 10.1006/jpdc.1998.1494
M3 - Article
AN - SCOPUS:0007399046
SN - 0743-7315
VL - 55
SP - 32
EP - 59
JO - Journal of Parallel and Distributed Computing
JF - Journal of Parallel and Distributed Computing
IS - 1
ER -