Autor: |
Chatterjee, S., Gilbert, J.R., Long, F.J.E., Schreiber, R., Teng, S.H. |
Zdroj: |
Journal of Parallel and Distributed Computing; April 1995, Vol. 26 Issue: 1 p72-84, 13p |
Abstrakt: |
Generating local addresses and communication sets is an important issue in distributed-memory implementations of data-parallel languages such as High Performance Fortran. We demonstrate a storage scheme for an array Aaffinely aligned to a template that is distributed across pprocessors with a cyclic(k) distribution that does not waste any storage, and show that, under this storage scheme, the local memory access sequence of any processor for a computation involving the regular section A(ℓ:h:s) is characterized by a finite state machine of at most kstates. We present fast algorithms for computing the essential information about these state machines, and we extend the framework to handle multidimensional arrays. We also show how to generate communication sets using the state machine approach. Performance results show that this solution requires very little runtime overhead and acceptable preprocessing time. |
Databáze: |
Supplemental Index |
Externí odkaz: |
|