Dynamic access ordering for streamed computations
Autor: | Wm Wulf, M.H. Salinas, Robert H. Klenke, Dee A. B. Weikle, Sally A. McKee, James H. Aylor, S.I. Hong |
---|---|
Rok vydání: | 2000 |
Předmět: |
Random access memory
Hardware_MEMORYSTRUCTURES business.industry Computer science Access method Memory bandwidth Parallel computing Memory controller CAS latency Theoretical Computer Science Vector processor Computational Theory and Mathematics Hardware and Architecture Embedded system Memory architecture Bandwidth (computing) Locality of reference business Software Dram Dram memory |
Zdroj: | IEEE Transactions on Computers. 49:1255-1271 |
ISSN: | 0018-9340 |
DOI: | 10.1109/12.895941 |
Popis: | Memory bandwidth is rapidly becoming the limiting performance factor for many applications, particularly for streaming computations such as scientific vector processing or multimedia (de)compression. Although these computations lack the temporal locality of reference that makes traditional caching schemes effective, they have predictable access patterns. Since most modern DRAM components support modes that make it possible to perform some access sequences faster than others, the predictability of the stream accesses makes it possible to reorder them to get better memory performance. We describe a Stream Memory Controller (SMC) system that combines compile-time detection of streams with execution-time selection of the access order and issue. The SMC effectively prefetches read-streams, buffers write-streams, and reorders the accesses to exploit the existing memory bandwidth as much as possible. Unlike most other hardware prefetching or stream buffer designs, this system does not increase bandwidth requirements. The SMC is practical to implement, using existing compiler technology and requiring only a modest amount of special purpose hardware. We present simulation results for fast-page mode and Rambus DRAM memory systems and we describe a prototype system with which we have observed performance improvements for inner loops by factors of 13 over traditional access methods. |
Databáze: | OpenAIRE |
Externí odkaz: |