Hiding communication latency and coherence overhead in software DSMs
Autor: | M. De Maria, Leonidas Kontothanassis, Raquel Giffoni Pinto, Ricardo Bianchini, Claudio Luis de Amorim, M. Abud |
---|---|
Rok vydání: | 1996 |
Předmět: |
Hardware_MEMORYSTRUCTURES
Reduced instruction set computing Workstation CPU cache business.industry Computer science TreadMarks computer.software_genre Computer Graphics and Computer-Aided Design law.invention Software Shared memory law Embedded system Virtual memory Systems architecture Operating system General Earth and Planetary Sciences business computer General Environmental Science |
Zdroj: | ASPLOS |
ISSN: | 0163-5980 |
DOI: | 10.1145/248208.237185 |
Popis: | In this paper we propose the use of a PCI-based programmable protocol controller for hiding communication and coherence overheads in software DSMs. Our protocol controller provides three different types of overhead tolerance: a) moving basic communication and coherence tasks away from computation processors; b) prefetching of diffs; and c) generating and applying diffs with hardware assistance. We evaluate the isolated and combined impact of these features on the performance of TreadMarks. We also compare performance against two versions of the Shrimp-based AURC protocol. Using detailed execution-driven simulations of a 16-node network of workstations, we show that the greatest performance benefits provided by our protocol controller come from our hardware-supported diffs. Reducing the burden of communication and coherence transactions on the computation processor is also beneficial but to a smaller extent. Prefetching is not always profitable. Our results show that our protocol controller can improve running time performance by up to 50% for TreadMarks, which means that it can double the TreadMarks speedups. The overlapping implementation of TreadMarks performs as well or better than AURC for 5 of our 6 applications. We conclude that the simple hardware support we propose allows for the implementation of high-performance software DSMs at low cost. Based on this conclusion, we are building the NCP 2 parallel system at COPPE/UFRJ. |
Databáze: | OpenAIRE |
Externí odkaz: |