EXOCHI
Autor: | Hong Jiang, Gautham N. Chinya, Guei-Yuan Lueh, Jamison D. Collins, Nick Y. Yang, Hong Wang, Perry Wang, Xinmin Tian, Milind B. Girkar |
---|---|
Rok vydání: | 2007 |
Předmět: |
Multi-core processor
Speedup Computer science Multiprocessing Parallel computing computer.software_genre Computer Graphics and Computer-Aided Design Instruction set Inline assembler Shared memory High-level programming language Multithreading Virtual memory Operating system Compiler computer Execution model Software Debugger |
Zdroj: | PLDI |
ISSN: | 1558-1160 0362-1340 |
Popis: | Future mainstream microprocessors will likely integrate specialized accelerators, such as GPUs, onto a single die to achieve better performance and power efficiency. However, it remains a keen challenge to program such a heterogeneous multicore platform, since these specialized accelerators feature ISAs and functionality that are significantly different from the general purpose CPU cores. In this paper, we present EXOCHI: (1) Exoskeleton Sequencer (EXO), an architecture to represent heterogeneous acceleratorsas ISA-based MIMD architecture resources, and a shared virtual memory heterogeneous multithreaded program execution model that tightly couples specialized accelerator cores with generalpurpose CPU cores, and (2) C for Heterogeneous Integration (CHI), an integrated C/C++ programming environment that supports accelerator-specific inline assembly and domain-specific languages. The CHI compiler extends the OpenMP pragma for heterogeneous multithreading programming, and produces a single fat binary with code sections corresponding to different instruction sets. The runtime can judiciously spread parallel computation across the heterogeneous cores to optimize performance and power. We have prototyped the EXO architecture on a physical heterogeneous platform consisting of an Intel® Core™ 2 Duo processor and an 8-core 32-thread Intel® Graphics Media Accelerator X3000. In addition, we have implemented the CHI integrated programming environment with the Intel® C++ Compiler, runtime toolset, and debugger. On the EXO prototype system, we have enhanced a suite of production-quality media kernels for video and image processing to utilize the accelerator through the CHI programming interface, achieving significant speedup (1.41X to10.97X) over execution on the IA32 CPU alone. |
Databáze: | OpenAIRE |
Externí odkaz: |