OpenCL Optimization and Best Practices for Qualcomm Adreno GPUs
Autor: | Jay Yun, Hongqiang Wang, Alex Bourd |
---|---|
Rok vydání: | 2018 |
Předmět: |
Profiling (computer programming)
CUDA Computer architecture Computer science Best practice 0202 electrical engineering electronic engineering information engineering Graphics processing unit Adreno 020201 artificial intelligence & image processing 02 engineering and technology General-purpose computing on graphics processing units Original equipment manufacturer 020202 computer hardware & architecture |
Zdroj: | IWOCL |
DOI: | 10.1145/3204919.3204935 |
Popis: | As the industry's leading mobile graphics processing unit (GPU) core, Adreno™ in Qualcomm®'s Snapdragon™ SOCs has supported the OpenCL™ standard since its A3x family and all through its A4x, A5x families, and the latest A6x family. How to effectively program and optimize OpenCL applications on Adreno OpenCL is of great interest for many OEMs as well as 3rd party app developers. This paper provides a high level overview of Adreno's compute architecture, introduces Adreno's OpenCL support and general guidance and good practices on programming, optimization and profiling, and illustrates how to apply them and achieve good performance through two use case studies. |
Databáze: | OpenAIRE |
Externí odkaz: |