Effective Performance Modeling and Domain-Specific Compiler Optimization of CNNs for GPUs

Autor: Yufan Xu, Qiwei Yuan, Erik Curtis Barton, Rui Li, P. Sadayappan, Aravind Sukumaran-Rajam
Rok vydání: 2022
Zdroj: Proceedings of the International Conference on Parallel Architectures and Compilation Techniques.
DOI: 10.1145/3559009.3569674
Databáze: OpenAIRE