Zobrazeno 1 - 10
of 139
pro vyhledávání: '"Benkner, Siegfried"'
Autor:
Alves, João N. F., Moustafa, Samir, Benkner, Siegfried, Francisco, Alexandre P., Gansterer, Wilfried N., Russo, Luís M. S.
The inference and training stages of Graph Neural Networks (GNNs) are often dominated by the time required to compute a long sequence of matrix multiplications between the sparse graph adjacency matrix and its embedding. To accelerate these stages, w
Externí odkaz:
http://arxiv.org/abs/2409.02208
Autor:
Petrovič, Filip, Střelák, David, Hozzová, Jana, Oľha, Jaroslav, Trembecký, Richard, Benkner, Siegfried, Filipovič, Jiří
Publikováno v:
Petrovic et al., A benchmark set of highly-efficient CUDA and OpenCL kernels and its dynamic autotuning with Kernel Tuning Toolkit. In Future Generation Computer Systems, Vol. 108, pages 161-177. 2020
Autotuning of performance-relevant source-code parameters allows to automatically tune applications without hard coding optimizations and thus helps with keeping the performance portable. In this paper, we introduce a benchmark set of ten autotunable
Externí odkaz:
http://arxiv.org/abs/1910.08498
Autor:
Dokulil, Jiri, Benkner, Siegfried
We present several proposals for extending the Open Community Runtime (OCR) specification. The extension are identifiers with local validity, which use the concept of futures to provide OCR implementations more optimization opportunities, labeled GUI
Externí odkaz:
http://arxiv.org/abs/1509.03161
Autor:
Amaral, Vasco, Norberto, Beatriz, Goulão, Miguel, Aldinucci, Marco, Benkner, Siegfried, Bracciali, Andrea, Carreira, Paulo, Celms, Edgars, Correia, Luís, Grelck, Clemens, Karatza, Helen, Kessler, Christoph, Kilpatrick, Peter, Martiniano, Hugo, Mavridis, Ilias, Pllana, Sabri, Respício, Ana, Simão, José, Veiga, Luís, Visa, Ari
Publikováno v:
In Parallel Computing March 2020 91
Autor:
Dokulil, Jiri, Bajrovic, Enes, Benkner, Siegfried, Pllana, Sabri, Sandrieser, Martin, Bachmayer, Beverly
The introduction of Intel(R) Xeon Phi(TM) coprocessors opened up new possibilities in development of highly parallel applications. The familiarity and flexibility of the architecture together with compiler support integrated into the Intel C++ Compos
Externí odkaz:
http://arxiv.org/abs/1211.5530
Autor:
Dokulil, Jiri1 (AUTHOR), Benkner, Siegfried1 (AUTHOR) siegfried.benkner@univie.ac.at
Publikováno v:
Journal of Supercomputing. Jul2022, Vol. 78 Issue 10, p12344-12379. 36p.
Autor:
Borckholder, Chris, Heinzel, Andreas, Kaniovskyi, Yuriy, Benkner, Siegfried, Lukas, Arno, Mayer, Bernd
Publikováno v:
In Procedia Computer Science 2013 23:24-35
Autor:
Dokulil, Jiri, Bajrovic, Enes, Benkner, Siegfried, Pllana, Sabri, Sandrieser, Martin, Bachmayer, Beverly
Publikováno v:
In Procedia Computer Science 2013 18:2508-2511
Publikováno v:
In Parallel Computing 2012 38(1):52-65
Publikováno v:
European Technology Platform for High Performance Computing (ETP4HPC). 2021, ⟨10.5281/zenodo.5549731⟩
White paper; International audience; As HPC hardware continues to evolve and diversify and workloads become more dynamic and complex, applications need to be expressed in a way that facilitates high performance across a range of hardware and situatio
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::d89b01dc7a6d12b6438b38775462aa28
https://hal.inria.fr/hal-03368013/document
https://hal.inria.fr/hal-03368013/document