Zobrazeno 1 - 10
of 43
pro vyhledávání: '"Mark Gates"'
Publikováno v:
2022 IEEE/ACM Workshop on Latest Advances in Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH).
Autor:
Mark Gates, Asim YarKhan, Dalal Sukkari, Kadir Akbudak, Sebastien Cayrols, Daniel Bielich, Ahmad Abdelfattah, Mohammed Al Farhan, Jack Dongarra
Publikováno v:
Kadir Akbudak
Autor:
Jakub Kurzak, Mark Gates, Nicholas J. Higham, Azzam Haidar, Jack Dongarra, Stanimire Tomov, Timothy B. Costa, Ahmad Abdelfattah, Mawussi Zounon, Sven Hammarling, Piotr Luszczek
Publikováno v:
ACM Transactions on Mathematical Software. 47:1-23
This article describes a standard API for a set of Batched Basic Linear Algebra Subprograms (Batched BLAS or BBLAS). The focus is on many independent BLAS operations on small matrices that are grouped together and processed by a single routine, calle
Autor:
Erin Carson, Mark Gates, Terry Cojean, Ahmad Abdelfattah, Yaohung M. Tsai, Srikara Pranesh, Jennifer A. Loe, Barry Smith, Stanimire Tomov, Tobias Ribizel, Xiaoye S. Li, Jack Dongarra, Hartwig Anzt, Nicholas J. Higham, Kasia Swirydowicz, Erik G. Boman, Alyson Fox, Siva Rajamanickam, Piotr Luszczek, Ulrike Meier Yang, Stephen Thomas
Publikováno v:
The International Journal of High Performance Computing Applications. 35:344-369
The efficient utilization of mixed-precision numerical linear algebra algorithms can offer attractive acceleration to scientific computing applications. Especially with the hardware integration of low-precision special-function units designed for mac
Autor:
Ahmad Abdelfattah, Dalal Sukkari, Robert Rosenberg, Jack Dongarra, Mark Gates, Azzam Haidar, Mohammed Al Farhan, Stanimire Tomov
Publikováno v:
The International Journal of High Performance Computing Applications. 34:645-658
With the acquisition and widespread use of more resources that rely on accelerator/wide vector–based computing, there has been a strong demand for science and engineering applications to take advantage of these latest assets. This, however, has bee
Autor:
James Demmel, Jack Dongarra, Mark Gates, Greg Henry, Julien Langou, Xiaoye Li, Piotr Luszczek, Weslley Pereira, Jason Riedy, Cindy Rubio-Gonzalez
Numerical exceptions, which may be caused by overflow, operations like division by 0 or sqrt(-1), or convergence failures, are unavoidable in many cases, in particular when software is used on unforeseen and difficult inputs. As more aspects of socie
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::92d989e36f316b438bd1ed6d0f956dd7
Autor:
Jennifer A. Loe, Xiaoye Li, Y Liu, Barry Smith, Stephen Thomas, S Kruger, A Ayala, Kasia Swirydowicz, Jack Dongarra, Hartwig Anzt, Robert D. Falgout, Ulrike Meier Yang, Daniel Osei-Kuffuor, Terry Cojean, Y Tsai, Erin Carson, N Higham, Mark Gates, Ahmad Abdelfattah, T Gruetzmacher, Tobias Ribizel, Ichitaro Yamazaki, S Cayrols, N Lindquist, Piotr Luszczek, Pratik Nayak, Sri Pranesh, Siva Rajamanickam, Erik G. Boman, Stan Tomov
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::32cb9fd6a201bb32eb1b6658edcb54fc
https://doi.org/10.2172/1814447
https://doi.org/10.2172/1814447
Autor:
Mark Gates, Panruo Wu, Maksims Abalenkovs, Azzam Haidar, David Stevens, Negin Bagherpour, Piotr Luszczek, Ichitaro Yamazaki, Jack Dongarra, Jakub Kurzak, Asim YarKhan, Samuel D. Relton, Mawussi Zounon, Jakub Šístek, Sven Hammarling
Publikováno v:
ACM Transactions on Mathematical Software. 45:1-35
The recent version of the Parallel Linear Algebra Software for Multicore Architectures (PLASMA) library is based on tasks with dependencies from the OpenMP standard. The main functionality of the library is presented. Extensive benchmarks are targete
Autor:
Jennifer A. Loe, Sherry Li, Mark Gates, Yaohung M. Tsai, Daniel Osei-Kuffuor, Stan Tomov, Hartwig Antz, Ulrike Meier Yang, Erik G. Boman, Scott Kruger
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::e9f2c62a1b5f095c2cdc83ba733be861
https://doi.org/10.2172/1735694
https://doi.org/10.2172/1735694
Publikováno v:
Proceedings of the IEEE. 106:2040-2055
Computational problems in engineering and scientific disciplines often rely on the solution of many instances of small systems of linear equations, which are called batched solves. In this paper, we focus on the important variants of both batch Chole