Zobrazeno 1 - 10
of 184
pro vyhledávání: '"Nelson, Amaral"'
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 20:1-18
This article introduces YaConv , a new algorithm to compute convolution using GEMM microkernels from a Basic Linear Algebra Subprograms library that is efficient for multiple CPU architectures. Previous approaches either create a copy of each image e
Publikováno v:
IEEE Micro. 42:34-40
Autor:
FERRARI, VICTOR, SOUSA, RAFAEL, PEREIRA, MARCIO, DE CARVALHO, JOÃO P. L., NELSON AMARAL, JOSÉ, MOREIRA, JOSÉ, ARAUJO, GUIDO
Publikováno v:
ACM Transactions on Architecture & Code Optimization; Dec2023, Vol. 20 Issue 4, p1-26, 26p
Autor:
Braedy Kuzma, Ivan Korostelev, João P. L. de Carvalho, José E. Moreira, Christopher Barton, Guido Araujo, José Nelson Amaral
The resurgence of machine learning has increased the demand for high-performance basic linear algebra subroutines (BLAS), which have long depended on libraries to achieve peak performance on commodity hardware. High-performance BLAS implementations r
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::544cf0acc71946e9fb46f0e69516c7c2
http://arxiv.org/abs/2305.18236
http://arxiv.org/abs/2305.18236
Publikováno v:
In Parallel Computing May 2016 54:46-58
Autor:
Caio Salvador Rohwedder, Nathan Henderson, João P. L. de Carvalho, Yufei Chen, J. Nelson Amaral
Publikováno v:
Proceedings of the 21st ACM/IEEE International Symposium on Code Generation and Optimization.
Artifact of the paper "To Pack or Not to Pack: A Generalized Packing Analysis and Transformation". - docker-packing-artifact.tar.gz: docker image for the execution of experiments - llvm-packing-v0.5.zip: LLVM source code with packing implementation (
Publikováno v:
The Journal of Supercomputing. 78:12553-12588
Autor:
Caio Salvador Rohwedder, Nathan Henderson, João P. L. de Carvalho, Yufei Chen, J. Nelson Amaral
Artifact of the paper "To Pack or Not to Pack: A Generalized Packing Analysis and Transformation". - docker-packing-artifact.tar.gz: docker image for the execution of experiments - llvm-packing-v0.5.zip: LLVM source code with packing implementation (
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::49192e3db4492bd8639e392d09fec68e
Autor:
Andre Bauer, Cristina L. Abad, Laurens Versluis, Alexandru Iosup, José Nelson Amaral, Jóakim von Kistowski, Nikolas Herbst, Ahmed Ali-Eldin, Alessandro Vittorio Papadopoulos, Petr Tuma
Publikováno v:
Papadopoulos, A V, Versluis, L, Bauer, A, Herbst, N, Kistowski, J V, Ali-Eldin, A, Abad, C L, Amaral, J N, Tuma, P & Iosup, A 2021, ' Methodological Principles for Reproducible Performance Evaluation in Cloud Computing ', IEEE Transactions on Software Engineering, vol. 47, no. 8, 8758926, pp. 1528-1543 . https://doi.org/10.1109/TSE.2019.2927908
IEEE Transactions on Software Engineering, 47(8):8758926, 1528-1543. Institute of Electrical and Electronics Engineers Inc.
IEEE Transactions on Software Engineering, 47(8):8758926, 1528-1543. Institute of Electrical and Electronics Engineers Inc.
The rapid adoption and the diversification of cloud computing technology exacerbate the importance of a sound experimental methodology for this domain. This work investigates how to measure and report performance in the cloud, and how well the cloud
Autor:
Victor Ferrari, Rafael Sousa, Marcio Pereira, João P. L. de Carvalho, José Nelson Amaral, Guido Araujo
Publikováno v:
Proceedings of the International Conference on Parallel Architectures and Compilation Techniques.