Zobrazeno 1 - 10
of 109
pro vyhledávání: '"Kalé, Laxmikant V."'
Autor:
Wu, Nanmiao, Gonidelis, Ioannis, Liu, Simeng, Fink, Zane, Gupta, Nikunj, Mohammadiporshokooh, Karame, Diehl, Patrick, Kaiser, Hartmut, Kale, Laxmikant V.
Asynchronous Many-Task (AMT) runtime systems take advantage of multi-core architectures with light-weight threads, asynchronous executions, and smart scheduling. In this paper, we present the comparison of the AMT systems Charm++ and HPX with the mai
Externí odkaz:
http://arxiv.org/abs/2207.12127
Asynchronous tasks, when created with over-decomposition, enable automatic computation-communication overlap which can substantially improve performance and scalability. This is not only applicable to traditional CPU-based systems, but also to modern
Externí odkaz:
http://arxiv.org/abs/2202.11819
Python is rapidly becoming the lingua franca of machine learning and scientific computing. With the broad use of frameworks such as Numpy, SciPy, and TensorFlow, scientific computing and machine learning are seeing a productivity boost on systems wit
Externí odkaz:
http://arxiv.org/abs/2111.04872
As an increasing number of leadership-class systems embrace GPU accelerators in the race towards exascale, efficient communication of GPU data is becoming one of the most critical components of high-performance computing. For developers of parallel p
Externí odkaz:
http://arxiv.org/abs/2102.12416
Publikováno v:
In Parallel Computing October 2022 113
We present a hybrid OpenMP/Charm++ framework for solving the $\mathcal{O} (N)$ Self-Consistent-Field eigenvalue problem with parallelism in the strong scaling regime, $P\gg{N}$, where $P$ is the number of cores, and $N$ a measure of system size, i.e.
Externí odkaz:
http://arxiv.org/abs/1403.7458
Autor:
Fazenda, Alvaro Luiz, Mendes, Celso L., Kale, Laxmikant V., Panetta, Jairo, Rodrigues, Eduardo Rocha
The dynamic load-balancing framework in Charm++/AMPI, developed at the University of Illinois, is based on using processor virtualization to allow thread migration across processors. This framework has been successfully applied to many scientific app
Externí odkaz:
http://arxiv.org/abs/1310.4218
Publikováno v:
In Parallel Computing October 2014 40(9):454-470
Publikováno v:
In Parallel Computing October 2014 40(9):536-547
Autor:
Phillips, James C., Hardy, David J., Maia, Julio D. C., Stone, John E., Ribeiro, João V., Bernardi, Rafael C., Buch, Ronak, Fiorin, Giacomo, Hénin, Jérôme, Jiang, Wei, McGreevy, Ryan, Melo, Marcelo C. R., Radak, Brian K., Skeel, Robert D., Singharoy, Abhishek, Wang, Yi, Roux, Benoît, Aksimentiev, Aleksei, Luthey-Schulten, Zaida, Kalé, Laxmikant V.
Publikováno v:
Journal of Chemical Physics; 7/28/2020, Vol. 153 Issue 4, p1-33, 33p, 1 Color Photograph, 10 Diagrams, 10 Graphs