Zobrazeno 1 - 10
of 312
pro vyhledávání: '"Loop nest optimization"'
Publikováno v:
Future Generation Computer Systems. 112:1093-1105
With Graphic Processing Units (GPUs) being widely adopted in data centers to provide computing power, efficient support for GPU multitasking attracts significant attention. The prior GPU multitasking works include spatial multitasking and simultaneou
Publikováno v:
PASC
The performance and scalability of computational fluid dynamics (CFD) solvers are essential for many applications, including multidisciplinary design optimization. With the evolution of highperformance computing resources such as Intel's Knights Land
Autor:
Hiroshi Horii, Jun Doi
Publikováno v:
2020 IEEE International Conference on Quantum Computing and Engineering (QCE).
Classical computers require large memory resources and computational power to simulate quantum circuits with a large number of qubits. Even supercomputers that can store huge amounts of data face a scalability issue in regard to parallel quantum comp
Publikováno v:
Computer Physics Communications. 271:108193
On modern hardware architectures, the performance of Flux Reconstruction (FR) methods can be limited by memory bandwidth. In a typical implementation, these methods are implemented as a chain of distinct kernels. Often, a dataset which has just been
Autor:
Heechul Yun, Michael Bechtel
Publikováno v:
HotSoS
In this paper, we propose memory-aware cache DoS attacks that can induce more effective cache blocking by taking advantage of information of the underlying memory hardware. Like prior cache DoS attacks, our new attacks also generate lots of cache mis
Publikováno v:
PacificVis
Force-directed algorithms are widely used to generate aesthetically pleasing layouts of graphs or networks arisen in many scientific disciplines. To visualize large-scale graphs, several parallel algorithms have been discussed in the literature. Howe
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::17eb7f1c652308922bc305bccb4b2ba9
http://arxiv.org/abs/2002.08233
http://arxiv.org/abs/2002.08233
Autor:
Shogo Fukushima, Aravind Krishnamoorthy, Manaschai Kunaseth, Subodh Tiwari, Ye Luo, Fuyuki Shimojo, Ken-ichi Nomura, Rajiv K. Kalia, Priya Vashishta, Aiichiro Nakano, Putt Sakdhnagool, Pankaj Rajak
Publikováno v:
HPC Asia
Confluence of extreme-scale quantum dynamics simulations (i.e. quantum@scale) and cutting-edge x-ray free-electron laser experiments are revolutionizing materials science. An archetypal example is the exciting concept of using picosecond light pulses
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783030503703
ICCS (1)
ICCS (1)
Efficient solvers for partial differential equations are among the most important areas of algorithmic research in high-performance computing. In this paper we present a new optimization for solving linear autonomous partial differential equations. O
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::68118c2ea3ac5988ecb0a6cd12ea109c
https://doi.org/10.1007/978-3-030-50371-0_10
https://doi.org/10.1007/978-3-030-50371-0_10
Publikováno v:
ISPA/BDCloud/SocialCom/SustainCom
Modern heterogeneous high-end computing systems featuring many-core co-processors/accelerators pose tough chal-lenges to parallelize and optimize real-world scientific codes. In this paper, we demonstrate highly scalable 3D Lattice Boltzmann multipha
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 14:1-26
The polyhedron model is a powerful model to identify and apply systematically loop transformations that improve data locality (e.g., via tiling) and enable parallelization. In the polyhedron model, a loop transformation is, essentially, represented a