Zobrazeno 1 - 10
of 13
pro vyhledávání: '"Karthikeyan Vaidyanathan"'
Autor:
Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Jongsoo Park, Carlos Rosales, Christopher S. Daley, Dhiraj D. Kalamkar, Vadim O. Pirogov, Mosotofa Ali Patwary, Cyril Mazauric, Pradeep Dubey, Xing Liu, Alexander Heinecke
Publikováno v:
The International Journal of High Performance Computing Applications. 30:11-27
This paper presents optimizations in a high-performance conjugate gradient benchmark (HPCG) for multi-core Intel® Xeon® processors and many-core Xeon Phi™ coprocessors. Without careful optimization, the HPCG benchmark under-utilizes the compute r
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783319460789
ISC Workshops
ISC Workshops
Lattice Quantumchromodynamics (QCD) is a powerful tool to numerically access the low energy regime of QCD in a straightforward way with quantifyable uncertainties. In this approach, QCD is discretized on a four dimensional, Euclidean space-time grid
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::d5667781a395d8aa81f9184e23221dac
https://doi.org/10.1007/978-3-319-46079-6_30
https://doi.org/10.1007/978-3-319-46079-6_30
Autor:
Dhiraj D. Kalamkar, Karthikeyan Vaidyanathan, Kiran Pamnany, Jeff R. Hammond, Dipankar Das, Pavan Balaji, Jongsoo Park, Bálint Joó
Publikováno v:
SC
We present a new approach for multithreaded communication and asynchronous progress in MPI applications, wherein we offload communication processing to a dedicated thread. The central premise is that given the rapidly increasing core counts on modern
Autor:
Jefferson Amstutz, Cedric Andreolli, Meenakshi Arunachalam, Gaurav Bansal, Martin Berzins, Paul Besl, Ashraf Bhuiyan, Stephen Blair-Chappell, Leonardo Borges, James P. Briggs, Mikhail Brinskiy, Michal Brylinski, Vlad Calina, James Dinan, Jussi Enkovaara, Rob Farber, Julia Fedorova, Wei P. Feinstein, Evan Felix, James R. Fergusson, Evgeny Fiksman, Indraneil Gokhale, Christiaan Gribble, Diana Guttman, Tom Henderson, John Holmen, Allen H.-L. Huang, Bormin Huang, Alan Humphrey, Juha Jäykkä, Jim Jeffers, Ashish Jha, Bálint Joó, Dhiraj D. Kalamkar, Mahmut Taylan Kandemir, Rahul Khanna, Taylor Kidd, Jeongnim Kim, Michael Klemm, Shuo Li, Yongchao Liu, Belinda Liviero, Mark Lubin, Luke Mason, Zakhar A. Matveev, Lawrence Meadows, John Michalakes, Jarno Mielikainen, Ravi A. Murty, Perri Needham, Chris J. Newburn, Matthias Noack, Enda O'Brien, Klaus-Dieter Oertel, Simon J. Pennycook, Dmitry Prohorov, Narayan Ranganathan, George M. Raskulinec, James Reinders, Bertil Schmidt, Michael Seaton, Edward P. Shellard, Mikhail Smelyanskiy, Paulo Souza, Dan Stanzione, Philippe Thierry, Prashanth Thinakaran, Karthikeyan Vaidyanathan, Sergei Vinogradov, Ross C. Walker, Florian Wende, Freddie Witherden, Praveen Yedlapalli
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::8b26b82ab664a3994be005188c3e9903
https://doi.org/10.1016/b978-0-12-803819-2.09998-5
https://doi.org/10.1016/b978-0-12-803819-2.09998-5
Autor:
Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, Alexander Heinecke, Jongsoo Park, Pradeep Dubey, Xing Liu, Md. Mosotofa Ali Patwary, Yutong Lu, Dhiraj D. Kalamkar
Publikováno v:
SC
A new sparse high performance conjugate gradient benchmark (HPCG) has been recently released to address challenges in the design of sparse linear solvers for the next generation extreme-scale computing systems. Key computation, data access, and commu
Autor:
Pradeep Dubey, Balint Joo, Dhiraj D. Kalamkar, Simon Heybrock, Tilo Wettig, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan
Publikováno v:
SC14: International Conference for High Performance Computing, Networking, Storage and Analysis.
The gap between the cost of moving data and the cost of computing continues to grow, making it ever harder to design iterative solvers on extreme-scale architectures. This problem can be alleviated by alternative algorithms that reduce the amount of
Autor:
Sebastian Rettenberger, Alexander Breuer, Arndt Bode, Pradeep Dubey, Michael Bader, William L. Barth, Alice-Agnes Gabriel, Mikhail Smelyanskiy, Alexander Heinecke, Karthikeyan Vaidyanathan, Xiangke Liao, Christian Pelties
Publikováno v:
SC
We present an end-to-end optimization of the innovative Arbitrary high-order DERivative Discontinuous Galerkin (ADER-DG) software SeisSol targeting Intel® Xeon Phi coprocessor platforms, achieving unprecedented earthquake model complexity through co
Autor:
Kiran Pamnany, Alexander Heinecke, Daehyun Kim, Dhiraj D. Kalamkar, Mikhail Smelyanskiy, Jongsoo Park, Bálint Joó, Bharat Kaul, Karthikeyan Vaidyanathan, Aniruddha G. Shet, Pradeep Dubey
Publikováno v:
IPDPS
Intel Xeon Phi coprocessor-based clusters offer high compute and memory performance for parallel workloads and also support direct network access. Many real world applications are significantly impacted by network characteristics and to maximize the
Autor:
Ping Tak Peter Tang, Jongsoo Park, Daehyun Kim, Karthikeyan Vaidyanathan, Ganesh Bikshandi, Pradeep Dubey
Publikováno v:
SC
This paper demonstrates the first tera-scale performance of Intel® Xeon Phi™ coprocessors on 1D FFT computations. Applying a disciplined performance programming methodology of sound algorithm choice, valid performance model, and well-executed opti
Autor:
Alexander Heinecke, Aniruddha G. Shet, Pradeep Dubey, Greg Henry, Alexander Kobotov, Roman S. Dubtsov, Mikhail Smelyanskiy, Karthikeyan Vaidyanathan, George Z. Chrysos
Publikováno v:
IPDPS
Dense linear algebra has been traditionally used to evaluate the performance and efficiency of new architectures. This trend has continued for the past half decade with the advent of multi-core processors and hardware accelerators. In this paper we d