Zobrazeno 1 - 10
of 13
pro vyhledávání: '"Nathan Wichmann"'
Autor:
Nathan Wichmann, Andrew Canning, Karthik Raman, Steven G. Louie, Ruchira Sasanka, Jack Deslippe, Mauro Del Ben, Felipe H. da Jornada, Chao Yang
Publikováno v:
Computer Physics Communications. 235:187-195
The ab initio GW approach is a rigorous Green’s-function-based framework that can be employed to compute electronic excitation properties of a wide variety of materials such as extended systems, molecules, as well as confined and nanostructured mat
Autor:
Brian Austin, Sudheer Chunduri, Nathan Wichmann, Nicholas J. Wright, Steven R. Warren, Taylor Groves, Peter Mendygral, Krishna Kandalla, Glenn K. Lockwood, Kalyan Kumaran, Scott Parker, Jacob Balma
Publikováno v:
SC
Network congestion is one of the biggest problems facing HPC systems today, affecting system throughput, performance, user experience, and reproducibility. Congestion manifests as run-to-run variability due to contention for shared resources (e.g., f
Publikováno v:
Concurrency and Computation: Practice and Experience. 31
Autor:
Paul R. C. Kent, Thorsten Kurth, Pierre Carrier, Nathan Wichmann, David Prendergast, Jack Deslippe, Taylor Barnes
Publikováno v:
ResearcherID
Barnes, TA; Kurth, T; Carrier, P; Wichmann, N; Prendergast, D; Kent, PRC; et al.(2017). Improved treatment of exact exchange in Quantum ESPRESSO. Computer Physics Communications, 214, 52-58. doi: 10.1016/j.cpc.2017.01.008. Lawrence Berkeley National Laboratory: Retrieved from: http://www.escholarship.org/uc/item/0fr829x9
Barnes, TA; Kurth, T; Carrier, P; Wichmann, N; Prendergast, D; Kent, PRC; et al.(2017). Improved treatment of exact exchange in Quantum ESPRESSO. Computer Physics Communications, 214, 52-58. doi: 10.1016/j.cpc.2017.01.008. Lawrence Berkeley National Laboratory: Retrieved from: http://www.escholarship.org/uc/item/0fr829x9
© 2017 We present an algorithm and implementation for the parallel computation of exact exchange in Quantum ESPRESSO (QE) that exhibits greatly improved strong scaling. QE is an open-source software package for electronic structure calculations usin
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2f709855593e7fa415c450e5a93e9ff1
https://escholarship.org/uc/item/0fr829x9
https://escholarship.org/uc/item/0fr829x9
Autor:
Taylor Barnes, Brandon Cook, Jack Deslippe, Douglas Doerfler, Brian Friesen, Yun He, Thorsten Kurth, Tuomas Koskela, Mathieu Lobet, Tareq Malas, Leonid Oliker, Andrey Ovsyannikov, Abhinav Sarje, Jean-Luc Vay, Henri Vincenti, Samuel Williams, Pierre Carrier, Nathan Wichmann, Marcus Wagner, Paul Kent, Christopher Kerr, John Dennis
Publikováno v:
Barnes, T; Cook, B; Deslippe, J; Doerfler, D; Friesen, B; He, Y; et al.(2017). Evaluating and optimizing the NERSC workload on knights landing. Proceedings of PMBS 2016: 7th International Workshop on Performance Modeling, Benchmarking and Simulation of High Performance Computing Systems-Held in conjunction with SC 2016: The International Conference for High Performance Computing, Networking, Storage and Analysis, 43-53. doi: 10.1109/PMBS.2016.010. Lawrence Berkeley National Laboratory: Retrieved from: http://www.escholarship.org/uc/item/75c1571h
© 2016 IEEE. NERSC has partnered with 20 representative application teams to evaluate performance on the Xeon-Phi Knights Landing architecture and develop an application-optimization strategy for the greater NERSC workload on the recently installed
Publikováno v:
IPDPS
We develop a method for improving the parallel scalability of computations that involve asynchronous task execution. We apply this method to the recently developed parallel selected inversion algorithm [Jacquelin, Lin and Yang 2014], named PSelInv, o
Autor:
Pieter Maris, Meiyue Shao, Nathan Wichmann, John O’Neill, Brandon Cook, Thanh N. Phung, Marcus Wagner, Gaurav Bansal
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783319460789
ISC Workshops
ISC Workshops
Initial optimization strategies and results on MFDn, a large-scale nuclear physics application code, running on a single KNL node are presented. This code consists of the construction of a very large sparse real symmetric matrix and computing a few l
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::1f735e539685c2a0334ca3219964d61a
https://doi.org/10.1007/978-3-319-46079-6_26
https://doi.org/10.1007/978-3-319-46079-6_26
Autor:
Steven G. Louie, Nathan Wichmann, Felipe H. da Jornada, Ruchira Sasanka, Karthik Raman, Derek Vigil-Fowler, Jack Deslippe, Taylor Barnes
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783319460789
ISC Workshops
ISC Workshops
We profile and optimize calculations performed with the BerkeleyGW [2, 3] code on the Xeon-Phi architecture. BerkeleyGW depends both on hand-tuned critical kernels as well as on BLAS and FFT libraries. We describe the optimization process and perform
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::556ca5d713af7c971aefdbb8a2bcfa0a
https://doi.org/10.1007/978-3-319-46079-6_29
https://doi.org/10.1007/978-3-319-46079-6_29
Autor:
Nathan Wichmann, John Shalf, Hongzhang Shan, Katherine Yelick, Nicholas J. Wright, Marcus Wagner
Publikováno v:
ACM SIGMETRICS Performance Evaluation Review. 40:92-98
The Gemini interconnect on the Cray XE6 platform provides for lightweight remote direct memory access (RDMA) between nodes, which is useful for implementing partitioned global address space (PGAS) languages like UPC and Co-Array Fortran. In this pape
Publikováno v:
Scientific Programming, Vol 18, Iss 3-4, Pp 139-151 (2010)
Application codes in a variety of areas are being updated for performance on the latest architectures. In this paper we examine an application, which comes from magnetic fusion for performance acceleration with a particular emphasis on methods that a