A robust variant of block Jacobi-Davidson for extracting a large number of eigenpairs: Application to grid-based real-space density functional theory.

Autor: Lee, M., Leiter, K., Eisner, C., Breuer, A., Wang, X.
Předmět:
Zdroj: Journal of Chemical Physics; 2017, Vol. 147 Issue 11, p1-8, 8p, 1 Diagram, 2 Charts, 3 Graphs
Abstrakt: In this work, we investigate a block Jacobi-Davidson (J-D) variant suitable for sparse symmetric eigenproblems where a substantial number of extremal eigenvalues are desired (e.g., ground-state real-space quantum chemistry). Most J-D algorithm variations tend to slow down as the number of desired eigenpairs increases due to frequent orthogonalization against a growing list of solved eigenvectors. In our specification of block J-D, all of the steps of the algorithm are performed in clusters, including the linear solves, which allows us to greatly reduce computational effort with blocked matrix-vector multiplies. In addition, we move orthogonalization against locked eigenvectors and working eigenvectors outside of the inner loop but retain the single Ritz vector projection corresponding to the index of the correction vector. Furthermore, we minimize the computational effort by constraining the working subspace to the current vectors being updated and the latest set of corresponding correction vectors. Finally, we incorporate accuracy thresholds based on the precision required by the Fermi-Dirac distribution. The net result is a significant reduction in the computational effort against most previous block J-D implementations, especially as the number of wanted eigenpairs grows. We compare our approach with another robust implementation of block J-D (JDQMR) and the state-of-the-art Chebyshev filter subspace (CheFSI) method for various real-space density functional theory systems. Versus CheFSI, for first-row elements, our method yields competitive timings for valence-only systems and 4-6X speedups for all-electron systems with up to 10x reduced matrix-vector multiplies. For all-electron calculations on larger elements (e.g., gold) where thewanted spectrum is quite narrow compared to the full spectrum, we observe 60X speedup with 200X fewer matrix-vector multiples vs. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index