Performance of a parallel algebraic multilevel preconditioner for stabilized finite element semiconductor device modeling
Autor: | Raymond S. Tuminaro, Gary L. Hennigan, Robert J. Hoekstra, Paul Lin, John N. Shadid, Marzio Sala |
---|---|
Rok vydání: | 2009 |
Předmět: |
Numerical Analysis
Mathematical optimization Partial differential equation Physics and Astronomy (miscellaneous) Preconditioner Applied Mathematics MathematicsofComputing_NUMERICALANALYSIS Solver System of linear equations Computer Science::Numerical Analysis Generalized minimal residual method Finite element method Mathematics::Numerical Analysis Computer Science Applications Computational Mathematics Nonlinear system Multigrid method Modeling and Simulation ComputingMethodologies_SYMBOLICANDALGEBRAICMANIPULATION Applied mathematics Mathematics |
Zdroj: | Journal of Computational Physics. 228:6250-6267 |
ISSN: | 0021-9991 |
DOI: | 10.1016/j.jcp.2009.05.024 |
Popis: | In this study results are presented for the large-scale parallel performance of an algebraic multilevel preconditioner for solution of the drift-diffusion model for semiconductor devices. The preconditioner is the key numerical procedure determining the robustness, efficiency and scalability of the fully-coupled Newton-Krylov based, nonlinear solution method that is employed for this system of equations. The coupled system is comprised of a source term dominated Poisson equation for the electric potential, and two convection-diffusion-reaction type equations for the electron and hole concentration. The governing PDEs are discretized in space by a stabilized finite element method. Solution of the discrete system is obtained through a fully-implicit time integrator, a fully-coupled Newton-based nonlinear solver, and a restarted GMRES Krylov linear system solver. The algebraic multilevel preconditioner is based on an aggressive coarsening graph partitioning of the nonzero block structure of the Jacobian matrix. Representative performance results are presented for various choices of multigrid V-cycles and W-cycles and parameter variations for smoothers based on incomplete factorizations. Parallel scalability results are presented for solution of up to 10^8 unknowns on 4096 processors of a Cray XT3/4 and an IBM POWER eServer system. |
Databáze: | OpenAIRE |
Externí odkaz: |