Techniques for solving stiff chemical kinetics on GPUs

Autor: Wright-Patterson Afb, Roger L. Davis, Christopher Stone
Rok vydání: 2013
Předmět:
Zdroj: 51st AIAA Aerospace Sciences Meeting including the New Horizons Forum and Aerospace Exposition.
DOI: 10.2514/6.2013-369
Popis: numbers of ODEs with comparable solution accuracy. The GPU implementation of the DVODE solver achieved a maximum speed-up of 7.7x over the baseline CPU run-time. The performance impact of mapping one thread to each ODE was compared to mapping an entire CUDA thread-block per ODE (i.e., multiple threads per ODE). The one-threadper-ODE approach achieved greater overall speed-up compared to the one-block-per-ODE approach but only when the number of ODEs was large: 1,000 ODEs were needed just to break even with the scalar CPU version and over 50,000 ODEs to reach maximum parallel efficiency. The performance difference is most pronounced with the RKF45 algorithm. The peak performance with the one-thread-per-ODE method was nearly 2x faster than the one-block-per-ODE approach. The one-block-per-problem implementation of RKF45 and DVODE both achieved lower peak speed-ups but outperformed the scalar CPU performance with as few as 100 ODEs. The new GPU-enabled ODE solvers demonstrate a method to significantly reduce the computational cost of detailed finite-rate combustion simulations with turn-around cost savings exceeding an order of magnitude.
Databáze: OpenAIRE