Zobrazeno 1 - 10
of 116
pro vyhledávání: '"Wittmann, Markus"'
Publikováno v:
Computers & Fluids, 2018
Lattice Boltzmann methods (LBM) are an important part of current computational fluid dynamics (CFD). They allow easy implementations and boundary handling. However, competitive time to solution not only depends on the choice of a reasonable method, b
Externí odkaz:
http://arxiv.org/abs/1711.11468
Autor:
Hammer, Nicolay, Jamitzky, Ferdinand, Satzger, Helmut, Allalen, Momme, Block, Alexander, Karmakar, Anupam, Brehm, Matthias, Bader, Reinhold, Iapichino, Luigi, Ragagnin, Antonio, Karakasis, Vasilios, Kranzlmüller, Dieter, Bode, Arndt, Huber, Herbert, Kühn, Martin, Machado, Rui, Grünewald, Daniel, Edelmann, Philipp V. F., Röpke, Friedrich K., Wittmann, Markus, Zeiser, Thomas, Wellein, Gerhard, Mathias, Gerald, Schwörer, Magnus, Lorenzen, Konstantin, Federrath, Christoph, Klessen, Ralf, Bamberg, Karl-Ulrich, Ruhl, Hartmut, Schornbaum, Florian, Bauer, Martin, Nikhil, Anand, Qi, Jiaxing, Klimach, Harald, Stüben, Hinnerk, Deshmukh, Abhishek, Falkenstein, Tobias, Dolag, Klaus, Petkova, Margarita
Publikováno v:
Advances in Parallel Computing, vol. 27: Parallel Computing: On the Road to Exascale, eds. G.R. Joubert et al., p. 827, 2016
In spring 2015, the Leibniz Supercomputing Centre (Leibniz-Rechenzentrum, LRZ), installed their new Peta-Scale System SuperMUC Phase2. Selected users were invited for a 28 day extreme scale-out block operation during which they were allowed to use th
Externí odkaz:
http://arxiv.org/abs/1609.01507
Autor:
Bauer, Simon, Mohr, Marcus, Rüde, Ulrich, Weismüller, Jens, Wittmann, Markus, Wohlmuth, Barbara
Matrix-free finite element implementations of massively parallel geometric multigrid save memory and are often significantly faster than implementations using classical sparse matrix techniques. They are especially well suited for hierarchical hybrid
Externí odkaz:
http://arxiv.org/abs/1608.06473
Simple floating point operations like addition or multiplication on normalized floating point values can be computed by current AMD and Intel processors in three to five cycles. This is different for denormalized numbers, which appear when an underfl
Externí odkaz:
http://arxiv.org/abs/1506.03997
Memory-bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice-Boltzmann method (LBM) on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single-chip perf
Externí odkaz:
http://arxiv.org/abs/1304.7664
We present a simple library which equips MPI implementations with truly asynchronous non-blocking point-to-point operations, and which is independent of the underlying communication infrastructure. It utilizes the MPI profiling interface (PMPI) and t
Externí odkaz:
http://arxiv.org/abs/1302.4280
Publikováno v:
Computers & Fluids, Volume 80, Pages 283-289 (2013)
We present a simple, parallel and distributed algorithm for setting up and partitioning a sparse representation of a regular discretized simulation domain. This method is scalable for a large number of processes even for complex geometries and ensure
Externí odkaz:
http://arxiv.org/abs/1111.1129
Publikováno v:
Computers & Mathematics with Applications, Volume 65, Issue 6, Pages 924-935 (2013)
Several possibilities exist to implement the propagation step of the lattice Boltzmann method. This paper describes common implementations which are compared according to the number of memory transfer operations they require per lattice node update.
Externí odkaz:
http://arxiv.org/abs/1111.0922
Autor:
Wittmann, Markus, Hager, Georg
Task parallelism as employed by the OpenMP task construct or some Intel Threading Building Blocks (TBB) components, although ideal for tackling irregular problems or typical producer/consumer schemes, bears some potential for performance bottlenecks
Externí odkaz:
http://arxiv.org/abs/1101.0093
Bandwidth-starved multicore chips have become ubiquitous. It is well known that the performance of stencil codes can be improved by temporal blocking, lessening the pressure on the memory interface. We introduce a new pipelined approach that makes ex
Externí odkaz:
http://arxiv.org/abs/1006.3148