Výsledky vyhledávání - "van de Geijn, Robert"

Report

Deriving Algorithms for Triangular Tridiagonalization a (Skew-)Symmetric Matrix

Autor: van de Geijn, Robert, Myers, Maggie, Xu, RuQing G., Matthews, Devin

We apply the FLAME methodology to derive algorithms hand in hand with their proofs of correctness for the computation of the $ L T L^T $ decomposition (with and without pivoting) of a skew-symmetric matrix. The approach yields known as well as new al

Externí odkaz: http://arxiv.org/abs/2311.10700

Zobrazit plný text záznamu

Report

Formal Derivation of LU Factorization with Pivoting

Autor: van de Geijn, Robert, Myers, Maggie

The FLAME methodology for deriving linear algebra algorithms from specification, first introduced around 2000, has been successfully applied to a broad cross section of operations. An open question has been whether it can yield algorithms for the bes

Externí odkaz: http://arxiv.org/abs/2304.03068

Zobrazit plný text záznamu

Report

Cascading GEMM: High Precision from Low Precision

Autor: Parikh, Devangi N., van de Geijn, Robert A., Henry, Greg M.

This paper lays out insights and opportunities for implementing higher-precision matrix-matrix multiplication (GEMM) from (in terms of) lower-precision high-performance GEMM. The driving case study approximates double-double precision (FP64x2) GEMM i

Externí odkaz: http://arxiv.org/abs/2303.04353

Zobrazit plný text záznamu

Report

GEMMFIP: Unifying GEMM in BLIS

Autor: Xu, RuQing G., Van Zee, Field G., van de Geijn, Robert A.

Matrix libraries often focus on achieving high performance for problems considered to be either "small" or "large", as these two scenarios tend to respond best to different optimization strategies. We propose a unified technique for implementing matr

Externí odkaz: http://arxiv.org/abs/2302.08417

Zobrazit plný text záznamu

Report

The MOMMS Family of Matrix Multiplication Algorithms

Autor: Smith, Tyler M., van de Geijn, Robert A.

As the ratio between the rate of computation and rate with which data can be retrieved from various layers of memory continues to deteriorate, a question arises: Will the current best algorithms for computing matrix-matrix multiplication on future CP

Externí odkaz: http://arxiv.org/abs/1904.05717

Zobrazit plný text záznamu

Report

Supporting mixed-datatype matrix multiplication within the BLIS framework

Autor: Van Zee, Field G., Parikh, Devangi N., van de Geijn, Robert A.

We approach the problem of implementing mixed-datatype support within the general matrix multiplication (GEMM) operation of the BLIS framework, whereby each matrix operand A, B, and C may be stored as single- or double-precision real or complex value

Externí odkaz: http://arxiv.org/abs/1901.06015

Zobrazit plný text záznamu

Report

Implementing Strassen's Algorithm with CUTLASS on NVIDIA Volta GPUs

Autor: Huang, Jianyu, Yu, Chenhan D., van de Geijn, Robert A.

Conventional GPU implementations of Strassen's algorithm (Strassen) typically rely on the existing high-performance matrix multiplication (GEMM), trading space for time. As a result, such approaches can only achieve practical speedup for relatively l

Externí odkaz: http://arxiv.org/abs/1808.07984

Zobrazit plný text záznamu

Report

A Simple Methodology for Computing Families of Algorithms

Autor: Parikh, Devangi N., Myers, Margaret E., Vuduc, Richard, van de Geijn, Robert A.

Discovering "good" algorithms for an operation is often considered an art best left to experts. What if there is a simple methodology, an algorithm, for systematically deriving a family of algorithms as well as their cost analyses, so that the best a

Externí odkaz: http://arxiv.org/abs/1808.07832

Zobrazit plný text záznamu

Report

Deriving Correct High-Performance Algorithms

Autor: Parikh, Devangi N., Myers, Maggie E., van de Geijn, Robert A.

Dijkstra observed that verifying correctness of a program is difficult and conjectured that derivation of a program hand-in-hand with its proof of correctness was the answer. We illustrate this goal-oriented approach by applying it to the domain of d

Externí odkaz: http://arxiv.org/abs/1710.04286

Zobrazit plný text záznamu

Report

Strassen's Algorithm for Tensor Contraction

Autor: Huang, Jianyu, Matthews, Devin A., van de Geijn, Robert A.

Tensor contraction (TC) is an important computational kernel widely used in numerous applications. It is a multi-dimensional generalization of matrix multiplication (GEMM). While Strassen's algorithm for GEMM is well studied in theory and practice, e

Externí odkaz: http://arxiv.org/abs/1704.03092

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání