Floating-Point Computations on Reconfigurable Computers

Autor:	G.R. Morris
Rok vydání:	2007
Předmět:	Coprocessor Floating point Kernel (image processing) Computer science Computation Pipeline (computing) Algorithm design Parallel computing Hardware_ARITHMETICANDLOGICSTRUCTURES Field-programmable gate array Reconfigurable computing Sparse matrix
Zdroj:	2007 DoD High Performance Computing Modernization Program Users Group Conference.
DOI:	10.1109/hpcmp-ugc.2007.35
Popis:	Modern reconfigurable computers combine general-purpose processors with field programmable gate arrays (FPGAs). The FPGAs are, in effect, reconfigurable application-specific coprocessors. During one run, the FPGA might be a matrix-vector multiply coprocessor; during another run, it might be a linear equation solver. There are several issues associated with the mapping of floating-point computations onto RCs. There is the determination of what the author terms "the FPGA design boundary," i.e., the portion of the application that is mapped onto the FPGA. Furthermore, FPGA-based kernel performance is heavily dependent upon both pipelining and parallelism. The author has coined the phrase "the three p's" to encapsulate this important relationship. In this paper, important FPGA design boundary heuristics are described, and a toroidal architecture and partitioned loop algorithm are used to maximize both pipelining and parallelism for a double- precision floating-point sparse matrix conjugate gradient solver that is mapped onto a reconfigurable computer. Wall clock run time comparisons show that the FPGA- augmented version runs more than two times faster than the software-only version.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::210b631669b181b138e3a5eb8142fbe1 https://doi.org/10.1109/hpcmp-ugc.2007.35 Zobrazit plný text záznamu