Zobrazeno 1 - 10
of 307
pro vyhledávání: '"Loop interchange"'
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
IEEE Access, Vol 8, Pp 172509-172523 (2020)
To effectively compute convolutional layers, a complex design space must exist (e.g., the dataflow techniques associated with the layer parameters, loop transformation techniques, and hardware parameters). For efficient design space exploration (DSE)
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 16:1-26
Iteration Point Difference Analysis is a new static analysis framework that can be used to determine the memory coalescing characteristics of parallel loops that target GPU offloading and to ascertain safety and profitability of loop transformations
Autor:
Michael Wolfe
Publikováno v:
ICS
Between a problem statement and its solution as a computer simulation are several steps, from choosing a method, writing a program, compiling to machine code, making runtime decisions, and hardware execution. Here we will look at the middle three dec
Autor:
Martin Elsman, Cosmin E. Oancea, Troels Henriksen, Fritz Henglein, Niels Gustav Westphal Serup
Publikováno v:
PLDI
Futhark is a purely functional data-parallel array language that offers a machine-neutral programming model and an optimising compiler that generates OpenCL code for GPUs. This paper presents the design and implementation of three key features of Fut
Autor:
Sanyam Mehta, Pen-Chung Yew
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 13:1-25
In the wake of the current trend of increasing the number of cores on a chip, compiler optimizations for improving the memory performance have assumed increased importance. Loop fusion is one such key optimization that can alleviate memory and bandwi
Publikováno v:
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 35:1092-1104
Nested loops represent a significant portion of application runtime in multimedia and DSP applications, an important domain of applications for coarse-grained reconfigurable architectures (CGRAs). While conventional approaches to mapping nested loops
Publikováno v:
PEPM
Code generation is the leading approach to making high-performance software reusable. Effects are indispensable in code generators, whether to report failures or to insert let-statements and if-guards. Extensive painful experience shows that unrestri
Publikováno v:
Fundamental Approaches to Software Engineering ISBN: 9783662466742
FASE
STARTPAGE=IPA;ENDPAGE=4;TITLE=ICT.OPEN 2015
Proceedings of the 18th International Conference on Fundamental Approaches to Software Engineering (FASE 2015), 202-217
STARTPAGE=202;ENDPAGE=217;TITLE=Proceedings of the 18th International Conference on Fundamental Approaches to Software Engineering (FASE 2015)
University of Twente Research Information (Pure Portal)
FASE
STARTPAGE=IPA;ENDPAGE=4;TITLE=ICT.OPEN 2015
Proceedings of the 18th International Conference on Fundamental Approaches to Software Engineering (FASE 2015), 202-217
STARTPAGE=202;ENDPAGE=217;TITLE=Proceedings of the 18th International Conference on Fundamental Approaches to Software Engineering (FASE 2015)
University of Twente Research Information (Pure Portal)
Writing correct parallel programs becomes more and more difficult as the complexity and heterogeneity of processors increase. This issue is addressed by parallelising compilers. Various compiler directives can be used to tell these compilers where to
Publikováno v:
AICCSA
We address the parallelization of the determination of all optimal solutions (DAOS) for the 1D array partitioning problem, an easy combinatorial optimization problem that may be solved by dynamic programming. It turns out that the designed DAOS algor