Zobrazeno 1 - 10
of 50
pro vyhledávání: '"Christophe Dubach"'
Autor:
Mai Jacob Peng, Christophe Dubach
Publikováno v:
Proceedings of the 32nd ACM SIGPLAN International Conference on Compiler Construction.
Publikováno v:
Schlaak, C, Juang, T-H & Dubach, C 2022, ' Memory-Aware Functional IR for Higher-Level Synthesis of Accelerators ', ACM Transactions on Architecture and Code Optimization, vol. 19, no. 2, 16 . https://doi.org/10.1145/3501768
Specialized accelerators deliver orders of a magnitude of higher performance than general-purpose processors. The ever-changing nature of modern workloads is pushing the adoption of Field Programmable Gate Arrays (FPGAs) as the substrate of choice. H
Autor:
Zhitao Lin, Christophe Dubach
Publikováno v:
Proceedings of the 8th ACM SIGPLAN International Workshop on Libraries, Languages and Compilers for Array Programming.
Publikováno v:
Mogers, N, Li, L, Radu, V & Dubach, C 2022, Mapping Parallelism in a Functional IR through Constraint Satisfaction : A Case Study on Convolution for Mobile GPUs . in B Egger & A Smith (eds), Proceedings of the 31st ACM SIGPLAN International Conference on Compiler Construction . pp. 218-230, ACM SIGPLAN 2022 International Conference on Compiler Construction, 2/04/22 . https://doi.org/10.1145/3497776.3517777
Graphics Processing Units (GPUs) are notoriously hard to optimize for manually. What is needed are good automatic code generators and optimizers. Accelerate, Futhark and Lift demonstrated that a functional approach is well suited for this challenge.
Publikováno v:
Euro-Par 2022: Parallel Processing ISBN: 9783031125966
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::10b3caf3c95001b275af2ec58177ab65
https://doi.org/10.1007/978-3-031-12597-3_21
https://doi.org/10.1007/978-3-031-12597-3_21
Autor:
Ronald G. Dreslinski, Siying Feng, Christophe Dubach, Michael O'Boyle, Subhankar Pal, Aporva Amarnath
Publikováno v:
MICRO
Pal, S, Amarnath, A, Feng, S, O'Boyle, M, Dreslinski, R & Dubach, C 2021, SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator . in The 54th Annual IEEE/ACM International Symposium on Microarchitecture Proceedings . pp. 1005-1021, 54th IEEE/ACM International Symposium on Microarchitecture, Athens, Greece, 18/10/21 . https://doi.org/10.1145/3466752.3480134
Pal, S, Amarnath, A, Feng, S, O'Boyle, M, Dreslinski, R & Dubach, C 2021, SparseAdapt: Runtime Control for Sparse Linear Algebra on a Reconfigurable Accelerator . in The 54th Annual IEEE/ACM International Symposium on Microarchitecture Proceedings . pp. 1005-1021, 54th IEEE/ACM International Symposium on Microarchitecture, Athens, Greece, 18/10/21 . https://doi.org/10.1145/3466752.3480134
Dynamic adaptation is a post-silicon optimization technique that adapts the hardware to workload phases. However, current adaptive approaches are oblivious to implicit phases that arise from operating on irregular data, such as sparse linear algebra
Publikováno v:
Software Testing, Verification and Reliability. 32
Publikováno v:
Stoltzfus, L, Hagedorn, B, Steuwer, M, Gorlatch, S & Dubach, C 2020, ' Tiling Optimizations for Stencil Computations Using Rewrite Rules in Lift ', ACM Transactions on Architecture and Code Optimization, vol. 16, no. 4, 52, pp. 52:2-52:25 . https://doi.org/10.1145/3368858
Stencil computations are a widely used type of algorithm, found in applications from physical simulations to machine learning. Stencils are embarrassingly parallel, therefore fit on modern hardware such as Graphic Processing Units perfectly. Although
Publikováno v:
IPDPS
The software and hardware landscape of high performance computing is expanding faster than computational scientists can take advantage of new frameworks and platforms. In an ideal world, simulation codes would be written once in a high-level manner a
Publikováno v:
EuroMLSys@EuroSys
The choice of convolutional routines (or primitives) for implementing the operations in a Convolutional Neural Network (CNN) has a tremendous impact over the inference time. To optimise the execution latency for a target system, a lengthy profiling s