Výsledky vyhledávání

OCC: An Automated End-to-End Machine Learning Optimizing Compiler for Computing-In-Memory

Autor: Adam Siemieniuk, Lorenzo Chelini, Andi Drebes, Henk Corporaal, Martin Kong, Asif Ali Khan, Tobias Grosser, Jeronimo Castrillon

Publikováno v: IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 41:1674-1686

Memristive devices promise an alternative approach toward non-Von Neumann architectures, where specific computational tasks are performed within the memory devices. In the Machine Learning (ML) domain, crossbar arrays of resistive devices have shown

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::07fd196b3bf59488a8bf079914306dbb
https://doi.org/10.1109/tcad.2021.3101464

Zobrazit plný text záznamu

Progressive Raising in Multi-level IR

Autor: Henk Corporaal, Albert Cohen, Nicolas Vasilache, Lorenzo Chelini, Tobias Grosser, Oleksandr Zinenko, Andi Drebes

Publikováno v: CGO 2021 : International Symposium on Code Generation and Optimization
CGO 2021 : International Symposium on Code Generation and Optimization, Feb 2021, Seoul / Virtual, South Korea
CGO

International audience; Multi-level intermediate representations (IR) show great promise for lowering the design costs for domain-specific compilers by providing a reusable, extensible, and non-opinionated framework for expressing domain-specific and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::795d5bb4bf4ddca92237b0b721acca01
https://hal.inria.fr/hal-03139764

Zobrazit plný text záznamu

Beyond Polyhedral Analysis of OpenStream Programs

Autor: Nuno Miguel Nobre, Andi Drebes, Graham Riley, Antoniu Pop

Publikováno v: 9th International Workshop on Polyhedral Compilation Techniques
9th International Workshop on Polyhedral Compilation Techniques, Jan 2019, Valencia, Spain
University of Manchester-PURE

International audience; Polyhedral techniques are, when applicable, an effective instrument for automatic parallelization and data locality optimization of sequential programs. This paper motivates their adoption in OpenStream, a task-parallel stream

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::40f53f4a96cf2bbbf613e161d0ec6bfa
https://hal.inria.fr/hal-02370558/file/IMPACT_2019_paper_7.pdf

Zobrazit plný text záznamu

Leveraging Data-Flow Task Parallelism for Locality-Aware Dynamic Scheduling on Heterogeneous Platforms

Autor: Antoniu Pop, Andi Drebes, Osman Seckin Simsek

Publikováno v: IPDPS Workshops
2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)

Writing programs for heterogeneous platforms is challenging, since programmers must deal with multiple programming models, partition work for CPUs and accelerators with different compute capabilities, and manage memory in multiple distinct address sp

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0ce86f50f4c0746abe4bebd08c9a3b68
https://doi.org/10.1109/ipdpsw.2018.00093

Zobrazit plný text záznamu

Fuse

Autor: Richard Neill, Andi Drebes, Antoniu Pop

Publikováno v: ACM Transactions on Architecture and Code Optimization

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=r3c4b2081b22::d7a924ea08ed10bce16620a5f6f5583b

Zobrazit plný text záznamu

Accurate and Complete Hardware Profiling for OpenMP

Autor: Antoniu Pop, Richard Neill, Andi Drebes

Publikováno v: Scaling OpenMP for Exascale Performance and Portability ISBN: 9783319655772

Analyzing the behavior of OpenMP programs and their interaction with the hardware is essential for locating performance bottlenecks and identifying performance optimization opportunities. However, current architectures only provide a small number of

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::46eebea517cd006502dcf973cc0f80bb
https://doi.org/10.1007/978-3-319-65578-9_18

Zobrazit plný text záznamu

Scalable Task Parallelism for NUMA: A Uniform Abstraction for Coordinated Scheduling and Memory Management

Autor: Karine Heydemann, Nathalie Drach, Antoniu Pop, Albert Cohen, Andi Drebes

Publikováno v: PACT'16-ACM/IEEE Conference on Parallel Architectures and Compilation Techniques
PACT'16-ACM/IEEE Conference on Parallel Architectures and Compilation Techniques, Sep 2016, Haifa, Israel. pp.125-137, ⟨10.1145/2967938.2967946⟩
Drebes, A, Pop, A, Heydemann, K, Cohen, A & Drach, N 2016, Scalable Task Parallelism for NUMA: A Uniform Abstraction for Coordinated Scheduling and Memory Management . in International Conference on Parallel Architecture and Compilation Techniques . pp. 125-137, International Conference on Parallel Architecture and Compilation Techniques, Haifa, Israel, 11/09/16 . https://doi.org/10.1145/2967938.2967946
PACT
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT 16
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT '16

Dynamic task-parallel programming models are popular on shared-memory systems, promising enhanced scalability, load balancing and locality. These promises, however, are undermined by non-uniform memory access (NUMA). We show that using NUMA-aware tas

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1d346ceee07936ae0d91923ffa5de0a0
https://inria.hal.science/hal-01425743/document

Zobrazit plný text záznamu

Interactive visualization of cross-layer performance anomalies in dynamic task-parallel applications and systems

Autor: Antoniu Pop, Andi Drebes, Albert Cohen, Karine Heydemann

Publikováno v: IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Apr 2016, Uppsala, Sweden. pp.274-283, ⟨10.1109/ISPASS.2016.7482102⟩
2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
ISPASS

International audience; This paper studies the interactive visualization and post-mortem analysis of execution traces generated by task-parallel programs. We focus on the detection of performance anomalies inaccessible to state-of-the-art performance

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::99374c5f73347f6c694aea2592bf24e5
https://hal.inria.fr/hal-01425892/file/paper.pdf

Zobrazit plný text záznamu

NUMA-aware scheduling and memory allocation for data-flow task-parallel applications

Autor: Albert Cohen, Antoniu Pop, Karine Heydemann, Nathalie Drach, Andi Drebes

Publikováno v: ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Mar 2016, Barcelona, Spain. ACM New York, NY, USA, Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.44:1-44:2, 2016, ⟨10.1145/2851141.2851193⟩
PPOPP

Dynamic task parallelism is a popular programming model on shared-memory systems. Compared to data parallel loop-based concurrency, it promises enhanced scalability, load balancing and locality. These promises, however, are undermined by non-uniform

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::771deef1bb21b18086f0290cb41de50b
https://hal.sorbonne-universite.fr/hal-01365746

Zobrazit plný text záznamu

Language-Centric Performance Analysis of OpenMP Programs with Aftermath

Autor: Karine Heydemann, Jean-Baptiste Bréjon, Albert Cohen, Antoniu Pop, Andi Drebes

Publikováno v: OpenMP: Memory, Devices, and Tasks ISBN: 9783319455495
IWOMP
IWOMP 2016-12th International Workshop on OpenMP
IWOMP 2016-12th International Workshop on OpenMP, Oct 2016, Nara, Japan. pp.237-250, ⟨10.1007/978-3-319-45550-1_17⟩
International Workshop on OpenMP, IWOMP16: OpenMP: Memory, Devices, and Tasks
Lecture Notes in Computer Science
Lecture Notes in Computer Science-OpenMP: Memory, Devices, and Tasks
Drebes, A, Bréjon, J-B, Pop, A, Heydemann, K & Cohen, A 2016, Language-Centric Performance Analysis of OpenMP Programs with Aftermath . in N Maruyama, B R De Supinski & M Wahib (eds), OpenMP : memory, devices, and tasks : 12th International Workshop on OpenMP, IWOMP 2016, Nara, Japan, October 5-7, 2016, proceedings . Lecture Notes in Computer Science, vol. 9903, Springer Nature, pp. 237-250, International Workshop on OpenMP, Nara, Japan, 5/10/16 . https://doi.org/10.1007/978-3-319-45550-1_17

International audience; We present a new set of tools for the language-centric performance analysis and debugging of OpenMP programs that allows programmers to relate dynamic information from parallel execution to OpenMP constructs. Users can visuali

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6149fdac7c4e6f17f1a4d7a2591f259b
https://doi.org/10.1007/978-3-319-45550-1_17

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání