Zobrazeno 1 - 10
of 11
pro vyhledávání: '"Andi Drebes"'
Autor:
Adam Siemieniuk, Lorenzo Chelini, Andi Drebes, Henk Corporaal, Martin Kong, Asif Ali Khan, Tobias Grosser, Jeronimo Castrillon
Publikováno v:
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 41:1674-1686
Memristive devices promise an alternative approach toward non-Von Neumann architectures, where specific computational tasks are performed within the memory devices. In the Machine Learning (ML) domain, crossbar arrays of resistive devices have shown
Autor:
Henk Corporaal, Albert Cohen, Nicolas Vasilache, Lorenzo Chelini, Tobias Grosser, Oleksandr Zinenko, Andi Drebes
Publikováno v:
CGO 2021 : International Symposium on Code Generation and Optimization
CGO 2021 : International Symposium on Code Generation and Optimization, Feb 2021, Seoul / Virtual, South Korea
CGO
CGO 2021 : International Symposium on Code Generation and Optimization, Feb 2021, Seoul / Virtual, South Korea
CGO
International audience; Multi-level intermediate representations (IR) show great promise for lowering the design costs for domain-specific compilers by providing a reusable, extensible, and non-opinionated framework for expressing domain-specific and
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::795d5bb4bf4ddca92237b0b721acca01
https://hal.inria.fr/hal-03139764
https://hal.inria.fr/hal-03139764
Publikováno v:
9th International Workshop on Polyhedral Compilation Techniques
9th International Workshop on Polyhedral Compilation Techniques, Jan 2019, Valencia, Spain
University of Manchester-PURE
9th International Workshop on Polyhedral Compilation Techniques, Jan 2019, Valencia, Spain
University of Manchester-PURE
International audience; Polyhedral techniques are, when applicable, an effective instrument for automatic parallelization and data locality optimization of sequential programs. This paper motivates their adoption in OpenStream, a task-parallel stream
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::40f53f4a96cf2bbbf613e161d0ec6bfa
https://hal.inria.fr/hal-02370558/file/IMPACT_2019_paper_7.pdf
https://hal.inria.fr/hal-02370558/file/IMPACT_2019_paper_7.pdf
Publikováno v:
IPDPS Workshops
2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
2018 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW)
Writing programs for heterogeneous platforms is challenging, since programmers must deal with multiple programming models, partition work for CPUs and accelerators with different compute capabilities, and manage memory in multiple distinct address sp
Publikováno v:
ACM Transactions on Architecture and Code Optimization
Publikováno v:
Scaling OpenMP for Exascale Performance and Portability ISBN: 9783319655772
Analyzing the behavior of OpenMP programs and their interaction with the hardware is essential for locating performance bottlenecks and identifying performance optimization opportunities. However, current architectures only provide a small number of
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::46eebea517cd006502dcf973cc0f80bb
https://doi.org/10.1007/978-3-319-65578-9_18
https://doi.org/10.1007/978-3-319-65578-9_18
Publikováno v:
PACT'16-ACM/IEEE Conference on Parallel Architectures and Compilation Techniques
PACT'16-ACM/IEEE Conference on Parallel Architectures and Compilation Techniques, Sep 2016, Haifa, Israel. pp.125-137, ⟨10.1145/2967938.2967946⟩
Drebes, A, Pop, A, Heydemann, K, Cohen, A & Drach, N 2016, Scalable Task Parallelism for NUMA: A Uniform Abstraction for Coordinated Scheduling and Memory Management . in International Conference on Parallel Architecture and Compilation Techniques . pp. 125-137, International Conference on Parallel Architecture and Compilation Techniques, Haifa, Israel, 11/09/16 . https://doi.org/10.1145/2967938.2967946
PACT
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT 16
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT '16
PACT'16-ACM/IEEE Conference on Parallel Architectures and Compilation Techniques, Sep 2016, Haifa, Israel. pp.125-137, ⟨10.1145/2967938.2967946⟩
Drebes, A, Pop, A, Heydemann, K, Cohen, A & Drach, N 2016, Scalable Task Parallelism for NUMA: A Uniform Abstraction for Coordinated Scheduling and Memory Management . in International Conference on Parallel Architecture and Compilation Techniques . pp. 125-137, International Conference on Parallel Architecture and Compilation Techniques, Haifa, Israel, 11/09/16 . https://doi.org/10.1145/2967938.2967946
PACT
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT 16
Proceedings of the 2016 International Conference on Parallel Architectures and Compilation-PACT '16
Dynamic task-parallel programming models are popular on shared-memory systems, promising enhanced scalability, load balancing and locality. These promises, however, are undermined by non-uniform memory access (NUMA). We show that using NUMA-aware tas
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1d346ceee07936ae0d91923ffa5de0a0
https://inria.hal.science/hal-01425743/document
https://inria.hal.science/hal-01425743/document
Publikováno v:
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Apr 2016, Uppsala, Sweden. pp.274-283, ⟨10.1109/ISPASS.2016.7482102⟩
2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
ISPASS
IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), Apr 2016, Uppsala, Sweden. pp.274-283, ⟨10.1109/ISPASS.2016.7482102⟩
2016 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
ISPASS
International audience; This paper studies the interactive visualization and post-mortem analysis of execution traces generated by task-parallel programs. We focus on the detection of performance anomalies inaccessible to state-of-the-art performance
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::99374c5f73347f6c694aea2592bf24e5
https://hal.inria.fr/hal-01425892/file/paper.pdf
https://hal.inria.fr/hal-01425892/file/paper.pdf
Publikováno v:
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Mar 2016, Barcelona, Spain. ACM New York, NY, USA, Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.44:1-44:2, 2016, ⟨10.1145/2851141.2851193⟩
PPOPP
ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Mar 2016, Barcelona, Spain. ACM New York, NY, USA, Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.44:1-44:2, 2016, ⟨10.1145/2851141.2851193⟩
PPOPP
Dynamic task parallelism is a popular programming model on shared-memory systems. Compared to data parallel loop-based concurrency, it promises enhanced scalability, load balancing and locality. These promises, however, are undermined by non-uniform
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::771deef1bb21b18086f0290cb41de50b
https://hal.sorbonne-universite.fr/hal-01365746
https://hal.sorbonne-universite.fr/hal-01365746
Publikováno v:
OpenMP: Memory, Devices, and Tasks ISBN: 9783319455495
IWOMP
IWOMP 2016-12th International Workshop on OpenMP
IWOMP 2016-12th International Workshop on OpenMP, Oct 2016, Nara, Japan. pp.237-250, ⟨10.1007/978-3-319-45550-1_17⟩
International Workshop on OpenMP, IWOMP16: OpenMP: Memory, Devices, and Tasks
Lecture Notes in Computer Science
Lecture Notes in Computer Science-OpenMP: Memory, Devices, and Tasks
Drebes, A, Bréjon, J-B, Pop, A, Heydemann, K & Cohen, A 2016, Language-Centric Performance Analysis of OpenMP Programs with Aftermath . in N Maruyama, B R De Supinski & M Wahib (eds), OpenMP : memory, devices, and tasks : 12th International Workshop on OpenMP, IWOMP 2016, Nara, Japan, October 5-7, 2016, proceedings . Lecture Notes in Computer Science, vol. 9903, Springer Nature, pp. 237-250, International Workshop on OpenMP, Nara, Japan, 5/10/16 . https://doi.org/10.1007/978-3-319-45550-1_17
IWOMP
IWOMP 2016-12th International Workshop on OpenMP
IWOMP 2016-12th International Workshop on OpenMP, Oct 2016, Nara, Japan. pp.237-250, ⟨10.1007/978-3-319-45550-1_17⟩
International Workshop on OpenMP, IWOMP16: OpenMP: Memory, Devices, and Tasks
Lecture Notes in Computer Science
Lecture Notes in Computer Science-OpenMP: Memory, Devices, and Tasks
Drebes, A, Bréjon, J-B, Pop, A, Heydemann, K & Cohen, A 2016, Language-Centric Performance Analysis of OpenMP Programs with Aftermath . in N Maruyama, B R De Supinski & M Wahib (eds), OpenMP : memory, devices, and tasks : 12th International Workshop on OpenMP, IWOMP 2016, Nara, Japan, October 5-7, 2016, proceedings . Lecture Notes in Computer Science, vol. 9903, Springer Nature, pp. 237-250, International Workshop on OpenMP, Nara, Japan, 5/10/16 . https://doi.org/10.1007/978-3-319-45550-1_17
International audience; We present a new set of tools for the language-centric performance analysis and debugging of OpenMP programs that allows programmers to relate dynamic information from parallel execution to OpenMP constructs. Users can visuali
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::6149fdac7c4e6f17f1a4d7a2591f259b
https://doi.org/10.1007/978-3-319-45550-1_17
https://doi.org/10.1007/978-3-319-45550-1_17