Výsledky vyhledávání - "Processament en paral·lel (Ordinadors)"

Mitigating the NUMA effect on task-based runtime systems

Autor: Maroñas Bravo, Marcos, Navarro Muñoz, Antoni, Ayguadé Parra, Eduard, Beltran Querol, Vicenç

Publikováno v: The Journal of Supercomputing.

Processors with multiple sockets or chiplets are becoming more conventional. These kinds of processors usually expose a single shared address space. However, due to hardware restrictions, they adopt a NUMA approach, where each processor accesses loca

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::abd86186972399f62bedce017ddf0f2e
https://doi.org/10.1007/s11227-023-05164-9

Zobrazit plný text záznamu

DynAMO: Improving parallelism through dynamic placement of atomic memory operations

Autor: Soria Pardos, Víctor, Armejach Sanosa, Adrià, Mück, Tiago, Suárez Gracía, Dario, Joao, Jose A., Rico, Alejandro, Moreto Planas, Miquel

With increasing core counts in modern multi-core designs, the overhead of synchronization jeopardizes the scalability and efficiency of parallel applications. To mitigate these overheads, modern cache-coherent protocols offer support for Atomic Memor

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______3484::c8887afabdf6bff87332c29de5fb139b
https://hdl.handle.net/2117/390752

Zobrazit plný text záznamu

A symbolic emulator for shuffle synthesis on the NVIDIA PTX code

Autor: Matsumura, Kazuaki, De Gonzalo, Simon Garcia, Peña, Antonio J.

Various kinds of applications take advantage of GPUs through automation tools that attempt to automatically exploit the available performance of the GPU's parallel architecture. Directive-based programming models, such as OpenACC, are one such method

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b28a27200c92d5a59b3fc0319f73ef95
https://hdl.handle.net/2117/384604

Zobrazit plný text záznamu

Task-Level Checkpointing System for Task-Based Parallel Workflows

Autor: Pere Vergés, Francesc Lordan, Jorge Ejarque, Rosa M. Badia

Publikováno v: Euro-Par 2022: Parallel Processing Workshops ISBN: 9783031312083

Scientific applications are large and complex; task-based programming models are a popular approach to developing these applications due to their ease of programming and ability to handle complex workflows and distribute their workload across large i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::793aa3c01f734f89df8fa4d25b4060ca
https://doi.org/10.1007/978-3-031-31209-0_19

Zobrazit plný text záznamu

Fast behavioural RTL simulation of 10B transistor SoC designs with Metro-Mpi

Autor: López Paradís, Guillem, Li, Brian, Armejach Sanosa, Adrià, Wallentowitz, Stefan, Moreto Planas, Miquel, Balkind, Jonathan

Chips with tens of billions of transistors have become today's norm. These designs are straining our electronic design automation tools throughout the design process, requiring ever more computational resources. In many tools, parallelisation has imp

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=od______3484::1fc5ca8611102f71fe9d6afb6efcbc02
https://hdl.handle.net/2117/390396

Zobrazit plný text záznamu

Improving the performance of classical linear algebra iterative methods via hybrid parallelism

Autor: Pedro J. Martinez-Ferrer, Tufan Arslan, Vicenç Beltran

We propose fork-join and task-based hybrid implementations of four classical linear algebra iterative methods (Jacobi, Gauss-Seidel, conjugate gradient and biconjugate gradient stabilised) as well as variations of them. Algorithms are duly documented

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eb9205c05f1c7c67ca0766908f115d3d

Zobrazit plný text záznamu

Automatic aggregation of subtask accesses for nested OpenMP-style tasks

Autor: Ali, Omar Shaaban Ibrahim, Aguilar Mena, Jimmy, Beltran Querol, Vicenç, Carpenter, Paul Matthew, Ayguadé Parra, Eduard, Labarta Mancho, Jesús José

Publikováno v: 2022 IEEE 34th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD).

Task-based programming is a high performance and productive model to express parallelism. Tasks encapsulate work to be executed across multiple cores or offloaded to GPUs, FPGAs, other accelerators or other nodes. In order to maintain parallelism and

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dd232db6bcdc58f97165b4ca87314b1c
https://doi.org/10.1109/sbac-pad55451.2022.00042

Zobrazit plný text záznamu

The BioExcel methodology for developing dynamic, scalable, reliable and portable computational biomolecular workflows

Autor: Jorge Ejarque, Pau Andrio, Adam Hospital, Javier Conejero, Daniele Lezzi, Josep LL. Gelpi, Rosa M. Badia

Developing complex biomolecular workflows is not always straightforward. It requires tedious developments to enable the interoperability between the different biomolecular simulation and analysis tools. Moreover, the need to execute the pipelines on

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::0a36f17c561d526efb97bc98456a62a7
http://arxiv.org/abs/2208.14130

Zobrazit plný text záznamu

Seamless optimization of the GEMM kernel for task-based programming models

Autor: Lorenzon, Arthur F., Marques, Sandro M. V. N., Navarro Muñoz, Antoni, Beltran Querol, Vicenç

Publikováno v: UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)

The general matrix-matrix multiplication (GEMM) kernel is a fundamental building block of many scientific applications. Many libraries such as Intel MKL and BLIS provide highly optimized sequential and parallel versions of this kernel. The parallel i

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f3d6d9a1d327b94ea65a21e179ee1840
https://doi.org/10.1145/3524059.3532385

Zobrazit plný text záznamu

XFeatur: Hardware Feature Extraction for DNN Auto-tuning

Autor: Sierra Acosta, Jorge, Diavastos, Andreas, González Colás, Antonio María

Publikováno v: 2022 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS)
UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
Proceedings of the IEEE International Symposium on Performance Analysis of Systems and Software

In this work, we extend the auto-tuning process of the state-of-the-art TVM framework with XFeatur; a tool that extracts new meaningful hardware-related features that improve the quality of the representation of the search space and consequently impr

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::29fb21c854094d197a4c30f6fc4ed48e
https://doi.org/10.1109/ispass55109.2022.00013

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání