Zobrazeno 1 - 7
of 7
pro vyhledávání: '"Ian Masliah"'
Autor:
Stanimire Tomov, Ian Masliah, Jack Dongarra, Ahmad Abdelfattah, Joel Falcou, Marc Baboulin, Azzam Haidar
Publikováno v:
Parallel Computing. 81:1-21
Expressing scientific computations in terms of BLAS, and in particular the general dense matrix-matrix multiplication (GEMM), is of fundamental importance for obtaining high performance portability across architectures. However, GEMMs for small matri
Autor:
Florian Lemaitre, Ian Masliah, Boris Gaillard, Thomas Romera, Lionel Lacassagne, Manuel Bouyer, Quentin L. Meunier, Andrea Petreto
Publikováno v:
DASIP 2019-The Conference on Design and Architectures for Signal and Image Processing
DASIP 2019-The Conference on Design and Architectures for Signal and Image Processing, Oct 2019, Montréal, Canada
DASIP
DASIP 2019-The Conference on Design and Architectures for Signal and Image Processing, Oct 2019, Montréal, Canada
DASIP
International audience; Many embedded applications rely on video processing or on video visualization. Noisy video is thus a major issue for such applications. However, video denoising requires a lot of computational effort and most of the state-of-t
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b42db94e5ede51e5bab4b395e41f48e4
https://hal.archives-ouvertes.fr/hal-02343597/file/DASIP19_RTEVD.pdf
https://hal.archives-ouvertes.fr/hal-02343597/file/DASIP19_RTEVD.pdf
Publikováno v:
WPMVP'19 Proceedings of the 5th Workshop on Programming Models for SIMD/Vector Processing
WPMVP'19-5th Workshop on Programming Models for SIMD/Vector Processing
WPMVP'19-5th Workshop on Programming Models for SIMD/Vector Processing, Feb 2019, Washington, United States. pp.4:1--4:8, ⟨10.1145/3303117.3306164⟩
WPMVP'19-5th Workshop on Programming Models for SIMD/Vector Processing
WPMVP'19-5th Workshop on Programming Models for SIMD/Vector Processing, Feb 2019, Washington, United States. pp.4:1--4:8, ⟨10.1145/3303117.3306164⟩
International audience; Connected Component Labeling (CCL) is a fundamental algorithm in computer vision, and is often required for real-time applications. It consists in assigning a unique number to each connected component of a binary image. In rec
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3a4c8f8a95f6ce459a34d57c13730ae2
https://hal.archives-ouvertes.fr/hal-02049029
https://hal.archives-ouvertes.fr/hal-02049029
Publikováno v:
The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018)-HPCS 2018
The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018)-HPCS 2018, Jul 2018, Orléans, France. pp.531-538, ⟨10.1109/HPCS.2018.00089⟩
HPCS
The 2018 International Conference on High Performance Computing & Simulation (HPCS 2018)-HPCS 2018, Jul 2018, Orléans, France. pp.531-538, ⟨10.1109/HPCS.2018.00089⟩
HPCS
International audience; From a high level point of view, developers define objects they manipulate in terms of structures or classes. For example, a pixel may be represented as a structure of three color components : red, green, blue and an image as
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::93e4c11925864e3f5774c9263f317329
https://hal.archives-ouvertes.fr/hal-01915529/file/datalayouts.pdf
https://hal.archives-ouvertes.fr/hal-01915529/file/datalayouts.pdf
Autor:
Marc Baboulin, Azzam Haidar, Stanimire Tomov, Ian Masliah, Jack Dongarra, Ahmad Abdelfattah, Joel Falcou
Publikováno v:
Lecture Notes in Computer Science
22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016)
22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016), Aug 2016, Grenoble, France. pp.659-671, ⟨10.1007/978-3-319-43659-3_48⟩
Euro-Par 2016: Parallel Processing ISBN: 9783319436586
Euro-Par
22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016)
22nd International Conference on Parallel and Distributed Computing (Euro-Par 2016), Aug 2016, Grenoble, France. pp.659-671, ⟨10.1007/978-3-319-43659-3_48⟩
Euro-Par 2016: Parallel Processing ISBN: 9783319436586
Euro-Par
International audience; The use of the general dense matrix-matrix multiplication (GEMM) is fundamental for obtaining high performance in many scientific computing applications. GEMMs for small matrices (of sizes less than 32) however, are not suffic
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b0204cafa8fb413b9a7ac684950149c8
https://hal.archives-ouvertes.fr/hal-01409286/file/main.pdf
https://hal.archives-ouvertes.fr/hal-01409286/file/main.pdf
Publikováno v:
[Research Report] RR-8780, Inria Saclay Ile de France; Paris-Sud XI. 2015
MCSoC
MCSoC
International audience; GPGPUs and other accelerators are becoming a mainstream asset for high-performance computing. Raising the programmability of such hardware is essential to enable users to discover, master and subsequently use accelerators in d
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::dee9f1e40f861c28be0d04ff5731420f
https://hal.inria.fr/hal-01204661/file/RR-8780.pdf
https://hal.inria.fr/hal-01204661/file/RR-8780.pdf
Autor:
Tzanio V. Kolev, Jack Dongarra, Marc Baboulin, Ian Masliah, Ahmad Abdelfattah, Ian Karlin, Veselin Dobrev, Stanimire Tomov, Azzam Haidar, Joel Falcou, Christopher Earl
Publikováno v:
ICCS
Procedia Computer Science
International Conference on Computational Science 2016 (ICCS 2016)
International Conference on Computational Science 2016 (ICCS 2016), Jun 2016, San Diego, CA, United States. pp.108-118, ⟨10.1016/j.procs.2016.05.302⟩
Abdelfattah, A, Baboulin, M, Dobrev, V, Dongarra, J, Earl, C, Falcou, J, Haidar, A, Karlin, I, Kolev, T, Masliah, I & Tomov, S 2016, ' High-performance tensor contractions for GPUs ', Procedia Computer Science, vol. 80, pp. 108-118 . https://doi.org/10.1016/j.procs.2016.05.302
Procedia Computer Science
International Conference on Computational Science 2016 (ICCS 2016)
International Conference on Computational Science 2016 (ICCS 2016), Jun 2016, San Diego, CA, United States. pp.108-118, ⟨10.1016/j.procs.2016.05.302⟩
Abdelfattah, A, Baboulin, M, Dobrev, V, Dongarra, J, Earl, C, Falcou, J, Haidar, A, Karlin, I, Kolev, T, Masliah, I & Tomov, S 2016, ' High-performance tensor contractions for GPUs ', Procedia Computer Science, vol. 80, pp. 108-118 . https://doi.org/10.1016/j.procs.2016.05.302
International audience; We present a computational framework for high-performance tensor contractions on GPUs. High-performance is difficult to obtain using existing libraries, especially for many independent contractions where each contraction is ve