Výsledky vyhledávání

Function/Kernel Vectorization via Loop Vectorizer

Autor: Matt Masten, Eric N. Garcia, Evgeniy Tyurin, Hideki Saito, Konstantina Mitropoulou

Publikováno v: 2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).

Currently, there are three vectorizers in the LLVM trunk: Loop Vectorizer, SLP Vectorizer, and Load-Store Vectorizer. There is a need for vectorizing functions/kernels: 1) Function calls are an integral part of programming real world application code

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dca2180748213d504da3c76b71f06900
https://doi.org/10.1109/llvm-hpc.2018.8639483

Zobrazit plný text záznamu

LLVM Framework and IR Extensions for Parallelization, SIMD Vectorization and Offloading

Autor: Xinmin Tian, Hideki Saito, Ernesto Su, Abhinav Gaba, Matt Masten, Eric Garcia, Ayal Zaks

Publikováno v: 2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::08c37a5cfb8c29811dd40d2cde6d001e
https://doi.org/10.1109/llvm-hpc.2016.008

Zobrazit plný text záznamu

Effective SIMD Vectorization for Intel Xeon Phi Coprocessors

Autor: Xinmin Tian, Serguei V. Preis, Sergey S. Kozhukhov, Aleksei G. Cherkasov, Matt Masten, Nikolay Panchenko, Hideki Saito, Eric N. Garcia

Publikováno v: Scientific Programming, Vol 2015 (2015)

Efficiently exploiting SIMD vector units is one of the most important aspects in achieving high performance of the application code running on Intel Xeon Phi coprocessors. In this paper, we present several effective SIMD vectorization techniques such

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cc00bbf38ea2b88060f5223429e9eb23

Zobrazit plný text záznamu

Practical SIMD Vectorization Techniques for Intel® Xeon Phi Coprocessors

Autor: Serguei V. Preis, Xinmin Tian, Matt Masten, Hideki Saito, Eric N. Garcia, Aleksei G. Cherkasov, Sergey S. Kozhukhov, Nikolay Panchenko

Publikováno v: IPDPS Workshops

Intel® Xeon Phi coprocessor is based on the Intel® Many Integrated Core (Intel® MIC) architecture, which is an innovative new processor architecture that combines abundant thread parallelism with long SIMD vector units. Efficiently exploiting SIMD

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b4e7ee3442187df996ec4456889ae1c8
https://doi.org/10.1109/ipdpsw.2013.245

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání