Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Matt Masten"'
Publikováno v:
2018 IEEE/ACM 5th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).
Currently, there are three vectorizers in the LLVM trunk: Loop Vectorizer, SLP Vectorizer, and Load-Store Vectorizer. There is a need for vectorizing functions/kernels: 1) Function calls are an integral part of programming real world application code
Publikováno v:
2016 Third Workshop on the LLVM Compiler Infrastructure in HPC (LLVM-HPC).
Autor:
Xinmin Tian, Serguei V. Preis, Sergey S. Kozhukhov, Aleksei G. Cherkasov, Matt Masten, Nikolay Panchenko, Hideki Saito, Eric N. Garcia
Publikováno v:
Scientific Programming, Vol 2015 (2015)
Efficiently exploiting SIMD vector units is one of the most important aspects in achieving high performance of the application code running on Intel Xeon Phi coprocessors. In this paper, we present several effective SIMD vectorization techniques such
Autor:
Serguei V. Preis, Xinmin Tian, Matt Masten, Hideki Saito, Eric N. Garcia, Aleksei G. Cherkasov, Sergey S. Kozhukhov, Nikolay Panchenko
Publikováno v:
IPDPS Workshops
Intel® Xeon Phi coprocessor is based on the Intel® Many Integrated Core (Intel® MIC) architecture, which is an innovative new processor architecture that combines abundant thread parallelism with long SIMD vector units. Efficiently exploiting SIMD