Zobrazeno 1 - 10
of 22
pro vyhledávání: '"Milind B. Girkar"'
Autor:
Mikhail Smelyanskiy, Jatin Chhugani, Changkyu Kim, Nadathur Satish, Hideki Saito, Pradeep Dubey, Milind B. Girkar, Rakesh Krishnaiyer
Publikováno v:
ISCA
Current processor trends of integrating more cores with wider SIMD units, along with a deeper and complex memory hierarchy, have made it increasingly more challenging to extract performance from applications. It is believed by some that traditional a
Autor:
Arun Kejariwal, Xinmin Tian, Alexandru Nicolau, Alexander V. Veidenbaum, Hideki Saito, Milind B. Girkar
Publikováno v:
ACM Transactions on Embedded Computing Systems. 8:1-34
Advances in the silicon technology have enabled increasing support for hardware parallelism in embedded processors. Vector units, multiple processors/cores, multithreading, special-purpose accelerators such as DSPs or cryptographic engines, or a comb
Autor:
Hong Jiang, Gautham N. Chinya, Guei-Yuan Lueh, Jamison D. Collins, Nick Y. Yang, Hong Wang, Perry Wang, Xinmin Tian, Milind B. Girkar
Publikováno v:
PLDI
Future mainstream microprocessors will likely integrate specialized accelerators, such as GPUs, onto a single die to achieve better performance and power efficiency. However, it remains a keen challenge to program such a heterogeneous multicore platf
Publikováno v:
Concurrency and Computation: Practice and Experience. 18:997-1007
Finding the minimum or maximum value in an array forms an important step in a variety of applications. This paper discusses vectorization schemes that take advantage of the streaming-SIMD-extensions in commonly used floating-point MIN and MAX reducti
Publikováno v:
Parallel Computing. 31:960-983
This paper presents the design and implementation of a parallelization framework and OpenMP runtime support in Intel C++ & Fortran compilers for exploiting nested parallelism in applications using OpenMP pragmas or directives. We conduct the performa
Publikováno v:
The Computer Journal. 48:588-601
State-of-the-art multiprocessor systems pose several difficulties: (i) the user has to parallelize the existing serial code; (ii) explicitly threaded programs using a thread library are not portable; (iii) writing efficient multi-threaded programs re
Publikováno v:
International Journal of Parallel Programming. 30:65-98
Recent extensions to the Intel® Architecture feature the SIMD technique to enhance the performance of computational intensive applications that perform the same operation on different elements in a data set. To date, much of the code that exploits t
Autor:
Xinmin Tian, Clark Nelson, Nikolay Panchenko, Serguei V. Preis, Sergey S. Kozhukhov, Aleksei G. Cherkasov, Robert Geva, Hideki Saito, Milind B. Girkar
Publikováno v:
IPDPS Workshops
SIMD vectorization has received significant attention in the past decade as an important method to accelerate scientific applications, media and embedded applications on SIMD architectures such as Intel® SSE, AVX, and IBM* AltiVec. However, most of
Autor:
Arun Kejariwal, Utpal Banerjee, Constantine D. Polychronopoulos, Alexander V. Veidenbaum, Milind B. Girkar, Xinmin Tian, Hideki Saito, Alexandru Nicolau
Publikováno v:
Conf. Computing Frontiers
Multi-cores such as the Intel Core 2 Duo, AMD Barcelona and IBM POWER6 are becoming ubiquitous. The number of cores and the resulting hardware parallelism is poised to increase rapidly in the foreseeable future. Nested thread-level speculative parall
Autor:
Arun Kejariwal, Alexander V. Veidenbaum, Milind B. Girkar, Xinmin Tian, Hideki Saito, Utpal Banerjee, Constantine D. Ppoluchronopoulos, Alexandru Nicolau
Publikováno v:
WOSP/SIPEW
Thread-level speculation (TLS) has been proposed as a means to parallelize difficult-to-analyze sequential codes. In this paper, we present a realistic measure of the performance potential of call-graph level TLS, using the SPEC CPU2006 benchmark sui