Zobrazeno 1 - 10
of 69
pro vyhledávání: '"Kevin J. Barker"'
Publikováno v:
IEEE Internet Computing. 27:7-14
Autor:
Cheng Tan, Nicolas Bohm Agostini, Tong Geng, Chenhao Xie, Jiajia Li, Ang Li, Kevin J. Barker, Antonino Tumeo
Publikováno v:
2022 IEEE International Symposium on High-Performance Computer Architecture (HPCA).
Autor:
Shuaiwen Leon Song, Kevin J. Barker, Jiajia Li, Nathan R. Tallent, Xu Liu, Jieyang Chen, Ang Li
Publikováno v:
IEEE Transactions on Parallel and Distributed Systems. 31:94-110
High performance multi-GPU computing becomes an inevitable trend due to the ever-increasing demand on computation capability in emerging domains such as deep learning, big data and planet-scale simulations. However, the lack of deep understanding on
Autor:
Jieyang Chen, Mark Raugas, Jesun Sahariar Firoz, Ang Li, Shuaiwen Leon Song, Chenhao Xie, Kevin J. Barker, Jiajia Li
Publikováno v:
ICPP
Designing efficient and scalable sparse linear algebra kernels on modern multi-GPU based HPC systems is a challenging task due to significant irregular memory references and workload imbalance across GPUs. These challenges are particularly compounded
Autor:
Marco Minutoli, Cheng Tan, Nicolas Bohm Agostini, Chenhao Xie, Ang Li, Tong Geng, Vito Giovanni Castellana, Antonino Tumeo, Jeff L. Zhang, Kevin J. Barker
Publikováno v:
ASAP
Reconfigurable architectures are today experiencing a renewed interest for their ability to provide specialization without sacrificing the capability to adapt to disparate workloads. Coarse-grained reconfigurable arrays (CGRAs) provide higher flexibi
Publikováno v:
DATE
Coarse-grained reconfigurable arrays (CGRAs), loosely defined as arrays of functional units interconnected through a network-on-chip (NoC), provide higher flexibility than domain-specific ASIC accelerators while offering increased hardware efficiency
Autor:
Mahesh Lakshminarasimhan, Xiaolong Wu, Catherine Olschanowsky, Kevin J. Barker, Jiajia Li, Ang Li
Publikováno v:
IISWC
Tensor computations present significant performance challenges that impact a wide spectrum of applications ranging from machine learning, healthcare analytics, social network analysis, data mining to quantum chemistry and signal processing. Efforts t
Publikováno v:
ICCD
Coarse-grained reconfigurable arrays (CGRAs), loosely defined as arrays of functional units (e.g., adder, subtractor, multiplier, divider, or larger multi-operation units, but smaller than a general-purpose core) interconnected through a Network-on-C
Publikováno v:
HPEC
Today's data-driven analytics and machine learning workload have been largely driven by the General-Purpose Graphics Processing Units (GPGPUs). To accelerate dense matrix multiplications on the GPUs, Tensor Core Units (TCUs) have been introduced in r
Publikováno v:
ICPP
This paper presents a workload classification framework that accurately discriminates illicit computation from authorized workloads on GPU-accelerated HPC systems at runtime. As such systems become increasingly powerful and widely-adopted, attackers