Zobrazeno 1 - 10
of 13
pro vyhledávání: '"Shanyuan Gao"'
Publikováno v:
International Journal of Reconfigurable Computing, Vol 2012 (2012)
As the number of cores per discrete integrated circuit (IC) device grows, the importance of the network on chip (NoC) increases. However, the body of research in this area has focused on discrete IC devices alone which may or may not serve the high-p
Externí odkaz:
https://doaj.org/article/a0a401da9da0404f90c699e132e82df1
Autor:
Qianyuan Ran, Xiaowei Jiang, Yingya Zhang, Shanyuan Gao, Pengcheng Li, Heng Pan, Lingbo Tang, Liuyihan Song, Jie Zhang, Pan Pan, Fei Feng, Hao Li, Yong Li, Shaochuang Wang, Zhisheng Xia, Guohui Wang, Jianbo Dong, Xin Long, Zheng Cao, Yiqun Guo
Publikováno v:
IEEE Micro. 41:85-92
Distributed systems have been widely adopted for deep neural networks model training. However, the scalability of distributed training systems is largely bounded by the communication cost. We design a highly efficient collective communication library
Autor:
Sen Ma, Shanyuan Gao
Publikováno v:
ReConFig
Heterogeneous computing systems have been the focus of major efforts in pursuit of Exascale computing over the past few years. These heterogeneous computing systems, often contain various accelerators, can significantly improve the computation perfor
Publikováno v:
ReConFig
In this paper, we introduce a new design flow and architecture that lets programmers replace synthesis with compilation to create custom accelerators within data center and warehouse scale computers that include reconfigurable many core architectures
Autor:
Jeremy Chritz, Shanyuan Gao
Publikováno v:
ReConFig
The recent release of Altera's SDK for OpenCL has greatly eased the development of FPGA-based systems. Research have shown performance improvements brought by OpenCL using a single FPGA device. However, to meet the objectives of high performance comp
Publikováno v:
FCCM
This paper designed a MPI-like Message Passing Engine (MPE) as part of the on-chip network, providing point-to-point and collective communication primitives in hardware. On one hand, the MPE offloads the communication workload from the general proces
Publikováno v:
International Journal of Reconfigurable Computing, Vol 2012 (2012)
As the number of cores per discrete integrated circuit (IC) device grows, the importance of the network on chip (NoC) increases. However, the body of research in this area has focused on discrete IC devices alone which may or may not serve the high-p
Publikováno v:
FCCM
Traditional approaches to evaluating a system's vulnerability to Single Event Upsets (SEUs) require elaborate and costly radiation beam testing or time-consuming simulation. While beam testing represents definitive evidence of a processor's susceptib
Publikováno v:
FPT
This paper demonstrates the benefits and pit-falls of implementing the collective communication operation reduce in the reconfigurable resources of an FPGA device across a cluster of all-FPGA compute nodes. Specifically, the communication and computa
Publikováno v:
2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines (FCCM); 2011, p154-161, 8p