Zobrazeno 1 - 10
of 11
pro vyhledávání: '"Benjamin Klenk"'
Autor:
George Michelogiannakis, Benjamin Klenk, Brandon Cook, Min Yee Teh, Madeleine Glick, Larry Dennison, Keren Bergman, John Shalf
Publikováno v:
ACM Transactions on Architecture and Code Optimization, vol 19, iss 2
The expected halt of traditional technology scaling is motivating increased heterogeneity in high-performance computing (HPC) systems with the emergence of numerous specialized accelerators. As heterogeneity increases, so does the risk of underutiliz
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c0653e28787b2d03eb6080694bf9a148
https://escholarship.org/uc/item/73x617x8
https://escholarship.org/uc/item/73x617x8
Autor:
Benjamin Klenk, Madeleine Glick, Eiman Ebrahimi, Mehrdad Khani, Manya Ghobadi, Ziyi Zhu, Mohammad Alizadeh, Keren Bergman, Amin Vahdat
Publikováno v:
SIGCOMM
This paper proposes optical network interconnects as a key enabler for building high-bandwidth ML training clusters with strong scaling properties. Our design, called SiP-ML, accelerates the training time of popular DNN models using silicon photonics
Publikováno v:
ISCA
The slowdown of single-chip performance scaling combined with the growing demands of computing ever larger problems efficiently has led to a renewed interest in distributed architectures and specialized hardware. Dedicated accelerators for common or
Autor:
Benjamin Klenk, Larry R. Dennison
Publikováno v:
OFC
Training deep neural networks demands vast amounts of computation, provided by large distributed systems. The increasing demand for bandwidth will exceed the limits of electrical and non-integrated optical signaling and will require integrated optics
Publikováno v:
IPDPS
Accelerators, such as GPUs, have proven to be highly successful in reducing execution time and power consumption of compute-intensive applications. Even though they are already used pervasively, they are typically supervised by general-purpose CPUs,
Autor:
Holger Fröning, Benjamin Klenk
Publikováno v:
Lecture Notes in Computer Science ISBN: 9783319586663
ISC
ISC
The scale of applications and computing systems is tremendously increasing and needs to increase even more to realize exascale systems. As the number of nodes keeps growing, communication has become key to high performance.
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::245ec13492ede026d3661c6ad5bcb246
https://doi.org/10.1007/978-3-319-58667-0_12
https://doi.org/10.1007/978-3-319-58667-0_12
Intra-GPU synchronization is a problem for GPU controlled communication.Options, based on dynamic parallelism provide on-device synchronization.GPU controlled communication have a lower performance than CPU assisted approaches.Relieving the CPU from
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::fe878a034c07a6763fc42353aa487402
https://publica.fraunhofer.de/handle/publica/245304
https://publica.fraunhofer.de/handle/publica/245304
Publikováno v:
ISPASS
Accelerated computing has become pervasive for increasing the computational power and energy efficiency in terms of GFLOPs/Watt. For application areas with highest demands, for instance high performance computing, data warehousing and high performanc
Publikováno v:
E2SC@SC
GPUs are widely used in high performance computing, due to their high computational power and high performance per Watt. Still, one of the main bottlenecks of GPU-accelerated cluster computing is the data transfer between distributed GPUs. This not o
Publikováno v:
CCGRID
GPUs gain high popularity in High Performance Computing, due to their massive parallelism and high performance per Watt. Despite their popularity, data transfer between multiple GPUs in a cluster remains a problem. Most communication models require t