Zobrazeno 1 - 10
of 28
pro vyhledávání: '"Kazuaki Ishizaki"'
Autor:
Michael J. Klaiber, George D. Gristede, Shih-Hsien Lo, Hiroshi Inoue, Leland Chang, Christos Vezyrtzis, Jungwook Choi, Gary W. Maier, Fanchieh Yee, Shubham Jain, Brian W. Curran, Jintao Zhang, Mingu Kang, Howard M. Haynie, Mauricio J. Serrano, Pong-Fei Lu, Silvia Melitta Mueller, Matthew M. Ziegler, Bruce M. Fleischer, Kazuaki Ishizaki, Kailash Gopalakrishnan, Michael R. Scheuermann, Ankur Agarwal, Xiao Sun, Sunil Shukla, Thomas W. Fox, Vijayalakshmi Srinivasan, Tina Babinsky, Swagath Venkataramani, Michael A. Guillorn, Ching Zhou, Nianzheng Cao, Eri Ogawa, Naigang Wang, Moriyoshi Ohara, Joel Abraham Silberman, Jinwook Oh, Marcel Schaal, Chia-Yu Chen, Wei Wang
Publikováno v:
Proceedings of the IEEE. 108:2232-2250
Advances in deep neural networks (DNNs) and the availability of massive real-world data have enabled superhuman levels of accuracy on many AI tasks and ushered the explosive growth of AI workloads across the spectrum of computing devices. However, th
Autor:
Scot H. Rider, Martin Lutz, Moriyoshi Ohara, Pong-Fei Lu, Monodeep Kar, Xiao Sun, Kailash Gopalakrishnan, Jie Yang, Hoang Tran, Wei Wang, Michael A. Guillorn, Marcel Schaal, Ankur Agrawal, Xin Zhang, Joel Abraham Silberman, Sunil Shukla, Nianzheng Cao, James Bonano, Zhibin Ren, Sanchari Sen, Siyu Koswatta, Kyu-hyoun Kim, Mingu Kang, Swagath Venkataramani, Eri Ogawa, Vijayalakshmi Srinivasan, Hiroshi Inoue, Matt Ziegler, Howard M. Haynie, Shubham Jain, Vinay Velji Shah, Allison Allain, Jintao Zhang, Matthew Cohen, Jungwook Choi, Kerstin Schelm, Jinwook Oh, Li Yulong, Chia-Yu Chen, Ching Zhou, Naigang Wang, Jinwook Jung, Sae Kyu Lee, Silvia Melitta Mueller, Kazuaki Ishizaki, Bruce M. Fleischer, Michael R. Scheuermann, Vidhi Zalani, Brian W. Curran, Leland Chang, Mauricio J. Serrano, Ashish Ranjan, Alberto Mannari, Robert Casatuta
Publikováno v:
ISCA
The growing prevalence and computational demands of Artificial Intelligence (AI) workloads has led to widespread use of hardware accelerators in their execution. Scaling the performance of AI accelerators across generations is pivotal to their succes
Autor:
Leland Chang, Marcel Schaal, Mauricio J. Serrano, Eri Ogawa, Vijayalakshmi Srinivasan, Jintao Zhang, Moriyoshi Ohara, Kailash Gopalakrishnan, Swagath Venkataramani, Jungwook Choi, Wei Wang, Kazuaki Ishizaki, Hiroshi Inoue
Publikováno v:
IEEE Micro. 39:102-111
The ubiquitous adoption of systems specialized for AI requires bridging two seemingly conflicting challenges—the need to deliver extreme processing efficiencies while employing familiar programming interfaces, making them compelling even for non-ex
Autor:
Kazuaki Ishizaki
Publikováno v:
ICPE
Big data processing frameworks have received attention because of the importance of high performance computation. They are expected to quickly process a huge amount of data in memory with a simple programming model in a cluster. Apache Spark is becom
Autor:
Kazuaki Ishizaki, Wei Wang, Moriyoshi Ohara, Vijayalakshmi Srinivasan, Jungwook Choi, Eri Ogawa, Hiroshi Inoue, Kailash Gopalakrishnan, Swagath Venkataramani
Publikováno v:
COOL CHIPS
This paper presents the design and implementation of a compiler for a deep neural network accelerator that provides high performance and energy efficiency. The compiler allows deep learning frameworks, such as TensorFlow, to exploit the accelerator h
Publikováno v:
IPDPS
Apache Spark is a framework for distributed computing that supports the map-reduce programming model. The SQL module of Spark contains Datasets, i.e., distributed collections of records stored in a serialized low-level format in a manually managed ch
Autor:
Kazuaki Ishizaki, Eduard Ayguadé, Moriyoshi Ohara, Ahsan Javed Awan, Mats Brorsson, Vladimir Vlassov
Publikováno v:
MEMSYS
While cluster computing frameworks are continuously evolving to provide real-time data analysis capabilities, Apache Spark has managed to be at the forefront of big data analytics for being a unified framework for both, batch and stream data processi
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c9ace3b9ddf3f3de20226911217db15b
http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-211727
http://urn.kb.se/resolve?urn=urn:nbn:se:kth:diva-211727
Autor:
Kazuaki Ishizaki
Publikováno v:
SEM4HPC@HPDC
Modern emerging workloads such as analytics, graph, and deep learning, rapidly appear. These are written by non-Ninja programmers. Modern hardware platforms are becoming complex due to deployments of hardware accelerators such as GPGPU and FPGA. It i
Publikováno v:
PACT
GPUs can enable significant performance improvements for certain classes of data parallel applications and are widely used in recent computer systems. However, GPU execution currently requires explicit low-level operations such as 1) managing memory
Publikováno v:
PPPJ
High-level languages such as Java increase both productivity and portability with productive language features such as managed runtime, type safety, and precise exception semantics. Additionally, Java 8 provides parallel stream APIs with lambda expre