Zobrazeno 1 - 8
of 8
pro vyhledávání: '"TaiYu Cheng"'
Publikováno v:
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. (3):497-508
Reducing power consumption is a crucial factor making industrial designs, such as mobile SoCs, competitive. Voltage scaling (VS) is the classical yet most effective technique that contributes to quadratic power reduction. A recent design technique ca
Autor:
Tohru Ishihara, TaiYu Cheng, Jun Nagayama, Masanori Hashimoto, Yoichi Momiyama, Yutaka Masuda
Publikováno v:
IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences. (3):509-517
This work proposes a design methodology that saves the power dissipation under voltage over-scaling (VOS) operation. The key idea of the proposed design methodology is to combine critical path isolation (CPI) and bit-width scaling (BWS) under the con
Critical Path Isolation and Bit-Width Scaling Are Highly Compatible for Voltage Over-Scalable Design
Autor:
TaiYu Cheng, Jun Nagayama, Yutaka Masuda, Yoichi Momiyama, Masanori Hashimoto, Tohru Ishihara
Publikováno v:
DATE
This work proposes a design methodology that saves the power under voltage over-scaling (VOS) operation. The key idea of the proposed design methodology is to combine critical path isolation (CPI) and bit-width scaling (BWS) under the constraint of c
Autor:
Bobo Zhan, Dong Wang, Xingquan Huang, Wenhan Wu, Taiyu Cheng, Ce Xu, Guoming Ma, Wei Wang, Xin Weifeng
Publikováno v:
2021 IEEE Electrical Insulation Conference (EIC).
Autor:
Masanori Hashimoto, TaiYu Cheng
Publikováno v:
ISCAS
Training DNN mostly relies on GPUs with FP32 format. While FP16 is acknowledged for its advantage of high computation and memory efficiencies for training DNN, the training must be accompanied with techniques dedicated for a particular dataset. There
Publikováno v:
ASP-DAC
This paper proposes a design optimization methodology that can achieve a mode-wise voltage scalable (MWVS) design with applying the activation-aware slack assignment (ASA). Originally, ASA allocates the timing margin of critical paths with a stochast
Recently, emerging “edge computing” moves data and services from the cloud to nearby edge servers to achieve short latency and wide bandwidth, and solve privacy concerns. However, edge servers, often embedded with GPU processors, highly demand a
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1282c91905518b3eb330f6403fc81901
http://t2r2.star.titech.ac.jp/cgi-bin/publicationinfo.cgi?q_publication_content_number=CTT100827089
http://t2r2.star.titech.ac.jp/cgi-bin/publicationinfo.cgi?q_publication_content_number=CTT100827089
Publikováno v:
PATMOS
This paper proposes to adopt logarithm-approximate multiplier (LAM) for multiply-accumulate (MAC) computation in neural network (NN) training engine, where LAM approximates a floating-point multiplication as an addition resulting in smaller delay, fe