Zobrazeno 1 - 10
of 15
pro vyhledávání: '"DONGUP KWON"'
Autor:
Suyeon Hur, Seongmin Na, Dongup Kwon, Joonsung Kim, Andrew Boutros, Eriko Nurvitadhi, Jangwoo Kim
Publikováno v:
ACM Transactions on Architecture and Code Optimization. 20:1-24
Deep neural networks (DNNs) have become key solutions in the natural language processing (NLP) domain. However, the existing accelerators customized for their narrow target models cannot support diverse NLP models. Therefore, naively running complex
SmartFVM: A Fast, Flexible, and Scalable Hardware-based Virtualization for Commodity Storage Devices
Publikováno v:
ACM Transactions on Storage. 18:1-27
A computational storage device incorporating a computation unit inside or near its storage unit is a highly promising technology to maximize a storage server’s performance. However, to apply such computational storage devices and take their full po
Publikováno v:
IEEE Symposium on Security and Privacy
Security bugs in CPUs have critical security impacts to all the computation related hardware and software components as it is the core of the computation. In spite of the fact that architecture and security communities have explored a vast number of
Publikováno v:
DAC
The increasing size of recurrent neural networks (RNNs) makes it hard to meet the growing demand for real-time AI services. For low-latency RNN serving, FPGA-based accelerators can leverage specialized architectures with optimized dataflow. However,
Publikováno v:
ISCA
A cost-effective multi-tenant neural network execution is becoming one of the most important design goals for modern neural network accelerators. For example, as emerging AI services consist of many heterogeneous neural network executions, a cloud pr
Publikováno v:
IEEE Computer Architecture Letters. 17:47-50
SSD arrays are becoming popular in modern storage servers as a primary storage, and they aim to reduce the high cost of the devices by performing inline deduplications. Unfortunately, existing software-based inline deduplications cannot achieve the d
Autor:
Dongup Kwon, Ali Jafari, Abirami Prabhakaran, Pranavi Appana, Prerna Budhkar, Mishali Naik, Andrew Boutros, Eriko Nurvitadhi, Karthik Gururaj, Sheffield David B
Publikováno v:
FPT
We present a CPU server with multiple FPGAs that is purely software-programmable by a unified framework to enable flexible implementation of modern real-life complex AI that scales to large model size (100M+ parameters), while delivering real-time in
Autor:
Bogdan Pasca, Dongup Kwon, Sergey Gribok, Gregory K. Chen, Eriko Nurvitadhi, Jaewoong Sim, Knag Phil, Martin Langhammer, Ram Krishnamurthy, Phillip Tomson, Debbie Marr, Aravind Dasu, Sumbul Huseyin Ekin, Ali Jafari, Raghavan Kumar, Andrew Boutros
Publikováno v:
FCCM
Interactive intelligent services, such as smart web search, are important datacenter workloads. They rely on dataintensive deep learning (DL) algorithms with strict latency constraints and thus require balancing both data movement and compute capabil
Autor:
Eriko Nurvitadhi, Raghavan Kumar, Martin Langhammer, Ali Jafari, Gregory K. Chen, Jaewoong Sim, Phillip Tomson, Sergey Gribok, Debbie Marr, Ram Krishnamurthy, Aravind Dasu, Knag Phil, Andrew Boutros, Bogdan Pasca, Dongup Kwon, Sumbul Huseyin Ekin
Publikováno v:
FPGA
Interactive intelligent services (e.g., smart web search) are becoming essential datacenter workloads. They rely on data-intensive artificial intelligence (AI) algorithms that do not use batch computation due to their tight latency constraints. Since
Publikováno v:
HPCA