Výsledky vyhledávání - "Kwon, Yongkee"

Report

Darwin: A DRAM-based Multi-level Processing-in-Memory Architecture for Data Analytics

Autor: Kim, Donghyuk, Kim, Jae-Young, Han, Wontak, Won, Jongsoon, Choi, Haerang, Kwon, Yongkee, Kim, Joo-Young

Processing-in-memory (PIM) architecture is an inherent match for data analytics application, but we observe major challenges to address when accelerating it using PIM. In this paper, we propose Darwin, a practical LRDIMM-based multi-level PIM archite

Externí odkaz: http://arxiv.org/abs/2305.13970

Zobrazit plný text záznamu

Report

Near Data Acceleration with Concurrent Host Access

Autor: Cho, Benjamin Y., Kwon, Yongkee, Lym, Sangkug, Erez, Mattan

Near-data accelerators (NDAs) that are integrated with main memory have the potential for significant power and performance benefits. Fully realizing these benefits requires the large available memory capacity to be shared between the host and the ND

Externí odkaz: http://arxiv.org/abs/1908.06362

Zobrazit plný text záznamu

Report

Mini-batch Serialization: CNN Training with Inter-layer Data Reuse

Autor: Lym, Sangkug, Behroozi, Armand, Wen, Wei, Li, Ge, Kwon, Yongkee, Erez, Mattan

Training convolutional neural networks (CNNs) requires intense computations and high memory bandwidth. We find that bandwidth today is over-provisioned because most memory accesses in CNN training can be eliminated by rearranging computation to bette

Externí odkaz: http://arxiv.org/abs/1810.00307

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Software prefetching for memory-level parallelism

Autor: Kwon, Yongkee

In computer systems, latency tolerance is the use of concurrency to achieve high performance in spite of high latency. Existing techniques to tolerate long memory latencies include data prefetching, out-of-order instruction execution, and multithread

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::aab8a40d9d495fdeeaa09c2c5f58a660

Zobrazit plný text záznamu

Periodical

A 1ynm 1.25V 8Gb 16Gb/s/Pin GDDR6-Based Accelerator-in-Memory Supporting 1TFLOPS MAC Operation and Various Activation Functions for Deep Learning Application

Publikováno v: IEEE Journal of Solid-State Circuits; January 2023, Vol. 58 Issue: 1 p291-302, 12p

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání