Výsledky vyhledávání - "Evgeny Bolotin"

HMG: Extending Cache Coherence Protocols Across Modern Hierarchical Multi-GPU Systems

Autor: Daniel Lustig, Evgeny Bolotin, Aamer Jaleel, Xiaowei Ren, Oreste Villa, David Nellans

Publikováno v: HPCA

Prior work on GPU cache coherence has shown that simple hardware-or software-based protocols can be more than sufficient. However, in recent years, features such as multi-chip modules have added deeper hierarchy and non-uniformity into GPU memory sys

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::3ca358ba512c4d28366288df97c89806
https://doi.org/10.1109/hpca47549.2020.00054

Zobrazit plný text záznamu

Understanding the Future of Energy Efficiency in Multi-Module GPUs

Autor: Evgeny Bolotin, David Nellans, Carole-Jean Wu, Akhil Arunkumar

Publikováno v: HPCA

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::e632854c44f9a0cdae2d4f8c534d56ed
https://doi.org/10.1109/hpca.2019.00063

Zobrazit plný text záznamu

Combining HW/SW Mechanisms to Improve NUMA Performance of Multi-GPU Systems

Autor: Eiman Ebrahimi, Oreste Villa, Evgeny Bolotin, Aamer Jaleel, Vinson Young, David Nellans

Publikováno v: MICRO

Historically, improvement in GPU performance has been tightly coupled with transistor scaling. As Moore's Law slows down, performance of single GPUs may ultimately plateau. To continue GPU performance scaling, multiple GPUs can be connected using sys

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::ea3fd042e610d5acac011c3433b99ec6
https://doi.org/10.1109/micro.2018.00035

Zobrazit plný text záznamu

Designing Efficient Heterogeneous Memory Architectures

Autor: Oreste Villa, Alex Ramirez, Mike O'Connor, Stephen W. Keckler, Evgeny Bolotin, David Nellans

Publikováno v: IEEE Micro. 35:60-68

Recent packaging technologies that enable DRAM chips to be stacked inside the processor package or on top of the processor chip can lower DRAM energy-per-bit costs, provide wider interfaces, and offer higher bandwidth. However, these technologies are

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::1f4587bce6a08fc010b228da591f1f6a
https://doi.org/10.1109/mm.2015.72

Zobrazit plný text záznamu

MCM-GPU

Autor: Evgeny Bolotin, Oreste Villa, Eiman Ebrahimi, David Nellans, Aamer Jaleel, Akhil Arunkumar, Ugljesa Milic, Carole-Jean Wu, Benjamin Cho

Publikováno v: ISCA

Historically, improvements in GPU-based high performance computing have been tightly coupled to transistor scaling. As Moore's law slows down, and the number of transistors per die no longer grows at historical rates, the performance curve of single

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::19091114e8a5f3a9456f633ede009d2d
https://doi.org/10.1145/3079856.3080231

Zobrazit plný text záznamu

Beyond the socket: NUMA-aware GPUs

Autor: Aamer Jaleel, Oreste Villa, Akhil Arunkumar, Ugljesa Milic, Alex Ramirez, Eiman Ebrahimi, Evgeny Bolotin, David Nellans

Publikováno v: UPCommons. Portal del coneixement obert de la UPC
Universitat Politècnica de Catalunya (UPC)
MICRO
Recercat. Dipósit de la Recerca de Catalunya
instname

GPUs achieve high throughput and power efficiency by employing many small single instruction multiple thread (SIMT) cores. To minimize scheduling logic and performance variance they utilize a uniform memory system and leverage strong data parallelism

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9eb7db3e25c87a542b4146cf5f044686

Zobrazit plný text záznamu

CLARA

Autor: Stephen W. Keckler, Evgeny Bolotin, Joel Emer, Mike O'Connor, Niladrish Chatterjee, Aditya Agrawal

Publikováno v: MEMSYS

With increasing DRAM densities, the performance and energy overheads of refresh operations are increasingly significant. When the system is active, refresh commands render DRAM banks unavailable for increasing periods of time. These refresh operation

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::6647bc4185f164085eb319057aa95a52
https://doi.org/10.1145/2989081.2989084

Zobrazit plný text záznamu

A case for toggle-aware compression for GPU systems

Autor: Evgeny Bolotin, Gennady Pekhimenko, Todd C. Mowry, Onur Mutlu, Stephen W. Keckler, Nandita Vijaykumar

Publikováno v: HPCA

Data compression can be an effective method to achieve higher system performance and energy efficiency in modern data-intensive applications by exploiting redundancy and data similarity. Prior works have studied a variety of data compression techniqu

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::5b20a3ec8b60a9d66a5eb7d54431f480
https://doi.org/10.1109/hpca.2016.7446064

Zobrazit plný text záznamu

Exploring the limits of GPGPU scheduling in control flow bound applications

Autor: Roman Malits, Avinoam Kolodny, Evgeny Bolotin, Avi Mendelson

Publikováno v: ACM Transactions on Architecture and Code Optimization

GPGPUs are optimized for graphics, for that reason the hardware is optimized for massively data parallel applications characterized by predictable memory access patterns and little control flow. For such applications' e.g., matrix multiplication, GPG

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::82ac49c14a611eda55c1e6d508cc08f6
https://doi.org/10.1145/2086696.2086708

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání