SSDStreamer: Specializing I/O Stack for Large-Scale Machine Learning

Autor: Jeonghun Gong, Shine Kim, Jonghyun Bae, Wenjing Jin, Hakbeom Jang, Jae W. Lee, Tae Jun Ham, Jaeyoung Jang, Jinkyu Jeong
Rok vydání: 2019
Předmět:
Zdroj: IEEE Micro. 39:73-81
ISSN: 1937-4143
0272-1732
DOI: 10.1109/mm.2019.2930497
Popis: This article presents SSDStreamer, an SSD-based caching system for large-scale machine learning. By using DRAM as stream buffer, instead of an upper-level cache, SSDStreamer significantly outperforms state-of-the-art multilevel caching systems on Apache Spark, while requiring much less DRAM capacity.
Databáze: OpenAIRE