In Situ Storage Layout Optimization for AMR Spatio-temporal Read Accesses

Autor: Bin Dong, Dharshi Devendran, Xiaocheng Zou, Kesheng Wu, Houjun Tang, Scott Klasky, Steve Harenberg, Daniel F. Martin, Nagiza F. Samatova, Wenzhao Zhang, Suren Byna, David Trebotich
Rok vydání: 2016
Předmět:
Zdroj: ICPP
DOI: 10.1109/icpp.2016.53
Popis: Analyses of large simulation data often concentrate on regions in space and in time that contain important information. As simulations adopt Adaptive Mesh Refinement (AMR), the data records from a region of interest could be widely scattered on storage devices and accessing interesting regions results in significantly reduced I/O performance. In this work, we study the organization of block-structured AMR data on storage to improve performance of spatio-temporal data accesses. AMR has a complex hierarchical multi-resolution data structure that does not fit easily with the existing approaches that focus on uniform mesh data. To enable efficient AMR read accesses, we develop an in situ data layout optimization framework. Our framework automatically selects from a set of candidate layouts based on a performance model, and reorganizes the data before writing to storage. We evaluate this framework with three AMR datasets and access patterns derived from scientific applications. Our performance model is able to identify the best layout scheme and yields up to a 3X read performance improvement compared to the original layout. Though it is not possible to turn all read accesses into contiguous reads, we are able to achieve 90% of contiguous read throughput with the optimized layouts on average.
Databáze: OpenAIRE