Best Practices in Accessing Tape-Resident Data in HPSS

Autor: Yu David, Che Guangwei, Chou Tim, Novakov Ognian
Jazyk: angličtina
Rok vydání: 2019
Předmět:
Zdroj: EPJ Web of Conferences, Vol 214, p 04022 (2019)
Druh dokumentu: article
ISSN: 2100-014X
DOI: 10.1051/epjconf/201921404022
Popis: Tape is an excellent choice for archival storage because of the capacity, cost per GB and long retention intervals, but its main drawback is the slow access time due to the nature of sequential medium. Modern enterprise tape drives now support Recommended Access Ordering (RAO), which is designed to reduce data recall/retrieval times. BNL SDCC's mass storage system currently holds more than 100 PB of data on tapes, managed by HPSS. Starting with HPSS version 7.5.1, a new feature called “Tape Order Recall (TOR) has been introduced. It supports both RAO and non-RAO drives. The file access performance can be increased by 30% to 60% over the random file access. Prior to HPSS 7.5.1, we have been using an in-house developed scheduling software, aka ERADAT. ERADAT accesses files based on the file logical position order. It has demonstrated a great performance over the past decade long usage in BNL. In this paper we will present a series of test results, compare TOR and ERADAT's performance under different configurations to show how effective TOR (RAO) and ERADAT perform and what is the best solution in data recall from SDCC's tape storage
Databáze: Directory of Open Access Journals