HDFS Optimization Strategy Based On Hierarchical Storage of Hot and Cold Data

Autor: Leixiao Li, Zhiqiang Ma, Yuxin Guan
Rok vydání: 2019
Předmět:
Zdroj: Procedia CIRP. 83:415-418
ISSN: 2212-8271
Popis: HDFS has been the core storage service of Hadoop big data platform for many years, with the continuous development of deep learning, artificial intelligence and other technologies, the amount of data continues to grow, the initial batch processing offline scenario did not fit the user’s needs, the data’s access heat appears big difference. Therefore, the HDFS storage system needs to be optimized according to the degree of heat and cold of the data. The heterogeneous storage and erasure-correction code technology of HDFS can improve the storage efficiency to some extent, but users need to specify the behavior of the storage for that particular data. Therefore, it will cause a waste of cluster resources. SSM systems can follow a predetermined set of rules, according to the degree of heat and cold, it uses different storage strategies to optimize the data, so as to improve the efficiency of the whole storage system.
Databáze: OpenAIRE