Impact of fault prediction on checkpointing strategies

Autor: Aupy, Guillaume, Robert, Yves, Vivien, Frédéric, Zaidouni, Dounia
Rok vydání: 2012
Předmět:
Druh dokumentu: Working Paper
Popis: This paper deals with the impact of fault prediction techniques on checkpointing strategies. We extend the classical analysis of Young and Daly in the presence of a fault prediction system, which is characterized by its recall and its precision, and which provides either exact or window-based time predictions. We succeed in deriving the optimal value of the checkpointing period (thereby minimizing the waste of resource usage due to checkpoint overhead) in all scenarios. These results allow to analytically assess the key parameters that impact the performance of fault predictors at very large scale. In addition, the results of this analytical evaluation are nicely corroborated by a comprehensive set of simulations, thereby demonstrating the validity of the model and the accuracy of the results.
Comment: 20 pages
Databáze: arXiv