Zobrazeno 1 - 10
of 32
pro vyhledávání: '"application-level checkpointing"'
Autor:
Keita Teranishi, Patricia González, George Bosilca, Aurelien Bouteiller, Nuria Losada, María Martín
Publikováno v:
RUC: Repositorio da Universidade da Coruña
Universidade da Coruña (UDC)
RUC. Repositorio da Universidade da Coruña
instname
Universidade da Coruña (UDC)
RUC. Repositorio da Universidade da Coruña
instname
[Abstract] The growth in the number of computational resources used by high-performance computing (HPC) systems leads to an increase in failure rates. Fault-tolerant techniques will become essential for long-running applications executing in future e
Publikováno v:
RUC: Repositorio da Universidade da Coruña
Universidade da Coruña (UDC)
RUC. Repositorio da Universidade da Coruña
instname
Universidade da Coruña (UDC)
RUC. Repositorio da Universidade da Coruña
instname
[Abstract] The resilience approach generally used in high-performance computing (HPC) relies on coordinated checkpoint/restart, a global rollback of all the processes that are running the application. However, in many instances, the failure has a mor
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::011b4fcf909698e01b3ce01e9af17ca5
http://hdl.handle.net/2183/27584
http://hdl.handle.net/2183/27584
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Kurt, Mehmet Can
Fault-tolerance on parallel systems has always been a big challenge for High Performance Computing (HPC), and hence it has drawn a lot of attention of the community. This pursuit in fault-tolerant systems is now important more than ever due to the re
Externí odkaz:
http://rave.ohiolink.edu/etdc/view?acc_num=osu1437390499
Autor:
Medeiros, B., João Sobral
Publikováno v:
COMPUTING AND INFORMATICS; Vol 31, No 1 (2012): Computing and Informatics; 89-101
Scopus-Elsevier
CIÊNCIAVITAE
Scopus-Elsevier
CIÊNCIAVITAE
Migrating traditional scientific applications to computational Grids requires programming tools that can help programmers update application behaviour to this kind of platforms. Computational Grids are particularly suited for long running scientific
Publikováno v:
Repositório Científico de Acesso Aberto de Portugal
Repositório Científico de Acesso Aberto de Portugal (RCAAP)
instacron:RCAAP
Repositório Científico de Acesso Aberto de Portugal (RCAAP)
instacron:RCAAP
Migrating traditional scientific applications to computational Grids requires programming tools that can help programmers update application behaviour to this kind of platforms. Computational Grids are particularly suited for long running scientific
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::f124f2d67dca7254ee786d25658b44f5
Publikováno v:
ICPP
Enabling applications for computational Grids requires new approaches to develop applications that can effectively cope with resource volatility. Applications must be resilient to resource faults, adapting the behaviour to available resources. This p
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e46582f564ffe20faab24141d2f9daaa
https://hdl.handle.net/1822/15597
https://hdl.handle.net/1822/15597
Publikováno v:
The 11th International Conference on Parallel and Distributed Computing, Applications and Technologies-PDCAT 2010
The 11th International Conference on Parallel and Distributed Computing, Applications and Technologies-PDCAT 2010, Dec 2010, Wuhan, China. ⟨10.1109/PDCAT.2010.89⟩
PDCAT
The 11th International Conference on Parallel and Distributed Computing, Applications and Technologies-PDCAT 2010, Dec 2010, Wuhan, China. ⟨10.1109/PDCAT.2010.89⟩
PDCAT
International audience; Distributing applications over PC clusters to speed-up or size-up the execution is now commonplace. Yet efficiently tolerating faults of these systems is a major issue. To ease the addition of checkpoint-based fault tolerance
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3c2418d2f85692f0385076c6a323aabf
https://hal.inria.fr/inria-00548953
https://hal.inria.fr/inria-00548953
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.