AgentTeamwork: Coordinating grid-computing jobs with mobile agents
Autor: | Koichi Kashiwagi, Shinya Kobayashi, Munehiro Fukuda |
---|---|
Rok vydání: | 2006 |
Předmět: | |
Zdroj: | Applied Intelligence. 25:181-198 |
ISSN: | 1573-7497 0924-669X |
DOI: | 10.1007/s10489-006-9653-6 |
Popis: | AgentTeamwork is a grid-computing middleware system that dispatches a collection of mobile agents to coordinate a user job over remote computing nodes in a decentralized manner. Its utmost focus is to maintain high availability and dynamic balancing of distributed computing resources to a parallel-computing job. For this purpose, a mobile agent is assigned to each process engaged in the same job, monitors its execution at a different machine, takes its periodical execution snapshot, moves it to a lighter-loaded machine, and resumes it from the latest snapshot upon an accidental crash. The system also restores broken inter-process communication involved in the same job using its error-recoverable socket and mpiJava libraries in collaboration among mobile agents. We have implemented the first version of our middleware including a mobile agent execution platform, error-recoverable socket and mpiJava API libraries, a job wrapper program, and several types of mobile agents such as commander, resource, sentinel, and bookkeeper agents, each orchestrating, allocating resources to, monitoring and maintaining snapshots of a user process respectively. This paper presents AgentTeamwork's execution model, its implementation techniques, and our performance evaluation using the Java Grande benchmark test programs. |
Databáze: | OpenAIRE |
Externí odkaz: |