A Learning Architecture for Scheduling Workflow Applications in the Cloud

Autor: Enda Barrett, Jim Duggan, Enda Howley
Přispěvatelé: ~
Rok vydání: 2011
Předmět:
Workflow management software
Computer science
Computational costs
Distributed computing
Processor scheduling
Cloud computing
Storage sites
Quality of service requirements
Optimal scheduling
Workflow engine
Workflow technology
Scheduling (computing)
Quality of service
Pay-per-use fashion
Precedence constraints
Computational resources
Cloud workflow scheduling
Utility-type market model
Schedules
Scheduling
Workflow scheduling
business.industry
Markov processes
Computational processing
Software architecture
Storage management
Environmental state
Biological cells
Data transmission costs
Genetic algorithms
Scheduling workflow applications
Workflow execution process
Bayesian model learning
Workflow
Genetic algorithm
Scientific workflows
Workflow based applications dependency
Workflow schedulers
Minimisation
Markov decision process
Learning architecture
business
Workflow management system
Workflow tasks
Zdroj: ECOWS
DOI: 10.1109/ecows.2011.27
Popis: Conference paper The scheduling of workflow applications involves the mapping of individual workflow tasks to computational resources, based on a range of functional and non-functional quality of service requirements. Workflow applications such as scientific workflows often require extensive computational processing and generate significant amounts of experimental data. The emergence of cloud computing has introduced a utility-type market model, where computational resources of varying capacities can be procured on demand, in a pay-per-use fashion. In workflow based applications dependencies exist amongst tasks which requires the generation of schedules in accordance with defined precedence constraints. These constraints pose a difficult planning problem, where tasks must be scheduled for execution only once all their parent tasks have completed. In general the two most important objectives of workflow schedulers are the minimisation of both cost and make span. The cost of workflow execution consists of both computational costs incurred from processing individual tasks, and data transmission costs. With scientific workflows potentially large amounts of data must be transferred between compute and storage sites. This paper proposes a novel cloud workflow scheduling approach which employs a Markov Decision Process to optimally guide the workflow execution process depending on environmental state. In addition the system employs a genetic algorithm to evolve workflow schedules. The overall architecture is presented, and initial results indicate the potential of this approach for developing viable workflow schedules on the Cloud. Science Foundation Ireland non-peer-reviewed
Databáze: OpenAIRE