Understanding collaborative studies through interoperable workflow provenance

Autor: Altintas, I., Anand, M.K., Crawl, D., Bowers, S., Belloum, A., Missier, P., Ludäscher, B., Goble, C.A., Sloot, P.M.A., McGuinness, D.L., Michaelis, J.R., Moreau, L.
Přispěvatelé: Computational Science Lab (IVI, FNWI), System and Network Engineering (IVI, FNWI)
Jazyk: angličtina
Rok vydání: 2010
Předmět:
Zdroj: Provenance and Annotation of Data and Processes: third International Provenance and Annotation Workshop, IPAW 2010, Troy, NY, USA, June 15-16, 2010 : revised selected papers, 42-58
STARTPAGE=42;ENDPAGE=58;TITLE=Provenance and Annotation of Data and Processes
Lecture Notes in Computer Science ISBN: 9783642178184
IPAW
Popis: The provenance of a data product contains information about how the product was derived, and is crucial for enabling scientists to easily understand, reproduce, and verify scientific results. Currently, most provenance models are designed to capture the provenance related to a single run, and mostly executed by a single user. However, a scientific discovery is often the result of methodical execution of many scientific workflows with many datasets produced at different times by one or more users. Further, to promote and facilitate exchange of information between multiple workflow systems supporting provenance, the Open Provenance Model (OPM) has been proposed by the scientific workflow community. In this paper, we describe a new query model that captures implicit user collaborations. We show how this model maps to OPM and helps to answer collaborative queries, e.g., identifying combined workflows and contributions of users collaborating on a project based on the records of previous workflow executions. We also adopt and extend the high-level Query Language for Provenance (QLP) with additional constructs, and show how these extensions allow non-expert users to express collaborative provenance queries against this model easily and concisely. Furthermore, we adopt the Provenance Challenge 3 (PC3) workflows as a collaborative and interoperable usecase scenario, where different stages of the workflow are executed in three different workflow environments - Kepler, Taverna, and WSVLAM. Through this usecase, we demonstrate how we can establish and understand collaborative studies through interoperable workflow provenance.
Databáze: OpenAIRE