HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique

Abstract : Workflow systems have played an important role in facilitating the reproducibility of scientific experiments, yet, little work has been devoted to enhance the reuse of produced data. We argue that these intermediate data should be considered as first-order objects, which are worthy of preservation and publication. Not only will this save computational resources, but more importantly it will ease and accelerate the evaluation of new hypotheses. To help scientists annotate such produced data, we exploit multiple sources of information : i) provenance information captured during the execution of workflows, and ii) domain annotations provided by semantic catalogs of tools, such as Bio.Tools. Finally, we show, on a real bioinformatics scenario, how provenance graphs can be transformed and synthesized, for human and machine use.
Complete list of metadata

Contributor : Florent Breuil Connect in order to contact the contributor
Submitted on : Tuesday, June 15, 2021 - 9:35:55 AM
Last modification on : Wednesday, April 27, 2022 - 4:39:28 AM
Long-term archiving on: : Thursday, September 16, 2021 - 6:14:05 PM


Files produced by the author(s)


  • HAL Id : emse-03260542, version 1


Alban Gaignard, Hala Skaf-Molli, Khalid Belhajjame. Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique. Journées Francophones d'Ingénierie des Connaissances (IC) Plate-Forme Intelligence Artificielle (PFIA'21), Jun 2021, Bordeaux, France. pp 73-80. ⟨emse-03260542⟩



Record views


Files downloads