Skip to Main content Skip to Navigation
Conference papers

Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique

Abstract : Workflow systems have played an important role in facilitating the reproducibility of scientific experiments, yet, little work has been devoted to enhance the reuse of produced data. We argue that these intermediate data should be considered as first-order objects, which are worthy of preservation and publication. Not only will this save computational resources, but more importantly it will ease and accelerate the evaluation of new hypotheses. To help scientists annotate such produced data, we exploit multiple sources of information : i) provenance information captured during the execution of workflows, and ii) domain annotations provided by semantic catalogs of tools, such as Bio.Tools. Finally, we show, on a real bioinformatics scenario, how provenance graphs can be transformed and synthesized, for human and machine use.
Complete list of metadata

https://hal-emse.ccsd.cnrs.fr/emse-03260542
Contributor : Florent Breuil <>
Submitted on : Tuesday, June 15, 2021 - 9:35:55 AM
Last modification on : Tuesday, July 13, 2021 - 3:36:22 PM

File

actes_IC_CH_PFIA2021_73-80.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : emse-03260542, version 1

Citation

Alban Gaignard, Hala Skaf-Molli, Khalid Belhajjame. Découvrabilité et réutilisation de données produites par des workflows : un cas d’usage en génomique. Journées Francophones d'Ingénierie des Connaissances (IC) Plate-Forme Intelligence Artificielle (PFIA'21), Jun 2021, Bordeaux, France. pp 73-80. ⟨emse-03260542⟩

Share

Metrics

Record views

44

Files downloads

6