Título: | PROVENANCE FOR BIOINFORMATICS WORKFLOWS | ||||||||||||||||||||||||||||||||||||
Autor: |
LUCIANA DA SILVA ALMENDRA GOMES |
||||||||||||||||||||||||||||||||||||
Colaborador(es): |
EDWARD HERMANN HAEUSLER - Orientador |
||||||||||||||||||||||||||||||||||||
Catalogação: | 25/OUT/2011 | Língua(s): | PORTUGUESE - BRAZIL |
||||||||||||||||||||||||||||||||||
Tipo: | TEXT | Subtipo: | THESIS | ||||||||||||||||||||||||||||||||||
Notas: |
[pt] Todos os dados constantes dos documentos são de inteira responsabilidade de seus autores. Os dados utilizados nas descrições dos documentos estão em conformidade com os sistemas da administração da PUC-Rio. [en] All data contained in the documents are the sole responsibility of the authors. The data used in the descriptions of the documents are in conformity with the systems of the administration of PUC-Rio. |
||||||||||||||||||||||||||||||||||||
Referência(s): |
[pt] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=18566&idi=1 [en] https://www.maxwell.vrac.puc-rio.br/projetosEspeciais/ETDs/consultas/conteudo.php?strSecao=resultado&nrSeq=18566&idi=2 |
||||||||||||||||||||||||||||||||||||
DOI: | https://doi.org/10.17771/PUCRio.acad.18566 | ||||||||||||||||||||||||||||||||||||
Resumo: | |||||||||||||||||||||||||||||||||||||
Many scientific experiments are designed as computational workflows,
which can be implemented using traditional programming languages. In the
Bioinformatics domain ad-hoc scripts are often used to build workflows. Scientific
Workflow Management Systems (SWMS) have emerged as an alternative to
those scripts. One particular SWMS feature that has received much attention by
the scientific community is the automatic capture of provenance data. These
allow users to track which resources and parameters were used to obtain the
results, among many other required information to validate and publish an
experiment. In the present work we have elicited some data provenance
challenges in the SWMS context, such as (i) the heterogeneity of data
representation schemes that hinders the understanding and interoperability; (ii)
the storage of consumed and produced data and (iii) the reproducibility of a
specific execution. These challenges have motivated the proposal of a data
provenance conceptual scheme for workflow representation. We have
implemented an extension of a particular SWMS system (Bioside) to include
provenance data and store them using the proposed conceptual scheme. We
have focused on some requirements commonly found in bioinformatics
workflows.
|
|||||||||||||||||||||||||||||||||||||
|