Toward a common standard for data and specimen provenance in life sciences


Open and practical exchange, dissemination, and reuse of specimens and data have become a fundamental requirement for life sciences research. The quality of the data obtained and thus the findings and knowledge derived is thus significantly influenced by the quality of the samples, the experimental methods, and the data analysis. Therefore, a comprehensive and precise documentation of the pre-analytical conditions, the analytical procedures, and the data processing are essential to be able to assess the validity of the research results. With the increasing importance of the exchange, reuse, and sharing of data and samples, procedures are required that enable cross-organizational documentation, traceability, and non-repudiation. At present, this information on the provenance of samples and data is mostly either sparse, incomplete, or incoherent. Since there is no uniform framework, this information is usually only provided within the organization and not interoperably. At the same time, the collection and sharing of biological and environmental specimens increasingly require definition and documentation of benefit sharing and compliance to regulatory requirements rather than consideration of pure scientific needs. In this publication, we present an ongoing standardization effort to provide trustworthy machine-actionable documentation of the data lineage and specimens. We would like to invite experts from the biotechnology and biomedical fields to further contribute to the standard.


DOI: 10.1002/lrh2.10365

Research Groups: Scientific Databases and Visualisation

Publication type: Journal

Journal: Learning Health Systems

Citation: Learning Health Systems,e10365

Date Published: 18th Apr 2023

Registered Mode: by DOI

Authors: Rudolf Wittner, Petr Holub, Cecilia Mascia, Francesca Frexia, Heimo Müller, Markus Plass, Clare Allocca, Fay Betsou, Tony Burdett, Ibon Cancio, Adriane Chapman, Martin Chapman, Mélanie Courtot, Vasa Curcin, Johann Eder, Mark Elliot, Katrina Exter, Carole Goble, Martin Golebiewski, Bron Kisler, Andreas Kremer, Simone Leo, Sheng Lin‐Gibson, Anna Marsano, Marco Mattavelli, Josh Moore, Hiroki Nakae, Isabelle Perseil, Ayat Salman, James Sluka, Stian Soiland‐Reyes, Caterina Strambio‐De‐Castillia, Michael Sussman, Jason R. Swedlow, Kurt Zatloukal, Jörg Geiger

help Submitter
Wittner, R., Holub, P., Mascia, C., Frexia, F., Müller, H., Plass, M., Allocca, C., Betsou, F., Burdett, T., Cancio, I., Chapman, A., Chapman, M., Courtot, M., Curcin, V., Eder, J., Elliot, M., Exter, K., Goble, C., Golebiewski, M., … Geiger, J. (2023). Toward a common standard for data and specimen provenance in life sciences. In Learning Health Systems (Vol. 8, Issue 1). Wiley.

Views: 1761

Created: 20th Apr 2023 at 14:12

Last updated: 5th Mar 2024 at 21:25

help Attributions


Powered by
Copyright © 2008 - 2023 The University of Manchester and HITS gGmbH