Dr Jens Klump1, Dr Mingfang Wu2, Dr Lesley Wyborn3
1CSIRO Mineral Resources, Kensington, Australia, 2ARDC, Melbourne, Australia, 3Australian National University, Canberra, Australia
The Research Data Alliance (RDA) Data Versioning WG/IG developed a set of principles for the versioning of data and terminology for describing processes and data artefacts. The data versioning principles and its terminology can be used to address questions around the use of persistent identifiers for data, data citation and attribution, among others. In particular, they are designed to support transparent and reproducible science so the source of the exact version of a dataset used in scientific research can be identified, and the creator, publisher, and funder of that dataset be identified and given attribution.
The Versioning Principles are based on the Functional Requirements for Bibliographic Records (FRBR), which is a conceptual model developed by librarians to facilitate retrieval and access to resources, from a user’s perspective, in online library catalogues. FRBR can also be applied to digital data as follows:
- Work (distinct intellectual creation, e.g. research study);
- Expression (specific form in which the work is realised, e.g. data product);
- Manifestation (format in which the work is embodied, e.g. file format); and
- Item (online location of the resource).
Identification of each level is critical to scientific replication and attribution.
This BoF will start with an introduction to the data versioning principles, followed by presentations of three use cases on how data versioning can enable citation with confidence and give proper credit to those who are involved in data versioning editorial process. We will then invite participants to discuss challenges from their own data versioning use cases or practices. The goal is to develop a better understanding of the needs of data versioning in Australian research and key actionable recommendations for supporting data infrastructures.
Biography:
Jens Klump is a geochemist by training and leads the Exploration Through Cover Research Group in CSIRO Mineral Resources based in Perth, Western Australia. In his work on data infrastructures, Jens covers the entire chain of digital value creation from data acquisition to data analysis with a focus on data in minerals exploration. This includes automated data and metadata capture, sensor data integration, both in the field and in the laboratory, data processing workflows, and data provenance, but also data analysis by statistical methods, machine learning and numerical modelling.
Jens earned degrees in geology and in oceanography from the University of Cape Town (UCT) and received his PhD in marine geology from the University of Bremen, Germany.