A Methodology for Progressively Developing and Harmonising FAIR Vocabularies for Global Interoperability of Geochemical Data

A Methodology for Progressively Developing and Harmonising FAIR Vocabularies for Global Interoperability of Geochemical Data

Lesley Wyborn1, Marthe Klöcking2, Alexander Prent3, Lucia Profeta4, Angus Nixon5, Kirsten Elger6, Manja Luzi-Helbing6, Rowan Browlee7, Steven Richard8, Kerstin Lehnert4, Rebecca Farrington9, Dominik Hezel10

1Australian National University, Canberra, ACT, Australia
2Georg-August-Universität, Göttingen Germany
3AuScope Ltd, Utrecht The Netherlands
4Columbia University, Pallisades, New York, United States
5 The University of Adelaide, Adelaide, South Australia, Australia
7Australian Research Data Commons, Canberra, ACT, Australia
8University of Arizona, Tuscon, Arizona, United States of America
8University of Arizona, Tuscon, Arizona, United States of America
9AuScope Limited, Melbourne , Victoria, Australia
10Goethe-Universität Frankfurt, Frankfurt Germany

Abstract

OneGeochemistry is an international initiative formed to enable global sharing of geochemical data. Unfortunately geochemical datasets are notoriously heterogeneous and are collected by thousands of researchers/research groups on a diversity of samples (rocks, minerals, meteorites, fluids, gasses, etc) using hundreds of analytical techniques across multiple geoscience disciplines. Hence, achieving international consensus on key concepts and definitions requires considerable time and effort.

To ensure compliance with FAIR Interoperability Principle I2 (viz. (meta)data to use controlled vocabularies that also follow FAIR principles), multiple local vocabularies are emerging online that often replicate similar concepts.

To achieve a balance between meeting current demands for FAIR-compliant vocabularies versus time required to reach international agreement, OneGeochemistry has developed a three-tiered approach (local, community, international) towards making semantic resources FAIR and machine actionable:
1) We encourage data providers with locally defined vocabularies or other resources to make them FAIR, available online from a reputable vocabulary service and ensure each term has a persistent identifier (SKOS/RDF);
2) We encourage groups with similar topics to begin harmonising on concepts/definitions and publish these as community resources;
3) We raise awareness of groups harmonising and making semantic resources FAIR-compliant at an international level, particularly those with endorsement from International authoritative groups (e.g., Scientific Unions/Associations/Societies/Commissions).

As convergence takes place towards internationally-agreed terms, the size of the community able to share (meta)data in machine-to-machine environments grows. Existing URIs initially used at either a local or community level could be redirected towards internationally endorsed definitions and concepts as these become available.

Biography

Lesley Wyborn is an Honorary Professor at ANU at the NCI and at the Research School of Earth Sciences. She also works part time for ARDC. She had 42 years’ experience in Geoscience Australia (GA) in mineral systems and geochemistry research, as well as in data science/data management. Since leaving GA in 2014 has continued her research into many aspects of Data Science as applied to geochemistry, geophysics, data quality, versioning of datasets, as well as the development of transparent high-performance national-scale datasets for use in HPC environments.

Categories