A Methodology for Progressively Developing and Harmonising FAIR Vocabularies for Global Interoperability of Geochemical Data
Lesley Wyborn1, Marthe Klöcking2, Alexander Prent3, Lucia Profeta4, Angus Nixon5, Kirsten Elger6, Manja Luzi-Helbing6, Rowan Browlee7, Steven Richard8, Kerstin Lehnert4, Rebecca Farrington9, Dominik Hezel10 1Australian National University, Canberra, ACT, Australia2Georg-August-Universität, Göttingen Germany3AuScope Ltd, Utrecht The Netherlands4Columbia University, Pallisades, New York, United States5 The University of Adelaide, Adelaide, South Australia, Australia7Australian Research Data Commons, Canberra, ACT, Australia8University of Arizona, Tuscon, Arizona, United States of America8University of Arizona, Tuscon, Arizona, United States of America9AuScope Limited, Melbourne , Victoria, Australia10Goethe-Universität Frankfurt, Frankfurt Germany
Abstract
OneGeochemistry is an international initiative formed to enable global sharing of geochemical data. Unfortunately geochemical datasets are notoriously heterogeneous and are collected by thousands of researchers/research groups on a diversity of samples (rocks, minerals, meteorites, fluids, gasses, etc) using hundreds of analytical techniques across multiple geoscience disciplines. Hence, achieving international consensus on key concepts and definitions requires considerable time and effort.
To ensure compliance with FAIR Interoperability Principle I2 (viz. (meta)data to use controlled vocabularies that also follow FAIR principles), multiple local vocabularies are emerging online that often replicate similar concepts.
To achieve a balance between meeting current demands for FAIR-compliant vocabularies versus time required to reach international agreement, OneGeochemistry has developed a three-tiered approach (local, community, international) towards making semantic resources FAIR and machine actionable:
1) We encourage data providers with locally defined vocabularies or other resources to make them FAIR, available online from a reputable vocabulary service and ensure each term has a persistent identifier (SKOS/RDF);
2) We encourage groups with similar topics to begin harmonising on concepts/definitions and publish these as community resources;
3) We raise awareness of groups harmonising and making semantic resources FAIR-compliant at an international level, particularly those with endorsement from International authoritative groups (e.g., Scientific Unions/Associations/Societies/Commissions).
As convergence takes place towards internationally-agreed terms, the size of the community able to share (meta)data in machine-to-machine environments grows. Existing URIs initially used at either a local or community level could be redirected towards internationally endorsed definitions and concepts as these become available.
Biography
Lesley Wyborn is an Honorary Professor at ANU at the NCI and at the Research School of Earth Sciences. She also works part time for ARDC. She had 42 years’ experience in Geoscience Australia (GA) in mineral systems and geochemistry research, as well as in data science/data management. Since leaving GA in 2014 has continued her research into many aspects of Data Science as applied to geochemistry, geophysics, data quality, versioning of datasets, as well as the development of transparent high-performance national-scale datasets for use in HPC environments.