Publishing research data with Frictionless data packages for reproducibility

Mr. James Wilmot1, Ms. Varvara Efremova1

1UNSW Sydney, Australia

Biography:

James is a research software engineer at UNSW Sydney with a background in physics. He has an interest in research data standardisation, schemas and specifications as well as metadata and provenance management. He has extensive experience in web application technologies, containerisation, data structures and metadata. When not coding he enjoys running, cycling and being outdoors.

James is part of the team developing opendata.studio – a declarative data analysis and publication platform for reproducibility and publication.

Abstract:

This talk introduces researchers and data users to Frictionless Data as a standard for FAIR packaging and publication of research data. Frictionless is an uncomplicated, incrementally adoptable and scalable standard for describing tabular datasets and their metadata using a portable JSON file format.

This talk will first provide an overview of the Frictionless data specification and its key components – Data Packages and Resources. We will then walk through some model examples showcasing how Data Packages can be used to package scientific datasets alongside their metadata.

The Frictionless data specification is open source and available publicly on Github. Frictionless is developed and maintained by the creators of the CKAN data management system.

 

Categories