Developing Community Guidelines for Consistently Curating and Representing Dataset Quality Information

Dr Ivana Ivánová1, Dr Lesley Wyborn2, Dr Mingfang Wu3, Mr Miles  Nicholls5, Ms Irina Bastrakova4

1Curtin University, Perth, Australia
2Australian National University, Canberra, Australia
3Australian Research Data Commons, Melbourne, Australia
4Geoscience Australia, Canberra, Australia
5Atlas of Living Australia, Canberra, Australia

 

Abstract:

The wide adoption of open research data in the past decade, means many datasets from the research sector, government agencies, museums and business sectors etc. are now available and FAIR (Findable, Accessible, Interoperable and Reusable). But FAIR is not enough: being able to define dataset quality is critical to ensuring trustworthiness and reusability.

To understand how Australia and New Zealand data communities define data quality, the ARDC Australian Data Quality Interest Group held a workshop to compare requirements for data quality strategies, community practices and implementation plans across multiple research domains. Common issues identified include a lack of organisational strategy for evaluating dataset quality, and a need for ‘common quality language’ and guidelines on managing data provenance.

The pathway forward is to develop community guidelines for conveying dataset quality information, in partnership with international initiatives including the Information Quality Cluster of Earth Science Information Partnership (ESIP) (who commenced their work towards quality information guidelines); data quality standard bodies (ISO and OGC); and relevant groups from Research Data Alliance).  Participating in international collaborations will both ensure our communities’ needs are taken into consideration and that we are in line with international best practice.

The proposed BoF session will start with lighting talks (15 mins) including high level perspective of current community practice and case studies; followed by group discussion of what would be expected from the guidelines (35 minutes); and conclude with future actions (10 mins).

Here is the website from the Australian Data Quality Interest Group

AGENDA:

  1. Introduction to session [3 minutes]
  2. Setting the scene & lightning talks [15min]: 
    • Ivana Ivánová (Curtin University): Introduction to AU/NZ Data Quality Interest Group and providing context for this BoF [5min]
    • Miles Nichols (Atlas of Living Australia): Data Profiles – a practical use for data quality information [5min]
    • Irina Bastrakova (Geoscience Australia): Standards for defining and curating quality – the ANZLIC perspective [5min]
  1. Breakout Sessions [35 minutes]
  2. Debrief and Next Steps [7 minutes]

Biography:

Ivana Ivánová is a Senior Lecturer in Spatial Sciences at the School of Earth and Planetary Science of the Curtin University and FrontierSI Research Fellow in Spatial Information Infrastructures. Her research interests and expertise are in spatial data quality, provenance of spatial resources and spatial knowledge infrastructures. She is actively involved in national and international standardisation at ISO (she co-leads the revision of ISO 19157-1 on geographic information quality) and OGC (where she co-chairs the Data Quality Domain Working Group).

Lesley Wyborn is an Adjunct Fellow at the National Computational Infrastructure at ANU and works part-time for the Australian Research Data Commons. She had 42 years’ experience in Geoscience Australia in scientific research and in geoscientific data management. She is currently Chair of the Australian Academy of Science ‘National Data in Science Committee’ and is on the American Geophysical Union Data Management Advisory Board and the Earth Science Information Partners Executive Board. She was awarded the Public Service Medal in 2014, the 2015 Geological Society of America Career Achievement Award in Geoinformatics and the 2019 US ESIP Martha Maiden Award.

Mingfang Wu is a senior research data specialist at ARDC.  https://orcid.org/0000-0003-1206-3431

Miles Nicholls is currently leading the Data Quality Project for the Atlas of Living Australia (ALA) having previously held the position of Data Manager at the ALA since 2009.  Prior to this Miles worked for the Federal government in Business intelligence and Data warehousing.

Irina Bastrakova is a Director, Spatial Data Architecture at Geoscience Australia. Irina has been actively involved with international and national geoinformatics communities for more than 19 years. Irina focuses her work on maximising potential of multidisciplinary data assets by leveraging common information models, data patterns and vocabularies. Irina is the Chair of the Australian Metadata Working Group and a member of many international and national geoinformatics and standards committees.

About the conference

eResearch Australasia provides opportunities for delegates to engage, connect, and share their ideas and exemplars concerning new information centric research capabilities, and how information and communication technologies help researchers to collaborate, collect, manage, share, process, analyse, store, find, understand and re-use information.

Conference Managers

Please contact the team at Conference Design with any questions regarding the conference.

© 2018 - 2020 Conference Design Pty Ltd