Cloud-Native Data Infrastructure for Ecosystem Science and Innovation

Dr. Siddeswara Guru1, Mr. Gerhard Weis1

1University Of Queensland, Indooroopilly, Australia

Biography:

Siddeswara Guru is the program lead for the TERN Data Services and Analytics Platform.

Gerhard Weis is the solution architect and technical lead in the TERN Data Services and Analytics Platform.

Abstract:

Terrestrial Ecosystem Research Network (TERN) is an NCRIS-funded land-based observatory that measures the environment from continental to site scale with different observation methods, including remote-sensing, in-situ and human. The significant data collection demands innovative data management approaches to meet the needs of scalability, flexibility and near-real-time processing.  The paper explores developing and managing cloud-native data infrastructure tailored for ecosystem science. By leveraging the ARDC NeCTAR cloud, the data infrastructure enables effective management of large volumes of data, containerised applications to submit and discover data at the collection level, multiple data dashboards to find and access data at the observation level and The Virtual desktop environment for analysis and collaboration. In addition, Apache airflow and Kubernetes are used for automated data processing and onboarding with overarching solid data governance and controlled vocabularies to describe all data-related artefacts. The cloud-native infrastructure aims to accelerate scientific data discovery, foster interdisciplinary research, and drive innovation in ecosystem management and conservation efforts.

 

Categories