Solutions and Best Practices: Workflows in Data Management and Analytics

Dr Siddeswara Guru1, Dr Anusuriya Devaraju1, Claire Trenham, Amir Aryani, Gerhard Weis, Wilma Karsdorp, Sergio Pintaldi, Ivan Hanigan

1University Of Queensland, Indooroopilly , Australia

Workflows are widely used to create, schedule, and orchestrate a set of tasks that needs repeated execution. Scientific workflow systems provide a reusable and interactive environment for researchers to design, test and deploy computational steps during various stages of scientific research, i.e.,  data analysis and management. Several workflow management tools such as Apache Airflow, Apache Oozie, and Apache NIFI are used to build data flow pipelines and scientific workflow systems such as Galaxy, VisTrails, Kepler, and KNIME are used in data analysis.  The BoF aims to bring together the data service providers, practitioners, and users to discuss best practices and practical solutions to build and use workflows in scientific data processing and analytics.

The BoF will have short presentations on the applications where workflows are used in scientific disciplines (e.g., social, atmospheric and earth and environmental science), and an open discussion on building a community around the use of workflows and more coordination in developing tools for everyday use. Further feedback from the participants on the topic will be solicited through an interactive presentation with Mentimeter at the end of the session.


Siddeswara Guru is the program lead for the Terrestrial Ecosystem Research Network

(TERN) Data Services and Analytics Platform. He initiates, coordinates and manages data

e-infrastructure and analytics projects apart from overseeing the data and information

management activities across TERN.


Oct 11 2021


3:50 pm - 4:50 pm

Local Time

  • Timezone: America/New_York
  • Date: Oct 11 2021
  • Time: 12:50 am - 1:50 am