AI-Powered Analysis of ORCID and RLA Data for Technology Landscaping

Mr Luhan Cheng1, A/Prof. Amir Aryani1, Dr. Stephen Petrie1, Dr. Hui Yin1

1Swinburne University of Technology, Hawthorn, Australia

Biography:

Luhan Cheng is a PhD candidate with the ARC Center of Information Resilience (CIRES) at Swinburne University of Technology. He completed his Bachelor of Computer Science (Honours) degree at Monash University on the topic of multi-agent reinforcement learning and game theory. Luhan worked at Monash eResearch Centre as a high-performance computing consultant while working at a teaching assistant at the Faculty of IT. His research interest lies in applying computational approach to scientific problems that have significant impacts on our society. His PhD project “Automated Horizon Scanning for Scientific Papers and Patents” aims to create a pathway for automated knowledge mining for scientific articles and invention patents. The project is supervised by Dr. Stephen Petrie and A/Prof. Amir Aryani.

Abstract:

ORCID profiles provide a substantial amount of information about research communities. ORCID is considered a key data source for providing valuable and open-access data about researchers and their research works. However, ORCID also has potential to inform Technology Landscaping regarding researchers and their associated research networks, with potential applications including identifying technical expertise, emerging research communities, and investment opportunities.

This presentation features a case study leveraging over 100,000 Australian ORCID researcher profiles and their research outcome collected from ARDC Research Link platform. We demonstrate how we have collected, connected, and analysed this dataset to observe the time-series evolution of the Australian research communities. The resulting animation will illustrate five years of historical developments within these communities. This will also establish a robust foundation for projecting future technological trends in Australia.

Aligning with the technical aspect of this presentation, we demonstrate an AI pipeline using LLM model to analyse our data and achieve the visualisation results. This makes the solution presented in our work accessible and interoperable with many research infrastructure solutions in universities. We will discuss the technical challenges and opportunities observed while developing and deploying this solution in Nectar Cloud. We believe the lessons learned from our work can help identify potential new use cases for application of AI pipelines to analyse ORCID and other open scholarly data.

 

Categories