Tools and workflows for metadata enrichment

Dr. Simon Musgrave1, Dr. Ben Foley1, Ms. Rosanna Smith1, Mr. Jianyao Xu1

1Language Data Commons of Australia, The University of Queensland, Australia

Biography:

0000-0003-3237-9943

Simon Musgrave is a linguist who has worked in various areas including syntax, language typology, Austronesian languages and the Australian English. Throughout his career, he has been fascinated by the ways new technologies can be applied to research and this interest led to being a part first of the Australian National Corpus project and now the Language Data Commons of Australia.

Abstract:

The FAIR criteria of Accessibility and Reusability depend on well-described data but, in many cases, existing collections of data are not as well-described as they might be. In the case of Indigenous data, even when collections are considered to be well-described, the description is often based on non-Indigenous knowledge. The Language Data Commons of Australia (LDaCA) is developing methods to enrich existing descriptions of data in general, but with the particular aim of moving towards Indigenous knowledges as the basis for the description of Indigenous data. This presentation will introduce two tools being used and developed in the LDaCA project that support this aim: Nyingarn and Crate-O.

Nyingarn is a platform for making physical documents searchable digital objects, and the platform also allows for the description of documents using the Text Encoding Initiative guidelines. Crate-O is a metadata editor which helps create rich descriptions in the format used with the RO-Crate storage standard. Discipline or project-specific schemas can be loaded in Crate-O to provide the structure needed in different use cases.

Examples of workflows incorporating these tools will be given:

– Manuscript materials produced by a Swedish linguist who visited Australia in the 1960s and 1970s are being digitised and annotated using Nyingarn.

– The metadata for an existing corpus of materials for the language Gurindji Kriol has been reformatted using Crate-O.

– Information about contributors to materials held by The University of Queensland Library has been added to spreadsheets downloaded from the library, with the results reformatted using Crate-O.

 

 

Categories