2–6 Dec 2024
University of Applied Sciences of the Grisons
Europe/Zurich timezone
Registration is open from Thursday 26th September 2024

The ONTOLISST project on DDI metadata, vocabularies and NLP

3 Dec 2024, 14:15
10m
University of Applied Sciences of the Grisons (Chur, Switzerland)

University of Applied Sciences of the Grisons

Chur, Switzerland

Speakers

Alina DANCIU (Sciences Po, Center for Socio-Political Data (CDSP)) Judit Gárdos (Hungarian Research Network Centre for Social Sciences, Research Documentation Centre (KDK)) Mari Kleemola (Finnish Social Science Data Archive, Tampere University)

Description

The talk introduces the new 2-year ONTOLISST project starting in December 2024, funded by the first OSCARS Cascading grant call. The project will develop a simplified multilingual ontology (LiSST) to describe social science research data, create a corpus of social science metadata, and research whether and how NLP tools can help with (semi)automated (meta)data curation. The aim is to better understand how social science archives assign thematic metadata to their datasets in order to describe their contents and how data curation practices shape social scientific understanding. ONTOLISST will build on metadata in DDI format in different languages from various sources and using different CVs. The presentation outlines the project tasks, expected outputs and relationships with existing standards and tools. It also discusses how AI could help to accelerate the tedious, resource-intensive but important work of metadata and data curation and improve (meta)data interoperability across languages and disciplinary barriers.

Primary authors

Alina DANCIU (Sciences Po, Center for Socio-Political Data (CDSP)) Judit Gárdos (Hungarian Research Network Centre for Social Sciences, Research Documentation Centre (KDK)) Mari Kleemola (Finnish Social Science Data Archive, Tampere University)

Presentation materials

There are no materials yet.