2–6 Dec 2024
University of Applied Sciences of the Grisons
Europe/Zurich timezone
Registration is open from Thursday 26th September 2024

Metacurate-ML: Conceptual Comparison

3 Dec 2024, 11:05
20m
University of Applied Sciences of the Grisons (Chur, Switzerland)

University of Applied Sciences of the Grisons

Chur, Switzerland
Regular Presentation Metacurate-ML

Speaker

Suparna De (University of Surrey)

Description

Questions from the CLOSER DDI-Lifecycle repository will be used to assist in training a model that is capable of using questions and response domains from the metadata extraction workstream to create conceptually equivalent items from which data variables can be concorded. Approaches such as fine-tuned large language model (LLM)-based relevance scores model and vector retrieval-LLM reordering will be presented.
The session will present initial results in question concept tagging that feed into the conceptual comparison task, addressing challenges of long-tail distribution of the data, model memorisation and human annotation bias in the dataset. Higher-level machine learning (ML) limitations of identifying indeterminate tags and the notion of probability in model outputs will be explored.

Primary authors

Suparna De (University of Surrey) Zeqiang Wang (University of Surrey)

Co-authors

Dr Wing Yan (Justina) Li (University of Surrey) Deirdre Lungley Paul Bradshaw (Scottish Centre for Social Research (ScotCen)) Jon Johnson (CLOSER, UCL)

Presentation materials

There are no materials yet.