2–6 Dec 2024
University of Applied Sciences of the Grisons
Europe/Zurich timezone
Registration is open from Thursday 26th September 2024

Metacurate-ML: Metadata Extraction from CAI

3 Dec 2024, 10:45
20m
University of Applied Sciences of the Grisons (Chur, Switzerland)

University of Applied Sciences of the Grisons

Chur, Switzerland
Regular Presentation Metacurate-ML

Speaker

Suparna De (University of Surrey)

Description

Extending the results of our work on pre-trained language models with recent developments in text-layout models and zero-shot techniques. Since relying solely on textual information makes it difficult to accurately classify and extract metadata, a combination of textual content and visual logic that incorporates vision transformers with optimisation techniques will be explored.
This will allow us to extract the specific items with questionnaires such as question texts, responses and routing to create a rich source of metadata which provenances’ data collection methodology to the resultant data which can be transformed into DDI-Lifecycle. We will investigate the feasibility of document understanding multimodal models that employ masked language techniques and present the resulting challenges.

Primary authors

Suparna De (University of Surrey) Jon Johnson (CLOSER, UCL)

Co-authors

Mr Zeqiang Wang (University of Surrey) Dr Chandresh Pravin (University of Surrey) Deirdre Lungley Mr Paul Bradshaw (Scottish Centre for Social Research (ScotCen))

Presentation materials

There are no materials yet.