The background for this workshop are recent calls for proposals where participants had limited understanding of the scope of metadata holdings, local computer science expertise and knowledge of current work being done at institutions to be able to develop a focused and convincing proposal for funding.
The development of the European Question Bank, and the European Language Social Science...
The DDI specifications cover a wide range of needs in data production, management, reuse, and dissemination. Deciding on which specification is best depends on the organisational challenges faced, and specific requirements. This tutorial is for an audience which is unfamiliar with the DDI standards. It introduces the different specifications, and considers the uses to which standard,...
Iztok Prezel, Dean of the Faculty of Social Sciences
Miro Pušnik the Director of the Central Technical Library at the In 2021, Slovenia signed up to the Pact for Research and Innovation, a commitment that establishes common values and principles for research and innovation in the European Research Area (ERA). Slovenia has actively responded to the need to harmonise research and innovation activities by adopting comprehensive legislation and...
Panel survey data is often repeated to allow comparisons over time. However, questionnaires may be slightly adjusted over data collection waves - and therefore the datasets variables. Impacting the comparability, data harmonization may be required to maintain the panel data “mission”.
Based on the ELIPSS panel use case, this talk will show opportunities of a centralized metadata management...
Recognizing the significance of DDI for use in data archives and research data management, this presentation aims to introduce the [Dataverse Data Curation Tool][1], an open and integrated DDI application that supports editing dataset variable metadata in Dataverse. An overview of the current challenges and limitations of integration of DDI into Dataverse, including issues related to metadata...
Data collections at GESIS - Leibniz Institute for the Social Sciences are currently managed with a common metadata database to support the search and re-use of research data. This involves a variety of tools, e.g. for study-level documentation, PID registration, data file management, and question and variable documentation. Colectica is currently added to the available metadata management...
CLOSER Discovery is the UK’s most comprehensive research tool for longitudinal population studies, containing questionnaire and dataset metadata for 11 leading UK studies.
Creating questionnaire metadata can be a time-consuming and challenging task. Historically, CLOSER’s Metadata Assistants (MAs) entered the questionnaire metadata into our in-house developed DDI questionnaire editor –...
The Slovenian Social Science Data Archives (ADP) have developed an XSLT (eXtensible Stylesheet Language Transformations) stylesheet to align the data, variable, and question descriptions within a selection of current ADP DDI 2.5 codebook instances with the established CESSDA European Question Bank (EQB) DDI 2.5 harvesting profile. This effort was undertaken as part of the Social Sciences &...
Colectica is delighted to launch Colectica 7.3 at EDDI. Colectica is software for creating, publishing, centralizing, and managing DDI metadata within and across organizations. It is used by national statistical organizations, university research groups, and data collection agencies to provide well-documented data to researchers and the public. Colectica is built on open standards like DDI and...
The present paper is focused on the work of an expert group on ‘Facilitating the use of new data sources for official statistics’ created by Eurostat during March 2021 – May 2022. The main purpose of this expert group was to reflect on new opportunities riced once with our rapidly changing and increasingly data-driven society and make recommendations to enhance the reuse of private sector data...
The [domain agnostic metrics][1] adopted by FAIR data assessment tools tend to penalise metadata aggregators, like the CESSDA Data Catalogue (CDC). This became apparent during the work done for the ‘[Bulk FAIR assessment of the CESSDA Data Catalogue using the F-UJI API][2]’ (as presented at EDDI2022).
Building on that work, FAIR scores were generated by the F-UJI and FAIR EVA tools for...
The Generations and Gender Programme (GGP) is a cross-national longitudinal panel survey on life-course and family dynamics launched in the year 2000. It comprises two rounds of data collection: Generations and Gender Survey-I (GGS-I), covering 19 countries, and GGS-II, initiated in 2017 and currently ongoing, with data available so far for 10 countries. The GGP is on the European Strategy...
Making historical analog data Re-useable:
a successful outcome of Data Rescue/Conservation Discipline at Bangladesh Bureau of Statistics.
Short Title: Historical Data Conversion and Archiving.
Chandra Shekhar Roy1
Alamgir Hossain2
1Senior Maintenance Engineer-IT, Bangladesh Bureau of Statistics, Statistics & Informatics Division,
Ministry of Planning, E27/A, Agargaon, Dhaka-1207,...
The QualidataNetwork (QualidataNet) links several research data centers that archive and provide access to sensitive qualitative research data. It is part of the Consortium for Social, Behavioural, Educational and Economic Sciences (KonsortSWD) at the National Research Data Infrastructure (NFDI) in Germany.
In 2022, we presented the project, our goals and how we plan to move forward. Many...
In Social Sciences surveys, the dynamic relationship among survey instruments and study entities like questionnaires, variables, questions, and response formats evolve. When reusing variables, researchers may need to modify variable attributes such as labels or names, question-wording, or response scales. Therefore, explaining these relations across different waves and studies is necessary to...
The Swiss Virtual Educational Observatory (VEO) project is funded by the Swiss National Science Foundation. It has the goal to link and to visual sources of research data about education and learning. While concentrating on research data itself data documentation and sources of open research data are relevant as well.
The first step to find research data is looking for open data and data...
The French Center for Socio-Political Data is jointly operated and financed by Sciences Po, one of the leading SSH French universities, and the French Center for National Research (CNRS). One of the center’s main missions has been to serve the French and international SSH communities by facilitating the reuse of surveys and data, both quantitative and qualitative, in the fields of sociology...
CESSDA has launched version 3 of its Vocabulary Service (CVS), a significant upgrade from version 2. Accessible at https://vocabularies.cessda.eu, CVS offers users the capability to explore and download multilingual controlled vocabularies in formats like SKOS, HTML, and PDF. The Editor component empowers authorized individuals to manage and translate vocabularies. Many vocabularies have been...
Join the chairs of the DDI Executive Board and the DDI Scientific Board, as well as the DDI Executive Director, to discuss Alliance priorities and plans, especially with an eye toward user and member needs. Engage in thoughtful conversations and Q&A, especially as we envision DDI's strategic role in the exciting future of metadata.
FAIR Digital Objects (FDOs) are datasets, publications, software, services, workflows, lab notebooks, and other digital results of research that are easily discoverable by humans and machines. FDOs are accessible to anyone with permission, interoperable with other digital objects regardless of their format or software, and reusable without modification for the same or different purposes. The...
Official statistics increasingly rely on external sources, particularly administrative data, to produce statistics. This requires further industrialisation of the data integration before the downstream steps leading to dissemination.
In 2021, INSEE has launched a project named Resil with the objective of centralising administrative data ingestion for further processing of social statistics....
This presentation describes the model-driven approach of DDI-CDI. In this way, it is possible to generate related syntax representations (such as XML Schema and RDF) and field-level documentation of the UML model. A subset mechanism allows targeted generation for specific use cases such as the process description. Experimental work will also be shown using visualization and sonification...
An investigation was conducted to examine the extent to which metadata in different Data Documentation Initiative (DDI) standards is openly available and which elements of these standards are used. DDI is a set of international standards for describing and documenting data used in social, behavioural, economic, and health sciences research.
To identify the online repositories, where DDI...
Recent developments in Machine Learning (ML) show robust performance in the area of Natural Language Processing (NLP) tasks, such as sentiment analysis and document classification. Our ML task is one of short text classification, specifically we are endeavouring to annotate variables using the variable name, label, question text and representation. Our task is one of multi-class...
In this presentation, we plan to describe NACDA’s efforts over the last year to develop a multi-series comparison of longitudinal, nationally representative, National Institute on Aging funded data collections using DDI-Lifecycle, as well as share some lessons learned.
The National Archive of Computerized Data on Aging (NACDA, part of ICPSR) began working with DDI-Lifecycle in 2018. Since...
DDI-Lifecycle utilises the variable cascade to organise and describe data from conception to collection.
The organisation of conceptual variables and conceptual variable groups allows comparison of data at different time points, universes, representations and many other dimensions through concordance tables and is well suited to iterative data collected as panels, cohorts, repeated surveys...
A value domain is the description of the values a variable is allowed to take. The idea originated with the ISO/IEC 11179 series of standards. DDI-CDI makes use of them explicitly, and DDI-L uses the idea as well. However, metadata reuse depends on which metadata are assigned to a variable, and which are assigned to a value domain. We address this here.
We can illustrate the problem with an...
The National Archive of Computerized Data on Aging (NACDA) and CLOSER - the interdisciplinary partnership of leading UK social and biomedical longitudinal population studies (LPS), have been discussing ways to collaborate and create common conceptual groups across their social science data collections in their Colectica Portals.
Although NACDA and CLOSER have approached metadata...
Any DDI dataset, ancillary, or supporting file is a candidate for systematic, persistent identification with ARK identifiers. End users, especially researchers, rely on ARKs for long term access to the global scientific and cultural record. Since 2001 some 8.2 billion ARKs have been created by over 1100 organizations — libraries, data centers, archives, museums, publishers, government...
Karstology is a multidisciplinary science that encompasses a wide range of earth and life sciences: hydrology, geology, biology, geomorphology, ecology, microbiology, speleology, and history of karst science. The Karst Research Institute ZRC SAZU reinforced its long tradition of excellent research by becoming the national coordinator and headquarters of three major environmental European RIs...
The objective of the ‘Climate Neutral and Smart Cities’ Science Project of EOSC Future is to demonstrate that relevant environmental data and data on citizens' values, attitudes, behavior and involvement can be combined in a meaningful way for social, political and scientific analysis.
The Science Project rests on three pillars: Indicator production and integration of data from three...
One of the primary objectives of the BeYond-COVID project was the integration of social sciences & humanities studies’ metadata into the COVID-19 Data Portal. This presentation focuses on the technical work done to achieve this for studies found in CESSDA Data Catalogue (CDC).
Focus of the presentation is the implementation of XML transformation from DDI-Codebook 2.5 to extended OmicsDI...
The "Climate Neutral and Smart Cities” Science Project contributes significantly to the European Open Science Cloud (EOSC) Future Project by showcasing cross-domain data integration using the new DDI-CDI metadata specification. This presentation demonstrates the use of the DDI-CDI Process Model, which offers a standardized approach to comprehensively describe data lineage and integration...
The Technical Committee meet face-to-face on a yearly basis to review progress and plan work over the year, in addition to resolving issues which need focused discussion which is more suited to a face-to-face environments
The DDI Training Group (TG) expects to have several members attending the EDDI 2023 conference in person. With this in mind we plan to discuss notable EDDI presentations, debrief on the training workshop and discuss training plans for 2024, and possible target groups (e.g. researchers). We are considering opening the meeting to non-members of the training group.
This will be a half-day meeting.
High performance computing (HPC) is key to Europe’s future prosperity, digital transformation and resilience. This has been acknowledged by the EU strategy and investments. One recent initiative in this context is the establishment of the Italian National Centre for HPC, Big Data and Quantum Computing. This centre, which is funded under the National Recovery and Resilience Plan...