{"id":32423,"date":"2025-11-27T07:21:45","date_gmt":"2025-11-27T06:21:45","guid":{"rendered":"https:\/\/capitularia.uni-koeln.de\/?p=32423"},"modified":"2025-12-01T16:59:45","modified_gmt":"2025-12-01T15:59:45","slug":"aus-dem-maschinenraum-faire-forschungsdaten","status":"publish","type":"post","link":"https:\/\/capitularia.uni-koeln.de\/en\/blog\/aus-dem-maschinenraum-faire-forschungsdaten\/","title":{"rendered":"From the Engine Room #2: FAIR research data retrospectively? FAIRification as a challenge in ongoing projects"},"content":{"rendered":"<p>How to cite<\/h5>\n         <div>\n           <span class=\"author\">Daniela Schulz<\/author>,\n           <span class=\"title\">From the Engine Room #2: FAIR research data retrospectively? FAIRification as a challenge in ongoing projects<\/title>,\n           in: Capitularia. Edition of the Frankish Capitularies, ed. by\n           Karl Ubl and collaborators, Cologne 2014 ff.\n           \n           URL: https:\/\/capitularia.uni-koeln.de\/en\/blog\/aus-dem-maschinenraum-faire-forschungsdaten\/ (accessed on 07\/26\/2026)\n         <\/div>\n       <\/div>\n<p><\/p>\n<p style=\"padding: 10px; background-color: #d1d1d1; font-size: small;\">The blog series \u2018From the Engine Room\u2019 is dedicated to the technical aspects and challenges of the \u2018Edition der fr\u00e4nkischen Herrschererlasse. Unlike our previous scientific posts on findings and editorial insights, here we focus on the infrastructural, methodological and technological dimensions of a long-term digital project. Since 2014, we have been working on a new edition of the capitularies. Designed as a hybrid edition, the project poses particular challenges: How can we ensure the long-term availability of our research data? How can we retroactively integrate measures for implementing the FAIR principles into a work plan that was designed before they were established? How can we network effectively with other projects and infrastructures? Or what role can AI play in the project? In shorter articles, we examine these questions from different perspectives: from research data management and networking strategies to technical infrastructure and the use of new technologies. In doing so, we share not only possible solutions, but also open questions and desiderata.<\/p>\n<h5>Introduction<\/h5>\n<p>This article examines the challenges of implementing FAIR principles retrospectively in long-term projects that began before 2016. When the \u2018Edition der fr\u00e4nkischen Herrschererlasse\u2019 project was started in 2014, the FAIR Data Principles (<em><strong>F<\/strong>indable, <strong>A<\/strong>ccessible, <strong>I<\/strong>nteroperable, <strong>R<\/strong>eusable<\/em>) had not yet been formulated. It was not until 2016 that the <a href=\"https:\/\/force11.org\/\" target=\"_blank\" rel=\"noopener\">FORCE11<\/a> community published its groundbreaking guidelines for handling research data (Wilkinson, M. D. et al. 2016), which have since been also adopted in funding programmes. Today, almost a decade later, we are faced with the challenge of retroactively adapting our project to standards that did not yet exist in this form when it was initially conceptualised.<\/p>\n<h5>The Dilemma of Anteriority<\/h5>\n<p>While newly proposed projects must submit a data management plan (DMP) and thus clarify the storage of their research data from the outset, Capitularia has not yet done so. The original work plan did not allocate resources for FAIR research data management (RDM) \u2013 simply because these requirements were not yet standard at the time of application. However, the subsequent integration of FDM measures is not only a conceptual issue, but above all a question of resources: Who will do the work? Where will the funds (and time) come from? And how can these tasks be prioritised over the core objectives of the project \u2013 the editorial work itself?<\/p>\n<p>In this context, finding a suitable repository for the long-term provision of our research data is particularly challenging. This decision is not trivial. Only a few (certified) repositories are suitable for complex, TEI-XML-encoded edition data and are relevant or responsible for this (or even accessible). Inclusion in such a repository involves considerable effort. Not only must the data be prepared in accordance with the specifications of the repository in question, but there is also the issue of the long-term costs of data storage and permanent provision. Who bears these costs after the end of the project? What costs are to be estimated here? Are these foreseeable? And actually, we are not only concerned with the provision of the data layer itself, but ideally this should be maintained in conjunction with a presentation layer, since without its context the data is less useful or even incomprehensible to the general public.<\/p>\n<p>For the latter problem, no satisfactory answers have yet been found \u2013 beyond statification and the reduction of functionalities \u2013 but for the other levels (e.g. bit layer and data layer), at least technical solutions exist. Established repositories such as Heidelberg Open Research Data (<a href=\"https:\/\/heidata.uni-heidelberg.de\/dataverse\/root\" target=\"_blank\" rel=\"noopener\">heiData<\/a>) for Heidelberg Digital Editions (<a href=\"https:\/\/www.ub.uni-heidelberg.de\/publikationsdienste\/digitale_editionen.html\">heiEDITIONS<\/a>) and the G\u00f6ttingen TextGrid Repository (<a href=\"https:\/\/textgridrep.org\/\">TextGridRep<\/a>) contain edition data and offer various advantages and disadvantages compared to discipline-specific solutions. <a href=\"https:\/\/radar.products.fiz-karlsruhe.de\/de\/radarabout\/radar4memory\" target=\"_blank\" rel=\"noopener\">RADAR4Memory<\/a> is a new repository for the humanities working in the field of history, operated by FIZ Karlsruhe as a partner institution of the NFDI consortium <a href=\"https:\/\/4memory.de\/\">NFDI4Memory<\/a>.<\/p>\n<p>Where available and appropriate, repositories provided by your own institution may also be suitable. The University of Cologne has such a repository in the form of the Data Centre for the Humanities (<a href=\"https:\/\/dch.phil-fak.uni-koeln.de\/\">DCH<\/a>), which also works closely with the CCeH as our technical partner and acts as the data centre for the Digital Humanities Coordination Office of the North Rhine-Westphalian Academy of Sciences, Humanities and the Arts. However, its focus is more on AV data, so it is unlikely that anyone would search for \u2018our\u2019 research data here. Furthermore, the DCH did not yet exist in its current form and with its broad portfolio of services when the Capitularies Project began, so no direct collaboration could be planned in advance. If a similar project were to be launched today, there would be no question of involving the DCH and other service providers from the outset, seeking advice and clarifying a prospective data depositing in advance.<\/p>\n<h5>Zenodo as a pragmatic interim solution<\/h5>\n<p>In order to defuse the situation and postpone a final decision on a repository, we have set up the Zenodo community \u2018<a href=\"https:\/\/zenodo.org\/communities\/capitularia\/\">Capitularia<\/a>\u2019. This open \u2018 repository\u2019 is generally accessible and is also used and recommended by the DCH as an external service. The (additional) storage of data sets and publications on Zenodo has emerged as a best practice in the field of digital humanities and beyond. Here, we store transcription files (in addition to the download option on our own website), presentations and scientific blog posts accompanying the project, which are thus assigned DOIs and become citable. This solution offers several advantages, even if the fundamental question of long-term data storage remains unresolved:<\/p>\n<ul>\n<li>Manageable effort and thus easy integration into existing workflows<\/li>\n<li>Independence from other services or individuals<\/li>\n<li>Possibility of versioning<\/li>\n<li>Free, sustainable storage by <a href=\"https:\/\/home.cern\/\">CERN<\/a><\/li>\n<li>Automatic DOI assignment for citability<\/li>\n<li>Visibility through linking to <a href=\"https:\/\/www.openaire.eu\/\">OpenAIRE<\/a> (Open Access Infrastructure for Research in Europe)<\/li>\n<\/ul>\n<h5>Capitularia in Context<\/h5>\n<p>Through the CCeH as a technical partner and currently also through a person employed in both structures, Capitularia is closely linked to the NFDI consortium <a href=\"https:\/\/text-plus.org\/en\/\" target=\"_blank\" rel=\"noopener\">Text+<\/a>, which focuses on the field of text and language-based data and, in the data domain of editions, is explicitly dedicated to the issues and challenges already mentioned here. Text+ offers a comprehensive range of <a href=\"https:\/\/text-plus.org\/en\/daten-dienste\/consulting\/\" target=\"_blank\" rel=\"noopener\">consulting<\/a> services, including on the topic of research data management and standards, develops <a href=\"https:\/\/textplus.pages.gwdg.de\/textplus-editions\/guidelines_sde\/\">guidelines<\/a> on best practices in these areas, and organises workshops and training courses. The <a href=\"https:\/\/registry.text-plus.org\/default\/landing?lang=en\" target=\"_blank\" rel=\"noopener\">Text+ Registry<\/a> also enables structured <a href=\"https:\/\/registry.text-plus.org\/doc\/edition\/2841381d-6a8f-409e-8af2-e6a20d3152d8\" target=\"_blank\" rel=\"noopener\">traceability of the project<\/a> and networking with other projects at the meta data level. Repository providers also participate in Text+, so that the contact persons are at least known. Several academies are likewise involved, giving rise to the hope that this connection will enable the aforementioned problems to be tackled jointly with other academy projects, thereby creating synergies.<\/p>\n<h5>Initial Findings and Open Questions<\/h5>\n<p>The experiments and experiences to date indicate that retroactive or ongoing FAIRification is possible to a limited extent, but also requires pragmatic solutions. Incremental or partial improvements seem more realistic to implement than a complete ad hoc conversion to a supposedly perfect FDM according to all the rules of the art, which would probably be impossible to maintain in passing and could fail due to a lack of resources. Documentation and transparency of decisions are essential here.<\/p>\n<p>Formulating the difficulties we face and our position, for example in this article or in discussions at conferences or with other projects, seems helpful to us in reflecting on our thoughts and approaches and defining individual, feasible work packages. In general, however, questions remain unanswered: if certain aspects or parameters could not be considered and thus factored into the project planning from the outset, the structural question arises as to how such (long-term) projects can be supported in order to integrate and implement measures retrospectively and meet current standards. What sustainable financing models are available for data preservation and data provision after the end of a project, and what options should be available (for legacy projects)? The discussion about FAIR research data management in digital editions is still in its early stages \u2013 with pieces like this one, we hope to contribute a small part to the debate on this complex topic.<\/p>\n<p style=\"text-align: right;\"><em>Daniela Schulz<\/em><\/p>\n<h5>References and Links:<\/h5>\n<ul>\n<li style=\"text-align: left;\">HeiData: <a href=\"https:\/\/heidata.uni-heidelberg.de\/dataverse\/root\" target=\"_blank\" rel=\"noopener\">https:\/\/heidata.uni-heidelberg.de\/dataverse\/root<\/a><\/li>\n<li style=\"text-align: left;\">HeiEDITIONS: <a href=\"https:\/\/www.ub.uni-heidelberg.de\/publikationsdienste\/digitale_editionen.html\" target=\"_blank\" rel=\"noopener\">https:\/\/www.ub.uni-heidelberg.de\/publikationsdienste\/digitale_editionen.html<\/a><\/li>\n<li style=\"text-align: left;\">HeiEDITIONS Documentation: <a href=\"https:\/\/heieditions.github.io\/guidelines\/toc.html\" target=\"_blank\" rel=\"noopener\">https:\/\/heieditions.github.io\/guidelines\/toc.html<\/a><\/li>\n<li style=\"text-align: left;\">Sandra K\u00f6nig et al. (2024): FAIRes FDM f\u00fcr digitale Editionen: Konzept f\u00fcr einen Workshop im World Caf\u00e9-Format. Zenodo. <a href=\"https:\/\/doi.org\/10.5281\/zenodo.11618480\" target=\"_blank\" rel=\"noopener\">https:\/\/doi.org\/10.5281\/zenodo.11618480<\/a><\/li>\n<li style=\"text-align: left;\">Karoline Lemke et al.: Empfehlung zur Erstellung, Bearbeitung und Publikation FAIRer Forschungsdaten in der Datendom\u00e4ne Editionen. <a href=\"https:\/\/textplus.pages.gwdg.de\/textplus-editions\/guidelines_sde\/\" target=\"_blank\" rel=\"noopener\">https:\/\/textplus.pages.gwdg.de\/textplus-editions\/guidelines_sde\/<\/a><\/li>\n<li style=\"text-align: left;\">RADAR4Memory: <a href=\"https:\/\/radar.products.fiz-karlsruhe.de\/de\/radarabout\/radar4memory\" target=\"_blank\" rel=\"noopener\">https:\/\/radar.products.fiz-karlsruhe.de\/de\/radarabout\/radar4memory<\/a><\/li>\n<li style=\"text-align: left;\">Melanie Seltmann \/ Sandra K\u00f6nig (2024): Text+ @ FORGE \u2013 FAIRes FDM f\u00fcr digitale Editionen. In: Text+ Blog. <a href=\"https:\/\/doi.org\/10.58079\/vfb4\" target=\"_blank\" rel=\"noopener\">https:\/\/doi.org\/10.58079\/vfb4<\/a><\/li>\n<li style=\"text-align: left;\">TextGrid Repository: <a href=\"https:\/\/textgridrep.org\/\" target=\"_blank\" rel=\"noopener\">https:\/\/textgridrep.org\/<\/a><\/li>\n<li style=\"text-align: left;\">Text+: Research Data Management. <a href=\"https:\/\/text-plus.org\/themen-dokumentation\/forschungsdatenmanagement\/\" target=\"_blank\" rel=\"noopener\">https:\/\/text-plus.org\/themen-dokumentation\/forschungsdatenmanagement\/<\/a><\/li>\n<li style=\"text-align: left;\">Wilkinson, M. D. et al. (2016): The FAIR Guiding Principles for scientific data management and stewardship. In: Scientific Data 3, 160018. <a href=\"https:\/\/doi.org\/10.1038\/sdata.2016.18\" target=\"_blank\" rel=\"noopener\">https:\/\/doi.org\/10.1038\/sdata.2016.18<\/a><\/li>\n<li style=\"text-align: left;\">Zenodo Community Capitularia: <a href=\"https:\/\/zenodo.org\/communities\/capitularia\" target=\"_blank\" rel=\"noopener\">https:\/\/zenodo.org\/communities\/capitularia<\/a><\/li>\n<\/ul>\n       <div class=\"cite_as\">\n         <h5>How to cite<\/h5>\n         <div>\n           <span class=\"author\">Daniela Schulz<\/author>,\n           <span class=\"title\">From the Engine Room #2: FAIR research data retrospectively? FAIRification as a challenge in ongoing projects<\/title>,\n           in: Capitularia. Edition of the Frankish Capitularies, ed. by\n           Karl Ubl and collaborators, Cologne 2014 ff.\n           \n           URL: https:\/\/capitularia.uni-koeln.de\/en\/blog\/aus-dem-maschinenraum-faire-forschungsdaten\/ (accessed on 07\/26\/2026)\n         <\/div>\n       <\/div>\n<p><\/p>\n<p>&nbsp;<\/p>","protected":false},"excerpt":{"rendered":"<p>The blog series \u2018From the Engine Room\u2019 is dedicated to the technical aspects and challenges of the \u2018Edition der fr\u00e4nkischen Herrschererlasse. Unlike our previous scientific posts on findings and editorial insights, here we focus on the infrastructural, methodological and technological dimensions of a long-term digital project. Since 2014, we have been working on a new [&hellip;]<\/p>\n","protected":false},"author":44,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"inline_featured_image":false,"footnotes":""},"categories":[186],"tags":[188,189,190,134,200,194,192,187,195,193],"class_list":["post-32423","post","type-post","status-publish","format-standard","hentry","category-from-the-engine-room","tag-fair","tag-forschungsdaten","tag-forschungsdatenmanagement","tag-infrastrukturen","tag-lza","tag-nfdi","tag-repositorien","tag-technologien","tag-text","tag-zenodo"],"_links":{"self":[{"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/posts\/32423","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/users\/44"}],"replies":[{"embeddable":true,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/comments?post=32423"}],"version-history":[{"count":30,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/posts\/32423\/revisions"}],"predecessor-version":[{"id":32555,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/posts\/32423\/revisions\/32555"}],"wp:attachment":[{"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/media?parent=32423"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/categories?post=32423"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/capitularia.uni-koeln.de\/en\/wp-json\/wp\/v2\/tags?post=32423"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}