Collaborations at the 4th BioHackathon Germany
The 4th BioHackathon Germany took place from 1-5 December 2025 in Walsrode, Germany, organised by de.NBI and ELIXIR Germany. Life scientists, data managers, software developers and project leaders attended the hybrid event to work together on open source code, standards and infrastructure to advance research data practices and tools.

Improving RO-Crate support in federated storage systems
One of the hackathon projects was titled Enhancing FAIR (Meta-)Data Practices in Life Science by Improving RO-Crates Support in Federated Storage Systems. It was led by Sebastian Beyvers and Jannis Schlegel (University of Giessen).
During their participation in the Biohackathon Germany 2025, their working group engaged in extensive discussions focused on the practical and technical challenges of integrating RO-Crates in federated storage systems for life science data. The group produced many outputs including extensions to ro-crate-py Python library and ro-crate-rs Rust library, an RO-Crate indexing tool, and the RO-Crate Explorer application to parse, visualise and traverse RO-Crates.
Making FAIRer access to training registries and learning paths across domains
The hackathon project titled On the path to machine-actionable training materials was led by Nick Juty (University of Manchester) and Petra Steiner (Technical University of Darmstadt). The lead author of the project report was Phil Reed (University of Manchester).
The project operated across three interrelated streams: metadata interoperability, material analysis, and the definition and representation of learning paths in a machine readable manner.
- Content federation was demonstrated via the mTeSS-X platform, enabling cross-instance exchange and preparing for future integration with the EOSC federation. To enhance interoperability, relevant ontologies and crosswalks were curated between established metadata models, specifically MoDALIA and Schema.org/Bioschemas. These mappings were implemented within the open-source OERbservatory Python package, providing a facility for exchanging data between platforms such as DALIA and TeSS.
- For material analysis, Large Language Models (LLMs) were utilised and vectorisation techniques were explored to calculate similarity, allowing for the identification of related materials and the potential for future deduplication of records across registries.
- To address the lack of machine-actionable trajectories across related or sequential materials, new Bioschemas profiles were proposed, specifically for learning paths. This model was validated using SPARQL queries on knowledge graphs derived from real-world examples like the Galaxy Training Network.
Such advancements provide a foundation for automated path generation and improved discoverability within training catalogues, and serves as a use case and strategy with broader applicability beyond those materials.
