Combining cloud and Git tools in a research data management strategy for team science

Julien Colomb, Robert Mies

In project management of collaborative research projects, there is an increasing de-mand for an information and communication technology (ICT) infrastructure becausepeople are distributed in time and space, including a safe and legal data sharing in-frastructure. Data managers want to foster the production of FAIR data by implement-ing best research data management (RDM) practices. In practice, the use of a cloudservice is the easiest to implement, but shared folders tend to become huge and unor-ganised, which greatly limits the findability and reuse of the shared data. In order fordata managers to regularly monitor activities on the shared folder, they need a betterversion control system than what cloud systems provide.Here we present a strategy where the data manager uses the power of Git on a lo-cal copy of the shared folder. By spotting new and modified files, they can intervenevery early and pro-actively to keep files organised, produce useful metadata, or publishdata on behalf of the researchers. In particular, the data manager can move large filesoutside of the data synchronised on the researchers’ computers. This strategy wassuccessfully implemented in two projects that collect relatively small datasets. Be-cause most of the collected data is available and organised, publication and archivalof the whole data can be performed by the data manager, potentially making this dataFAIR during and after the project.

Bausteine FDM. 1;1-10 (2026)

Keywords

cloudcollaborationdata stewardshipgit
Share the article

Participating Institutions