Skip to main content


The new RDA web platform is still being rolled out. Existing RDA members PLEASE REACTIVATE YOUR ACCOUNT using this link: Please report bugs, broken links and provide your feedback using the UserSnap tool on the bottom right corner of each page. Stay updated about the web site milestones at

WG Data Versioning – RDA 13th Plenary Meet

  • Creator
  • #134393

    Jens Klump

    Meeting title 
    Data Versioning WG Working Meeting (Remote Access Instructions)
    Room Location: Commonwealth A2
    Collaborative session notes:
    Short introduction describing the scope of the group and if any previous activities 
    The demand for reproducibility of research results is growing, Therefore it will become increasingly important for a researcher to be able to cite the exact extract of the data set that was used to underpin their research publication. However, systematic data versioning practices are currently not available.
    Versioning procedures and best practices are well established for scientific software and can be used to enable reproducibility of scientific results. The codebase of large software projects does bear some semblance to large dynamic datasets. Are therefore versioning practices for code also suitable for data sets or do we need a separate suite of practices for data versioning? How can we apply our knowledge of versioning code to improve data versioning practices?
    Over the past year, we have collected use cases of data versioning practices and extracted data versioning patterns. A draft of the Working Group’s report and recommendations for data versioning practices will be presented in this session. We invite data scientists, operators of data repositories, and anyone who is interested in moving data versioning forward, to attend.
    Additional links to informative material related to the group 

    Working Group Page:
    Case Statement:
    Data Versioning Use Cases:
    Mapping of use cases to W3C Data Exchange WG use cases:…
    Draft of white paper on data versioning:

    Notes and presentations from past plenaries:

    Notes and presentation from P12 Gaborone:
    Notes from P11 Berlin:
    Presentation from P11 Berlin:
    Notes from P10 Montreal:
    Presentation from P10 Montreal:
    Notes from Denver Plenary BoF meeting:

    Meeting objectives 
    The objective of this session is to establish a work plan for this RDA Working Group on developing agreed practices for Data Versioning to finalise the outcomes and recommendations. This includes:

    Identifying areas where versioning is required and/or other use cases:

    Identifying groups in RDA and planning of how to engage 
    Identifying external groups 
    Overview of collected use cases

    Present the outline of a white paper on recommendations for data versioning:

    Spectrum of data types to be included (files, databases, unstructured data, model runs, etc.), 
    How to align these with the practices for the assignment of persistent identifiers.
    Identify other topics that should be included

    Meeting agenda

    Data Versioning WG retrospective
    Presentation of report draft and recommendations on data versioning practices
    Adoption of WG recommendations through W3C
    Engagement with other RDA and external groups
    Work plan for final six months of RDA Data Versioning WG
    Scheduling of online meetings up to Plenary 14

    Group chair serving as contact person
    Jens Klump
    Type of meeting
    Working meeting
    Target Audience

    Members of the Working Group.
    Data scientists and operators of data repositories
    Data producers and users
    Publishers who want to be sure that the correct version of a data set is cited in a publication
    Anyone who is interested in moving data versioning forward.


Log in to reply.