WG Data Versioning - RDA 12th Plenary Meeting

You are here

Meeting title
Data Versioning WG Working Meeting (Remote Access Instructions) 

Meeting room: Okavango 2

Collaborative session notes

https://docs.google.com/document/d/1LJDTY2pjIBlcuQPEPAC9-3pL6FwJbfwrOTY7ZGm8Afo/edit?usp=sharing

Short introduction

The demand for reproducibility of research results is growing, Therefore it will become increasingly important for a researcher to be able to cite the exact extract of the data set that was used to underpin their research publication. However, systematic data versioning practices are currently not available.

Versioning procedures and best practices are well established for scientific software and can be used enable reproducibility of scientific results. The codebase of large software projects does bear some semblance to large dynamic datasets. Are therefore versioning practices for code also suitable for data sets or do we need a separate suite of practices for data versioning? How can we apply our knowledge of versioning code to improve data versioning practices?

We invite data scientists, operators of data repositories, and anyone who is interested in moving data versioning forward, to attend.

Additional links

Working Group Page: https://rd-alliance.org/groups/data-versioning-wg

Case Statement: https://rd-alliance.org/group/data-versioning-wg/case-statement/data-ver...

Data Versioning Use Cases: https://rd-alliance.org/data-versioning-use-cases

Notes from P11 Berlin: https://www.rd-alliance.org/data-versioning-wg-notes-p11-berlin

Presentation from P11 Berlin: https://www.rd-alliance.org/data-versioning-wg-presentation-p11

Notes from P10 Montreal: https://rd-alliance.org/notes-data-versioning-session-p10-montreal

Presentation from P10 Montreal: https://rd-alliance.org/data-versioning-presentation-rda-p10

Notes from Denver Plenary BoF meeting: https://www.rd-alliance.org/data-versioning-rda-8th-plenary-bof-meeting

Meeting objectives

The objective of this session is to establish a work plan for this RDA Working Group on developing agreed practices for Data Versioning. This includes:

1. Identifying other areas where versioning is required and/or other use cases:
- Identifying other groups in RDA and planning of how to engage
- Identifying other external groups
- Overview of already collected use cases
- Seeking further documented cases where groups and organisations are undertaking data versioning.

2. Discussing ways to categorise use cases

3. Develop the outline of a white paper on recommendations for versioning:
- Spectrum of data types to be included (files, databases, unstructured data, model runs, etc.),
- How to align these with the practices for the assignment of persistent identifiers.
- Identify other topics that should be included

Meeting agenda

1. Introduction

2. Recap of Why, How and What of Data Versioning

3. Review of use cases, including the W3C Dataset Exchange Use Cases and Requirements (https://docs.google.com/document/d/1TfBPlfjTVg0YcFxuw0UszAXPYrRmyZ6PCxtx...
Use case examples

4. Review of preliminary results from use cases documentation project

5. Work plan for RDA Data Versioning WG

6. Engagement with other RDA and external groups

7. Outline of white paper on data versioning practices

8. Scheduling of online meetings up to Plenary 13

Audience

Target Audience:

- Members of the Working Group.
- Data scientists and operators of data repositories
- Data producers and users
- Anyone who is interested in moving data versioning forward.

Suggested Preparation:

- Review already collected use cases
- Contribute new use cases

Group chair serving as contact person
Jens Klump

Type of meeting
Working meeting


Remote Access Instructions:

Please join my meeting from your computer, tablet or smartphone. 
https://global.gotomeeting.com/join/954242053 

You can also dial in using your phone. 
United States: +1 (786) 535-3219 

Access Code: 954-242-053 

More phone numbers 
Australia: +61 2 8355 1050 
Austria: +43 7 2081 5427 
Belgium: +32 28 93 7018 
Canada: +1 (647) 497-9353 
Denmark: +45 32 72 03 82 
Finland: +358 923 17 0568 
France: +33 170 950 594 
Germany: +49 692 5736 7317 
Ireland: +353 15 360 728 
Italy: +39 0 230 57 81 42 
Netherlands: +31 207 941 377 
New Zealand: +64 9 280 6302 
Norway: +47 21 93 37 51 
Spain: +34 932 75 2004 
Sweden: +46 853 527 836 
Switzerland: +41 225 4599 78 
United Kingdom: +44 20 3713 5028 

First GoToMeeting? Let's do a quick system check: 
https://link.gotomeeting.com/system-check