status: Recognised & Endorsed

Chair (s): Jens Klump, Lesley Wyborn, Mingfang Wu, Kirsten Elger

Group Email: [group_email]

Secretariat Liaison: Stefanie Kethers


The Data Versioning WG has transitioned to the Data Versioning IG as of July 2021. The email address and group space have remained the same. 


The demand for reproducibility of research results is growing, Therefore it will become increasingly important for a researcher to be able to cite the exact extract of the data set that was used to underpin their research publication. The capacity of computational hardware infrastructures have grown it is now common to have online petabyte data stores, This has encouraged the development of concatenated seamless data sets where users can use web services to select subsets based on spatial and time queries. Further, the growth in computer power has meant that higher level pre-processed data products can be generated in really short time frames.

This means that data sets and data products are needing some form of systematized way of being able to reference the exact version of the data that was used to underpin the research findings, and/or was used to generate higher level products. This was recognised by the RDA Working Group on Data Citation, whose final report recognises the need for Data Versioning. However, there were no specifics on best practice for data versioning, particularly for large volume multi-terabyte and even petabyte scale data sets. A BoF meeting held at the RDA Plenary in September 2016 in Denver highlighted the fact that there are no recognised best practices for versioning of data.

Versioning procedures and best practices are well established for scientific software and can be used enable reproducibility of scientific results. The codebase of very large software projects does bare some semblance to large dynamic datasets. Are these suitable for data sets or do we need a separate suite of practices for data versioning?

Ultimately versioning concepts developed for research data will need to be brought in line with versioning concepts used in persistent identifier systems.


The BoF initially emerged at Plenary 8 in Denver through the discussion available here:  https://www.rd-alliance.org/data-versioning-rda-8th-plenary-bof-meeting

Posts

08
September
2021

RDA Virtual Plenary 18 - Notification of Acceptance

by Secretariat Group Account

Dear Chairs of the Data Versioning IG, The Technical Advisory Board (TAB) has completed its review of session applications for the RDA Virtual Plenary 18 (VP18) and has accepted your application titled Advancing Data Versioning: From Principles to Actionable Recommendations. Please consider this your official notification of acceptance. Congratulations!
0 | Add new comment
01
September
2021

Sept 16 webinar ‘Navigating Data Sharing in International Research Collaborations (iN2N)'

by Stephanie Hagstrom

Hello RDA WG and IG members - You are invited to attend the webinar ‘Navigating Data Sharing in International Research Collaborations (iN2N)’ hosted by the RDA-US on 16 September at 14:00. This webinar will address the most challenging issues for data sharing in international research collaborations and will present a framework to address these challenges developed by the attendees of The International Network-of-Networks (iN2N) Global Expert workshop as part of the NSF funded iN2N project. You may read more about the webinar and register here.
0 | Add new comment
12
August
2021

RDA VP18 Session Proposal Submission

by Secretariat Group Account

Dear Data Versioning IG members, Thank you for your session proposal for Virtual Plenary 18 titled “Advancing Data Versioning: From Principles to Actionable Recommendations”. A review of all submitted proposals is now underway by the RDA Technical Advisory Board, with notifications of acceptance planned to be sent by 10 September. Please feel free to contact the RDA Secretariat at ***@***.***-alliance.org with any questions or concerns you have regarding your submission. Thank you. Regards, RDA Secretariat
0 | Add new comment
10
August
2021

Data Intelligence Journal CfP: Special Issue on "Metadata as Data Intelligence"

by Mingfang Wu

Greetings All: Please accept our apologies for multiple postings Please forward the CfP to interested colleagues Most of all Please consider submitting a paper Thanks!! ------------------ *Call for papers*: Data Intelligence , an MIT Press Direct open access journal, seeks contributions for a special issue on Metadata as Data Intelligence . *Important dates* - Submission Deadline: Oct. 31 2021 - Notification of acceptance: Dec. 31, 2021 - Revised/final manuscripts due: Feb. 10, 2022
0 | Add new comment
18
May
2021

Data Versioning WG Webinar Promotion Help

by Stephanie Hagstrom

Hello Group Members.  Jens is presenting a webinar next week showcasing the Data Versioning Working Group outputs. Please help promote this webinar through your social networking channels. There is detailed information on the webinar listed below and a list of easy ways for you to help.  See the section below titled "RDA Webinar Promotional Resources" where you can click on the links provided and select "like” or “share” or “retweet”. Webinar Information
0 | Add new comment
07
May
2021

New Data Versioning IG Charter Draft - Open period for comments closes today

by Jens Klump

Dear Members of the RDA Data Versioning WG/IG The period for comments on the new Data Versioning IG Charter is ending today. Thank you to those who sent us comments on the IG charter draft. Your comments have been very helpful. We will now publish the IG charter for review by the RDA community. After the community review, the IG charter will go to the RDA TAB for approval. IG Charter: https://docs.google.com/document/d/1ylfAmzKJvKICSaR_aL1I2OEEh9Oq-pUv3eox...
0 | Add new comment
21
April
2021

New Data Versioning IG Charter Draft

by Jens Klump

Dear Members of the Data Versioning WG/IG After the Data Versioning WG has come to a close. At the last meeting at RDA VP16 Costa Rica, we decided to continue as a new Data Versioning IG. The current procedures at RDA require us to submit a new charter for the Data Versioning IG for approval by the TAB.
2 | Add new comment
23
February
2021

RDA Virtual Plenary 17 - Notification of Acceptance

by Secretariat Group Account

Dear Chairs of the Data Versioning WG, Congratulations! Your RDA Virtual Plenary 17 (VP17) session application titled “Advancing Data Versioning: From Principles to Actionable Recommendations” has been approved by the Technical Advisory Board (TAB). Please consider this your official notification of acceptance.
0 | Add new comment
29
January
2021

RDA VP17 Session proposal submission

by Alexandra Delipalta

Dear Data Versioning WG members,    Thank you for your session proposal for Virtual Plenary 17 titled ‘Advancing Data Versioning: From Principles to Actionable Recommendations’. A review of all submitted proposals is now underway by the RDA Technical Advisory Board, with notifications of acceptance planned to be sent by 26 February.    Please feel free to contact the RDA Secretariat at enquiries@rd-alliance.org with any questions or concerns you have regarding your submission. Thank you.   
0 | Add new comment
23
October
2020

RDA Virtual Plenary 16 (VP16) - Deadline for poster submissions, registration and programme

by Alexandra Delipalta

Dear Data Versioning WG members,  With RDA VP16 fast approaching, we would like to bring the following news and key dates to your attention: Call for posters closing on 25 October 2020
0 | Add new comment

Pages