DMP Common Standards WG

WG

Group details

Secretariat Liaison: 
Lynn Yarmey
TAB Liaison: 
Wenbo Chu
WGs Getting started (~0-6 months after RDA endorsement)
 

The need for establishing this working group was articulated during the 9th plenary meeting in Barcelona during the Active DMPs IG session.  The discussion was framed by a white paper by Simms et al. on machine-actionable data management plans (DMPs). The white paper is based on outputs from the IDCC workshop held in Edinburgh in 2017 that gathered almost 50 participants from Africa, America, Australia, and Europe. It describes eight community use cases which articulate consensus about the need for a common standard for machine-actionable DMPs (where machine actionable is defined as “information that is structured in a consistent way so that machines, or computers, can be programmed against the structure”)

 

The specific focus of this working group is on developing common information model and specifying access mechanisms that make DMPs machine-actionable. The outputs of this working group will help in making systems interoperable and will allow for automatic exchange, integration, and validation of information provided in DMPs, for example, by checking whether a provided PID links to an existing dataset, if hashes of files match to their provenance traces, or whether a license was specified. The common information models are NOT intended to be prescriptive templates or questionnaires, but to provide re-usable ways of representing machine-actionable information on themes covered by DMPs.

 

The vision that this working group will work to realise is one where DMPs are developed and maintained in such a way that they are fully integrated into the systems and workflows of the wider research data management environment. To achieve this vision we will develop a common data model with a core set of elements. Its modular design will allow customisations and extensions using existing standards and vocabularies to follow best practices developed in various research communities. We will provide reference implementations of the data model using popular formats, such as JSON, XML, RDF, etc.  This will enable tools and systems involved in processing research data to read and write information to/from DMPs. For example, a workflow engine can add provenance information to the DMP, a file format characterization tool can supplement it with identified file formats, and a repository system can automatically pick suitable content types for submission and later automatically identify applicable preservation strategies.

 

The deliverables will be publicly available under CC0 license and will consist of models, software, and documentation. The documentation will describe functionality and semantics of terms used, rationale, standard compliant ways for customisation, and requirements for supporting systems to fully utilise the capabilities of the developed model.

 

The working group will be open to everyone and will involve all stakeholders representing the whole spectrum of entities involved in research data management, such as: researchers, tool providers, infrastructure operators, repository staff and managers, software developers, funders, policy makers, and research facilitators. We will take into account requirements of each group.This will likely speed up and increase adoption of the working group outcomes.

 

The group will predominantly collaborate online, but will use any possibility to meet in person during RDA plenaries, conferences, workshops, hackathons or other events in which their members participate. All meetings in which decisions are made will be documented and their summaries will be circulated using the RDA website.

 

The work will be performed iteratively and incrementally following the best practices from system and software engineering. We will evaluate preliminary drafts of the model with community to receive early feedback and to ensure that the developed common model is interoperable and exchangeable across implementations. We will also express existing DMPs using the developed common model and will investigate how to support modification of machine actionable DMPs by various tools involved in data management process, while ensuring that proper provenance and versioning information is stored with. Finally, we will build prototypes to investigate possible system integrations and to evaluate to which degree the information contained in the DMPs can be automatically validated and which actions or alerts depending on a DMP state can be triggered, e.g. by sending notifications to repositories or funder systems.

 

During our work we will monitor parallel efforts and engage with various research communities to find candidates for pilot studies and to transfer the acquired know-how. Towards the end of the lifetime of the working group we will launch pilot projects in which the model will be customised to suit the needs of the identified interested communities. Pilot studies will use the models to integrate systems and demonstrate how machine-actionable DMPs can work.

 

We believe that the outcomes delivered by this group will contribute to improving the quality of research data and research reproducibility, while at the same time reducing the administrative burden for researchers and systems administrators.


Recent Activity

17 Feb 2018

Recent developments and the next call

Dear group members!
We would like to invite you to a call in which we would like to discuss our latest developments and discuss the next steps:
* Results of the user story collection, labelling, grouping and visualizing. Some spoilers below:
o https://bl.ocks.org/peterneish/f6dad14e46327011f0ccf15d49dd27fb
o https://github.com/RDA-DMP-Common/user-stories/projects/2

19 Jan 2018

Publishing DMPs

Thanks for valuable input to this issue. I also would like to add a correction: The recommendations in the openAIRE report I referred to was not directed to the ERC, but to EC. Thanks to Dagmar Meyer for pointing this out to me.

Best regards,
Philipp

 

16 Jan 2018

Publishing DMPs

It is becoming increasingly common to make data management plans (DMPs) public. The Norwegian Research Council have recently updated their policy adding a section where they encourage Norwegian research institutions to publish the DMPs of their researchers (cf. – in Norwegian –  https://www.forskningsradet.no/no/Nyheter/Datahandteringsplaner_sikrer_gjenbruk_av_data/1254032409350/p1174467583739).

12 Dec 2017

P11 session proposal Berlin

Dear all,
We will be submitting a session proposal for the 11th plenary meeting in
Berlin - please find the draft here:
https://docs.google.com/document/d/1EPdFgodWpV9U6Zo7reuhfbemNK9zS6zhJdJu...
u3A/edit?usp=sharing
Please give your feedback directly in the document or during one of the two
calls that we are organising this week.
During the calls we will discuss the current status of cleaning up and
labelling our user stories that we perform here:

12 Dec 2017

RDA DMP Common Standards WG call (Europe/North America) scheduled for Thursday, 14th December

Dear RDA DMP Common Standards WG,
I have now closed the ‘Doodle poll’ for this meeting. I’m afraid we couldn’t find a time which worked for everyone, but the clear favourite time is:
Thursday, 14th December, 15:00 - 16:00 (GMT)
If you go to this link, you should be able to see this time converted into your own local timezone.
https://doodle.com/poll/et32nr968xc7kwnq

11 Dec 2017

Re: [dmp-common] DMP Common Standards Update Call (APAC timezones)

Thank you to those who indicated their availability for an update call (Asia Pacific timezones).
Please find details of the call below.
Topic: DMP Common Standards APAC call
Time: Dec 14, 2017 11:00 AM Australia/Melbourne
Join from PC, Mac, iOS or Android: https://unimelb.zoom.us/j/543403163
Or join by phone:
Dial: +61 2 8015 2088
Meeting ID: 543 403 163

08 Dec 2017

DMP Common Standards Update Call (2)

Dear RDA DMP Common Standard WG members,
you will have seen that one of my co-chairs, Peter Neish, is arranging a call to update members on recent and planned activities. His call is aimed at people in the Asia Pacific region timezone.
I am arranging a similar call for people in the European and East-Coast North American timezones. If you are interested in participating, please add your name to the Doodle poll here, indicating your availability:
https://doodle.com/poll/et32nr968xc7kwnq

06 Dec 2017

DMP Common Standards Update Call

Dear all,
This is to let you know that we are having a couple of calls across different time zones next week to update on progress for the DMP common standards Working Group.
Please indicate your availability at the following doodle poll, which is aimed at the Asia Pacific region. If you do not find any times, the second call (details to come) may be better suited to your time zone.
https://doodle.com/poll/7ty8rhazgexkqs9c

28 Nov 2017

New user stories added but we need more!

Dear all,
I am writing to share good news with you: we have 88 user stories in our open consultation on GitHub!
There are new user stories coming from a workshop that we organised in Vienna. Together with Kevin Ashley from the DCC, we managed to gather people from different institutions having different roles to discuss and write up user stories. We have analysed research data lifecycle, as well as the DMP themes. You can find the workshop report here:
https://doi.org/10.5281/zenodo.1067753