- Re-visiting the recommendations
- Discussion of the HDSR Review Paper on adoptions
- Presenting new adopters' approaches, solutions and implementations
- Discussing feedback from adopters
- Discussion of novel application domains such as versioning on-line learning/AI systems, cross-links to reproducibility in science, EU-AI regulation, etc.
- Identifying the next steps forward, adaptations required, etc.
Collaborative meeting notes: https://docs.google.com/document/d/1xTj0Xim9T_jQY9IjmeoZXm5lw6SRKZnNHTKi...
Draft Program for the WG meeting
1. Short description of the WG recommendations (for newcomers)
2. Brief reports on use cases
3. Report on HDSR survey paper on adoptions
4. Discussion of novel application domains such as versioning on-line learning/AI systems, cross-links to reproducibility in science, EU-AI regulation
5. Other issues, next steps
- Data Center Operators wishing to provide precise identification and citation services
- Researchers wanting to encourage their data center operators to provide data identification and citation services
The RDA Working Group on Data Citation (WG-DC) brings together experts addressing the issues, requirements, advantages and shortcomings of existing approaches for efficiently identifying and citing arbitrary subsets of (potentially highly dynamic) data. It's recommendations are based upon on (1) timestamping and versioning changes to evolving data and (2) identifying arbitrary subsets by assigning PIDs to the queries selecting the according subsets and are applicable across all types of data, such as e.g. collections of files, relational databases, multidimensional data cubes or regions in images..
The WGDC Recommendations in the short form of a 2-page flyer are available at:
An extended Description of Recommendations is available at: Bulletin of the IEEE Technical Committee on Digital Libraries, 12:1, 2016. (https://zenodo.org/record/4048304)
Webinar recordings as well as slide sets and supporting papers by adopters presenting their experience in implementing the recommendations are collected at the RDA WGDC webinar page at
A comprehensive review of the recommendations, the wide range of reference implementations as well as a survey of all adoptions reported over the years has recently been published in the Harvard Data Science Review: Rauber, A., Gößwein, B., Zwölf, C. M., Schubert, C., Wörister, F., Duncan, J., … Parsons, M. A. (2021). Precisely and Persistently Identifying and Citing Arbitrary Subsets of Dynamic Data. Harvard Data Science Review, 3(4). https://doi.org/10.1162/99608f92.be565013
The Data Citation WG has delivered its outputs and is now primarily focusing on supporting adoption by maintaining these outputs, assisting institutions in implementing the recommendations and sharing the lessons learned.