Data Discovery Paradigms IG


Group details

Anita de Waard, Siri Jodha Singh Khalsa
Secretariat Liaison: 
Kathy Fontaine
TAB Liaison: 
Andrew Treloar

RDA Interest Group Charter

Name of Interest Group:  Data Discovery Paradigms IG


An emerging statement on research data is that it should be FAIR: “Findable, Accessible, Interpretable and Reusable”. To comply with the first of these criteria, being Findable, we need a data infrastructure that supports users in discovering research data regardless of its location or the manner in which it is stored, described and exposed. This is a significant and growing challenge, as the number of research data repositories, and the need for cross-disciplinary data discovery, increases. This interest group aims to explore common elements and shared issues that those who search for data, and who build systems that enable data search, share.



The objectives for this interest group are to provide a forum where representatives from across the spectrum of stakeholders and roles pertaining to data search can discuss issues related to improving data discovery. The goal is to identify concrete deliverables such as a registry of data search engines, common test datasets, usage metrics, and a collection of data search use cases and competency questions.


Key questions the IG wishes to address:

At RDAP8, we identified a long list of topics pertaining to data discovery, which were then voted by the group members to a shortlist of 10 topics. The top 5 of these have been selected as the key Task Forces which the group is focusing on (linked to the wiki page for each Task Force); three of them have started working already: 

For more details on the full list of potential task forces and the process followed in selecting them, please see the page on Task Forces.


  • Sept 2016 – We held a very well-attended kickoff meeting at RDA Plenary 8:, where topics were identified,
  • Sept-Dec 2016 —Through an online voting mechanism, the top 5 topics for Task Forces were determined, and chairs for these Task Forces Identified
  • Jan - April 2017 — Task forces hold meetings determining scope, deliverables and timelines
  • April 5, 2017 — Next meeting of the IG at RDA Plenary 9 to discuss Task Forces' scope, plans and deliverables
  • April - Sept 2017 — Activities proceed for Task Forces
  • Sept 2017 — RDA Plenary 11; IG meets, Task Forces report back and the group decides whether to close the Task Force, turn a Task Force into a Working Group, or start new Task Forces. 


Related Activities:

  • NASA’s WG on Search Relevancy – focus is on improving search result relevance for EOSDIS data
  • ESIP’s Information Quality Cluster and NASA’s WG on Data Quality are both addressing ways of capturing and conveying quality information
  • W3C’s Best Practices for Spatial Data on the Web aims to improve discoverability and accessibility of geodata


Other RDA IGs whose activities are of interest and who we will interact with:

  • Metadata
  • Registries
  • Brokering
  • PIDs
  • Research data collections



The Data Discovery Paradigms Interest Group is open to all members and encourages active participation through the Task Force mechanism. Task Forces have phone coniferences on a regular basis. To become active in either the Task Forces or propose other activities for the IG,, please contact the Chairs. 



Recent Activity

05 Apr 2017

Remote Access for the Data Discovery Paradigms IG session in P9

Dear IG members,
This is just a gentle reminder of our session later this afternoon " IG Data
Discovery Paradigms: Reports from Task Forces and Way Ahead" at room MR14
For those who have not been able to join us on sunny(ish) Barcelona, there
is also the option of connecting remotely using the following information:
GTM Link:

29 Mar 2017

Re: [datadiscovery] Re: [datadiscovery] Joint meeting of the three task forces is...

Thanks SJ
Dr. Lewis John McGibbney Ph.D., B.Sc.
Data Scientist II
Computer Science for Data Intensive Applications Group 398M
Jet Propulsion Laboratory
California Institute of Technology
4800 Oak Grove Drive
Pasadena, California 91109-8099
Mail Stop : 158-256C
Tel: (+1) (818)-393-7402
Cell: (+1) (626)-487-3476
Fax: (+1) (818)-393-1190
Email: ***@***.***
Dare Mighty Things

28 Mar 2017

Re: [datadiscovery] Joint meeting of the three task forces is on Wed. 29 March,...

Hi Folks,
I’ve been trying to juggle my diary and there is no excuse for me not to be free at 4AM right ;)
Anyways, my question here, are these telecon’s weekly? I’m just trying to keep my calendar up-to-date.
Thank you
Dr. Lewis John McGibbney Ph.D., B.Sc.
Data Scientist II
Computer Science for Data Intensive Applications Group 398M
Jet Propulsion Laboratory
California Institute of Technology
4800 Oak Grove Drive
Pasadena, California 91109-8099
Mail Stop : 158-256C
Tel: (+1) (818)-393-7402

24 Mar 2017

Joint meeting of the three task forces is on Wed. 29 March, at UTC 11:00am

Dear All,
Thanks for those who have participated in the poll. The following time slot
has most votes.
In this meeting, we will discuss and coordinate our presentations and
activities for the group's session at RDA P9
Hope to see or hear from more of you.
Anita, Fotis, Siri and Ming
LocationLocal TimeTime ZoneUTC Offset
- Victoria) Wednesday, 29 March 2017 at 10:00:00 pm AEDT
UTC+11 hours
New York (USA - New

23 Mar 2017

Draft outputs for scrutiny, and please join us at RDAP9!

Dear Data Discovery Interest Group members,
We are excited to share with you the draft outputs created by three of the Task Force teams of the Research Data Alliance Data Discovery Paradigms Interest Group. We want to thank all those who contributed to these reports, and especially the chairs of the groups who worked tirelessly to get these outputs done before our next Plenary.
We would greatly appreciate your thoughts and feedbacks on the three interim documents, from the respective Task Forces:

15 Mar 2017

Doodle Poll for a joint meeting of the three RDA DDP task forces

Dear All,
We plan to have a joint meeting of the three task forces, Best Practices
for Data Discovery, Use Cases and Relevancy Ranking, before the RDA P9. We
will discuss how to present our work and get feedback at the RDA P9.
Please indicate your availability at this Doodle Poll:
Thanks and regards,
Dr Mingfang Wu
Senior Business Analyst | Australia National Data Service

15 Mar 2017

Notes from 14 March meeting for the Best Practices task force

Dear All,

are the notes from the 4th meeting on 14 March for the best practices task
Any feedback and/or suggestions on the following three documents will be
much appreciated.
- Best Practice for Data Providers

- Best Practice for Data Repositories

- Best Practice for Data seekers