Data Discovery Paradigms IG

IG

Group details

Secretariat Liaison: 
Kathy Fontaine
TAB Liaison: 
Andrew Treloar
Case Statement: 
IG Established
 

RDA Interest Group Charter

Name of Interest Group:  Data Discovery Paradigms IG


Introduction:

An emerging statement on research data is that it should be FAIR: “Findable, Accessible, Interpretable and Reusable”. To comply with the first of these criteria, being Findable, we need a data infrastructure that supports users in discovering research data regardless of its location or the manner in which it is stored, described and exposed. This is a significant and growing challenge, as the number of research data repositories, and the need for cross-disciplinary data discovery, increases. This interest group aims to explore common elements and shared issues that those who search for data, and who build systems that enable data search, share.

 

Objectives:

The objectives for this interest group are to provide a forum where representatives from across the spectrum of stakeholders and roles pertaining to data search can discuss issues related to improving data discovery. The goal is to identify concrete deliverables such as a registry of data search engines, common test datasets, usage metrics, and a collection of data search use cases and competency questions.

 

Key questions the IG wishes to address:

At RDAP8, we identified a long list of topics pertaining to data discovery, which were then voted by the group members to a shortlist of 10 topics. The top 5 of these have been selected as the key Task Forces which the group is focusing on (linked to the wiki page for each Task Force); three of them have started working already: 

For more details on the full list of potential task forces and the process followed in selecting them, please see the page on Task Forces.

Timeline:

Related Activities:

  • NASA’s WG on Search Relevancy – focus is on improving search result relevance for EOSDIS data
  • ESIP’s Information Quality Cluster and NASA’s WG on Data Quality are both addressing ways of capturing and conveying quality information
  • W3C’s Best Practices for Spatial Data on the Web aims to improve discoverability and accessibility of geodata

Other RDA IGs whose activities are of interest and who we will interact with:

  • Metadata
  • Registries
  • Brokering
  • PIDs
  • Research data collections

 

Participation:

The Data Discovery Paradigms Interest Group is open to all members and encourages active participation through the Task Force mechanism. Task Forces have phone coniferences on a regular basis. To become active in either the Task Forces or propose other activities for the IG,, please contact the Chairs. 

 

 


Recent Activity

12 Dec 2017

RE: [datadiscovery] The RDA Data Discovery interest Group: Looking back and...

Dear Kerstin,
Thank you very much for your interest and enthusiasm; indeed, discussing granularity and domain-specific / cross-domain issues is an important aspect of Data Discovery and a great concept for a Task Force.
Would you be interested in leading such a Task Force? If so, what would be your general plan of action (i.e. the particular objectives and expected outcomes) for the TF?
Regards,
Fotis

01 Dec 2017

The RDA Data Discovery interest Group: Looking back and looking ahead

Greetings, members of the Data Discovery Paradigms Interest Group!
We want to make sure we get this message out to you as the end of 2017 comes into view, to let you know of the developments we’ve been working on, as well as new opportunities for 2018.
In particular, we are interested in your thoughts on forming and joining a new set of Task Forces (see point 3), before the end of the year. But first, here’s what we’ve been up to!
1. Meetings.
1. RDA P10:

03 Nov 2017

Re: [datadiscovery] UTC 8pm, Tuesday 2 Nov.: A kickoff discussion with Dr....

Dear Ming,
Thanks so much for recording and posting the presentation.
Helping repositories make their datasets more discoverable and
useful has been a primary goal of the DDPIG since its inception,
and use of schema.org markup is mentioned in Recommendation 9 of
our Best Practices for Data Repositories: "Make records easily
indexed and searchable by major web search engines".
I interpreted Natasha's suggestion regarding the focus a new
DDPIG group that it should work with the schema.org community on

27 Oct 2017

Re: [datadiscovery] UTC 8pm, Tuesday 2 Nov.: A kickoff discussion with Dr....

Hi Jennie,
Please go ahead to share the meeting ID with your colleagues.
To All: Sorry, I didn't make it clear about the meeting agenda in my last
email. Here it is:
* Natasha: Data search at Google and Google's guideline for data
repositories (~15-20 mins)
* All: Q&A and group discussion (~30 mins)
* All: Discussion of starting a new task force within the RDA Data
Discovery Paradigm IG (~10 mins)
Regards,
Ming

26 Oct 2017

UTC 8pm, Tuesday 2 Nov.: A kickoff discussion with Dr. Natasha Noy from Google on making data discoverable by web search engines

Dear All,
I am glad to inform you that Dr. Natasha Noy from Research at Google has
accepted our invitation to introduce us the effort Google has put into data
discovery and Google's guidelines for data repositories to make data more
easily discoverable by web search engines.
You may recall that our Best Practices Task Force has drafted a white paper

04 Sep 2017

Reminder: Relevancy Ranking TF meeting: 5 Sept. at 11am UTC

Dear All,
The next relevancy ranking TF meeting is on 5th Sept. at 11am UTC.
We sent a survey to collect information about current practices of data
repositories in setting up data search services. As of on 1st Sept., we
got 114 responses, 20 out of 114 are incomplete ones. A preliminary summary
of 94 complete responses is available here
.
More analyses on possible correlations will be discussed.
Regards,
Ming
-------------------------------
Here is the information for joining the meeting.

16 Aug 2017

Re: [datadiscovery] Interesting Article on Data Discovery

This might be an interesting read for this group:
https://link.springer.com/article/10.1007/s00799-017-0227-5
International Journal on Digital Libraries, pp 1–16
Extracting discourse elements and annotating scientific documents using the SciAnnotDoc model: a use case in gender documents
Authors Hélène de Ribaupierre, Gilles Falquet
Abstract

14 Aug 2017

Meeting Reminder: 11am UTC, 15 August, for the RDA DDPIG Relevancy Ranking TF

Dear All,
This is a gentle reminder that our next meeting for the Relevancy Ranking
TF is on 15 August, 11am UTC.
So far, we have 92 responses to the survey
.
In this meeting, we will look at ways to summarise and report survey
responses. Hope you can join us. You are also welcome to table any
relevancy ranking related issues for discussion.
Regards,
Ming
PS: Information for joining the meeting:

05 Aug 2017

Fwd: For RDA Data Discovery IG

Members of the Data Discovery Paradigms IG will most likely be
interested in this white paper, "Searching Data: A Review of
Observational Data Retrieval Practices", by Gregory et al. In
particular, this could add quite a few use cases - see the "User
Actions" and "Systems" sections.

sjs

-------- Forwarded Message --------

cellpadding="0">

Subject:

For RDA Data Discovery IG

Date:
Sat, 29 Jul 2017 20:11:34 -0600

01 Aug 2017

DDPIG Task Force Use Cases minutes

Dear RDA Data Discovery IG members,
Please find attached the minutes of our call earlier today for the RDA Data
Discovery Topic E: Use cases, prototyping tools and test collections (you
can find the same information on the task force wiki page as well:
https://www.rd-alliance.org/group/data-discovery-paradigms-ig/wiki/notes...
-cases-teleconferences-dec-2016-aug-2017).
Based on the discussion on the work done so far, the action identified is