Skip to main content


The new RDA web platform is still being rolled out. Existing RDA members PLEASE REACTIVATE YOUR ACCOUNT using this link: Please report bugs, broken links and provide your feedback using the UserSnap tool on the bottom right corner of each page. Stay updated about the web site milestones at

Searching for connections: Data search, web search and literature search

  • Creator
  • #134041

    Kathleen Gregory

    Collaborative session notes:…

    Introduction of the group (5 minutes) – Kathleen Gregory

    Four flash presentations of invited speakers on the topic of data search studies (12′ each : ~50 minutes). The goal of the presentations is to identify connections/gaps in existing work. We sent an email soliciting presentations on 21 Feb, with the aim of recruiting new perspectives.  This list includes confirmed speakers. 

    Brigitte Mathiak: Web Search and Literature Search During Data Discovery
    Tim Suber: A User Model for User-Adaptive Searching in Data Portals

    Abstract: Data portals host a large amount of data sets. Finding the right data set can be challenging. The goal of this work is to guide the user to the correct data set. This is implemented by creating a user model to make the search user-adaptive. This work focuses on supporting the user in using search filters by predicting future interactions. For this purpose, a model of partially-ordered sequential rules is trained based on previous user sessions. The model was tested on an existing data set, containing records of a geospatial portal. It was able to predict subsequently chosen search filters with an accuracy of 51%, which is a significant improvement over a random prediction under the same conditions with 11%. The results show that the development of a user model is a suitable process to predict search filters and can improve user experience in data portals. 

    Graham Parton: Levelling up data discovery at a UK domain repository: How the UK’s CEDA Archive uses a range of tools to aid users discover what they need… and the gaps that still remain.
    Mingfang Wu: Overall user study design for a national data discovery portal / log survey or interview analysis from a national portal

    Discussion on the following topic statement (to be adjusted based on the discussion) (20 minutes)

    Topic statement: How can we enhance data discovery through the connection/integration of multiple resources, e.g.  web search,  literature search, data repository search, and what would be the corresponding best practices / recommendations that could be offered to repositories, aiming for user studies?


    Scope and goals

    Roadmap towards an output

           4. Next steps and wrap up (10 minutes)

    Additional links to informative material
    The group has delivered the following three supporting outputs: 

    Eleven quick tips for finding research data

    Data discovery paradigms: user requirements and recommendations for data repositories

    A survey of current practices in data search services

    Slides from previous plenary sessions: 

    November 2021 – RDA Virtual Plenary 18:

    Mapping the road ahead for the data discovery paradigms IG (Group session, slides)

    January 2021 – RDA Virtual Plenary 17:

    Investigating data discovery across domains (Group session, slides)

    November 2020 – RDA Virtual Plenary 16:

    What information about data do users desire for discovery? (Group session, slides)

    April 2020 – RDA Virtual Plenary 15:

    Inferring data searchers’ intent and their interaction with data discovery systems (group session)

    Data Granularity BoF; user perspectives, data citation and data versioning (BoF session)

    Oct. 2019 (P14) – Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)

    April 2019 (P13): Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)

    Oct. 2018 – RDA Plenary 12; IG meets, Task Forces report back

    March 2018 — RDA Plenary 11; IG meets, Task Forces report back

    Slides from earlier plenaries are available from the group page.

    Are you willing to hold your session at multiple times to accommodate various time zones?

    Avoid conflict with the following group (1)
    Research Metadata Schemas WG

    Brief introduction describing the activities and scope of the group
    The objective of this IG is to provide a forum where representatives from across the spectrum of stakeholders and roles pertaining to data discovery can work together to identify, study and make recommendations concerning issues related to improving data discovery. The goal is to produce concrete deliverables that will be recognised and valued by the research and data communities.
    This group was officially endorsed at RDA P9. The group has worked on the following task forces, namely:

    User studies in data discovery (ongoing)

    Data/Metadata granularity (ongoing, a BoF has been submitted)

    Using for research dataset discovery (This task force has spun off to the Research Metadata Schemas Working Group, which was endorsed in Sept. 2019).

    Initial four task forces from the group:

    Relevancy ranking (completed)

    Use cases, prototyping tools and test collections (completed)

    Best practice for making data findable (completed)

    Metadata enrichment (closed)

    Group chair serving as contact person
    Kathleen Gregory

    Meeting objectives

    To update the group progress

    To update on latest studies of data search 

    To identify a work plan towards a best practices output for the topic

    Please indicate the breakout slot (s) that would suit your meeting
    Breakout 1, Breakout 3, Breakout 5

    Privacy Policy

Log in to reply.