Skip to main content


The new RDA web platform is still being rolled out. Existing RDA members PLEASE REACTIVATE YOUR ACCOUNT using this link: Please report bugs, broken links and provide your feedback using the UserSnap tool on the bottom right corner of each page. Stay updated about the web site milestones at

From Discoverability to Chatbots: New trends in Data Discovery

  • Creator
  • #133880

    Brigitte Mathiak

    Introduction of the group (10 minutes) 

    Introduction: Ten rules to improve data discoverability (15 minutes)

    Discussion and feedback on the ten rules (20 minutes) 

    Presentations on AI/ML to improve data quality and data discoverability

    Lizhou Fan (UMich) and Sara Lafia (NORC at the University of Chicago): DataChat: Prototyping a Conversational Agent for Dataset Search and Visualization (15 min) 

    Tianying Chen (GESIS) on a ChatGPT discovery study (15 min) 

    Discussion of this new topic (10 min)

    Wrap up and steps forward (5 minutes)

    Additional links to informative material
    The group has delivered the following three supporting outputs: 

    Eleven quick tips for finding research data

    Data discovery paradigms: user requirements and recommendations for data repositories

    A survey of current practices in data search services

    Slides from previous plenary sessions: 

    October 2023 – RDA hybrid plenary 21:

    Ten recommendations for data repositories/catalogues to improve data discoverability


    March 2023 – RDA Virtual Plenary 20:

    Recommendations for data repositories to improve data discoverability

    June 2022 – RDA Virtual Plenary 19:

    Searching for connections: Data search, web search and literature search (Group session, slides)

    November 2021 – RDA Virtual Plenary 18:

    Mapping the road ahead for the data discovery paradigms IG (Group session, slides)

    January 2021 – RDA Virtual Plenary 17:

    Investigating data discovery across domains (Group session, slides)

    November 2020 – RDA Virtual Plenary 16:

    What information about data do users desire for discovery? (Group session, slides)


    April 2020 – RDA Virtual Plenary 15:

    Inferring data searchers’ intent and their interaction with data discovery systems (group session)

    Data Granularity BoF; user perspectives, data citation and data versioning (BoF session)

    Oct. 2019 (P14) – Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)

    April 2019 (P13): Data Discovery Paradigms IG: Reports from Task Forces and Way Ahead (slides)

    Oct. 2018 – RDA Plenary 12; IG meets, Task Forces report back

    March 2018 — RDA Plenary 11; IG meets, Task Forces report back

    Slides from earlier plenaries are available from the group page.

    Applicable Pathways
    Data Infrastructures – Organisational to Environments, Data Lifecycles – Versioning, Provenance, and Reward

    Avoid conflict with the following group (1)
    Complex Citations Working Group

    Brief introduction describing the activities and scope of the group
    The objective of this IG is to provide a forum where representatives from across the spectrum of stakeholders and roles pertaining to data discovery can work together to identify, study and make recommendations concerning issues related to improving data discovery. The goal is to produce concrete deliverables that will be recognised and valued by the research and data communities.
    This group was officially endorsed at RDA P9. The group has worked on the following task forces, namely:

    User studies in data discovery (ongoing)

    Data/Metadata granularity (started the Data Granularity Working Group)

    Using for research dataset discovery (This task force has spun off to the Research Metadata Schemas Working Group, which was endorsed in Sept. 2019. The group is now in maintainance mode).

    Task forces from the group:

    Relevancy ranking (completed)

    Use cases, prototyping tools and test collections (completed)

    Best practice for making data findable (completed)

    Metadata enrichment (closed)

    Data granularity (became a WG, in progress)

    Publish structured metadata (became the Research Data Schemas WG, completed)

    User study of data discovery context and search behaviour (in progress)

    Group chair serving as contact person
    Brigitte Mathiak

    I Understand a Chair Must be Present at the Event to Hold the Breakout Session

    Meeting objectives

    To update the group progress

    To wrap up Ten rules to improve data discoverability

    To discuss new trends surrounding chatbots in data discovery

    Please indicate at least (3) three breakout slots that would suit your meeting.
    Breakout 11, Breakout 13, Breakout 14

    Privacy Policy

Log in to reply.