status: Recognised & Endorsed

Chair (s): Mingfang Wu, Sarala Wimalaratne, Adam Shepherd, Leyla Garcia

Group Email: [group_email]

Secretariat Liaison: Stefanie Kethers


Introduction:

 

The wide use of schema.org to add structured metadata in web pages for use by  commercial search engines has attracted the attention of the data management community as a possible mechanism to leverage the robust commercial search engines like Google, Yahoo, Bing etc. to facilitate discovery and access to scientific data. Various projects have been exploring this approach, including the US NSF EarthCube p418 projectGoogle's Dataset Recommendations, BioSchemas, Force11 DCIP, Research Data AustraliaDataCiteHarvard Dataverse,  NASA’s Distributed Active Archive Center (DAAC) Infrastructure, EOSCpilot,  etc.  Since schema.org has largely been driven by commercial business use cases, and a loosely governed process for adding and defining resource type, property and vocabulary for research domain, there are gaps and deficiencies  that make its application for research data problematic.

 

 

Since P11, the RDA Data Discovery Paradigms IG started the task force "Using schema.org for research data discovery". The group has organised sessions at RDA plenaries and online calls to discuss how we research community come together to embrace the advantages of discovering data via web search engines, meanwhile to address gaps and deficiencies. There is a proposal to form a RDA Working Group with a focused scope and set of well-defined priorities/objectives.

 

The objectives of this work group are twofold:

  1. to identify and bridge gaps in existing schemas commonly used for research data, by bringing together communities who are working with such vocabularies to document research data and related resources;
     
  2. to provide guidelines for those communities whose needs are not addressed by existing metadata schema such as schema.org, and provide guidelines on proposing extensions.

To align with the above objectives, we instrumented a survey on current practices in using schemas to describe research datasets. The survey is still open, your participation is more than welcome. (The survey was developed by the DDP IG TF, which led to the formation of this WG.)

 

The planned outputs will include:

  1. A generic ‘conceptual data model’ with essential types and properties for research data discovery over the web. The model will be built on bioschemas.org, science-on-schema.org, schema.org, DCAT, DDI-DISCO and SSN schemas from some representative research domains, and data discovery use cases. A research domain can map their schema to the conceptual model when they publish data to the web or exchange metadata between data portals/repositories.
     

  2. A guideline, illustrated with common patterns, of common patterns for publishing metadata landing pages with structured data markups; and a guideline of how to customise the research schemas for target domains with examples. 
     

  3. Toolings for making the implementation easier if resources are available. This could include collecting and cataloguing tools that generate, validate and parse schema.org & DCAT markup, etc.

Living documents:

Research Metadata Schemas WG (This is the directory for all documents prepared by this WG, including meeting notes.)

In particular, the following three documents are for the WG outputs:

  • Crosswalks (As dated on 15 May 2020, there are 12 crosswalks from 12 schemas to schema.org.)
  • Best practice guides
  • Tooling (collecting tools for supporting the structured data publishing process, e.g. tools that generate, validate, crawling  search structured data)

Members are more than welcome to contribute to the above three ongoing documents.  This output folder  has all working in progress documents for the outputs.

 

The WG's Wiki Index

 

Meeting schedule:

The group meeting in 2022 is on third Wednesday in even months. A meeting reminder will be sent to the group about 2 days before each meeting.

The next meeting will be: 20:00 UTC, Wednesday 20 April. You can check your local time here.

If you are unable to join, you can still add your thoughts to this document with rolling agendas and notes.

 

 

Posts

19
April
2022

The next group call is on Wednesday 20th April, starting 20:00 UTC

by Mingfang Wu

Greetings All, This is a reminder that our next group call is on Wednesday 20th April, starting 20:00 UTC. (You can check your local time
0 | Add new comment
14
April
2022

RDA Plenary 19 Draft Programme Now Available

by Secretariat Group Account

Dear Group Chairs, The RDA 19th Plenary draft programme is now available: https://www.rd-alliance.org/rdas-19th-plenary-programme-0. Please note that the Plenary programme is a part of the International Data Week programme that can be accessed at https://www.rd-alliance.org/international-data-week-2022-programme.
0 | Add new comment
29
March
2022

Webinar: How To Get Attention For Research Project Outputs Part 2

by Yolanda Meleco

Hello, just a brief reminder that Part 2 of our webinar titled How To Get Attention For Research Project Outputs is this Thursday, 31 March, at 16:00 UTC. We hope to see you there! If you haven’t yet done so, register at https://www.rd-alliance.org/how-get-attention-webinar-31-03-2022. All the best,
0 | Add new comment
25
March
2022

RDA Plenary 19 as a part of IDW 2022 - Notification of Acceptance

by Secretariat Group Account

Dear Chairs of the Research Metadata Schemas WG, Congratulations! Your RDA Plenary 19 (P19) session application titled Guidelines and toolings for publishing structured metadata on the Web has been approved. Please consider this your official notification of acceptance. We have some feedback and requests for you to take into consideration while preparing your session: The agenda structure is clear, leaving ample time for discussion and interaction.
0 | Add new comment
04
March
2022

P19 session proposals submissions

by Irina Hope

Dear group members,  Thank you for submitting your session proposal for the Plenary 19 titled ‘Guidelines and toolings for publishing structured metadata on the Web’.  A review of all submitted proposals is now underway. Notifications of sessions acceptance will be sent out by Thursday, 24th March.  Please contact the RDA Secretariat at enquiries@rd-alliance.org with any questions or concerns you may have regarding your submission.   Kind regards,  RDA Secretariat 
0 | Add new comment
25
February
2022

Test

by Secretariat Group Account

Just testing the group email
0 | Add new comment
16
February
2022

Next group call on Wednesday 16 February, 20:00 UTC

by Mingfang Wu

Dear All:   Our next meeting will be: 20:00 UTC, Wednesday 16 February. You can check your local time here.   Here is the meeting agenda.   A major agenda item is to discuss the group's session proposal to the RDA Plenary 19.  
0 | Add new comment
24
January
2022

RDA RMS WG: Doodle for the group's next meeting (Plenary session proposal)

by Mingfang Wu

Dear Members of the RDA Research Metadata Schemas WG: We are inviting you to participate in this Doodle poll for the group's next meeting in February. The poll will be closed by 31st January 2022. The RDA Plenary 19 session proposal is due on 26 February. In this meeting, we will discuss the group's session proposal. We will be very keen to hear from you how you would like the plenary session being organised and what you would like to get out from the session should you consider attending.
1 | Add new comment
25
October
2021

Reminder: the next monthly call is on Tuesday 26 October, starting 1pm UTC

by Mingfang Wu

Greetings All, This is a reminder of our next group call on Tuesday 26 Oct., at 1pm UTC. (You can check your local time by following this link
0 | Add new comment

Pages