status: Recognised & Endorsed

Chair (s): Mingfang Wu, Sarala Wimalaratne, Adam Shepherd, Leyla Garcia

Group Email: [group_email]

Secretariat Liaison: Stefanie Kethers


Introduction:

 

The wide use of schema.org to add structured metadata in web pages for use by  commercial search engines has attracted the attention of the data management community as a possible mechanism to leverage the robust commercial search engines like Google, Yahoo, Bing etc. to facilitate discovery and access to scientific data. Various projects have been exploring this approach, including the US NSF EarthCube p418 projectGoogle's Dataset Recommendations, BioSchemas, Force11 DCIP, Research Data AustraliaDataCiteHarvard Dataverse,  NASA’s Distributed Active Archive Center (DAAC) Infrastructure, EOSCpilot,  etc.  Since schema.org has largely been driven by commercial business use cases, and a loosely governed process for adding and defining resource type, property and vocabulary for research domain, there are gaps and deficiencies  that make its application for research data problematic.

 

 

Since P11, the RDA Data Discovery Paradigms IG started the task force "Using schema.org for research data discovery". The group has organised sessions at RDA plenaries and online calls to discuss how we research community come together to embrace the advantages of discovering data via web search engines, meanwhile to address gaps and deficiencies. There is a proposal to form a RDA Working Group with a focused scope and set of well-defined priorities/objectives.

 

The objectives of this work group are twofold:

  1. to identify and bridge gaps in existing schemas commonly used for research data, by bringing together communities who are working with such vocabularies to document research data and related resources;
     
  2. to provide guidelines for those communities whose needs are not addressed by existing metadata schema such as schema.org, and provide guidelines on proposing extensions.

To align with the above objectives, we instrumented a survey on current practices in using schemas to describe research datasets. The survey is still open, your participation is more than welcome. (The survey was developed by the DDP IG TF, which led to the formation of this WG.)

 

The planned outputs will include:

  1. A generic ‘conceptual data model’ with essential types and properties for research data discovery over the web. The model will be built on bioschemas.org, science-on-schema.org, schema.org, DCAT, DDI-DISCO and SSN schemas from some representative research domains, and data discovery use cases. A research domain can map their schema to the conceptual model when they publish data to the web or exchange metadata between data portals/repositories.
     

  2. A guideline, illustrated with common patterns, of common patterns for publishing metadata landing pages with structured data markups; and a guideline of how to customise the research schemas for target domains with examples. 
     

  3. Toolings for making the implementation easier if resources are available. This could include collecting and cataloguing tools that generate, validate and parse schema.org & DCAT markup, etc.

Living documents:

Research Metadata Schemas WG (This is the directory for all documents prepared by this WG, including meeting notes.)

In particular, the following three documents are for the WG outputs:

  • Crosswalks (As dated on 15 May 2020, there are 12 crosswalks from 12 schemas to schema.org.)
  • Best practice guides
  • Tooling (collecting tools for supporting the structured data publishing process, e.g. tools that generate, validate, crawling  search structured data)

Members are more than welcome to contribute to the above three ongoing documents.  This output folder  has all working in progress documents for the outputs.

 

The WG's Wiki Index

 

Meeting schedule:

 

This group has a regular meeting on fourth Thursday each month, starting 8pm UTC. A meeting reminder with zoom ID will be emailed to this group ahead of each meeting.

 

 

Posts

30
April
2021

Info: Update on the group's plenary session, a Doodle poll for a new meeting time

by Mingfang Wu

Greetings All, As the subject heading indicates, this email serves two purposes: update on the group's plenary session and participate in a Doodle poll for the group's future meeting time. 1. Update on the group's plenary session: The session focused on the group output "Guidelines for publishing structured data on the Web". Leyla (Garcia) gave a presentation on the guidelines with nine recommendations, followed by Julia (Collins) and Baptiste (Cecconi) who provided feedback on the guidelines from the
0 | Add new comment
16
April
2021

Invitation to comment on the group's output (and about group's plenary session)

by Mingfang Wu

Greetings All, We would like to invite you all to review and comment on the group's output: "Guidelines for publishing structured data on the web ". Our group session at the next week's RDA Plenary will focus on the discussion of this output. Please join our session if possible and provide your feedback. The group session is scheduled in the Breakout session 8 - Thursday 22 April, 12:45-14:15 UTC. It would be nice if you have time to read the output document before the session. If you don't, you can get the
0 | Add new comment
24
March
2021

Reminder: RDA Research Metadata Schemas WG monthly call this Thursday (25th), 8pm UTC

by Mingfang Wu

Greetings All, Our next scheduled call is on Thursday 25 March, 8pm UTC. Here is the link for checking your local time. Here are the two key agenda items for this call: 1) Feedback to the group's recommendation "Guidelines for publishing structured metadata over the web " ( https://www.rd-alliance.org/guidelines-publishing-structured-metadata-we... ) 2) The group session
0 | Add new comment
24
February
2021

Reminder: RDA Research Metadata Schemas WG monthly call this Thursday (25th), 8pm UTC

by Mingfang Wu

Greetings all, Reminder that we have a scheduled monthly call on Thursday, 25th February, 8pm UTC ( you can check your local time here
0 | Add new comment
23
February
2021

RDA Virtual Plenary 17 - Notification of Acceptance

by Secretariat Group Account

Dear Chairs of the Research Metadata Schemas WG, Congratulations! Your RDA Virtual Plenary 17 (VP17) session application titled “Guidelines for publishing structured metadata over the Web” has been approved by the Technical Advisory Board (TAB). Please consider this your official notification of acceptance.
0 | Add new comment
15
February
2021

Draft final report "Guidelines for publishing structured metadata on the Web"

by Mingfang Wu

Dear Members of the RDA Research Metadata Schemas WG: We are sharing with you the first version of the group's final report "Guidelines for publishing structured metadata on the Web ". We would like to thank those members who have been contributing to the report. We welcome comments & suggestions to improve the final report. You can add your comment to this post , email your comment to one of the co-chairs, or join the next scheduled group call on Thursday, 25th February 8pm UTC (You can check your local time here).
0 | Add new comment
09
February
2021
09
February
2021
29
January
2021

RDA VP17 Session proposal submission

by Alexandra Delipalta

Dear Research Metadata Schemas WG members,    Thank you for your session proposal for Virtual Plenary 17 titled ‘Guidelines for publishing structured metadata over the Web’. A review of all submitted proposals is now underway by the RDA Technical Advisory Board, with notifications of acceptance planned to be sent by 26 February.    Please feel free to contact the RDA Secretariat at enquiries@rd-alliance.org with any questions or concerns you have regarding your submission. Thank you.    Regards, 
0 | Add new comment
22
January
2021

RDA's 17th Plenary - Submission deadlines coming up!

by Jamie Lupo-Petta

Dear Research Metadata Schemas WG Members,   With RDA VP17 fast approaching, we would like to bring the following news and key dates to your attention:   Call for Sessions closing on 26 January 2021 at 23:59 UTC. The deadline for submitting your session proposal is almost here! To submit your request for a breakout session, please complete the form below which corresponds to your meeting type. 
0 | Add new comment

Pages