The proposed session has the following objectives:
- Introduce this new working group
- Practices of structured data markup implementation
- Implementation of structured data markup from DataCite
- Customising and extending Schema.org for life sciences
- Crawl and index structured data markup for discovery
- Group discussion
- Connections with other RDA WG/IG
- Feedback on the proposed work and schedule
- Relevant projects in progress or in plan
- Become an adopter
- Summary and actions
Collaborative session notes: https://docs.google.com/document/d/1UshlHlUPmV2FsLIOez8wLYnqYHCyqfyS-gnGIXLvcIw/edit?usp=sharing
- Introduce the WG (objects, outputs, etc)
- Identify key objects and/or properties across research domains
- Demonstrate tools for validating and harvesting landing pages with structured data
- Discuss connections with other RDA groups
- Seek participants' feedback on proposed work
- Actions on future activities
Anyone who is interested and would like to know more about how to get their data discoverable via web architecture
Anyone who works on research data metadata schema would like to be informed of what this WG plans to do and how they can contribute to the proposed work.
This proposed working group will provide a platform to complement, build on and extend efforts from bioschema.org, science-on-schema.org and the like communities in applying and extending the core schema.org vocabulary for describing research datasets and related resources. The objectives of this work group are twofold:
to identify and bridge gaps in existing schemas commonly used for research data, by bringing together communities who are working with such vocabularies to document research data and related resources;
to provide guidelines for those communities whose needs are not addressed by existing metadata schema such as schema.org, and provide guidelines on proposing extensions.
The planned outputs will include:
A generic ‘conceptual data model’ with essential types and properties for research data discovery over the web. The model will be built on bioschemas.org, science-on-schema.org, schema.org, DCAT, DDI-DISCO and SSN schemas from some representative research domains, and data discovery use cases. A research domain can map their schema to the conceptual model when they publish data to the web or exchange metadata between data portals/repositories.
A guidelines of common patterns for publishing metadata landing pages with structured data markups, and guidelines of how to customise the research schemas for target domains with examples.
Toolings for making the implementation easier if resources are available. This could include collecting and cataloguing tools that generate, validate and parse schema.org & DCAT markup, etc..
The proposed working group has been spun off from a task force "Using schema.org for Research Dataset Discovery" of the RDA Data Discovery Paradigms IG. We will work with the RDA community to have the case statement endorsed, formally start off the group at P14.
The task force has organised:
- A BoF session at the P13 (slides).
- A un-conferencing session at P12 (slides).
- Discussed starting a task force in DDP IG session at P11 (slides).
- Proposal for the task force.