Syntactic and Semantic Conversion of Metadata

You are here

26 Jan 2021
Group(s) submitting the application: 
Meeting objectives: 

The main objective of this session is to discuss the challenges of interconverting metadata by those who have tried to map between schemes at one point or another. The second objective of this session is to create a list of best practices.  The third objective is to compare the conversions discussed with the RDA Metadata Standards Catalog (MSC; under the MSCWG), which has a growing list of conversion tools and mappings.

There will be short presentations by those who do metadata conversions, sharing their methodology as well as the challenges. The presentations will cover a variety of domains and both semantic and syntactic conversion. The breakout sessions during the meeting are intended to discuss the challenges as well as mitigation strategies. We aim for consensus on a way forward.

Meeting agenda: 

Collaborative session notes: http://bit.ly/MIG_P17-Notes

  1. Welcome and Meeting Objectives ( 5 min)

  2. Review definitions of syntactic and semantic conversion (Keith Jeffery) (5 min)

  3. Short Presentations on existing syntactic and semantic conversion methods (35 min)

    • SSHOC  (Mari Kleemola) 

    • SEMAF (Daan Broeder)  

    • 3M/X3ML toolkit (Maria Theodoridou) 

    • CEDAR (John Graybeal) 

    • B2Find (Anna-Lena Flügel)

    • Metadata Standards Catalog converter (Alex Ball) 

  4. Breakouts to address the following items or others that result from presentations (35 min)

    • Challenges re: converter technologies

    • Best practices when converting

  5. Report back and Discussion (10 min)

 

Target Audience: 

Anyone interested in interoperability

Group chair serving as contact person: 
Brief introduction describing the activities and scope of the group: 

The Metadata Interest Group(MIG) is supported by individuals and organizations involved in the development, implementation, and use of metadata for scientific data.  MIG’s chief objective is to identify and endorse metadata solutions for addressing data management challenges. Metadata is crucial for the discovery, access, preservation, exchange, manipulation, and use/re-use of research data.  Metadata discussions frequently focus on community-specific issues such as discipline-specific standards; however, there are dimensions of metadata that are of general interest, cross disciplines, and are amenable to broad community input.  MIG endorses metadata solutions that balance domain-specific and interdisciplinary needs.

MIG has proposed metadata ‘packages’ for the commonly required purposes; a ‘package’ consists of elements (attributes or data structures) the values of which relate to the unique identity of the dataset or record (instance).  To assure machine processing a formal syntax and declared semantics are required, and the packages must exhibit referential and functional integrity as well as human readability. This Metadata Element Set intended to become an RDA recommendation.

At previous plenaries, MIG has reached out to other RDA Interest Groups and Working Groups where metadata is important to the goals of their group. The element set has been shared with and reviewed by many of the RDA domain groups.

The Metadata Standards Catalog WG comes under the ‘umbrella’ of MIG.

Short Group Status: 

The Metadata Interest Group was established and endorsed in 2013. MIG serves as an umbrella group for all aspects of metadata for research data. In particular, it coordinates the efforts of the Working Groups concerned with metadata to provide a coherent approach to metadata. The Metadata Interest Group has spun off 2 working groups, the Metadata Standards Directory WG and the Metadata Standards Catalog.  Both of those groups have been completed and are in maintenance mode. New content continues to be added to the Metadata Standards Directory and a new version of the Metadata Catalog was released in 2020.

Metadata discussions frequently focus on community-specific issues such as discipline-specific standards; however, there are dimensions of metadata that are of general interest, cross disciplines, and are amenable to broad community input.  MIG endorses metadata solutions that balance domain-specific and interdisciplinary needs.

Estimate of the required room capacity (Hybrid plenary): 
Santa Fe
Type of Meeting: 
Working meeting
Avoid conflict with the following group (1): 
Are you willing to hold your session at multiple times to accommodate various time zones?: 
No
Please indicate the breakout slot (s) that would suit your meeting: 
Breakout 2
Breakout 4
Breakout 5
Breakout 8
Meeting presenters: 
Alex Ball, Keith Jeffery, Rebecca Koskela, John Graybeal, Maria Theodoridou, Daan Broeder, Mari Kleemola, Anna-Lena Flügel
How do you prefer to hold the virtual component of your session: 
live
Do any of the session speakers plan to present from the venue?: 
Remote presentations only
Contact for group (email):