Notes from the January 2017 DFT meeting

31 Jan 2017

Topics

1. P9 Status

There are 2 sessions planned for RDA P9 – the regular DFT IG meeting and another BoF on Semantics for Domain Vocabularies

There is no joint meeting planned with VSIG although some work on this continues with RDA Fellow Ahmed Eleish. Gary has talked with Adam Shepard on the slow down with vocabulary services. Perhaps Simon Cox has some supported work.

2. Update on terminologies

There have been a modest number of additions to the vocabulary on the Term Tool (TeD-T- see http://smw-rda.esc.rzg.mpg.de/index.php/Main_Page)

Some of these come from the OAIS : A Framework to Enable Interpretation of the Data

(J. Steven Hughes )

Broad categories include: data preparation, data reduction, data analysis

Examples of data preparation include:

Aggregation

Aggregation Anomaly Detection

Bias Correction B

Bayesian Techniques

Bivariant Regression

Classification & Clustering;

Hierarchical Clustering

Constrained Variational Analysis

Coordinate Transformation

Fusion

We need definitions for these.

3. Enhancements to the Term Tool and plans for additions

There may be progress with the SKOS version on the vocabulary but we are waiting for Ahmed to return from his winter break. Progress may depend on what he has time for in the new semester at RPI.

Some new functionality for vocabularies may be available from ongoing work with a tool called Reptor. Reptor is a PHP application which turns a web server into a data repository. It demonstrates the functionality of a modern data repository using standards such as Dublin Core along the recommendations of the Research Data Alliance (RDA) for persistent identifiers and minimal metadata. For information on the current installation see : http://dft-rda.esc.rzg.mpg.de/reptor/

4. Metadata Discussions

Rebecca Koskela expressed an interest in putting in some of the RDA metadata work such fleshing out the profile and metadata components. Much of the session was spent going over TeD-T capabilities & the meaning of such things as “scope” along with tool documentation.

Currently the tool includes a number of terms related to metadata such as:

Administrative metadata  +

Annotation  +

Archival Description  +

Canonical Metadata Packages  +

Concept  +

Contextual Metadata  +

Data Discovery  +

Data Quality  +

Detailed Metadata  +

Discovery Metadata  +

Key Metadata  +

Metadata  +

Metadata Element  +

Metadata Profile  +

Metadata Registry  +

Metadata Standards  +

Objective Metadata  +

Payload Metadata  +

Provenance metadata  +

Rich Metadata  +

Rebecca noted that we have MD Catalogue, but not Metadata directory which was worked out as part of the WG. Gary noted that the group had completed but not entered or shared their definitions. More publicizing of the URL for the tool might help.

Rebecca noted also, that some of the definitions are brief which reflects in part the fact that they often are placeholders for terms that have been discussed in meetings, but for which groups have not finalized discussion.

It was agreed that we should do some joint preparation for P9 and a session in March would be good as well as some continued discussion at P9. There is joint interest with the Data Discovery Paradigms IG and perhaps more could be discussed at the TAB session and afterwards.

Rebecca suggested that it may be good to do some follow up at the June Chairs meeting. In addition it seemed that the Reptor tool, when ready would be useful to test out some of the RDA metadata profile ideas.  

Next Meeting - tentatively set for Monday Feb. 13 at 11:00 Eastern US time.

Attendees

Gary Berg-Cross

Rebecca Koskela

Raphael Ritz

Thomas Zastrow

Appendix on BoF Domain Vocabulary Activities

The BoF produced some useful sharing of ideas. This has been followed up by:

2 Ontolog sessions organized by Gary Berg-Cross on semantics for Domain Vocabularies. See http://ontologforum.org/index.php/DomainVocabularies

Part of this was a follow up by Mark Fox (University of Toronto) on: An Ontology Design Pattern for Global City Indicators

An Ontological Engineering briefing & foundation grounding for domain work

– Mark Fox & Mark Mussen produced this.

Vocamps sessions organized by Gary Berg-Cross for Chem Safety (continuing) & materials science;

See http://vocamp.org/wiki/GeoVoCampDC2016

Additional vocabulary work for the Chemistry Research Interest Group

has a focus on the IUPAC "color books"

Follow up with Leah McEwen has produced Chem Safety use cases and a start on a safety profile.

10 pattern areas have been identified along with terms and some definitions.

Modular pattern work continues and RDA Fellow Ahmed Eleish is looking at using some some text mining to help both Chem Research topics.