DFT Terminology and Tool Updates for Harmonizing Vocabularies
A short introduction describing the activities and the scope of the group:
Recent RDA meetings continue to show that some agreed upon meaning of terms used in RDA data landscape discussion, remains an important element. It is useful for RDA communication and group work as well as outreach & branding success. This will be especially important as RDA groups mature, common interests emerge and RDA messaging is accelerated.
The DFT IG mission remains to continue the discussion of an understandable data vision initiated by the RDA DFT WG and to support continuing RDA efforts to elaborate the basic data concepts within a useful framework while documenting data vocabularies. A key aspect of the work is to support broader model and vocabulary agreements within and across RDA groups along with their representative communities and stakeholders. Another aspect is contact and coordination with groups outside of RDA working on data vocabularies.
To support this virtual meetings are held for 2 or 3 months before plenaries along with contact with RDA groups on candidate terms for population in the DFT term tool called TeD-T.
Additional links to informative material related to the group i.e. group page, Case statement, working documents etc:
Some slides available - see the DFT site for P9 overviews and updates by Gary Berg-Cross
See also Data-driven Vocabulary Development by RDA Fellow Ahmed Eleish.
This session will present a stable version 1.0 of the vocabulary. This will include updates from the P9 and Gothenburg chairs discussion. The intent is to support continued synchronization of RDA conceptualization and enable better understanding within and between RDA groups. In addition it will provide updates on the term tool operation, functionality and use by groups.
The session will also allow newer groups to present their vocabularies and issues to the group and discuss relations to other groups and their definitions. We expect, for example, several terms around data collections to be completed along with progress on metadata profiles. Some additional ideas for the metadata profiles has been developed as part of DFT-IG discussions.
Improvements in the contextual depth of definitions will be discussed to support synchronized conceptualization as well as to enable better understanding within and between communities.
One potential area of interest is terminological services, which is once again becoming an active group (https://www.rd-alliance.org/ig-vocabulary-services-rda-10th-plenary-meeting). DFT remains interested in various vocabulary services including mapping between vocabularies and finding similar terms and will therefore interact with the renamed -Vocabulary and Semantic Services Interest Group – VSSIG.
Issues under discussion:
* Update on defining relations and useful links between and among terms.
* Technique to assign PIDs to every individual definition so that linking between different vocabularies is possible.
* How to handle data definition mutability as concepts change over time, including marking definitions or terms as deprecated and versioning of stable snapshots of the DFT vocabulary set. We have implemented an initial version approach around Plenaries.
* Interest in handling similarity between terms including those as part of other data vocabulary development:
Whether we need translate terms and their definitions to other languages then English.
* How to add more rich, contextual semantics to metadata as discussed at the recent Chairs meeting in Gottenburg.
The following is the working agenda for the DFT Breakout session at P10
An Overview of DFT IG history & the Breakout Session Agenda- (2 page handout as background) 10 minutes
Explaining latest additions to the DFT vocabulary for metadata and collections 10 minutes
Overview of the Ted-T tool to capture vocabularies and progress in sharing vocabularies 10 minutes
Discussion of new vocabularies (e.g. rich metadata, Portal, blockchain, raw data,..) 20 minutes
Group discussion of mapping to and leveraging other vocabularies including: 35 minutes
*W3C DXWG (Data Exchange WG) and a landscape report on existing efforts to develop data definitions
* Solicitation from other RDA IGs and WGs about current/near term candidate vocabulary items. These may include:
MIG and related RDA work
Data Fabric and Data Collections
The session will conclude with a summary of the results and next steps 5 minutes
Target Audience includes RDA groups (as well as external groups) that may have vocabularies to mature or an interest in leveraging existing ones :
- Data in Context
- Data Fabric IG
- Vocabulary and Semantic Services Interest Group – VSSIG
- Data Discovery Paradigms IG
- Global Water Information IGMike Fortun
- Big Data IG
- Research Data Repository Interoperability WG
- Data policy standardisation and implementation IG
- Mapping the Landscape IG
- AgSemantics WG
Group chair serving as contact person: Gary Berg-Cross
Type of meeting: Informative meeting
Session Room: International 2
Session Time: Thursday 21 September, Breakout 7, 09:00 - 10:30
Collaborative session notes:
Session slides and materials:
Remote Instructions (Gotomeeting):
Access Code: 314-498-885
Australia: +61 2 8355 1050
Austria: +43 1 2530 22520
Belgium: +32 28 93 7018
Canada: +1 (647) 497-9410
Denmark: +45 32 72 03 82
Finland: +358 923 17 0568
France: +33 170 950 594
Germany: +49 692 5736 7317
Ireland: +353 15 360 728
Italy: +39 0 230 57 81 42
Netherlands: +31 207 941 377
New Zealand: +64 9 280 6302
Norway: +47 21 93 37 51
Spain: +34 932 75 2004
Sweden: +46 853 527 836
Switzerland: +41 225 4599 78
United Kingdom: +44 20 3713 5028
United States: +1 (224) 501-3216