status: Recognised & Endorsed

Chair (s): Jianhui Li, Rainer Stotzka, Robert Quick

Group Email: [group_email]

Secretariat Liaison:


The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, and automating regular data management tasks. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.

  

DFIG is focusing on the data creation and consumption cycle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

Throughout its existence, DFIG has shepherded multiple spin-off groups into existence, dealing with specific aspects of the cycle and components involved, particularly regarding Persistent Idenfiers (PIDs), their relevance and applicability to address data referencing and management issues. These efforts have brought forth a new understanding which is summarized in an overview document here.

The group is currently reassessing the overall landscape in trying to identify the next challenges, components or other work areas of interest. An overview is contained in The Future Trends for the Data Fabric.

 

An essential topic of the IG are FAIR Digital Objects, see Wiki page including regular meetings and information material here: 
RDA IG Data Fabric: FAIR Digital Objects 

File Repository

14
April
2017
12
April
2017
24
September
2015

RDA Terminology: Data Management and Data Fabric

by Gary Berg-Cross

These are the slides that were available for discussion at session 1 of DF regaring terminology used in the Paris Data Management document.


AttachmentSize
File Terminology for DF.pptx627.46 KB
23
September
2015

Briefing on Data Fabric Position Paper - Interoperability & Virtuality

by Gary Berg-Cross

The attached are the slides briefed at the first DF Session at P6 regarding the Position paper on the DF concept which should include Interoperability and Virtuality.


AttachmentSize
File df Position paper-BC-KJ-RM-shorter final 9-21-15.pptx681.52 KB
04
September
2015

Climate data analytics use case

by Stephan Kindermann

This use case describes the climate (model) data analytics scenario to support the climate model community as well as the climate impact community in their climate data processing activities.


AttachmentSize
File RDA-climate-analytics_use_case.docx92.97 KB
09
April
2015

AData Fabric Position Paper: Interoperability is Important

by Gary Berg-Cross

The Data Fabric Interest Group (DFIG) has gotten off to a rapid start with a draft white paper, acquisition of use cases, and presentations at Plenaries.


AttachmentSize
File DF-position-paper-final-20150409.docx25.87 KB
10
March
2015

Use case for: Review Process for a Removal of Data

by Ulrich Schwardmann

The amount of research data is growing faster than Moore's Law with the additional side remark that it is not even clear, that the data storage devices are still underlying the Moore's La


AttachmentSize
File UseCaseDateLifeTime_v150408.docx541.35 KB
19
February
2015

ENES data federation use case

by Stephan Kindermann

The ENES data federation builds on European data nodes and portals which are seemlessly integrated into the worldwide ESGF data federation.


AttachmentSize
File RDA-ENES-use_case.docx790.6 KB
03
February
2015

Use Case "Open Reference Data Repository for Nanoscopy" - Version 2

by Thomas Jejkal

This document contains the use case description for the "Open Reference Data Repository for Nanoscopy" in the template version 2.


AttachmentSize
File ORDA-DF-UseCase.docx93.9 KB

Pages