status: Recognised & Endorsed
Chair (s): Xin Chen, Rainer Stotzka, Robert Quick, Maggie Hellström
Group Email: [group_email]
Secretariat Liaison:
The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, and automating regular data management tasks. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.
|
DFIG is focusing on the data creation and consumption cycle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective. DFIG's goal is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios. |
Throughout its existence, DFIG has shepherded multiple spin-off groups into existence, dealing with specific aspects of the cycle and components involved, particularly regarding Persistent Idenfiers (PIDs), their relevance and applicability to address data referencing and management issues. These efforts have brought forth a new understanding which is summarized in an overview document here.
The group is currently reassessing the overall landscape in trying to identify the next challenges, components or other work areas of interest. An overview is contained in The Future Trends for the Data Fabric.
An essential topic of the IG are FAIR Digital Objects, see Wiki page including regular meetings and information material here:
RDA IG Data Fabric: FAIR Digital Objects
File Repository
RDA Terminology: Data Management and Data Fabric
These are the slides that were available for discussion at session 1 of DF regaring terminology used in the Paris Data Management document.
Attachment | Size |
---|---|
![]() | 627.46 KB |
Briefing on Data Fabric Position Paper - Interoperability & Virtuality
The attached are the slides briefed at the first DF Session at P6 regarding the Position paper on the DF concept which should include Interoperability and Virtuality.
Attachment | Size |
---|---|
![]() | 681.52 KB |
Climate data analytics use case
This use case describes the climate (model) data analytics scenario to support the climate model community as well as the climate impact community in their climate data processing activities.
Attachment | Size |
---|---|
![]() | 92.97 KB |
AData Fabric Position Paper: Interoperability is Important
The Data Fabric Interest Group (DFIG) has gotten off to a rapid start with a draft white paper, acquisition of use cases, and presentations at Plenaries.
Attachment | Size |
---|---|
![]() | 25.87 KB |
Use case for: Review Process for a Removal of Data
The amount of research data is growing faster than Moore's Law with the additional side remark that it is not even clear, that the data storage devices are still underlying the Moore's La
Attachment | Size |
---|---|
![]() | 541.35 KB |
ENES data federation use case
The ENES data federation builds on European data nodes and portals which are seemlessly integrated into the worldwide ESGF data federation.
Attachment | Size |
---|---|
![]() | 790.6 KB |
Use Case "Open Reference Data Repository for Nanoscopy" - Version 2
This document contains the use case description for the "Open Reference Data Repository for Nanoscopy" in the template version 2.
Attachment | Size |
---|---|
![]() | 93.9 KB |