Data Fabric IG


Group details

Case Statement: 
IG Established

The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, and automating regular data management tasks. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.


DFIG is focusing on the data creation and consumption cycle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

Throughout its existence, DFIG has shepherded multiple spin-off groups into existence, dealing with specific aspects of the cycle and components involved, particularly regarding Persistent Idenfiers (PIDs), their relevance and applicability to address data referencing and management issues. These efforts have brought forth a new understanding which is summarized in an overview document here.

The group is currently reassessing the overall landscape in trying to identify the next challenges, components or other work areas of interest. An overview is contained in The Future Trends for the Data Fabric.

File Attachment: 

Recent Activity

24 Jun 2019

Plenary 14 Session Submission Deadline Is Thursday!

With Plenary 14 (P14) scheduled to occur from October 23-25, the deadline for session submission is this Thursday, 27 June at 16:00 UTC.

Submissions for meeting sessions are open to working groups, interest groups, joint groups and birds of feather (BoF) meetings.

Please note, however, submissions are accepted from group chairs only. If you submit a session request for a group, please notify the other chairs of that group.

14 Jun 2019

Follow-up WG on Data Type Registries

Dear colleagues,
governance and interoperability between DTRs  is a still open issue in a
digital object ecosystem and in the DTR session at RDA P13 there was an
agreement that these topics should be the main concern of a third round
of the DTR WG in RDA.
Larry Lannom and myself have drafted therefore a case statement, which
we would like to submit to RDA to establish a follow-up WG.
We would like to encourage you to comment on the case statement and join
as supporter.

05 Jun 2019

Follow-up WG on PID Kernel Information Profiles

Dear colleagues,
the discussions at the PID Kernel Information session at the RDA P13
indicated that there is more work to be done to further define PID
Kernel Information profiles, mechanisms to manage and govern them, and
further build the technical connections to related essential components,
notably a profile registry that can be built potentially on top of the
Data Type Registry.
Therefore, we (myself, Beth Plale and Jens Klump) have drafted a new
case statement, which we would like to submit to RDA to establish a

25 May 2019

Documents of GEDE DO Web-Workshop

Dear all,
We had the Web-Workshop on Adapting Repositories to DOIP last week with about 34 participants. More of this sort will hopefully follow soon and a side meeting in Helsinki is planned. All documents of the Web-Workshop are now all available via this Github address

16 May 2019

Plenary 14: Call for Sessions, Collocated Events, Posters and Registration Now Open!

Taking place from 23-25 October 2019, the 14th RDA Plenary will take place in Dipoli, the nature-immersed building of Aalto University, Helsinki, in Finland, “one of the happiest countries in the world”, states Per Öster, CSC-IT Center For Science Director and Co-Chair of the P14 Programme Committee.


10 May 2019

Webworkshop on Adaptation of Repositories to DOIP on 22.5

Dear RDA Data Fabric Colleagues,
after having held two webinars about the basics of DO and DO Interface Protocol, we will now start with Web-Workshops (Workinars) to discuss concrete implementation matters. A first such online working meeting will be organised with repository managers and DOIP experts using two concrete examples. More such online working meetings and a side workshop at the RDA Plenary in Helsinki are intended. Therefore, please, join the coming meeting where you can get involved in the discussions. More information will be made available at the GEDE DO web site.

24 Mar 2019


Dear GEDE, C2CAMP and Data Fabric colleagues,
here is an updated version of our use case study which we will take as basis for our discussions at the P13 DO side meeting. We received 3 more uses cases which have been integrated in this version. There is still time to present new use cases by other initiatives to be included in the paper on a scientific view on Digital Objects which we are writing.
best regards
Peter Wittenburg
Max Planck Computing & Data Facility

23 Mar 2019

GEDE Webinar on FAIRness & Certification - Slides and Recording are online

Dear colleagues,
During this webinar we had up to 101 participants which indicates how interesting this topic is. And the webinar showed that still much development is ahead of us. However, CoreTrustSeal is operation and we can only suggest that repositories will participate in quality assessments. As indicated by Edit Herczog, there will be a session at the RDA P13 plenary begin of April devoted to this topic and feel free to participate (remotely).

20 Mar 2019

RE: 22.3 - 15.00 CET: Webinar on Maturity Indicators for FAIRness and Certification of Repositories

Dear all,
this is a reminder about the
GEDE Webinar on Maturity Indicators for FAIRness and Certification of Repositories 22. March 2019, 15.00 CET
Use this link to join the meeting:
Please, inform your colleagues about this webinar where key people (Speakers: Erik Schultes (GO FAIR), Mark Wilkinson (U Madrid), Ingrid Dillo (DANS/FAIRsFAIR), Edit Herczog (RDA)) will present their views on future developments in measuring FAIRness and repository quality.