Data Fabric IG

IG

Group details

Case Statement: 
IG Established
 

The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, and automating regular data management tasks. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.

  

DFIG is focusing on the data creation and consumption cycle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify common components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

Throughout its existence, DFIG has shepherded multiple spin-off groups into existence, dealing with specific aspects of the cycle and components involved, particularly regarding Persistent Idenfiers (PIDs), their relevance and applicability to address data referencing and management issues. These efforts have brought forth a new understanding which is summarized in an overview document here.

The group is currently reassessing the overall landscape in trying to identify the next challenges, components or other work areas of interest. An overview is contained in The Future Trends for the Data Fabric.


File Attachment: 

Recent Activity

24 Jun 2019

Plenary 14 Session Submission Deadline Is Thursday!

With Plenary 14 (P14) scheduled to occur from October 23-25, the deadline for session submission is this Thursday, 27 June at 16:00 UTC.

Submissions for meeting sessions are open to working groups, interest groups, joint groups and birds of feather (BoF) meetings.

Please note, however, submissions are accepted from group chairs only. If you submit a session request for a group, please notify the other chairs of that group.

14 Jun 2019

Follow-up WG on Data Type Registries

Dear colleagues,
governance and interoperability between DTRs  is a still open issue in a
digital object ecosystem and in the DTR session at RDA P13 there was an
agreement that these topics should be the main concern of a third round
of the DTR WG in RDA.
Larry Lannom and myself have drafted therefore a case statement, which
we would like to submit to RDA to establish a follow-up WG.
We would like to encourage you to comment on the case statement and join
as supporter.

05 Jun 2019

Follow-up WG on PID Kernel Information Profiles

Dear colleagues,
the discussions at the PID Kernel Information session at the RDA P13
indicated that there is more work to be done to further define PID
Kernel Information profiles, mechanisms to manage and govern them, and
further build the technical connections to related essential components,
notably a profile registry that can be built potentially on top of the
Data Type Registry.
Therefore, we (myself, Beth Plale and Jens Klump) have drafted a new
case statement, which we would like to submit to RDA to establish a

25 May 2019

Documents of GEDE DO Web-Workshop

Dear all,
We had the Web-Workshop on Adapting Repositories to DOIP last week with about 34 participants. More of this sort will hopefully follow soon and a side meeting in Helsinki is planned. All documents of the Web-Workshop are now all available via this Github address
https://github.com/GEDE-RDA-Europe/GEDE/tree/master/Digital-Objects/DO-W...

16 May 2019

Plenary 14: Call for Sessions, Collocated Events, Posters and Registration Now Open!

Taking place from 23-25 October 2019, the 14th RDA Plenary will take place in Dipoli, the nature-immersed building of Aalto University, Helsinki, in Finland, “one of the happiest countries in the world”, states Per Öster, CSC-IT Center For Science Director and Co-Chair of the P14 Programme Committee.

 

10 May 2019

Webworkshop on Adaptation of Repositories to DOIP on 22.5

Dear RDA Data Fabric Colleagues,
after having held two webinars about the basics of DO and DO Interface Protocol, we will now start with Web-Workshops (Workinars) to discuss concrete implementation matters. A first such online working meeting will be organised with repository managers and DOIP experts using two concrete examples. More such online working meetings and a side workshop at the RDA Plenary in Helsinki are intended. Therefore, please, join the coming meeting where you can get involved in the discussions. More information will be made available at the GEDE DO web site.

24 Mar 2019

DO-usecase-analysis-v4

Dear GEDE, C2CAMP and Data Fabric colleagues,
here is an updated version of our use case study which we will take as basis for our discussions at the P13 DO side meeting. We received 3 more uses cases which have been integrated in this version. There is still time to present new use cases by other initiatives to be included in the paper on a scientific view on Digital Objects which we are writing.
best regards
Peter
Peter Wittenburg
Max Planck Computing & Data Facility