Data Fabric IG

IG

Group details

Chair(s): 
Bridget Almas, Jianhui Li, Tobias Weigel
Case Statement: 
 

The Data Fabric IG (DFIG) identified that working with data in the many scientific labs and most probably also in other areas such as industry and governance is highly inefficient and too costly. Excellent scientists working on date intensive science tasks are forced to spend about 75% of their time to manage, find, combine and curate data. What a waste of time and capacity. The DFIG is therefore looking at the data creation and consumption cycle to identify opportunities to optimize the work with data, to place current RDA activities in the overall landscape, to look what other rcommunities are doing in this area and to foster testing and adoption of RDA outputs. The goal of DFIG finally is to indentify so-called Common Components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios such as replicating data in federations, developing virtual research environments, etc. Much important work is being done on data publishing and citation, but DFIG believes that we need to start at early moments in the "Data Fabrics" in the labs to organize, document and manage data professionally if we want to meet the requirements of the coming decades.

  

DFIG is focusing on the data creation and consumption circle as it happens daily in the scientific and industrial labs and on the identification of ways to make this work more efficiently and thus more cost-effective.

DFIG's goal is to identify so-called Common Components and define their characteristics and services that can be used across boundaries in such a way that they can be combined to solve a variety of data scenarios.

DFIG has various spin-offs of its discussions such as work on Repository Registry, the acceleration of the testing activities, the huge terminology problems we have, self-registration of CoCos, etc. These will appear partly on this site but will also fork into new RDA groups or be dealt with at other places.

Current Core Group Activities

  1. Use Cases (description of use cases that describe concrete "data fabrics" in the various labs)
  2. Composition Building - Finding Minimal Metadata for PIDs
  3. Composition Building - Towards the Global Digital Object Cloud
  4. Recommendations for Implementing a Virtual Layer for Management of the Complete Life Cycle of Scientific Data
  5. Broker-Driven Core Component Workflows

135 reads

 


Recent Activity

26 Sep 2017

Data Fabric P10 Montréal meeting

The Data Fabric IG met at P10 and held a succesful session with two contributions from new members with fresh perspectives and a structured discussion afterwards.

The presentations were done by Mike Kearney (CCSDS) and Pascal Lesage (BONSAI).

continue reading

19 Sep 2017

Topic for discussion at tomorrow's Plenary session: Supporting output on virtual layer recommendations and GDOC

Hello all,
One of the topics for tomorrow's P10 session of the Data Fabric Interest
Group will be discussion of the attached document,/"Recommendations for
Implementing a Virtual Layer for Management of the Complete Life Cycle
of Scientific Data//," /as an official output of the group.
This is a revised version of a white paper that was published early this
year. The original version of the paper can be found here:

11 Sep 2017

CoreTrustSeal Certification Launched

Dear Colleagues,
(apologies in advance for multiple postings)
Please read below an announcement related to trustworthy data repositories
core-level certification. Do not hesitate to circulate widely.
Very best wishes.
Mustapha Mokrane
--
*Mustapha Mokrane*, PhD. | Executive Director
World Data System-International Programme Office
c/o NICT Europe Centre, 28 rue de Berri, 75008 Paris, France
Mob: +33 6 1201 2912 <+33%206%2012%2001%2029%2012> ORCID ID:
0000-0002-0925-7983

12 Jun 2017

Web Conference to Discuss Possible Group Output

Hello all,

The chairs of the DFIG would like to invite the members of the group to participate in a web conference to discuss the possiblity of submitting a revised version of the  "Recommendations for Implementing a Virtual Layer for Management of the Complete Life Cycle of Scientific Data" (http://doi.org/10.23728/b2share.a921cfe6422544ec96302f60dece7393) as an official supporting output of the DFIG.

01 Jun 2017

Meeting Notes Chairs Meeting 1 June 2017

Feel free to extend as needed.

Action TW (days): contact Jens Klump on Versioning IG status and possible joint session
Action TW (days): Send draft slides for Gothenburg around
Action TW (weeks): Take a shot at the 2 documents
Action all: Prepare various session drafts
Action Bridget: Edit the 2 docs into one recommendation doc; possible VC with DF members last week of June/first week of July

 

P10 Session planning - until June 16!:

continue reading

29 Mar 2017

Document for recommendation session of DFIG

Dear Data Fabric IG members,
attached is a document which resulted from the many different contributions to the PID topic, in particular also from the intensive interactions in GEDE. In our view it indicates a lot of consensus about core assertions. We would like to discuss this at the coming Recommendations session of the Data Fabric IG at P9.
The intention is to
- discuss this document and its assertions,
- to identify views which are perhaps missing,
- to identify persons who want to be in a focus group (charrette) to further discuss this topic,