Pasquale Pagano is Senior Researcher at “Istituto di Scienza e Tecnologie della Informazione A. Faedo” (ISTI) of the Italian National Research Council (CNR). His research is focussed on the study and experimentation of models, methodologies and techniques for the design and development of distributed virtual research environments (VREs) which require the handling of heterogeneous computational and storage resources, provided by private and public Cloud based e-Infrastructures, for the management of heterogenous data sources. He has strong background on distributed architectures. He has participated to the design of the most relevant distributed systems and e-Infrastructure enabling middleware developed by ISTI - CNR. He is currently the Technical Director of D4Science and chief manager of gCube software, the open-source platform for the management and operation of scientific data infrastructures. He is also serving as Technical Director in the BlueBRIDGE European Project, as Service Operation Manager in the Parthenos European Project and as Infrastructure Manager of the SoBigData Research Infrastructure.
When: Day 2 - 15th November, Session 10: The Italian Way to Data Management Plan, 14:00 - 16:00
D4Science Data infrastructure: a facilitator for a FAIR data management
Abstract. D4Science.org is a Hybrid Data Infrastructure conceived to provide team of scientists with working environments making available the data and the services suitable for their own research investigation/collaboration. These environments are conceived to hide the complexity related with (a) the heterogeneity (multiple typologies, multiple formats) and the distribution of datasets of potential interest and (b) the technical issues required to exploit a service or to perform a computation-intensive task. In order to be effective these environments must support emerging practices and principles falling under the “open science” umbrella, including the FAIR ones. Findability is enabled by assigning to each of the D4Science managed “resources” a unique identifier and rich and extensible metadata (including attribution, provenance and licence information) and by publishing resources like datasets, methods/algorithms and research objects in a catalogue that supports keyword, faceted and geospatial discovery. Accessibility is obtained by making “published” resources available through multiple protocols in order to maximise the set of potential exploitation cases. These protocols provide also for Authentication and Authorization, whenever the published resource requires it. Interoperability is facilitated by enriching the “published” resources with metadata in multiple formats (including ISO 19115, Darwin Core, Dublin Core, DCAT and application profiles). Finally, Reusability is promoted by systematically endowing “published” resources with a clear licence governing their use/re-use as well as with a citation statement. The richness and plurality of metadata accompanying each resource and their extendibility make it possible to continuously enrich “published” resources with further metadata.