Towards common criteria for data fitness
A short introduction describing the activities and the scope of the group:
The increasing availability of research data and its evolving role as a first class scientific output in the scholarly communication requires a better understanding of and the possibility to assess data fitness for use. The concept of data fitness is multifaceted and covers various aspects related to a dataset (e.g., findability, accessibility, interoperability, metadata, usability), but also levels of peer review or curation. Moreover, the reliability of a data service providing datasets - for example the level of certification of the data repository - could also serve as a useful proxy. Currently, criteria for assessing the reusability of a dataset are not made transparent to users. Firstly, a concept of data fitness requires assessment criteria as well as the weighing of each of those criteria. This preferably leads to a corresponding metric. Secondly, we want to find effective ways to expose and communicate such a metric, e.g. using a labeling or tagging system.
Additional links to informative material related to the group i.e. group page, Case statement, working documents etc:
There are currently many initiatives to define and operationalize fitness/quality criteria at the level of the individual dataset. In particular, several groups are working on metrics related to the FAIR principles. In the session, we would like to bring some of these initiatives together to ensure the working group has a good overview of these different initiatives. Following the different presentations, we will discuss how we can use this work within the context of the working group, and whether a white paper outlining the current state of the field would be appropriate.
Introduction and definitions - Helena Cousijn (10 min)
Fitness for use: Users of the U.S. Geological Survey Science Data Catalog - Wade Bishop (10 min)
Results FAIR assessment pilot - Lisa de Leeuw (10 min)
Discussion on study outcomes (15 min)
FAIR-ness metrics - Luiz Olavo Bonino (15 min)
Completeness and reusability - Jon Petters (5 min)
Discussion on criteria (15 min)
Next steps (10 min)
We would like to invite representatives from data repositories, funding agencies, publishers and research institutes to come to the meeting and share their views on and experiences with suitable criteria to assess data fitness.
Group chair serving as contact person: Helena Cousijn
Type of meeting: Working meeting
Session Room: Mansfield 5
Session Time: Thursday 21 September, Breakout 8, 11:00 - 12:30
Collaborative session notes:
Session slides and materials: