Steps towards defining FAIR principles for Machine Learning (ML)

You are here

28 Jul 2021

Steps towards defining FAIR principles for Machine Learning (ML)

Submitted by Fotis Psomopoulos


Meeting objectives: 

The main focus of the meeting is to push forward the discussion on the FAIR principles, and the way they could be applied in a Machine Learning context. Specifically, the discussion will be focussed under the following two main question lines:

  1. How do you think FAIR should be applied to ML? What changes/definitions are most needed? What questions need most discussion? 

  2. Should this address only ML models, and/or also processes, and/or also platforms, etc.?

In order to facilitate the discussion, people from relevant activities, initiatives and projects will be invited to offer their perspective. Such initiatives include the Pistoia Alliance, ELIXIR, the CLAIRE network and FAIR4HEP among others. Given the online form of the Plenary, some of these perspectives may be delivered as pre-recorded videos.

Ultimately, the main outcome of this session would be to build upon the effort so far and identify the concrete next steps forward.

Meeting agenda: 

Collaborative session notes: https://docs.google.com/document/d/1gs5AYPcQFk4YRbwRf8fyFmqmK2x9Em5jCYA-7gjW6PI/edit?usp=sharing

 

Introduction(s) (10 min): 

  • What is FAIR?

  • Intro to FAIR 4 ML and the road so far

 

Talks (5 + 2 min each), focus on FAIR for ML in different contexts. Tentative list:

  • Pistoia Alliance

  • ELIXIR

  • FAIR4HEP

  • CLAIRE

 

Discussion & next steps (30 min)

  • Should FAIR address only ML models, and/or also processes, and/or also platforms?

  • How do you think FAIR should be applied to ML? Reuse FAIR for data and software, or aim towards a redefinition?

  • What changes/definitions are most needed? What questions need most discussion? 

  • Should we propose a formal structure under RDA (IG, WG, CoP)?

Type of Meeting: 
Informative meeting
Short introduction describing any previous activities: 

There is a large amount of FAIR work, both in RDA and elsewhere, initially focused on data and now software and other products but generally not ML models. Some of the speakers in this session are involved in projects where FAIR for ML models is a topic of discussion. Additionally, we presented poster 31b (FAIR principles for ML models - https://doi.org/10.5281/zenodo.4271995) at RDA VP16 to start discussion on this at RDA, with a dedicated BoF session at RDA VP17 (https://www.rd-alliance.org/defining-fair-machine-learning-ml) that aimed to capture the overall perspective on the topic.

The discussion around FAIR for Machine Learning continued in further events under different domains; during the FAIR Festival, the efforts of FAIR4ML were presented together with similar initiatives for Software and Workflows. During ESIP 2021 in July 2021, the initiative was presented in the context of “Best Practices for Reusability of Machine Learning Models: Guideline and Specification”, with a first informal Community Call taking place in the same month as well. Having a critical mass of interested parties captured already, the main purpose of the VP18 BoF session will be to identify concrete next steps forward.

BoF chair serving as contact person: 
Please indicate the breakout slot (s) that would suit your meeting. : 
Breakout 2
Breakout 5
Are you willing to host a live second session to accommodate a different time zone? : 
Yes
Meeting presenters: 
Daniel S. Katz (University of Illinois at Urbana-Champaign), Fotis E. Psomopoulos (Institute of Applied Biosciences, Centre for Research and Technology Hellas), additional speakers tbc
How do you prefer to hold the virtual component of your session: 
Other
Other: 
A mix of live talks and recorded talks (depending on breakout slot), followed by live discussion.
Avoid conflict with the following group (1): 
Avoid conflict with the following group (2): 
Avoid conflict with the following group (3): 
Contact for group (email): 
Applicable Pathways: 
The FAIR Agenda
Please select breakout slot (s) for your (optional) second live session. : 
Breakout 9
Breakout 12