What are you looking for?
Birds of a Feather (BoF) Session June 7, 2023

2nd BOF: Trusted / Secure Research Environments for Sensitive / Confidential Data: FAIRness for “Closed” Data and Processes

Plenary: RDA 21st Plenary Meeting [part of International Data Week 2023], Salzburg, Austria

Submitted by

Meeting objectives

 

Click here for the collaborative session notes

The conflicting goals of protecting and maintaining control over sensitive or confidential data while also thriving to give third parties access to the data pose a significant challenge. Trusted Research Environments (TREs) / Secure Research Environments (SREs) have been established in the last decade that, if properly set-up and operated, help ease this problem through providing high security guarantees of a highly controlled and monitored environment with trust.

In many settings, both academia and industry have a need to safeguard access to highly sensitive / confidential data that is commonly referred to as closed data. This sensitivity/confidentiality requirements can arise from e.g. commercial value or privacy requirements and warrants careful data management through support of technical-, organizational- and legal measures intertwined within a TRE.

The FAIR Guiding Principles for scientific data management and stewardship addresses infrastructures supporting reuse of scholarly data specifically targeted at machine-readability and –actionability of this data. While the guiding principles cover the management of all research data and aid in identifying necessary steps towards FAIR research data, they do not provide a best practice guideline / template or design decisions to make closed data FAIR per se.

We found that many TREs/SREs are similar in architecture design and technical implementation. There is however a lack of openly available guidelines and design decision explanation for setting up and running TREs.

We thus aim at establishing a WG that will

  • identify and publish a blue-print/reference architecture for the technical architecture, roles and processes commonly found in such trusted research environments based on the evaluation of existing solutions
  • make it easier for institutions to set up data infrastructures that allow researchers to gain access to sensitive data (irrespective of whether that sensitivity stems from privacy/GDPR reasons or is due to the commercial/IPR sensitivity of the data). It will demonstrate that such data, in spite of not being freely share-able can still be FAIR, made available for research.
  • demonstrate how results obtained on such closed data can still be made reproducible and transparent to the degree permitted by data sensitivity, establishing a clear public metadata record on the research performed as well as supporting findability of the data, linked to clear access request/permission processes and public verification of access by specific trusted parties.
  • increase interoperability between such environments on a technical, legal and organizational level, hopefully enabling easier set-up of ad-hoc joint TREs in settings where specific data sources need to be joined but may not be passed on to a third party for hosting.
  • make it easier to set-up data visiting platforms where trusted code can be executed, with monitoring and result inspection processes clearing results for return to a researcher so that, in some cases, even a completely shielded interaction with sensitive data may be possible.

The goal is to document, abstract from and establish best practices for balancing these differing requirements for access limitations and flexibility of interaction / analysis, understanding associated risks, with the goal of making data accessible and usable to research that otherwise would not be possible.

Based on an extremely well-attended BOF Session held at P21 in Gothenburg (>100 participants, more than 80 participants registering their name in the collaborative meeting notes), we are currently preparing a concrete Charter document to establish a Working Group. While discussion and consolidation is happening in the run-up to the plenary meeting, we will use the BOF session at P21 to finalize the document, get broad agreement on the activities to be performed, discuss the feedback received in order to submit it for approval after the plenary.

 

Meeting presenters

Andreas Rauber (TU Wien), Martin Weise (TU Wien), Bob Hanisch (NIST), Ville Tenhunen (EGI Foundation), Rob Baxter (DARE UK), Steve McEachern (Australian Data Archive), Lucas van der Meer, (ODISSEI), Jorik van Kemenade, SURF

Meeting agenda

09:00 – 09:15: Short summary of BOF session in Gothenburg (Andreas Rauber)

09:15 – 09:30: Presentation of planned WG activities, draft Charter document and feedback received so far
                         (Andreas Rauber, Lucas van der Meer, Ville Tenhunen, Bob Hanisch, Rob Baxter)

                          https://docs.google.com/document/d/1877OtQyZ46QCHVZ8_1QqRgZJPXyImZdW1qYVyKItMSQ

09:30 – 10:00: Discussion:
                         Refining activity plan, “signing up” volunteers for coordinating specific activities
                         Sanity check on feasibility given time (18 months) and ressource (no dedicated additional funding) constraints

10:00 – 10:30 Finalizing Charter for submission

 

Have you presented a session on the same topic at any previous plenaries?

Yes

Previous presentation details

A number of bilateral meetings and video calls with several TREs have been held over the past year, discussing ismilarities and difference sin set-up. However, these discoveries were isolated – bringin them together on a more formal basis, clarifaing the advantags and disadvantages of specific design decisions and – most importantly – sharing them with the community of institutins intereste din setting up such an infrastructure has been missing.

We would like to get toegther the groups operating or planing to operate such infrastrutures in a more sturctured way, documenting the experience made so far, explaining design decisions and options, pointing out best-practices.

A BOF session was held during P20 in Gothenburg atracting a huge audience. A wide range of potential activities for the WG were identified. These are currently being condensed into a set of core activities, mapped to RDA goals, evolving into a draft charter document.

Working notes of the TRESD BOF Session held at P20 in Gothenburg on March 23 2023.
https://docs.google.com/document/d/1pQheT3ZIp1SUU4iYc0Z4IO7PVOT6_-N7QjdZTWS-sIk

Additional links to informative material

Working notes of the TRESD BOF Session held at P20 in Gothenburg on March 23 2023.
https://docs.google.com/document/d/1pQheT3ZIp1SUU4iYc0Z4IO7PVOT6_-N7QjdZTWS-sIk

Desai, T, Ritchie, F and Welpton, R. 2016. Five Safes: Designing data access for research [Online]. Economics Working Paper Series 1601. DOI: https://doi.org/10.13140/RG.2.1.3661.1604

Palmgren, J, Rasmussen, T, Bengtström, M, Kahri, P, Ebbing, M, Henrichsen, B, Nilsson, M and Høst, G. 2019. A vision of a Nordic secure digital infrastructure for health data: The Nordic Commons. (Technical report). Oslo: Nordic Council of Ministers, NordForsk.

United Kingdom Health Data Research Alliance. 2020. Trusted Research Environments [Online]. URL: https://ukhealthdata.org/projects/aligning-approach-to-trusted-research-environments/. Accessed September 2020. Version 2.0.

Weise, M., Kovacevic, F., Popper, N. and Rauber, A., 2022. OSSDIP: Open Source Secure Data Infrastructure and Processes Supporting Data Visiting. Data Science Journal, 21(1), p.4. DOI: https://doi.org/10.5334/dsj-2022-004

Applicable Pathways

FAIR, CARE, TRUST - Evaluation and Policy

Avoid conflict with the following groups