Achieving anonymity and correcting bias with synthetic data through generative AI
-
Discussion
-
Collaborative meeting notes:
https://docs.google.com/document/d/1vnjU3h5bF1814uSFMh48P_aZOBFwkJGKk0Fn…Brief introduction to the HDIG
Presentations on synthetic data critical issues and perspectives
Q&A and discussion on topics presented
Next stepsAdditional links to informative material
• Group page: https://rd-alliance.org/groups/health-data.html
• Case statement: https://www.rd-alliance.org/group/health-data/case-statement/health-data.html
• HDIG Sessions Presentations @RDA Plenaries (HDIG File Repository): https://www.rd-alliance.org/node/50708/repositoryAre you willing to hold your session at multiple times to accommodate various time zones?
NoAvoid conflict with the following group (1)
Blockchain Applications in Health WGAvoid conflict with the following group (3)
Life Science Data Infrastructures IGContact for group (email)
l.durst@lynkeus.comGroup chair serving as contact person
Ludovica DurstMeeting objectives
Former HDIG discussions have highlighted the hurdles, such as silo fragmentation and privacy concerns, which are usually encountered when striving to access significant amounts of health data, and how this comparative shortage of Big Data in health is hindering artificial intelligence and knowledge discovery in medicine.
The forthcoming HDIG session in Edinburgh aims at addressing the issue of effective anonymisation through synthetic data generation in combination with advanced privacy-enhancing technologies. Federated approaches based on Secure Multiparty Computation and Differential Privacy can enable the creation of artificial data through Generative Adversarial Networks and allow scaling up experimentation with non-re-identifiable health data and effective training of clinical decision-support tools. At the same time, large-scale generation and use of synthetic data sets is raising renewed interest in their legal and ethical applications, as well as in their validation.
This session proposes to highlight several critical issues related to synthetic data: 1) their still insufficiently acknowledged legal status as anonymous data; 2) the deep learning methodologies currently used to generate synthetic data and synthetic imaging; 3) their capacity to correct biased databases; 4) their potential for augmenting specific patient cohorts and for creating virtual cohorts; 5) the trust synthetic data inspire and their validation tests; 6) their combined usage with other privacy enhancing technologies.Please indicate the breakout slot (s) that would suit your meeting
Breakout 2, Breakout 4, Breakout 5, Breakout 8Privacy Policy
1
Log in to reply.