Some of you were involved in developing a community letter to NIH regarding data licensing impediments to data reuse. Here, I am including additional folks who I think might like to join this conversation. Please see the letter to NIH here and sign the letter here if you are supportive (we will leave the signatories open). Thanks to all who helped get this letter written and sent in the first place. Since then, we’ve been working to better understand the licensing barriers to data reuse and redistribution, and have also had some meetings with NIH and the Technology Transfer community.
We have been evaluating data sources (specifically relevant to the Monarch Initiative and the NCATS Data Translator, but this is just a starting place) against a new evaluation rubric. You can see the results of our efforts at http://reusabledata.org/. We debuted this effort at the recent Research Data Alliance Legal-Interoperability group meeting in Montreal; there are many other non-biomedical communities that have the same issues and I was very grateful to find them and have their assistance. The slides from RDA are posted here.
We would really appreciate the review of the rubric and the addition of new data sources - the rubric is meant to evolve. You can make tickets or pull requests for your own sources of interest in the github repo. The goal is to help other downstream data re-users understand what barriers they might expect in redistributing data from these sources. Note that very few sources actually fair well according to the rubric - especially those of us with large, integrated data sets where we don’t have the legal authority to redistribute. This is a community level problem and does not intend to call out any individual source out as being good or bad.
While there have been various efforts to understand how to evaluate data sources for “FAIRness,” and an NIH RFI on data repository evaluation (responses here), we believe that data integration, reuse, and redistribution require a deep level of understanding of not only content, interoperability, and access issues, but especially the licensing ones. Our response to the RFI is at: Metrics to Assess Value of Biomedical Digital Repositories, our evaluation of the open science prizes, and some related slides presented to the Biocuration Society are also available.
The 2018 Annual Association of University Technology Managers meeting is held between Feb. 18-21 in Phoenix, Arizona. https://www.autm.net/events-courses/annual-meeting/2018-annual-meeting/ Here, we hope to hold a round table and would welcome participants and attendees, especially those with legal, business, and technology transfer expertise. Please let us know if you are interested.
Special thanks to Seth Carbon, Julie McMurry, Robin Champieux, Letisha Wyatt and Lilly Winfree for the hard work necessary to launch reusabledata.org.
Please post to the brand new list serve or the github if you have questions, concerns, ideas, fears, etc. We will be writing a manuscript on this soon and would very much like your input and feedback. We need all of your help to make our publicly funded data resources fundamentally more reusable!
Melissa Haendel, PhD
Library & Dept. of Medical Informatics and Clinical Epidemiology
Appointments: Shanez De Silva
community collaboration on licensing for data reuse
You are here