Health Data Brokerage and Reproducibility
Collaborative session notes
(link to be added)
Brief introduction describing the activities and scope of the group(s)
Clinical data are a valuable resource for research and other secondary use purposes. However, due to its nature, clinical data resides in distributed data repositories in various forms (e.g., written reports, structured data, semi-structured data such as genomic tests, imaging). Finding, selecting and integrating the research data for a given research question requires a set of data curation activities including data access, query, extraction, transformation, cleaning, aggregation, and sharing. Each of these steps in the data lifecycle impacts the scope and coverage of the resulting curated data set. For reproducible research, research data curation services should be clearly documented, if possible in a machine interpretable way, and should be accessible beyond the lifetime of data curation process.
This meeting objectives include:
- Inform the attendees of current work of mapping the clinical data services
- Identify gaps in the work, with a specific interest in understanding the service needs from geographical areas beyond the original working group
- Augment the current work to have a more informed WG output
- Brief introduction to the group
- This interactive session will utilize meeting chairs to work with session attendees in small groups to review the high level workflow descriptions of one or more data curation activities (data access, query, extraction, transformation, cleaning, aggregation, and sharing).
- The moderator will present the current status of work in this area and a description of the small group hands-on session, including tasks for each group to complete.
- Meeting chairs will break into small groups of attendees to complete the tasks and prepare a short presentation of the groups progress.
- The attendees will reconvene to a single group where results from the breakout session are reported for each group.
- The final portion will be a group discussion of the small group results with the focus being interdependence of the curation activities and how the individual services could be integrated into generalized model for reproducible data creation.
Case statement (to be approved): https://docs.google.com/document/d/1wrpxYnIdvJHKN21J70esdFhVEabqizjaNXJe...
Working Documents: https://docs.google.com/document/d/1qQD-E8D-Qno8y_XrSOQ2O6sZEaT2_2r3pIMj...
Clinical Service Components: https://docs.google.com/spreadsheets/d/1-uSocVpju4_fBcMDBgW2LxG3EpvBNRed...
Please specify who is your target audience and how they should prepare for the meeting
For this open session, we invite Biomedical researchers using data-driven analytical techniques in their research life-cycle; Healthcare Data Scientists dealing with data mining, machine learning, physiological modelling and image processing technologies and the data these produce; Policymakers for Healthcare; Health bioinformatics legal experts; Healthcare and Health Maintenance Organisation administrators; Pharmaceutical industry researchers and manufacturers; and, participants form other related WG/IG.
Type of meeting