Type:
webinar, related to RDA Policy engagement
Date/Time:
6 March 2017, 15:00 CET, 14:00 UTC
Description:
Sensor, image, timeseries, simulation, and statistics data - all over in the sciences and engineering we find sampled or generated data that can be represented as "datacubes", technically speaking: multi-dimensional arrays. For example, 2-D satellilte imagery, 2-D x/y/t image timeseries and x/y/z geophysical voxel data, and 4-D x/y/z/t climate data contribute to today's data deluge in the Earth sciences. Virtual observatories in the Space sciences routinely generate Petabytes of such data. Life sciences deal with microarray data, confocal microscopy, human brain data, which all fall into the same category.
Array databases, pioneered by the rasdaman technology, have set out to provide flexible, scalable services on massive datacubes. The concept of datacube query languages meantime has proven successful on hundreds of Terabytes and through parallelizing single queries over 1,000+ cloud nodes. Meantime, many activities exist: Data center federations are being on the radar, such as under work in the international EarthServer initiative. ISO has set out on SQL/MDA (Multi-Dimensional Arrays), a specification extending SQL with n-D arrays in a flexible, domain-neutral way. This heralds a new generation of services with new quality parameters, such as flexibility, ease of access, embedding into well-known user tools, and scalability mechanisms that remain completely transparent to users. Likewise, RDA is contributing substantially to this wave through its Array Database Assessment Working Group, affiliated with the Big Data Interest Group. Hence, alongside with set-oriented techniques like relational databases an Spark and graph technology, array databases constitute an enabling common building block for next-generation data infrastructures.
In this webinar we present the concept of datacubes and services around them, based on practical largescale science applications and several "Big Datacube" standards initiated and edited by the presenter, so as to proliferate knowledge and stimulate discussion.
Audience:
Data experts, data repository managers, policy makers, research administrators, project coordinators and researchers will all benefit from participating.
Presenter:
Dr. Peter Baumann, Jacobs University, Rasdaman GmbH
Link to the webinar: https://youtu.be/d4RWCLRa6F4
Do you have any questions or suggestions?Contact the webinar team and we will forward the question to the speaker
|