At the 16th plenary, this group held its 1st BoF, (https://www.rd-alliance.org/data-movement-what-infrastructure-fabrics-are-required-0) to investigate whether there was sufficient overlap between structured big-data movement activities at eInfrastructure operators (such as EGI, GEANT etc) and the RDA-centred communities that concern themselves with scalable data description, big data handling in the repository context, etc.
The agenda there was kept broad, intending to cover the spectrum of issues and potential collaboration opportunities. Turnout was encouraging (30 virtual participants registered on the attendee log), and of the various agenda items, two were identified as warranting further study. These were:
1) Informative -- Continuing a liaison between data movement / data orchestration as-a-service initiatives currently being set up at EGI, and per the ESCAPE project, and initiatives among GEANT constituent NRENs on the one hand, and science disciplines represented at RDA who were about to embark on the structured, distributed big data part of their digital journey on the other hand.
2) Targeted project -- investigate the integration of fast data movement tools and expertise present at NRENs with data ingest mechanisms in state of the art repository platforms. The intent being to embark on a PoC between repository builders and operators and NRENs whereby the latter offer scalable, sped-up repository ingest and inter-repository asset replication as a service, obviating the need for repository builders to invest time in the maintenance of bespoke ingest/distribution tools for the petabyte era. Interest was shown both from Zenodo and Fedora.