Each sub-set of data and each data source can be processed in parallel. Dask can be used to parallelize this.