Several of these are covered as Core Data Packaged datasets at: https://github.com/datasets
I know you are already pretty knowledgeable about Data Packages ;-) so I was wondering if we could converge / reuse here e.g. you could build this package by pulling from core datasets and merging or similar ...?