The training data path in the code configuration is the author's local path, not a public URL.
starling/configs/dataloader/dataloader.yaml:4 points to
/work/bnovak/projects/sequence2ensemble/lammps_data/combined_data/data
How to get the whole training dataset?