Currently, this repo hardcodes links to datasets, necessitating downstream changes whenever the dataset that we correlate with a given standard simulation changes (necessitating PRs like #2669). We should develop a better process for changing datasets more generally, and more specifically, designating a given one or another as an "official" dataset used by default for a given simulation setup.