The dataset used for the attached paper [CAT_Random_Forest.pdf](https://github.com/pangeo-data/mldata/files/5809394/CAT_Random_Forest.pdf)