Releases · srlearn/datasets

drug_interactions: Devendra Singh Dhami's relational version of the drug-drug interactions dataset (#12)
toy_machines: Toy multiclass-classification dataset based on one distributed with the ACE data mining system (#15)

Fixes and Other Changes

Add "Julia" section to README
Fix link to GitHub tags in README
Fix typo in README paage -> page (#13)
Move boston_housing background to correct location, previously it was incorrectly added to the boston_housing/ instead of boston_housing/boston_housing (#14)

Assets 14

06 Aug 18:28

hayesall

v0.0.4

b74bda1

Data Standardization and Validation

Standardized Data Formatting

All datasets are now validated with the grammar defined in srlearn/linter

Datasets

Four more datasets are included in this release:

financial_nlp_small
nell_sports
boston_housing
icml

Other Changes

RELEASE_VERSION is now appended to the end of zipfiles. So instead of releasing toy_cancer.zip, this and future versions will have a version (e.g. toy_cancer_v0.0.4.zip) as part of the file name.
Add general usage instructions to main project README.md
Add a hash_datasets.sh script. This is not used at the moment, but can be used to get a hash value for all files in a dataset. This could be helpful for tracking whether two versions of a dataset are exactly the same, even when the zipped contents are different.
Add lint_datasets.sh script for testing dataset content
CI build: on pull requests and pushes to the main branch, the lint_datasets.sh script runs on all datasets under srlearn/

Assets 12

13 Jul 20:08

hayesall

v0.0.3

83d21f6

4 more datasets

Datasets:

✨ Add uwcse
✨ Add cora
✨ Add webkb
✨ Add citeseer

Other Changes

📄 Add MIT License for code in this repository
✨ Add Makefile to assist with builds
🔥 Delete ~13.8 Megabytes of unnecessary comments
📝 Add overview to README and srlearn/README
🔥 Drop Gifs/ and Images/ directories

Assets 8

12 Jul 21:27

hayesall

v0.0.2

1952389

Hotfix patch for deploying artifacts

Fix typo users -> uses

Assets 4

12 Jul 21:24

hayesall

v0.0.1

f9af187

Release Test with Two Datasets

Add toy_cancer benchmark dataset
Add toy_father benchmark dataset

Assets 2

Releases: srlearn/datasets

v0.0.6 - California Housing, RoofWorld20, Deprecate Boston Housing

What's Changed

Contributors

Uh oh!

Drug Interactions and Toy Machines

Datasets

Fixes and Other Changes

Uh oh!

Data Standardization and Validation

Standardized Data Formatting

Datasets

Other Changes

Uh oh!

4 more datasets

Uh oh!

Hotfix patch for deploying artifacts

Uh oh!

Release Test with Two Datasets

Uh oh!