From 024711fc0d0fe114ae0facbe3b5636b41266e1d7 Mon Sep 17 00:00:00 2001 From: Erik Altman Date: Sat, 12 Aug 2023 14:31:46 -0400 Subject: [PATCH] Update README.md Added link to Kaggle site with credit card transaction data. --- README.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/README.md b/README.md index bb95410..702109d 100644 --- a/README.md +++ b/README.md @@ -26,7 +26,7 @@ conda env create -f setup.yml ### Credit Card Transaction Dataset -The synthetic credit card transaction dataset is provided in [./data/credit_card](/data/credit_card/). There are 24M records with 12 fields. +The synthetic credit card transaction dataset is provided in [./data/credit_card](/data/credit_card/) and also on [Kaggle](https://www.kaggle.com/datasets/ealtman2019/credit-card-transactions). There are 24M records with 12 fields. You would need git-lfs to access the data. If you are facing issue related to LFS bandwidth, you can use this [direct link](https://ibm.box.com/v/tabformer-data) to access the data. You can then ignore git-lfs files by prefixing `GIT_LFS_SKIP_SMUDGE=1` to the `git clone ..` command. ![figure](./misc/cc_trans_dataset.png)