I downloaded a sample from this piece of research by JetBrains link, it will be useful for testing.
It's a public S3 bucket, so it was as easy as aws s3 sync s3://github-notebooks-update1/ data/ (need to Control-C it, there's a lot of data)
Originally posted by @kokes in #48 (comment)