Data leakage during training on most publicly available datasets #2
Closed
drmehultyagi
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Computed embeddings for SCIN dataset work well, but for datasets like Fitz 17k, DDI - generated embeddings lead to zero validation metrics.
Thanks and regards
I found the answer in the notebook. Thanks.
Beta Was this translation helpful? Give feedback.
All reactions