Skip to content

Conversation

@JJ-Pineda
Copy link
Contributor

@JJ-Pineda JJ-Pineda commented Jan 7, 2025

Resolves #132 by implementing a fingerprint batch size in the batch_doc_creation method. This way we limit the size of memory spikes from tensorflow to <2gb on average during compound indexing.

@JJ-Pineda JJ-Pineda marked this pull request as ready for review January 7, 2025 23:16
@JJ-Pineda JJ-Pineda merged commit 133331b into main Jan 7, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Batched fingerprint embedding can cause memory spikes during indexing

2 participants