feat: implement ChemTEB by kjappelbaum · Pull Request #3 · lamalab-org/chempile

kjappelbaum · 2025-07-26T13:35:10Z

Summary by Sourcery

Add a Jupyter notebook implementing the ChemTEB baseline for chemical QA retrieval using OpenAI embeddings and evaluate performance with NDCG@10.

New Features:

Introduce a notebook for the ChemTEB pipeline that loads the ChemHotpot QA dataset and generates text embeddings via the OpenAI API
Implement a retrieval workflow using cosine similarity between query and corpus embeddings
Define DCG@k and NDCG@k functions and compute mean NDCG@10 for performance evaluation

review-notebook-app · 2025-07-26T13:35:15Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

sourcery-ai · 2025-07-26T13:35:15Z

Reviewer's Guide

Introduces a new Jupyter notebook implementing the ChemTEB retrieval and evaluation pipeline: it loads the ChemHotpotQARetrieval dataset, generates and persists embeddings via the OpenAI API, defines DCG/NDCG metrics, computes cosine-similarity–based retrieval, and reports mean NDCG@10.

File-Level Changes

Change	Details	Files
Notebook scaffolding and dependency setup	Add pip install step for scikit-learn Import datasets, OpenAI client, dotenv, numpy, pandas, sklearn metrics	`chemretrieval-bench/test.ipynb`
Load ChemHotpotQARetrieval dataset	Load 'default', 'corpus', and 'queries' splits via load_dataset Cache dataset locally for reuse	`chemretrieval-bench/test.ipynb`
Embed texts using OpenAI embeddings API	Define get_embedding wrapper for text-embedding-3-small Generate and collect embeddings for corpus and queries	`chemretrieval-bench/test.ipynb`
Persist and reload embeddings	Implement save_embeddings with numpy.save Save corpus_embeddings.npy and queries_embeddings.npy Load embeddings via numpy.load for evaluation	`chemretrieval-bench/test.ipynb`
Implement DCG and NDCG evaluation metrics	Add compute_dcg_at_k computing discounted cumulative gain Add compute_ndcg_at_k normalizing against ideal DCG	`chemretrieval-bench/test.ipynb`
Compute retrieval performance and aggregate results	Convert embeddings to numpy arrays for vectorized operations Compute cosine similarities per query–corpus pair Select top-k results, assign binary relevances, compute NDCG@10 Collect results in pandas DataFrame and compute mean score	`chemretrieval-bench/test.ipynb`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

sourcery-ai

Hey @kjappelbaum - I've reviewed your changes - here's some feedback:

Strip the notebook outputs and migrate the core retrieval logic into reusable Python modules or scripts for better maintainability and reviewability.
Use batched embedding requests and vectorized numpy or sklearn cosine-similarity computations instead of per-item loops to improve performance and avoid API rate limits.
Fix the inconsistent filename when loading saved embeddings (e.g. ‘cqueries_embeddings.npy’) and consider parameterizing file paths rather than hard-coding them.

Prompt for AI Agents

Please address the comments from this code review:
## Overall Comments
- Strip the notebook outputs and migrate the core retrieval logic into reusable Python modules or scripts for better maintainability and reviewability.
- Use batched embedding requests and vectorized numpy or sklearn cosine-similarity computations instead of per-item loops to improve performance and avoid API rate limits.
- Fix the inconsistent filename when loading saved embeddings (e.g. ‘cqueries_embeddings.npy’) and consider parameterizing file paths rather than hard-coding them.

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

kjappelbaum added 4 commits July 25, 2025 23:20

feat: implement ChemTEB

320c9bb

queries

b3ecc01

trying to reproduce BASF results

e3a30d7

slow, but ..

1ab5258

sourcery-ai bot reviewed Jul 26, 2025

View reviewed changes

kjappelbaum and others added 2 commits July 26, 2025 15:39

vectorized

6e9c507

feat: add embedding runs

ae62c02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: implement ChemTEB#3

feat: implement ChemTEB#3
kjappelbaum wants to merge 6 commits intomainfrom
embedding_metrics

kjappelbaum commented Jul 26, 2025 •

edited by sourcery-ai bot

Loading

Uh oh!

review-notebook-app bot commented Jul 26, 2025

Uh oh!

sourcery-ai bot commented Jul 26, 2025 •

edited

Loading

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kjappelbaum commented Jul 26, 2025 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Sourcery

Uh oh!

review-notebook-app bot commented Jul 26, 2025

Uh oh!

sourcery-ai bot commented Jul 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

sourcery-ai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kjappelbaum commented Jul 26, 2025 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented Jul 26, 2025 •

edited

Loading