benchmark utility #460

josephdviviano · 2025-12-19T07:47:14Z

I've read the .github/CONTRIBUTING.md file
My code follows the typing guidelines
I've added appropriate tests
I've run pre-commit hooks locally

Description

A utility for benchmarking torchgfn against gflownet and gfnx.

Copilot

Pull request overview

This PR introduces a comprehensive benchmarking utility for comparing torchgfn against gflownet and gfnx libraries across multiple environments (hypergrid, ising, box, bitseq). The implementation includes library-specific runners, configuration management, result aggregation, and detailed documentation.

Key changes:

Benchmarking framework with abstract base classes and environment-specific runners for three GFlowNet libraries
Support for multiple environments with library compatibility checking
Comprehensive timing, memory tracking, and result aggregation capabilities

Reviewed changes

Copilot reviewed 13 out of 13 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
pyproject.toml	Added pyright configuration to disable optional type checks
benchmark/sanity_check.py	Sanity check script for comparing JAX and PyTorch matrix multiplication performance
benchmark/lib_runners/base.py	Base classes defining the benchmarking interface and data structures
benchmark/lib_runners/torchgfn_runner.py	TorchGFN runner implementation supporting hypergrid, ising, box, and bitseq environments
benchmark/lib_runners/gfnx_runner.py	JAX-based GFNX runner with JIT compilation support
benchmark/lib_runners/gflownet_runner.py	GFlowNet runner using Hydra configuration system
benchmark/lib_runners/init.py	Module initialization exposing runner classes
benchmark/benchmark_libraries.py	Main benchmarking script with CLI interface
benchmark/README.md	Comprehensive documentation for the benchmark utility
benchmark/dependencies.sh	Dependency installation script
benchmark/gfnx	Git submodule reference for gfnx library
benchmark/gflownet	Git submodule reference for gflownet library
.gitmodules	Git submodule configuration

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-19T07:49:24Z

benchmark/lib_runners/gflownet_runner.py

+        # See gflownet.py line 594 vs 601.
+        # This fix applies to ALL environments (hypergrid, ising, ccube).


The line number reference (594 vs 601) in the comment may become outdated as the external library evolves. Consider referencing the method name or a more stable identifier instead.

Suggested change

# See gflownet.py line 594 vs 601.

# This fix applies to ALL environments (hypergrid, ising, ccube).

# See the implementation of sample_batch in gflownet.py in the external

# gflownet library.

codecov · 2025-12-19T08:14:51Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 74.38%. Comparing base (a47bf73) to head (d8c0223).
⚠️ Report is 31 commits behind head on master.

Additional details and impacted files

@@             Coverage Diff             @@
##           master     #460       +/-   ##
===========================================
+ Coverage    0.55%   74.38%   +73.83%     
===========================================
  Files          48       47        -1     
  Lines        6845     6891       +46     
  Branches      802      825       +23     
===========================================
+ Hits           38     5126     +5088     
+ Misses       6806     1454     -5352     
- Partials        1      311      +310

Flag	Coverage Δ
unittests	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

josephdviviano added 2 commits December 18, 2025 00:06

added info on benchmarking utils

0a84e3d

sketch of the inter-library benchmark

d8c0223

josephdviviano requested a review from Copilot December 19, 2025 07:47

josephdviviano self-assigned this Dec 19, 2025

Copilot AI reviewed Dec 19, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

benchmark utility #460

benchmark utility #460

Uh oh!

josephdviviano commented Dec 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

codecov bot commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# See gflownet.py line 594 vs 601.
		# This fix applies to ALL environments (hypergrid, ising, ccube).

benchmark utility #460

Are you sure you want to change the base?

benchmark utility #460

Uh oh!

Conversation

josephdviviano commented Dec 19, 2025

Description

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Dec 19, 2025

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants