Chunking #409

josephdviviano · 2025-10-08T17:54:32Z

I've read the .github/CONTRIBUTING.md file
My code follows the typing guidelines
I've added appropriate tests
I've run pre-commit hooks locally

Description

This is a DRAFT (no working example yet) which adds the core functionality implemented in https://github.com/GFNOrg/Chunk-GFN

I will update this with details when I have a working implementation.

Please see testing/test_chunking.py for working examples of core functionality.

…or logic, recurrent estimators, recurrent modules

…com:GFNOrg/torchgfn into generalize_samplers

…amplers

… generalize_samplers

… chunking

…nments

…tion paths. simplified the API of adapters.

…adapters

… chunking

tweak of how default preprocessor is defined

…tion, added some useful safeguard assertions, bugfix related to saving estimator_outputs path

Make adapters logic part of estimators via `PolicyMixin`

… chunking

jaggbow · 2025-10-15T18:21:34Z

src/gfn/chunking/adapters.py

+        device = states_active.device
+        mask = torch.zeros(B, N, dtype=torch.bool, device=device)
+
+        for b in range(B):


I wonder if it's possible to parallelize this.

jaggbow · 2025-10-15T18:21:59Z

src/gfn/chunking/chunkers.py

+        non_exit_ids = [i for i in range(env.n_actions) if i != env.exit_token_id]
+        seen = set(env.vocab)
+        out: set[Hashable] = set()
+        while len(out) < n_tokens_to_add and len(out) < 10_000:


any reason the 10000 is hard coded here?

josephdviviano added 9 commits September 29, 2025 01:40

added policy adaptors, factorized samplers to allow for modular adapt…

d3148f7

…or logic, recurrent estimators, recurrent modules

fixed _SeqStates

93a654a

Merge branch 'identity_preprocessor_remove_shape_checking' of github.…

33f96b1

…com:GFNOrg/torchgfn into generalize_samplers

Update input_dim to use preprocessor output_dim

14b110c

Update input_dim to use preprocessor's output_dim

c7a3d8c

Merge branch 'master' of github.com:GFNOrg/torchgfn into generalize_s…

788ff96

…amplers

Merge branch 'generalize_samplers' of github.com:GFNOrg/torchgfn into…

8eb98a5

… generalize_samplers

Merge branch 'generalize_samplers' of github.com:GFNOrg/torchgfn into…

b3ed2bb

… chunking

added draft of chunking logic -- need to test on some discrete enviro…

a4fc53a

…nments

josephdviviano changed the base branch from master to generalize_samplers October 8, 2025 17:54

josephdviviano changed the base branch from generalize_samplers to master October 8, 2025 17:55

josephdviviano marked this pull request as draft October 8, 2025 17:58

josephdviviano added 18 commits October 8, 2025 23:34

added dtype casting to preprocessors

02856ee

added vectorized and non-vectorized adapter-based probability calcula…

1224bc0

…tion paths. simplified the API of adapters.

added documentation

34202a8

removed strange change to documentation

d1db3bd

removed strange change to documentation

08bf6eb

added basic recurrent bitsequence algorithm

baa50e4

added working bitsequence example for recurrent estimators and their …

e8d3fc2

…adapters

fixed test

e0dd464

Merge branch 'generalize_samplers' of github.com:GFNOrg/torchgfn into…

9d857ca

… chunking

Merge branch 'master' into chunking

496df98

Update estimators.py

bca3df6

tweak of how default preprocessor is defined

black / isort

fc6cb7a

simplification of the contex, adapter logic, compression of documenta…

e2dc289

…tion, added some useful safeguard assertions, bugfix related to saving estimator_outputs path

streamlined adapters under their own module

3c2862f

typing

4a23ea0

removed strict type ceck

e2755e6

shrank docs

ba6f0bd

added notes

db36953

josephdviviano added 8 commits October 13, 2025 13:11

removed finalize

3226660

removed check_cond_forward

4c2c1df

removed record step

d066c97

lint errors

e638be9

autoflake

1ee6a8f

minor formatting

6026008

Merge pull request #413 from GFNOrg/make_adapters_part_of_estimators

aeec438

Make adapters logic part of estimators via `PolicyMixin`

Merge branch 'generalize_samplers' of github.com:GFNOrg/torchgfn into…

f1e51c2

… chunking

jaggbow reviewed Oct 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Chunking #409

Chunking #409

Uh oh!

josephdviviano commented Oct 8, 2025

Uh oh!

jaggbow Oct 15, 2025

Uh oh!

jaggbow Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Chunking #409

Are you sure you want to change the base?

Chunking #409

Uh oh!

Conversation

josephdviviano commented Oct 8, 2025

Description

Uh oh!

jaggbow Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

jaggbow Oct 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants