TorchStoreStateDict #90

daniellepintz · 2026-01-04T19:57:53Z

Concatenate tensors into one blob of bytes for sending across transport (RDMA, Gloo, etc.) instead of sending one by one. In theory this should be faster than sending one by one due to overhead from transport buffers.

LucasLLC

This is awesome! Impressed to see so much progress in a short time span.

Some recommended next steps:

Let's profile the current implementation and see what kind of speedup we're getting on batch put vs. none-batch put. Could be helpful to add some fine-grained logging (e.g. check out latency_tracker)
Next solid step would be to unpack the state dict within the storage volume
Once this is done, we can take a look at what it would take to "fetch in batch" as well

torchstore/state_dict_utils.py

LucasLLC · 2026-01-05T16:40:33Z

tests/test_state_dict.py

 MODEL_LINER_LENGTH = 10


+def _setup_process_group():


I wonder if it's worth putting this function in a helper in tests/utils since it's used in multiple places?

https://github.com/meta-pytorch/torchstore/blob/main/tests/utils.py#L105

tests/test_state_dict.py

torchstore/state_dict_utils.py

torchstore/api.py

torchstore/state_dict_utils.py

torchstore/client.py

torchstore/storage_volume.py

torchstore/state_dict_utils.py

daniellepintz · 2026-01-11T22:15:31Z

@LucasLLC I moved the get changes to a new PR: #97

regular tensor working

6107819

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Jan 4, 2026

add dtensor support

57a012f

daniellepintz changed the title ~~regular tensor working~~ TorchStoreStateDict Jan 4, 2026

LucasLLC reviewed Jan 5, 2026

View reviewed changes

daniellepintz added 3 commits January 6, 2026 16:07

addr comments

dcda8ad

Refactor put

e7d77b2

Get working

115084b

LucasLLC reviewed Jan 7, 2026

View reviewed changes

torchstore/state_dict_utils.py Outdated Show resolved Hide resolved

LucasLLC reviewed Jan 7, 2026

View reviewed changes

torchstore/client.py Outdated Show resolved Hide resolved

LucasLLC reviewed Jan 7, 2026

View reviewed changes

torchstore/storage_volume.py Show resolved Hide resolved

LucasLLC reviewed Jan 7, 2026

View reviewed changes

torchstore/state_dict_utils.py Outdated Show resolved Hide resolved

daniellepintz added 8 commits January 7, 2026 10:00

fix tests and add latency tracker

ab6ad1e

update

cccd919

upd

91fb6e5

logging

10414dc

fix lint

25c6ee2

fix copy

500d230

update

6e5b422

fix lint

ce207cb

daniellepintz added 2 commits January 11, 2026 19:02

update tests

d9e67da

upd

6deaf5e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TorchStoreStateDict #90

TorchStoreStateDict #90

Uh oh!

daniellepintz commented Jan 4, 2026 •

edited

Loading

Uh oh!

LucasLLC left a comment

Uh oh!

Uh oh!

LucasLLC Jan 5, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniellepintz commented Jan 11, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TorchStoreStateDict #90

Are you sure you want to change the base?

TorchStoreStateDict #90

Uh oh!

Conversation

daniellepintz commented Jan 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LucasLLC left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

LucasLLC Jan 5, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

daniellepintz commented Jan 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

daniellepintz commented Jan 4, 2026 •

edited

Loading

daniellepintz commented Jan 11, 2026 •

edited

Loading