[refactor] Convert BatchMeta to columnar layout; enable zero-copy serialization by default by mpb159753 · Pull Request #39 · Ascend/TransferQueue

mpb159753 · 2026-02-27T07:12:01Z

Columnar BatchMeta + Zero-Copy Default

1. Context & Motivation

Closes: [refactor] Convert BatchMeta from row-oriented to column-oriented layout

The current BatchMeta uses a row-oriented design (BatchMeta → List[SampleMeta] → Dict[str, FieldMeta]), which introduces three scaling issues in high-throughput scenarios:

O(B×F) Complexity: Critical paths (build_storage_meta_groups, add_fields, _filter_storage_data) involve nested loops over every sample × every field, incurred multiple times per PUT.
Small Object Explosion: A batch of 1024 samples with 10 fields creates 10,000+ Python objects, causing GC pressure and unpredictable tail latency.
Redundant Transmission: Schema info (dtype, shape) is duplicated per sample; row-oriented serialization produces fragmented ZMQ frames, preventing zero-copy optimization.

This PR refactors BatchMeta to a column-oriented (structure-of-arrays) design, reducing metadata complexity from O(B×F) to O(B) + O(F), and enables zero-copy serialization by default with automatic pickle fallback.

2. Key Changes

2.1 Columnar BatchMeta (`metadata.py`)

Aspect	Before (Row-oriented)	After (Column-oriented)
Structure	`BatchMeta.samples: List[SampleMeta]`	Flat arrays: `global_indexes`, `partition_ids`, `production_status`
Field metadata	Per-sample `FieldMeta` objects (B×F instances)	Shared `field_schema` dict (F entries)
Status check	Loop over samples O(B)	`np.all()` on ndarray O(1)
Classes	`BatchMeta`, `SampleMeta`, `FieldMeta`	`BatchMeta` only

Removed: SampleMeta and FieldMeta classes entirely
Added: field_schema dict with three field types: Regular Tensor, Nested Tensor (is_nested), Non-Tensor (is_non_tensor)
Vectorized: production_status as np.ndarray(int8) — enables O(1) readiness checks via np.all()

2.2 Zero-Copy Serialization Default (`serial_utils.py`)

Zero-copy serialization is now the default behavior (previously gated by env var)
Automatic fallback to pickle on serialization failure, with one-time warning
Removed ZERO_COPY_SERIALIZATION environment variable switch

2.3 Storage & Transport Adaptation

simple_backend.py / simple_backend_manager.py / controller.py: Adapted to columnar API; clear() uses del instead of None assignment to reduce memory fragmentation
zmq_utils.py: ZMQ transport uses new serialization utilities; frame count reduced from O(B) to F+1 (one metadata header + one per field)

2.4 Test Suite

test_metadata.py: Fully rewritten for columnar API (net -799 lines)
All other test files adapted to new BatchMeta constructor

3. Benchmark Results

Tests conducted in Docker (single-node Ray) across 7 payload sizes. Three configurations compared:

main-no-zerocopy: Baseline (row-oriented, pickle serialization)
main-zero-copy: Row-oriented + custom zero-copy serialization (previous PR)
columnar-batchmeta-zero-copy: This PR (columnar + zero-copy default)

Throughput Comparison (Gbps)

Config	Operation	main (No ZC)	main (ZC)	This PR	vs main (ZC)
debug (0.05 MB)	PUT	0.004	0.005	0.005	+17%
	GET	0.005	0.006	0.008	+33%
tiny (0.6–1.5 MB)	PUT	0.055	0.058	0.119	+106%
	GET	0.057	0.086	0.220	+157%
small (50–150 MB)	PUT	0.89	1.56	4.71	+202%
	GET	1.14	2.53	5.87	+132%
medium (0.5–1.5 GB)	PUT	2.91	6.82	18.26	+168%
	GET	3.31	6.95	8.83	+27%
large (3–6 GB)	PUT	4.32	12.34	26.11	+112%
	GET	4.57	8.41	9.60	+14%
xlarge (6–13 GB)	PUT	4.37	11.86	25.74	+117%
	GET	4.67	8.47	10.20	+20%
huge (10–25 GB)	PUT	4.31	11.08	23.89	+116%
	GET	4.49	5.50	9.70	+76%

Speedup vs Baseline (main-no-zerocopy)

Config	PUT Speedup	GET Speedup
debug	1.2×	1.5×
tiny	2.2×	3.8×
small	5.3×	5.2×
medium	6.3×	2.7×
large	6.0×	2.1×
xlarge	5.9×	2.2×
huge	5.5×	2.2×

Visualization

Resource Usage

Columnar layout reduces CPU time by eliminating per-sample object creation and pickle overhead:

Config	main (No ZC) CPU-sec	main (ZC) CPU-sec	This PR CPU-sec	Reduction vs main (ZC)
large	850	572	574	~0% (2× throughput)
xlarge	1570	1166	1009	-13% (2× throughput)
huge	2569	2387	1936	-19% (2× throughput)

Note: CPU time is comparable or lower despite processing 2× more data per unit time.

4. API Breaking Changes

Item	Before	After
`BatchMeta.samples`	`List[SampleMeta]`	Removed
`SampleMeta` class	Available	Removed
`FieldMeta` class	Available	Removed
`sample.fields['x'].dtype`	Per-sample access	`batch.field_schema['x']['dtype']`
Constructor	`BatchMeta(samples=[...])`	`BatchMeta(global_indexes=..., partition_ids=..., field_schema=..., production_status=...)`

5. Files Changed

16 files changed, 1369 insertions(+), 2168 deletions(-)

Category	Files	Summary
Core	`metadata.py`	Columnar BatchMeta rewrite
Serialization	`serial_utils.py`, `zmq_utils.py`	Zero-copy default + ZMQ adaptation
Storage	`simple_backend.py`, `simple_backend_manager.py`, `base.py`	Columnar API adaptation
Controller	`controller.py`	Columnar API adaptation
Tests	`test_metadata.py` + 7 test files	Full rewrite + adaptation
Scripts	`put_benchmark.py`	Minor adjustments

6. Conclusion

The columnar BatchMeta refactoring combined with default zero-copy serialization delivers:

PUT throughput: Up to 6.3× improvement over baseline, +100–200% over previous zero-copy PR
GET throughput: Up to 5.2× improvement over baseline, +14–157% over previous zero-copy PR
CPU efficiency: Comparable or lower CPU time despite 2× higher throughput
Code reduction: Net -799 lines of metadata-related code

ascend-robot · 2026-02-27T07:12:11Z

CLA Signature Pass

mpb159753, thanks for your pull request. All authors of the commits have signed the CLA. 👍

…ialization by default - Restructure BatchMeta from row-based (FieldMeta/SampleMeta) to columnar storage - Add _SampleView for lazy read-only row access into columnar BatchMeta - Enable zero-copy (msgpack) serialization by default; auto-fallback to pickle - Remove TQ_ZERO_COPY_SERIALIZATION env var toggle - Update all related tests and managers to new columnar BatchMeta API - Add serial_utils.py with shared serialization helpers Signed-off-by: 看我72遍 <m.pb@msn.com>

…st layout - Change BatchMeta.custom_meta and _custom_backend_meta from dict[int, dict] to list[dict], aligned positionally with global_indexes - Update __post_init__ to validate length and initialize empty lists - Adapt get/update/clear_custom_meta, select_samples, select_fields, concat, reorder, empty, to_dict, from_dict methods - Add dict→list bridge in controller._build_batch_metadata - Update simple_backend_manager and base.py to access by position index - Adapt test_metadata.py and test_kv_storage_manager.py accordingly Signed-off-by: 看我72遍 <m.pb@msn.com>

Add BatchMeta.with_data_fields() method that returns a new BatchMeta with the given field list, allowing field names not yet present in the current field_schema. This lets callers request newly-added fields on a known sample range without triggering poll_for_meta, which may return samples outside the intended region. Fix test_cross_shard_complex_update which incorrectly used poll_for_meta(force_fetch) to verify new fields — this could fetch samples (indices 0-9 or 30-39) that do not have the new_extra_* fields, causing a RuntimeError. Use meta_update.with_data_fields() instead to target exactly the 10-29 update region. Signed-off-by: 看我72遍 <m.pb@msn.com>

ascend-robot · 2026-02-27T07:56:33Z

CLA Signature Pass

mpb159753, thanks for your pull request. All authors of the commits have signed the CLA. 👍

ascend-robot · 2026-02-27T09:03:53Z

CLA Signature Pass

mpb159753, thanks for your pull request. All authors of the commits have signed the CLA. 👍

- Add CUSTOM_TYPE_NUMPY = 6 extension type - _encode_numpy(): direct memoryview extraction, no torch intermediary - _decode_numpy(): reconstruct np.ndarray from raw bytes - Update enc_hook to use _encode_numpy with exclusion-based dtype check - Register CUSTOM_TYPE_NUMPY in ext_hook - Update test_numpy_numeric_arrays_zero_copy to assert np.ndarray (not Tensor) - Update test_zmq_msg_serialization with correct TensorDict numpy comment - Add TestNumpyNativeSerialization with 22 new parametrized/edge-case tests Fixes: numpy arrays decoded as torch.Tensor (type information lost), and the torch.from_numpy + flatten() path could trigger extra copies. Signed-off-by: 看我72遍 <m.pb@msn.com>

ascend-robot · 2026-02-27T09:06:50Z

CLA Signature Pass

mpb159753, thanks for your pull request. All authors of the commits have signed the CLA. 👍

…eview - metadata.py: fix BatchMeta.empty().to_dict() crash when production_status is None - metadata.py: explicitly serialize dtype as string in to_dict(); add _parse_dtype() helper for from_dict() to reconstruct torch/numpy dtypes without implicit pickle - controller.py: add field_schema_cache to DataPartitionStatus for O(F) get_field_schema() (replaces O(G*F) double-scan of field_dtypes/field_shapes per-sample maps) - zmq_utils.py: downgrade pickle-fallback log from WARNING to INFO; it is a normal degradation path (e.g. body contains torch.dtype objects), not an error - simple_backend_manager.py: move defaultdict import to top-level; remove two redundant local imports in get_data() and clear_data() - tutorial/03_metadata_concepts.py: update tutorial for columnar BatchMeta API - docs/plans: add serial_utils warning background doc for future discussion Signed-off-by: 看我72遍 <m.pb@msn.com>

ascend-robot · 2026-02-27T09:11:17Z

CLA Signature Pass

mpb159753, thanks for your pull request. All authors of the commits have signed the CLA. 👍

0oshowero0 · 2026-02-27T10:33:52Z

tutorial/03_metadata_concepts.py

    """
-    Demonstrate FieldMeta - specific data fields of each training sample.
+    Demonstrate BatchMeta field_schema - field-level metadata for the batch.
+    After the columnar refactoring, field metadata is stored once at the batch level


We can directly introduce the BatchMeta as it is now, don't need to compare with previous design

0oshowero0 · 2026-02-27T10:34:40Z

tutorial/03_metadata_concepts.py


-    print("FieldMeta represents a single field in ONE sample:")
-    print("- name: Field identifier ('Prompt', 'Response', etc.)")
+    print("field_schema stores metadata for each field ONCE per batch (not per sample):")


same as above comment

0oshowero0 · 2026-02-27T10:40:16Z

tutorial/03_metadata_concepts.py

+    print(f"  For global_index: use batch.global_indexes[0] = {ready_batch.global_indexes[0]}")
+
+
+def demonstrate_batch_meta_construction():


The core logics here is not demonstrating the construction, but usage.

What's the relationship between this demo and demonstrate_batch_meta()

0oshowero0 · 2026-02-27T10:40:25Z

tutorial/03_metadata_concepts.py

    """
-    Demonstrate SampleMeta - describes a single data sample.
+    Demonstrate how to construct BatchMeta directly and operate on it
+    (analogous to old SampleMeta operations: add_fields, select_fields, union).


Don't need to compare

0oshowero0 · 2026-02-27T10:42:46Z

tutorial/03_metadata_concepts.py



-def demonstrate_field_meta():
+def demonstrate_batch_meta_schema():


I believe we don't need to deliberately demonstrate the field_schema here. We can simplify it by showing how to construct a BatchMeta manually

0oshowero0 · 2026-02-27T10:43:10Z

tutorial/03_metadata_concepts.py

-        dtype=torch.int64,
-        shape=(512,),  # Sequence length for ONE sample
-        production_status=ProductionStatus.NOT_PRODUCED,
+    # Example 2: Create a field schema entry for attention_mask


Functionally repeated

0oshowero0 · 2026-02-27T10:51:46Z

transfer_queue/metadata.py

+    return dtype_str
+
+
+class _SampleView:


better to provide a __str__ representation because there is no where to know how to access the inner properties.

0oshowero0 · 2026-02-27T10:52:21Z

transfer_queue/metadata.py

-        return self.fields.get(name)
+    def fields(self) -> dict:
+        """Read-only access to field_schema: batch.samples[i].fields['a'] -> field meta dict."""
+        return self._batch.field_schema


We should only return the field_schema of THE selected sample?

0oshowero0 · 2026-02-27T10:54:41Z

tutorial/03_metadata_concepts.py

    )

    partition_id = "demo_partition"
    batch_meta = tq_client.put(data=data_batch, partition_id=partition_id)


I have tried this code but the returned batch_meta only has one field_schema. Is that because the field has uniform shape? What if it's nested?

BatchMeta(global_indexes=[8, 9, 10, 11, 12, 13, 14, 15], partition_ids=['demo_partition', 'demo_partition', 'demo_partition', 'demo_partition', 'demo_partition', 'demo_partition', 'demo_partition', 'demo_partition'], field_schema={'input_ids': {'dtype': torch.int64, 'shape': torch.Size([512]), 'is_nested': False, 'is_non_tensor': False}, 'attention_mask': {'dtype': torch.float32, 'shape': torch.Size([512]), 'is_nested': False, 'is_non_tensor': False}, 'nested': {'dtype': torch.float32, 'shape': None, 'is_nested': True, 'is_non_tensor': False, 'per_sample_shapes': [(4, 3), (2, 4), (4, 4), (2, 3), (4, 5), (2, 2), (4, 6), (2, 1)]}}, extra_info={}, custom_meta=[{}, {}, {}, {}, {}, {}, {}, {}], _custom_backend_meta=[{}, {}, {}, {}, {}, {}, {}, {}])

That's correct, but as mentioned earlier, better to support extracting the field schema of the given sample. Now it looks like:

I have found a bug:

input_ids = torch.randint(0, 1000, (8, 512)) attention_mask = torch.ones(8, 512) nested = torch.nested.as_nested_tensor( [torch.randn(4, 3).int(), torch.randn(2, 4),torch.randn(4, 4), torch.randn(2, 3),torch.randn(4, 5), torch.randn(2, 2),torch.randn(4, 6), torch.randn(2, 1)], layout=torch.strided ) data_batch = TensorDict( { "input_ids": input_ids, "attention_mask": attention_mask, "nested": nested, }, batch_size=8, ) partition_id = "demo_partition" batch_meta = tq_client.put(data=data_batch, partition_id=partition_id)

The dtype is not supporting per_sample_dtypes and directly use the first element's dtype.

Please check whether the current design fits the backend's requirements @dpj135 @tianyi-ge

0oshowero0 · 2026-02-27T11:08:31Z

transfer_queue/storage/managers/base.py

        Extract the expected shape, dtype, and custom_backend_meta for each field-sample pair in metadata.
        The order matches the key/value order: sorted by field name, then by global index.

+        O(F) optimized version that uses field_schema instead of per-sample metadata.


Don't need to compare with old version. It will confusing users

0oshowero0 · 2026-02-27T11:09:03Z

transfer_queue/storage/managers/base.py

-        Serializes the input tensors, stores them using the storage client,
-        extracts per-sample dtype and shape information, and sends a notification
-        to the controller that new data is available.
+        O(F) optimized version that extracts field-level schema instead of per-sample metadata.


Same as above comment

0oshowero0 · 2026-02-27T11:14:32Z

transfer_queue/storage/managers/base.py

+        for field_name, field_data in data.items():
+            first_item = field_data[0] if len(field_data) > 0 else None
+            is_nested = isinstance(field_data, torch.Tensor) and field_data.is_nested
+            field_dtype = getattr(first_item, "dtype", type(first_item) if first_item is not None else None)


For nested tensor, we cannot assume the first item's dtype equals to all other tensors

0oshowero0 · 2026-02-27T11:15:28Z

transfer_queue/storage/managers/base.py

+                per_field_dtypes[global_idx][field_name] = field_dtype
+                if is_nested:
+                    assert unbound is not None  # is_nested=True implies unbind() was called
+                    per_field_shapes[global_idx][field_name] = tuple(unbound[i].shape)


Lack per field dtype logic

0oshowero0 · 2026-02-27T11:16:38Z

transfer_queue/storage/managers/base.py

+            # Pre-compute unbind once to avoid O(B²) repeated calls inside the loop
+            unbound = field_data.unbind() if is_nested else None
+
+            for i, global_idx in enumerate(metadata.global_indexes):


We should only goes in this loop when the field is nested to speed up?

0oshowero0 · 2026-02-27T11:22:25Z

transfer_queue/metadata.py

    def __post_init__(self):
        """Initialize all computed properties during initialization"""
-        self.samples = copy.deepcopy(self.samples)
+        self.global_indexes = copy.deepcopy(self.global_indexes)


We'd better remove these deepcopy from __post_init__ since it will bring much higher pressure for controller -- now the controller is called for each single PUT/GET operation.

I suggest to provide a dedicate method that deepcopy these variables, and actively call this method for operations like chunk/union/reorder...

0oshowero0 · 2026-02-27T11:24:02Z

transfer_queue/storage/simple_backend.py

        Args:
            fields: Field names used for getting data.
-            local_indexes: Local indexes used for getting data.
+            local_keys: Global indexes used as dict keys.


Do we still need local keys now?

It can directly be the global_indexes

0oshowero0 · 2026-02-27T11:26:51Z

transfer_queue/storage/managers/simple_backend_manager.py

+        # global_index is used directly as dict key in storage, no local_index conversion needed
+        global_indexes_slice = metadata.global_indexes[start:end]
+        local_indexes = list(global_indexes_slice)


unnecessary codes and comments here

0oshowero0 · 2026-02-27T11:28:56Z

transfer_queue/storage/managers/simple_backend_manager.py

+                self._prepare_and_send_to_unit(
+                    unit_idx=unit_idx,
+                    storage_id=storage_id,
+                    chunk_size=chunk_size,
+                    batch_size=batch_size,
+                    start_offset=0,  # fixed; no cross-batch rotation
+                    num_units=num_units,
+                    data=data,
+                    metadata=metadata,


There are too many input params. Many of them can be simplified. E.g. chunk_size, num_units

And I suggest to provide a mapping function as earlier version, as it helps to make the dispatching logic more clear.

0oshowero0 · 2026-02-27T11:31:52Z

transfer_queue/storage/managers/simple_backend_manager.py

        """
        Retrieve data from remote StorageUnit based on metadata.

+        Routes to each SU using the _su_id recorded in metadata._custom_backend_meta


Why? We don't need _custom_backend_meta to do routing

0oshowero0 · 2026-02-27T11:32:47Z

transfer_queue/storage/managers/simple_backend_manager.py

+        groups: dict[str, list[int]] = defaultdict(list)
+        for i, gi in enumerate(metadata.global_indexes):
+            backend_meta = metadata._custom_backend_meta[i]
+            if not backend_meta or "_su_id" not in backend_meta:
+                raise RuntimeError(
+                    f"get_data: missing _su_id for global_index {gi} in _custom_backend_meta. "
+                    f"Make sure put_data was called before get_data."
+                )
+            groups[backend_meta["_su_id"]].append(gi)


Can be simplified by providing a mapping function

0oshowero0 · 2026-02-27T11:34:08Z

transfer_queue/storage/managers/simple_backend_manager.py

    async def _get_from_single_storage_unit(
-        self, storage_meta_group: StorageMetaGroup, target_storage_unit: str, socket: zmq.Socket = None
+        self,
+        gi_list: list[int],


fully spell as global_indexes. Use gi only for cases like [gi for gi in global_indexes]

0oshowero0 · 2026-02-27T11:34:34Z

transfer_queue/storage/managers/simple_backend_manager.py

+        groups: dict[str, list[int]] = defaultdict(list)
+        for i, gi in enumerate(metadata.global_indexes):
+            backend_meta = metadata._custom_backend_meta[i]
+            if not backend_meta or "_su_id" not in backend_meta:
+                raise RuntimeError(
+                    f"clear_data: missing _su_id for global_index {gi} in _custom_backend_meta. "
+                    f"Make sure put_data was called before clear_data."
+                )
+            groups[backend_meta["_su_id"]].append(gi)


Same as above comment

0oshowero0 · 2026-02-27T11:37:37Z

transfer_queue/utils/serial_utils.py

 CUSTOM_TYPE_CLOUDPICKLE = 2
 CUSTOM_TYPE_TENSOR = 3  # For tensor with buffer reference
 CUSTOM_TYPE_NESTED_TENSOR = 4  # For nested tensor (strided or jagged)
+CUSTOM_TYPE_BATCHMETA = 5  # For BatchMeta serialization


Is there any serial performance comparison for BatchMeta? I'm wondering if pickle could be faster for BatchMeta obj

0oshowero0 · 2026-02-27T11:39:41Z

transfer_queue/utils/zmq_utils.py

-        else:
-            return pickle.loads(frames[0])
+        # pickle fallback path: serialize() sets frame[0] to _PICKLE_FALLBACK_SENTINEL on failure.
+        if len(frames) >= 2 and frames[0] == _PICKLE_FALLBACK_SENTINEL:


Why we have to tackle this outside serial_utils.py?

0oshowero0 · 2026-02-27T11:42:55Z

transfer_queue/controller.py

                if global_idx not in self.field_dtypes:
                    self.field_dtypes[global_idx] = {}
                self.field_dtypes[global_idx].update(dtype_value[i])
+                # Update field_schema_cache with new dtype info


What will happen if:

We input a uniform tensor (same dtype & shape) in field A -> only has one dtype and shape info

We append other format tensor with different dtype & shape in the same field A? Will we update the dtype and shape info to reflect the new changes?

A folloing question:

If user retrieve the data put in step 1, what's the BatchMeta looks like? It will say this is a nested or ordinary tensor?

jianjunzhong

Scripts in recipe/simple_use_case cannot run properly.

jianjunzhong · 2026-02-27T14:00:39Z

transfer_queue/metadata.py

+            self.production_status = np.zeros(batch_size, dtype=np.int8)
+
+        for field_name, meta in self.field_schema.items():
+            if meta.get("per_sample_shapes") is not None:


per_sample_shapes is only meaningful when is_nested=True, but the validation logic doesn't check this correlation.

jianjunzhong · 2026-02-27T14:22:24Z

transfer_queue/metadata.py

+        return np.dtype(dtype_str)
+    except TypeError:
+        pass
+    # Fallback: return as-is (e.g. plain Python type repr like "<class 'int'>")


Should record a warning:
logger.warning(f"Unknown dtype string '{dtype_str}', returning as-is")

jianjunzhong · 2026-02-28T07:10:08Z

tests/test_simple_storage_unit.py

+
+
+def test_storage_unit_data_dict_key():
+    """StorageUnitData dict-key: gi 直接作为 key，clear 真正释放内存."""


It's better to use English instead of Chinese

ascend-robot added the ascend-cla/yes label Feb 27, 2026

mpb159753 added 3 commits February 27, 2026 15:53

mpb159753 force-pushed the refactor/columnar-batchmeta-zero-copy branch from efdcc43 to 0a17163 Compare February 27, 2026 07:56

mpb159753 force-pushed the refactor/columnar-batchmeta-zero-copy branch from 89b43bb to ed4a66f Compare February 27, 2026 09:06

mpb159753 force-pushed the refactor/columnar-batchmeta-zero-copy branch from ed4a66f to 83a3ee7 Compare February 27, 2026 09:11

0oshowero0 reviewed Feb 27, 2026

View reviewed changes

jianjunzhong reviewed Feb 27, 2026

View reviewed changes

jianjunzhong reviewed Feb 28, 2026

View reviewed changes

		print(f" For global_index: use batch.global_indexes[0] = {ready_batch.global_indexes[0]}")


		def demonstrate_batch_meta_construction():



		def demonstrate_field_meta():
		def demonstrate_batch_meta_schema():



		def test_storage_unit_data_dict_key():
		"""StorageUnitData dict-key: gi 直接作为 key，clear 真正释放内存."""

Conversation

mpb159753 commented Feb 27, 2026

Columnar BatchMeta + Zero-Copy Default

1. Context & Motivation

2. Key Changes

2.1 Columnar BatchMeta (metadata.py)

2.2 Zero-Copy Serialization Default (serial_utils.py)

2.3 Storage & Transport Adaptation

2.4 Test Suite

3. Benchmark Results

Throughput Comparison (Gbps)

Speedup vs Baseline (main-no-zerocopy)

Visualization

Resource Usage

4. API Breaking Changes

5. Files Changed

6. Conclusion

Uh oh!

ascend-robot commented Feb 27, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented Feb 27, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented Feb 27, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented Feb 27, 2026

CLA Signature Pass

Uh oh!

ascend-robot commented Feb 27, 2026

CLA Signature Pass

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0oshowero0 Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0oshowero0 Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

0oshowero0 Feb 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

2.1 Columnar BatchMeta (`metadata.py`)

2.2 Zero-Copy Serialization Default (`serial_utils.py`)

0oshowero0 Feb 27, 2026 •

edited

Loading

0oshowero0 Feb 27, 2026 •

edited

Loading

0oshowero0 Feb 27, 2026 •

edited

Loading