Handle small tactical/openings NPZ batches by lukifer23 · Pull Request #102 · lukifer23/Matrix0

lukifer23 · 2025-10-12T21:55:06Z

Summary

clamp tactical and openings sampling to the available curriculum data, enabling replacement sampling only when needed
skip empty tactical/openings shards to avoid malformed outputs
add regression tests that cover sampling from undersized tactical and openings NPZ files

Testing

pytest tests/test_data_manager_batches.py

https://chatgpt.com/codex/tasks/task_e_68e6f991ddcc83239164246fad36d7cc

Copilot

Pull Request Overview

This PR addresses the handling of small tactical and openings NPZ batch files to prevent runtime errors when sampling more data than available. The changes enable robust sampling from undersized curriculum data sources and skip empty data files.

Key changes include:

Clamping tactical and openings sampling to the available curriculum data size
Enabling replacement sampling only when the batch size exceeds the available data
Adding empty data validation to skip malformed outputs
Including regression tests for undersized NPZ file scenarios

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
tests/test_data_manager_batches.py	Adds regression tests that verify batch sampling from undersized tactical and openings NPZ files
azchess/data_manager.py	Updates tactical and openings batch methods to handle small files with proper size validation and replacement sampling

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-10-12T21:55:35Z

azchess/data_manager.py

+                draw_size = min(batch_size, total_positions)
+                replace = batch_size > total_positions
+                target_size = batch_size if replace else draw_size


[nitpick] The variable draw_size is misleading since it represents the sample size when replacement is disabled, not the size being drawn. Consider renaming to sample_size_no_replace or clamped_size for clarity.

Suggested change

draw_size = min(batch_size, total_positions)

replace = batch_size > total_positions

target_size = batch_size if replace else draw_size

sample_size_no_replace = min(batch_size, total_positions)

replace = batch_size > total_positions

target_size = batch_size if replace else sample_size_no_replace

Copilot · 2025-10-12T21:55:35Z

azchess/data_manager.py

+                draw_size = min(batch_size, total_positions)
+                replace = batch_size > total_positions
+                target_size = batch_size if replace else draw_size


[nitpick] The variable draw_size is misleading since it represents the sample size when replacement is disabled, not the size being drawn. Consider renaming to sample_size_no_replace or clamped_size for clarity.

Suggested change

draw_size = min(batch_size, total_positions)

replace = batch_size > total_positions

target_size = batch_size if replace else draw_size

clamped_size = min(batch_size, total_positions)

replace = batch_size > total_positions

target_size = batch_size if replace else clamped_size

Handle small tactical and openings batches

a14ac46

Copilot AI review requested due to automatic review settings October 12, 2025 21:55

lukifer23 added the codex label Oct 12, 2025 — with ChatGPT Codex Connector

lukifer23 merged commit d3109a5 into master Oct 12, 2025
1 check failed

lukifer23 deleted the codex/clamp-draw-size-and-add-regression-tests branch October 12, 2025 21:55

Copilot AI reviewed Oct 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Handle small tactical/openings NPZ batches#102

Handle small tactical/openings NPZ batches#102
lukifer23 merged 1 commit intomasterfrom
codex/clamp-draw-size-and-add-regression-tests

lukifer23 commented Oct 12, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Oct 12, 2025

Uh oh!

Copilot AI Oct 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lukifer23 commented Oct 12, 2025

Summary

Testing

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants