Ensure inference fallback respects dynamic policy size by lukifer23 · Pull Request #95 · lukifer23/Matrix0

lukifer23 · 2025-09-22T21:23:23Z

Summary

add a helper to derive the policy tensor width from shared memory resources
use the derived width when building fallback policy logits on the server and client paths

Testing

python - <<'PY' ...

https://chatgpt.com/codex/tasks/task_e_68d1bc54bd888323b9feb57ff68200d9

Copilot

Pull Request Overview

This PR ensures that inference fallback logic respects dynamic policy tensor sizes rather than using hardcoded dimensions. The changes add a helper function to dynamically derive policy width from shared memory resources and update both server and client paths to use this derived width when creating fallback policy logits.

Added _get_policy_width_from_resource() helper function to extract policy tensor dimensions
Updated server fallback logic to derive policy width from worker resources
Updated client fallback logic to use dynamic policy width instead of hardcoded 4672

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

Copilot · 2025-09-22T21:24:07Z

azchess/selfplay/inference.py

+        return int(width)
+    except (TypeError, ValueError):
+        return 0


The function returns 0 for invalid width values, but later code uses this as a tensor dimension. A zero-width tensor would cause runtime errors. Consider returning a sensible default width or raising an exception to fail fast.

Copilot · 2025-09-22T21:24:07Z

azchess/selfplay/inference.py

+                        policy_logits_np = np.zeros(
+                            (batch_size, policy_width), dtype=np.float32
+                        )


When policy_width is 0 (from the helper function), this creates a tensor with shape (batch_size, 0) which will likely cause errors in downstream code expecting valid policy dimensions.

Copilot · 2025-09-22T21:24:08Z

azchess/selfplay/inference.py

+        if policy_width <= 0:
+            self.logger.debug(
+                "Falling back to zero-width policy logits due to missing shape information"
+            )


Similar to the server path, creating a zero-width policy tensor when policy_width is 0 will cause runtime errors. The fallback should ensure a valid tensor dimension.

Suggested change

)

)

policy_width = max(1, policy_width)

Ensure inference fallback respects dynamic policy size

ab01c5f

Copilot AI review requested due to automatic review settings September 22, 2025 21:23

lukifer23 added the codex label Sep 22, 2025 — with ChatGPT Codex Connector

lukifer23 merged commit c075bb0 into master Sep 22, 2025
1 check failed

lukifer23 deleted the codex/refactor-fallback-policy-width-derivation branch September 22, 2025 21:23

Copilot AI reviewed Sep 22, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ensure inference fallback respects dynamic policy size#95

Ensure inference fallback respects dynamic policy size#95
lukifer23 merged 1 commit intomasterfrom
codex/refactor-fallback-policy-width-derivation

lukifer23 commented Sep 22, 2025

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Sep 22, 2025

Uh oh!

Copilot AI Sep 22, 2025

Uh oh!

Copilot AI Sep 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

lukifer23 commented Sep 22, 2025

Summary

Testing

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Copilot AI Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI Sep 22, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants