[TEST] Refactor instruction normalization and expand test coverage by RainRat · Pull Request #302 · RainRat/pythonevolverstage

RainRat · 2026-02-05T11:01:16Z

This PR improves the robustness of the normalize_instruction utility in evolverstage.py and addresses a significant gap in test coverage.

Changes:

Refactored normalize_instruction:
- Replaced the fragile re.split logic with a robust regular expression (re.match) that correctly parses Redcode instructions even with non-standard formatting.
- Implemented default handling: missing modifiers now default to .I, and missing addressing modes default to $.
- Added support for case-insensitivity (e.g., mov -> MOV).
- Improved whitespace handling, including support for tabs and varied spacing around commas.
- Added trailing comment stripping to prevent parsing errors when comments are present on the same line as an instruction.
- Ensures consistent output format: OPCODE.MODIFIER MODE VALUE,MODE VALUE.
Expanded Test Suite:
- Updated tests/test_instruction_normalization.py from basic happy-path tests to a comprehensive suite of 12 tests.
- Added specific test cases for:
  - Missing modifiers.
  - Missing addressing modes.
  - Extra whitespace and tabs.
  - Case-insensitivity.
  - Trailing comments.
  - Various separators (comma vs space).

Impact:

These changes prevent potential crashes and warrior corruption during the unarchiving and normalization processes, making the evolver much more resilient to varied Redcode source files. All existing and new tests pass successfully.

PR created automatically by Jules for task 4604263877508189389 started by @RainRat

- Completely rewrote the brittle `normalize_instruction` function in `evolverstage.py` to use a robust regex-based parser. - Added support for missing modifiers (defaults to .I), missing addressing modes (defaults to $), case-insensitivity, varied whitespace/tabs, and trailing comments. - Expanded `tests/test_instruction_normalization.py` with 12 comprehensive test cases covering these edge cases. - Verified that all 118 tests in the suite pass. Co-authored-by: RainRat <20098977+RainRat@users.noreply.github.com>

google-labs-jules · 2026-02-05T11:01:17Z

👋 Jules, reporting for duty! I'm here to lend a hand with this pull request.

When you start a review, I'll add a 👀 emoji to each comment to let you know I've read it. I'll focus on feedback directed at me and will do my best to stay out of conversations between you and other bots or reviewers to keep the noise down.

I'll push a commit with your requested changes shortly after. Please note there might be a delay between these steps, but rest assured I'm on the job!

For more direct control, you can switch me to Reactive Mode. When this mode is on, I will only act on comments where you specifically mention me with @jules. You can find this option in the Pull Request section of your global Jules UI settings. You can always switch back!

New to Jules? Learn more at jules.google/docs.

For security, I will only act on instructions from the user who triggered this task.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 30a417e187

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-02-05T11:02:43Z

evolverstage.py

+    if not clean_instr:
+        return ""


Reject comment-only input instead of returning empty output

Returning "" for comment-only lines makes callers treat a non-instruction as successfully normalized, which is harmful in the unarchive workflow: the loop increments its line counter before calling normalize_instruction, so comment/header lines consume the warrior length budget without writing an instruction and the DAT padding step may no longer fill to the intended length. This can silently produce truncated warriors when importing third-party .red files that include metadata comments.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-02-05T11:02:43Z

evolverstage.py

+                value = int(op[1:])
+            except ValueError:
+                value = 0


Preserve parse failures instead of coercing bad operands to zero

The new operand parser converts any non-numeric operand suffix (for example malformed tokens like $foo) into 0, which silently rewrites invalid instructions into valid-but-different code. In normalization paths this corrupts warrior behavior and makes bad input hard to detect; previously these cases raised and could be surfaced by callers (e.g., warning/skip logic in normalization mode) instead of mutating semantics to $0/#0.

Useful? React with 👍 / 👎.

chatgpt-codex-connector bot reviewed Feb 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TEST] Refactor instruction normalization and expand test coverage#302

[TEST] Refactor instruction normalization and expand test coverage#302
RainRat wants to merge 1 commit intomainfrom
improve-instruction-normalization-4604263877508189389

RainRat commented Feb 5, 2026

Uh oh!

google-labs-jules bot commented Feb 5, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Feb 5, 2026

Uh oh!

chatgpt-codex-connector bot Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

RainRat commented Feb 5, 2026

Changes:

Impact:

Uh oh!

google-labs-jules bot commented Feb 5, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant