Hook Engine + Chained Command Rewriting (PR #131 Part 1) by ahundt · Pull Request #156 · rtk-ai/rtk

ahundt · 2026-02-16T23:59:26Z

PR 131 Part 1: Hook Engine + Chained Command Rewriting

Branch: feat/rust-hooks-v2 | Base: master | Tests: 541 pass
Closes: #112 | Split from: PR #131
New dep: which = "7"
PR: #156

Context

FlorianBruniaux requested splitting PR #131 (52 files, 8K+ additions) into separate PRs:

Gemini CLI support — standalone, no deps on the rest

Data safety rules (rm->trash, git reset --hard->stash) + rtk.*.md files

Chained command rewriting (cd && git status) — note: feat(hooks): add cross-platform Node.js/Bun hook for Windows support #141 also implements this

Rust-based hooks — the hook infrastructure changes

This PR combines items 3 + 4 because they are architecturally inseparable: the hook protocol handler calls lexer::tokenize() then analysis::parse_chain() to process chained commands. Separating them would require duplicating the lexer.

Coordination with PR #141: FlorianBruniaux noted overlap with #141's JS-based hook for Windows. This PR achieves Windows support via compiled Rust binary instead -- no bash, node, or bun required. CI/CD already builds Windows binaries. exec.rs uses cfg!(windows) for shell selection.

Merge Sequence

1. This PR -> master (foundation)
2. Retarget PR 2 (data safety) and PR 3 (Gemini) from feat/rust-hooks-v2 -> master
3. PR 2 and PR 3 can merge in any order (zero file conflicts between them)

Summary

Replaces the 204-line bash hook with a native Rust binary that provides quote-aware chained command rewriting. Closes #112 where cd /path && git status only rewrote cd.

Impact: Captures ~12-20M tokens/month in previously-missed optimizations across chained commands.

Why Rust over bash:

Chained commands work (cd && git status rewrites both)
Extensible (data safety rules in Part 2)
Debuggable (rtk hook check shows exact rewrites)
Multi-platform (Windows support, no JS dependencies)
Backward compatible (legacy .sh becomes 4-line shim)

`rtk hook claude` -- Claude Code PreToolUse handler

Reads JSON from stdin, applies rewriting, outputs JSON to stdout. Fail-open: malformed input exits 0 with no output so Claude proceeds unchanged.

stdin:  {"tool_input":{"command":"git status"}}
stdout: {"hookSpecificOutput":{"permissionDecision":"allow","updatedInput":{"command":"rtk run -c 'git status'"}}}

Chained command rewriting (closes #112)

Before: cd /tmp && git status -- hook only saw cd, missed git status
After: lexer splits on &&/||/; respecting quotes, each command wrapped independently

git commit -m "Fix && Bug" is NOT split (quote-aware).

`rtk run -c <command>` -- Command executor

Parses chains, detects shellisms (globs/pipes/subshells -> passthrough to sh/cmd), handles builtins (cd/export/pwd), applies output filters, prevents recursion via RTK_ACTIVE env guard.

`rtk hook check` -- Debugger

rtk hook check "cd /tmp && git status"
# Output: rtk run -c 'cd /tmp' && rtk run -c 'git status'

Changes

16 files changed (+2969, -221)

New (src/cmd/): mod.rs, hook.rs, claude_hook.rs, lexer.rs, analysis.rs, builtins.rs, exec.rs, filters.rs, predicates.rs, test_helpers.rs

Modified: src/main.rs (+Commands::Run, +Commands::Hook), src/init.rs (register binary hook), hooks/rtk-rewrite.sh (204-line script -> 4-line shim), Cargo.toml (+which), INSTALL.md (+Windows section)

Intentionally excluded (stacked PRs):

Safety rules -> PR 2 (feat/data-safety-rules-v2)
Gemini support -> PR 3 (feat/gemini-support-v2)

Review Guide

Focus areas:

src/cmd/lexer.rs + analysis.rs -- Chain parsing correctness (quote handling)
src/cmd/claude_hook.rs -- Protocol compliance, fail-open design
src/cmd/exec.rs -- Builtin handling, Windows shell selection (cfg!(windows))
src/cmd/hook.rs -- Shared decision logic (used by Parts 2 and 3)

Implementation Notes

Binary size: Compiled with LTO + stripping. Size increase from which dependency minimal (<0.1 MB). Full size impact measurable after all 3 parts merge (PR #131 reported 5.1 MB total, +0.3 MB from combined deps).

Backward compatible: All existing RTK features work unchanged. Legacy bash hook becomes 4-line shim forwarding to rtk hook claude.

Test Plan

cargo test -- 541 tests pass (hook:22, claude_hook:18, lexer:28, analysis:10, builtins:8, exec:22, filters:5, predicates:4)
echo '{"tool_input":{"command":"git status"}}' | cargo run -- hook claude -- JSON rewrite works
echo '{"tool_input":{"command":"cd /tmp && git status"}}' | cargo run -- hook claude -- chain split works
cargo run -- hook check "git status" -- text debugger works
cargo run -- run -c "echo hello" -- executor works
grep 'cfg!(windows)' src/cmd/exec.rs -- Windows shell selection present

Related PRs (Split from PR #131)

Part	PR	Description
1	#156	Hook Engine + Chained Commands (this PR)
2	#157	Data Safety Rules
3	#158	Gemini CLI Support

Merge order: Part 1 first → retarget Parts 2 & 3 to master → merge in any order

Rust binary replaces 204-line bash script as Claude Code PreToolUse hook. Adds rtk hook claude, rtk run -c, and Windows support via cfg!(windows). Closes rtk-ai#112 (chained commands missed). Based on updated master (70c3786) which includes: - Hook audit mode (rtk-ai#151) - Claude Code agents and skills (d8f4659) - tee raw output feature (rtk-ai#134) Migrated from feat/rust-hooks (571bd86) with conflict resolution for: - src/main.rs: Commands enum (preserved both hook audit + our hook commands) - src/init.rs: Hook registration (integrated both approaches) New files (src/cmd/ module): - mod.rs: Module declarations (10 modules, excluding safety/trash/gemini for PR 1) - hook.rs: Shared hook decision logic (21 tests, 3 safety tests removed for PR 2) - claude_hook.rs: Claude Code JSON protocol handler (18 tests) - lexer.rs: Quote-aware tokenizer (28 tests) - analysis.rs: Chain parsing and shellism detection (10 tests) - builtins.rs: cd/export/pwd/echo/true/false (8 tests) - exec.rs: Command executor with recursion guard (22 tests, safety dispatch removed for PR 2) - filters.rs: Output filter registry (5 tests) - predicates.rs: Context predicates (4 tests) - test_helpers.rs: Test utilities Modified files: - src/main.rs: Added Commands::Run, Commands::Hook, HookCommands enum, routing - src/init.rs: Changed patch_settings_json to use rtk hook claude binary command - hooks/rtk-rewrite.sh: Replaced 204-line bash script with 4-line shim (exec rtk hook claude) - Cargo.toml: Added which = 7 for PATH resolution - INSTALL.md: Added Windows installation section Windows support: - exec.rs:175-176: cfg!(windows) selects cmd /C vs sh -c for shell passthrough - predicates.rs:26: USERPROFILE fallback for Windows home directory - No bash, node, or bun dependency - rtk hook claude is a compiled Rust binary Tests: All 541 tests pass

pszymkowiak · 2026-02-18T21:40:59Z

Thanks Andrew for the clean split from #131, and the architecture is genuinely well thought out — the lexer, fail-open design, RAII guard, and deny(clippy::print_stdout) are all
excellent.

However, I have a critical concern that I think is a regression:

The hook wraps everything in rtk run -c '...' instead of routing to specialized filters.

The current bash hook does:
git status → rtk git status (uses src/git.rs, 80% savings)
cargo test → rtk cargo test (uses src/runner.rs, 90% savings)
gh pr view 123 → rtk gh pr view 123 (uses src/gh_cmd.rs, 87% savings)

This PR does:
git status → rtk run -c 'git status' (exec.rs → strip_ansi only, ~0% savings)
cargo test → rtk run -c 'cargo test' (exec.rs → strip_ansi only, ~0% savings)

The entire value of RTK is the specialized filters per command. rtk cargo test shows only failures (90% token reduction). rtk run -c 'cargo test' just runs the command and strips ANSI
codes. That's a massive regression for every user.

Did you test this with real commands and compare token savings? I'd expect rtk run -c 'cargo test' to produce significantly more output than rtk cargo test.

What I'd expect instead:
The hook should still route to rtk git status, rtk cargo test, etc. The chained command support is the real win here — cd /tmp && git status → cd /tmp && rtk git status — but the routing
to specialized filters must be preserved.

A few other items:

Streaming: run_passthrough uses .output() which buffers everything. A cargo build --release (2+ min) shows zero output until completion. That's a UX regression.
Hardcoded path: /Users/athundt/.claude/... in claude_hook.rs comments — please remove.
cd persistence: builtins.rs comment says it "maintains session state across hook calls" but each hook invocation is a new process, so cd can't persist. The comment is misleading.
Secrets integration: We're working on a secrets vault feature (encrypt/decrypt pass in the hook). The 4-line shim replacing rtk-rewrite.sh removes our decrypt logic. We'll need to port
that to Rust — which is fine and actually better, but we need to coordinate.

The foundation here is solid. The lexer, chain parsing, and fail-open protocol handling are exactly what we need. But the command routing needs to preserve the specialized filters —
that's RTK's core value.

Happy to discuss the best approach. Would it make sense to have rtk run -c detect known commands and dispatch to their specialized modules internally?

pszymkowiak · 2026-02-18T21:45:19Z

I tested the PR locally and found a critical issue: the hook routes everything through rtk run -c, bypassing all specialized filters.

Here's what I measured:

Command	raw	rtk (current)	rtk run -c (this PR)
git log -10	792 tok	119 (85% saved)	792 (0%)
git status	38	7 (82%)	38 (0%)
cargo test	2,429	8 (99.7%)	2,425 (0%)
cargo clippy	2,545	156 (94%)	2,527 (0%)
grep 'fn run'	941	534 (43%)	941 (0%)
ls src/	479	106 (78%)	479 (0%)
Total	7,249	952 (87%)	7,227 (0.3%)

…commands Replaces stub check_for_hook_inner with full tokenize+native-path dispatch. Adds route_native_command() with replace_first_word/route_pnpm/route_npx helpers to route single parsed commands to optimized RTK subcommands. Chains (&&/||/;) and shellisms still use rtk run -c. No safety integration (PR rtk-ai#157 adds that). Mirrors ~/.claude/hooks/rtk-rewrite.sh routing table. Corrects shell script vitest double-run bug for pnpm vitest run flags.

ahundt · 2026-02-19T08:06:37Z

Thanks @pszymkowiak, i found that too independently and just pushed a fix, hopefully it should work now!

also there is one potential dep that could be added optionally to robustify things and improve token reduction but i left it out in favor of a small regex at this time to minimize deps, strip-ansi-escapes strips ANSI escape sequences from byte sequences using the vte terminal parser — handles the full escape sequence space (CSI, OSC, private-mode params, two-byte sequences) that a hand-rolled regex misses. https://crates.io/crates/strip-ansi-escapes

rtk has no `tail` subcommand — routing to "rtk tail" was silently broken (rtk would error "unrecognized subcommand"). Remove the Route entry so the command falls through to `rtk run -c '...'` correctly. Move the log-tailing test cases from test_routing_native_commands (which asserted the broken path) into test_routing_fallbacks_to_rtk_run where they correctly verify the rtk-run-c fallback behavior.

Port tests added during the ROUTES table integration that were missing from the v2 worktree: registry.rs: - 12 classify tests for Python/Go commands (pytest, go×4, ruff×2, pip×3, golangci-lint) that verify PATTERNS/RULES and ROUTES alignment - 11 lookup tests (test_lookup_*, test_no_duplicate_binaries_in_routes, test_lookup_is_o1_consistent) that verify O(1) HashMap routing hook.rs: - Extend test_routing_native_commands from 20 to 47 cases covering all ROUTES entries: docker, kubectl, curl, eslint, tsc, prettier, playwright, prisma, pytest, golangci-lint, ruff, pip, gh variants - Add test_routing_subcommand_filter_fallback (14 cases) verifying that Only[] subcommand filters correctly reject unmatched subcommands Total: 545 → 569 tests (+24)

ahundt · 2026-02-19T10:30:25Z

ok i made it a much better router that goes in registry.rs so all the "add a new tool" code is in one place, it should be much more efficient than the previous two versions and easier to extend to better address that bug you mentioned.

Three integration tests that simulate the full hook pipeline from scratch: raw command → check_for_hook (lexer + router) → rewritten rtk cmd → execute both → assert rtk output has fewer tokens than raw Tests: - test_e2e_git_status_saves_tokens: verifies ≥40% savings vs raw git status - test_e2e_ls_saves_tokens: verifies ≥40% savings vs raw ls -la - test_e2e_git_log_saves_tokens: verifies ≤5% overhead (already-compact input) Each test first asserts the lexer+router produced the correct rewrite, then executes both commands and compares whitespace-delimited token counts. Run with: cargo test e2e -- --ignored Requires: cargo install --path . (rtk on PATH) + git repo

ahundt · 2026-02-20T00:28:10Z

@pszymkowiak I also added a few small e2e tests to confirm the behavior actually uses the rtk internals and those pass.

This was referenced Feb 17, 2026

Data Safety Rules + Extensible Rule System (PR #131 Part 2) #157

Open

Gemini CLI Hook Support (PR #131 Part 3) #158

Open

ahundt changed the title ~~feat: Rust-based hook engine with chained command rewriting~~ Hook Engine + Chained Command Rewriting (PR #131 Part 1) Feb 17, 2026

This was referenced Feb 17, 2026

Gemini CLI support, rm→trash / git→stash data safety rtk.*.md rules, chained command rewriting, Rust-based hooks #131

Closed

feat(hooks): add cross-platform Node.js/Bun hook for Windows support #141

Closed

This was referenced Feb 17, 2026

Optional feature for safely remapping commands? (Eg rm -> trash) #115

Open

Hook: chained commands (cd dir && cmd) are never rewritten #112

Closed

aeppling added the P1-critical Bloque des utilisateurs, fix ASAP label Feb 18, 2026

pszymkowiak added invalid This doesn't seem right labels Feb 18, 2026

ahundt added 2 commits February 19, 2026 05:07

pszymkowiak mentioned this pull request Feb 20, 2026

feat(hook): native cross-platform (Windows & more) hook-rewrite command #150

Open

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Hook Engine + Chained Command Rewriting (PR #131 Part 1)#156

Hook Engine + Chained Command Rewriting (PR #131 Part 1)#156
ahundt wants to merge 5 commits intortk-ai:masterfrom
ahundt:feat/rust-hooks-v2

ahundt commented Feb 16, 2026 •

edited

Loading

Uh oh!

pszymkowiak commented Feb 18, 2026

Uh oh!

pszymkowiak commented Feb 18, 2026

Uh oh!

ahundt commented Feb 19, 2026

Uh oh!

ahundt commented Feb 19, 2026 •

edited

Loading

Uh oh!

ahundt commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

ahundt commented Feb 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR 131 Part 1: Hook Engine + Chained Command Rewriting

Context

Merge Sequence

Summary

rtk hook claude -- Claude Code PreToolUse handler

Chained command rewriting (closes #112)

rtk run -c <command> -- Command executor

rtk hook check -- Debugger

Changes

Review Guide

Implementation Notes

Test Plan

Related PRs (Split from PR #131)

Uh oh!

pszymkowiak commented Feb 18, 2026

Uh oh!

pszymkowiak commented Feb 18, 2026

Uh oh!

ahundt commented Feb 19, 2026

Uh oh!

ahundt commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ahundt commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ahundt commented Feb 16, 2026 •

edited

Loading

`rtk hook claude` -- Claude Code PreToolUse handler

`rtk run -c <command>` -- Command executor

`rtk hook check` -- Debugger

ahundt commented Feb 19, 2026 •

edited

Loading