fix(builtins): prevent awk parser panic on multi-byte UTF-8 by chaliy · Pull Request #476 · everruns/bashkit

chaliy · 2026-03-02T03:46:57Z

Summary

Add current_char() and advance() helpers for char-boundary safe parsing
Replace all 66+ chars().nth(self.pos) calls with current_char()
Fix string, regex, identifier, and comment parsing for multi-byte chars
Fix matches_keyword lookahead to use slice-based char access

Closes #395

The awk parser used byte offsets with chars().nth() (char index), causing panics when multi-byte UTF-8 appeared in comments, strings, or regex patterns. Added current_char()/advance() helpers for safe char-boundary handling and replaced all chars().nth(byte_offset) patterns with byte-safe slicing. Closes #395

chaliy force-pushed the claude/fix-395-Y2nIj branch from 5f75382 to 49a1814 Compare March 2, 2026 18:05

chaliy merged commit f527d64 into main Mar 2, 2026
17 checks passed

chaliy deleted the claude/fix-395-Y2nIj branch March 12, 2026 03:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(builtins): prevent awk parser panic on multi-byte UTF-8#476

fix(builtins): prevent awk parser panic on multi-byte UTF-8#476
chaliy merged 1 commit intomainfrom
claude/fix-395-Y2nIj

chaliy commented Mar 2, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

chaliy commented Mar 2, 2026

Summary

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants