Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 5 additions & 8 deletions AGENTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -37,11 +37,6 @@ agents/
tools/ # Agent tools
load-progress.sh # Read past context (JSONL)
record-progress.sh # Write progress logs (JSONL)
skill-bench/ # Claude Code Skill testing framework
runner.sh # Test runner
cases/ # Test case definitions (TOML, test prompts in English)
tools/ # Check scripts for validating test results
logs/ # Test execution logs
claude-plugin/ # Claude Code Plugin structure
skills/ # Individual skill definitions
mise.toml # Task definitions (fmt, clippy, test, pre-commit)
Expand All @@ -50,11 +45,13 @@ mise.toml # Task definitions (fmt, clippy, test, pre-commit)

## Skill-Bench Testing

`agents/skill-bench/` contains the testing framework for Claude Code skills:
`tests/` contains skill test cases using the `skill-bench` CLI:

- **Test cases are in English** - All `test_prompt` values in TOML files must be English
- **Run tests**: `./agents/skill-bench/runner.sh` (executes inside container)
- **Test pattern**: `./agents/skill-bench/runner.sh "cases/*/*.toml"`
- **List tests**: `skill-bench list tests`
- **Run tests**: `skill-bench run tests`
- **Filter by skill**: `skill-bench run tests --skill patent-search`
- **Log directory**: `skill-bench run tests --log logs/`

## Tools

Expand Down
4 changes: 2 additions & 2 deletions Cargo.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

1 change: 0 additions & 1 deletion agents/skill-bench/.gitignore

This file was deleted.

This file was deleted.

34 changes: 0 additions & 34 deletions agents/skill-bench/cases/patent-assignee-check/functional.toml

This file was deleted.

177 changes: 0 additions & 177 deletions agents/skill-bench/runner.sh

This file was deleted.

37 changes: 0 additions & 37 deletions agents/skill-bench/tools/check-mcp-loaded.sh

This file was deleted.

62 changes: 0 additions & 62 deletions agents/skill-bench/tools/check-mcp-success.sh

This file was deleted.

Loading
Loading