fix: update package structure and validation rules for clarity

spazyCZ · spazyCZ · commit 1756da8691ea · 2026-02-22T11:53:48.000+01:00
diff --git a/docs/spec/01-package-format.md b/docs/spec/01-package-format.md
@@ -9,7 +9,11 @@ package-name/
 │       ├── SKILL.md              # Skill instructions (REQUIRED per skill)
 │       ├── scripts/              # Executable scripts
 │       ├── references/           # Documentation files
-│       └── assets/               # Templates, data, images
+│       ├── assets/               # Templates, data, images
+│       └── tests/                # Deterministic skill tests (optional)
+│           ├── test-config.json  #   Test runner config
+│           ├── fixtures/         #   Test input files
+│           └── cases/            #   Test cases (YAML)
 ├── commands/                     # Slash commands (optional)
 │   └── command-name.md
 ├── agents/                       # Sub-agent definitions (optional)
@@ -20,16 +24,20 @@ package-name/
 │   └── rule-name/
 │       └── RULE.md
 ├── hooks/                        # Lifecycle hooks (optional)
-│   └── hooks.json
+│   ├── hooks.json                #   Hook definitions
+│   ├── scripts/                  #   Shell scripts referenced by hooks
+│   └── tests/                    #   Deterministic hook tests (optional)
+│       ├── test-config.json      #     Test runner config
+│       ├── fixtures/             #     Simulated event payloads (JSON)
+│       └── cases/                #     Test cases (YAML)
 ├── mcp/                          # MCP server configs (optional)
 │   └── servers.json
 ├── evals/                        # LLM-judged evaluations (optional)
 │   ├── eval-config.json          #   Eval runner configuration
 │   ├── fixtures/                 #   Shared test fixtures
-│   │   └── sample.pdf
-│   └── cases/                    #   Eval cases
-│       ├── 01-skill-e2e.yaml     #     Skill end-to-end eval
-│       └── 02-hook-integration.yaml  # Hook integration eval
+│   ├── cases/                    #   Eval cases (YAML)
+│   └── reports/                  #   Eval run reports (auto-generated)
+│       └── <timestamp>.json      #     Full provenance per run
 ├── AGENTS.md                     # Universal agent instructions (optional)
 ├── README.md                     # Human documentation
 ├── CHANGELOG.md                  # Version history
@@ -73,7 +81,7 @@ Evals are **LLM-judged integration tests** that verify skills and hooks work cor
 | Field | Type | Required | Description |
 |-------|------|----------|-------------|
 | `version` | `number` | **Yes** | Eval config format version. Currently `1`. |
-| `engine` | `string` | **Yes** | Agent runtime to use: `"claude-code"`, `"copilot"`, `"codex"`, `"cursor"`. |
+| `engine` | `string` | **Yes** | Agent runtime to use. Supported values: `"claude-code"`, `"copilot"`, `"codex"`, `"cursor"`. Current headless eval support is shown in [Platform Eval Entry Points](#platform-eval-entry-points). |
 | `timeout` | `number` | No | Max seconds per eval case. Default `120`. |
 | `judge` | `string` | No | Model used for LLM-as-judge assessment. Default: same as engine model. |
 | `sandbox.network` | `bool` | No | Allow network access in sandbox. Default `false`. |
diff --git a/docs/spec/02-manifest.md b/docs/spec/02-manifest.md
@@ -87,7 +87,7 @@ The manifest is the **single required file**. It identifies the package and decl
     "evals": [
       {
         "name": "accuracy-eval",
-        "path": "evals/accuracy.yaml",
+        "path": "evals/cases/accuracy-eval.yaml",
         "description": "Measures accuracy against benchmark",
         "metrics": [
           { "name": "accuracy", "type": "percentage" }
diff --git a/docs/spec/15-validation.md b/docs/spec/15-validation.md
@@ -2,7 +2,7 @@
 
 | Rule | Constraint |
 |------|-----------|
-| Package `name` | `[a-z0-9-]`, max 64 chars |
+| Package `name` | `[a-z0-9-]` max 64 chars, or scoped `@scope/name` max 130 chars |
 | Package `version` | Valid SemVer 2.0 |
 | Skill `description` | Max 1024 chars, must describe WHAT + WHEN |
 | SKILL.md body | Recommended < 5,000 tokens / 500 lines |

Original file line number	Diff line number	Diff line change
`@@ -87,7 +87,7 @@ The manifest is the single required file. It identifies the package and decl`
`87`	`87`	`"evals": [`
`88`	`88`	`{`
`89`	`89`	`"name": "accuracy-eval",`
`90`		`- "path": "evals/accuracy.yaml",`
	`90`	`+ "path": "evals/cases/accuracy-eval.yaml",`
`91`	`91`	`"description": "Measures accuracy against benchmark",`
`92`	`92`	`"metrics": [`
`93`	`93`	`{ "name": "accuracy", "type": "percentage" }`