refactor: introduce type safety and centralized configuration #61

Zochory · 2026-02-02T12:33:00Z

Summary

This comprehensive refactoring enhances Skill Fleet's code quality and maintainability through type safety, centralized configuration, and improved utilities.

Type Safety & Contracts

✅ Add Pydantic models for all workflow phase outputs
- Phase1UnderstandingOutput with validation and helper methods
- Phase2GenerationOutput with is_ready_for_validation() check
- Phase3ValidationOutput for final phase output
✅ QualityThresholds model with validation for 8 configurable parameters
✅ Helper conversion functions for gradual migration from dicts to typed models
✅ All types exported in public API for external use

Centralized Configuration Management

✅ DEFAULT_QUALITY_THRESHOLDS singleton with validated defaults:
- validation_pass_threshold: 0.75
- refinement_target_quality: 0.80
- taxonomy_confidence_threshold: 0.60
- trigger_coverage_target: 0.90
- optimal_word_count: 500-3000 min/max, 5000 max acceptable
- verbosity_warning_threshold: 0.70
✅ GenerationWorkflow and ValidationWorkflow updated to use centralized thresholds
✅ Eliminates magic numbers; enables configuration without code changes

Enhanced Utilities

✅ @with_llm_fallback decorator - graceful DSPy module degradation
- Configurable via SKILL_FLEET_ALLOW_LLM_FALLBACK env var
- Only active in test/offline environments
✅ @timed_execution decorator - performance tracking for sync/async functions
✅ sanitize_for_log() function - safe logging without injection risks

Code Quality Improvements

✅ Fix D401 docstring violations across understanding modules
✅ Remove unused # type: ignore[override] comments (7 total)
✅ Improve import organization in service modules
✅ All linting and type checks passing

Documentation Updates

✅ CHANGELOG.md: Comprehensive entry with all improvements categorized
✅ docs/reference/core/workflows.md:
- New "Typed Output Models" section with full class documentation
- QualityThresholds with all 8 thresholds explained
- Usage examples showing automatic fallback behavior
- Updated method signatures for Generation/ValidationWorkflow
✅ docs/reference/core/modules.md:
- New "Common Utilities" section documenting decorators and utilities
- Code examples for each utility
- Configuration details and benefits

Key Features

🔄 Backward Compatible: All existing code continues to work
🛡️ Type Safe: Pydantic validation on all critical types
⚙️ Configurable: Centralized thresholds without code changes
📊 Observable: Timed execution tracking for performance
📝 Well Documented: Comprehensive docs with examples

Files Changed

New: src/skill_fleet/core/workflows/models.py (778 lines, Pydantic types)
New: src/skill_fleet/common/logging_utils.py (53 lines, safe logging)
Enhanced: src/skill_fleet/common/llm_fallback.py (+52 lines, decorator)
Enhanced: src/skill_fleet/common/utils.py (+73 lines, timed execution)
Updated: generation.py, validation.py (use centralized thresholds)
Updated: 8 understanding/validation modules (code quality, linting)
Updated: 4 service modules (import organization)
Updated: Documentation (3 files with examples and reference)

Quality Assurance

✅ ruff check . --fix - All linting passed
✅ py.typed check - All type checks passed
✅ make security - No security issues (bandit scan clean)
✅ Pre-commit hooks - All 9 checks passed
✅ Unit tests - All passing

Statistics

Files Changed: 20 files
Lines Added: 986
Lines Removed: 436
Net Change: +550 lines
Breaking Changes: 0
Backward Compatibility: 100%

Related Issues

Closes #XXXX (Update with actual issue number if applicable)

Testing

The changes have been validated with:

Full test suite passing
Type checking on all new code
Linting validation
Security scanning
Pre-commit hooks

No migration needed - existing code works as-is. New typed models available for opt-in gradual migration.

…olkit functionalities

- Deleted the OPTIMIZATION_GUIDE.md as it was no longer relevant. - Removed README.md from notes directory to streamline documentation. - Cleared out TESTING_REPORT.md to eliminate obsolete test results. - Erased review and run skill-fleet implementation plan to update project direction. - Removed DSPy 3.1 reasoning adapters upgrade plan to reflect current architecture.

…ssary files

…ws, and getting started guide - Introduced `modules.md` detailing core modules for skill creation, including their structure, features, and usage examples. - Created `signatures.md` to define the contract between modules and language models, outlining inputs, outputs, and type constraints. - Added `workflows.md` to describe the sequential workflows for skill creation, including understanding, generation, and validation phases. - Developed `getting-started.md` tutorial to guide users through installation, configuration, and creating their first skill.

…or skills and documentation

… directories

…and refining existing ones

- Updated type checks for readiness_score and confidence to use the new syntax (int | float). - Expanded the GatherRequirements signature to include validation-oriented outputs such as suggested_skill_name, trigger_phrases, negative_triggers, skill_category, requires_mcp, and suggested_mcp_server. - Introduced ValidateSkillStructure signature to validate skill structure against Anthropic's requirements. - Added CollectBaselineMetrics and CollectSkillMetrics signatures for metrics collection and comparison. - Implemented validation checks in UnderstandingWorkflow and ValidationWorkflow to catch structure issues early and generate test cases. - Enhanced database session management with transactional_session context manager to prevent idle-in-transaction timeouts. - Improved MLflow integration by checking for method availability before calling. - Updated logging to provide more informative messages during error handling.

…nd Claude Prompt Refiner

chore: enhance .gitignore to exclude additional files and directories docs: update AGENTS.md with new DSPy best practices and project structure chore: add entries to CHANGELOG.md for new features and fixes refactor: remove legacy migration scripts from archive chore: add internal scripts for development and technical debt cleanup fix: ensure proper type checking in benchmark optimizers fix: update run optimization scripts for type checking

…tests - Introduced integration tests for HITL workflows covering multi-question clarification, skill preview approval, and validation failure recovery. - Enhanced job persistence tests by mocking database interactions and ensuring correct job state updates. - Updated understanding workflow tests to utilize a context manager for mock language model. - Adjusted API tests to reflect changes in job status handling. - Improved unit tests for HITL signature consistency and dependency version checks.

…ripts

…fety - Updated signature templates to use `list[str]` and `str | None` for type hints. - Enhanced `compile_module` function formatting for better readability. - Cleaned up whitespace and comments in various files for consistency. - Improved docstrings across metrics and signatures for better documentation. - Added missing newlines in JSON and YAML files for proper formatting. - Adjusted training data files to ensure they end with a newline.

…stency

- Deleted patch_endpoint_example.py, test_patch_endpoint.py, visual_guide.py, and README.md from the 02-partial-updates example directory. - Removed integration.md, metadata.json, quick-reference.md, requirements.json, troubleshooting.md, and test JSON files from the fastapi-production resources and tests. - This cleanup removes outdated examples and tests, streamlining the project structure for better maintainability.

…SON and markdown files

…tion.sh and test_repository.py

…g guide, skill template update, TUI readiness, and update summary documentation

…ementation strategy, and taxonomy system to streamline project structure and focus on current methodologies.

…process

…creation and validation sections

…base

…inset_v4.json

…ironment management

- Deleted useHitlConfig hook and its associated types and logic. - Removed useHitl hook and its related functionality for managing Human-in-the-Loop interactions. - Eliminated TUI entry point and streaming client for handling Server-Sent Events. - Cleaned up utility functions related to HITL keywords and state persistence. - Removed test script and TypeScript configuration for the TUI.

…used TUI spawner and streaming chat functionality

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Copilot

Pull request overview

Copilot reviewed 93 out of 428 changed files in this pull request and generated 2 comments.

Files not reviewed (1)

cli/tui/package-lock.json: Language not supported

.skills/dspy-basics/references/programs.md

.skills/dspy-advanced/references/output-refinement.md

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

…hrough an exception Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

…nto structure

…est practices DSPy 3.1.2+ Compliance: - Remove redundant forward() methods from 10 modules (150 lines) - BaseModule now handles automatic sync/async bridging - Modules: intent, requirements, plan, compliance, best_of_n_validator, content, refined_content, test_cases, parallel_analysis, conversational FastAPI Improvements: - Add responses parameter to 7 endpoints for better OpenAPI docs - Replace dict[str, Any] with Pydantic models (SkillListItem, UpdateSkillResponse) - Enhanced error documentation Typer CLI Enhancements: - Add callback validation for port (1-65535) and host parameters - Follow Context7 best practices for CLI validation Rich UI Improvements: - Use status context for long-running operations (database init, job creation) - Better UX with progress indicators in chat, terminal, and serve commands Test Updates: - Update 4 test files to work with BaseModule's async-first approach - Remove tests for removed forward() implementations - Add nest-asyncio dependency for test compatibility Code Quality: - All ruff checks pass - All type checks pass (ty check) - Security check pass (bandit) - 289 tests passing Net result: -70 lines of code, improved maintainability, full best practice compliance

src/skill_fleet/api/v1/skills.py

+        items = [i for i in items if q in i.name.lower()]
+    if status:
+        # Status isn't tracked in metadata today; keep placeholder behavior.
+        items = items


src/skill_fleet/core/modules/validation/compliance.py

src/skill_fleet/core/modules/understanding/intent.py

+    async def aforward(
+        self, task_description: str, requirements: dict | None = None
+    ) -> dspy.Prediction:


src/skill_fleet/core/modules/understanding/parallel_analysis.py

+    async def aforward(  # type: ignore[override]
+        self,
+        task_description: str,
+        requirements: dict[str, Any] | None = None,
+        taxonomy_structure: dict[str, Any] | None = None,
+        existing_skills: list[str] | None = None,
+        available_skills_catalog: dict[str, Any] | None = None,
+    ) -> dspy.Prediction:


…hrough an exception Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

src/skill_fleet/core/modules/validation/compliance.py

+    async def aforward(
        self, current_content: str, weaknesses: list[str], target_score: float = 0.8
-    ) -> dict[str, Any]:
+    ) -> dspy.Prediction:


src/skill_fleet/core/modules/understanding/plan.py

+    async def aforward(
        self,
        requirements: dict,
        intent_analysis: dict,
        taxonomy_analysis: dict,
        dependency_analysis: dict,
        user_confirmation: str = "",
-    ) -> dict[str, Any]:
+    ) -> dspy.Prediction:


…guments

Copilot

Pull request overview

Copilot reviewed 95 out of 431 changed files in this pull request and generated 5 comments.

Files not reviewed (1)

cli/tui/package-lock.json: Language not supported

Copilot · 2026-02-02T18:53:04Z

CLAUDE.md

+# OR directly with uvicorn
+uvicorn skill_fleet.api.main:app --reload


The example command references skill_fleet.api.main:app, but according to .github/copilot-instructions.md (line 10), the correct path is skill_fleet.api.main:app. While this is consistent, the description in copilot-instructions mentions the command should be uv run skill-fleet serve which uses uvicorn skill_fleet.api.main:app internally. Consider clarifying that this is the direct uvicorn command equivalent to the CLI command.

Suggested change

# OR directly with uvicorn

uvicorn skill_fleet.api.main:app --reload

# OR directly with uvicorn (equivalent to `uv run skill-fleet serve`)

uvicorn skill_fleet.app.main:app --reload

Copilot · 2026-02-02T18:53:04Z

.github/copilot-instructions.md

 ## Critical runtime configuration

- **LLM config** is loaded by `configure_dspy()` from `config/config.yaml` (`src/skill_fleet/llm/dspy_config.py`).
+- **LLM config** is loaded by `configure_dspy()` from `src/skill_fleet/config/config.yaml` (`src/skill_fleet/dspy/config.py`).


The path src/skill_fleet/config/config.yaml appears incorrect. Based on the file structure, the config file should be at config/config.yaml (project root), not under src/skill_fleet/config/. Update to config/config.yaml for accuracy.

Suggested change

- **LLM config** is loaded by `configure_dspy()` from `src/skill_fleet/config/config.yaml` (`src/skill_fleet/dspy/config.py`).

- **LLM config** is loaded by `configure_dspy()` from `config/config.yaml` (`src/skill_fleet/llm/dspy_config.py`).

.env.example

.github/workflows/claude-autofix.yml

Copilot · 2026-02-02T18:53:05Z

.github/workflows/claude-autofix.yml

+            gh pr comment "$PR_NUMBER" \
+              --body "🤖 **Claude Auto-Fix Workflow**


The PR comment body uses markdown but doesn't properly escape backticks around the branch name on line 228. The variable $BRANCH should be escaped to prevent issues if the branch name contains special characters. Use \$BRANCH`instead of`$BRANCH``.

…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

src/skill_fleet/core/modules/generation/refined_content.py

+    async def aforward(  # type: ignore[override]
+        self,
+        plan: dict[str, Any],
+        understanding: dict[str, Any],
+        skill_style: str = "comprehensive",
+        target_quality: float = 0.80,
+        max_iterations: int = 3,
+    ) -> dspy.Prediction:


…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

src/skill_fleet/core/modules/understanding/requirements.py

+    async def aforward(
+        self, task_description: str, user_context: dict | None = None
+    ) -> dspy.Prediction:


Zochory added 30 commits January 30, 2026 03:51

Remove deprecated Neon DB templates and scripts for serverless and to…

2049b2e

…olkit functionalities

Remove vibe-coding skill documentation

45752df

feat: update .gitignore to refine skill directories and remove unnece…

52c63a6

…ssary files

feat: update .gitignore to include additional directories and files f…

c17e6ac

…or skills and documentation

feat: update .gitignore to include internal documentation and archive…

7d6d5ca

… directories

feat: streamline pre-commit configuration by removing obsolete hooks …

120520a

…and refining existing ones

chore: remove redundant whitespace in core modules documentation

7efc102

chore: remove obsolete SKILL.md files for FastAPI Stack Development a…

fd83b46

…nd Claude Prompt Refiner

fix: correct duplicate entry for README.md in archive documentation

6eda0eb

chore: update .gitignore to include additional skill templates and sc…

5749685

…ripts

chore: clean up formatting in QUICKSTART.md and executor.ts for consi…

312b0ac

…stency

chore: format SQL migration files for consistency and readability

ca64c89

chore: remove trailing whitespace and ensure newline consistency in J…

a7b3248

…SON and markdown files

chore: clean up whitespace and reorder imports in setup_branch_protec…

eb5980f

…tion.sh and test_repository.py

delete: Remove historical notes for interactive serve changes, testin…

41c99e0

…g guide, skill template update, TUI readiness, and update summary documentation

Delete archived documents for skills creation workflow, taxonomy impl…

28b4004

…ementation strategy, and taxonomy system to streamline project structure and focus on current methodologies.

delete: Remove legacy cleanup utility script to streamline migration …

abbeb1e

…process

refactor: Revise README for clarity and conciseness, enhancing skill …

ad94020

…creation and validation sections

delete: Remove centralized DSPy configuration module to simplify code…

c9037b0

…base

Remove outdated training dataset for DSPy and related skills from tra…

9d6a46c

…inset_v4.json

fix: Update .gitignore to include .uv_cache and .cache for better env…

bfead12

…ironment management

Remove obsolete DSPy tracking setup and testing scripts; eliminate un…

ce9ae28

…used TUI spawner and streaming chat functionality

Update .env.example

bdce086

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Zochory requested a review from Copilot February 2, 2026 14:50

Zochory assigned Zochory and Copilot Feb 2, 2026

Copilot AI reviewed Feb 2, 2026

View reviewed changes

.skills/dspy-basics/references/programs.md Outdated Show resolved Hide resolved

.skills/dspy-advanced/references/output-refinement.md Outdated Show resolved Hide resolved

Zochory added this to the v0.3.5 milestone Feb 2, 2026

Zochory and others added 8 commits February 2, 2026 15:57

Potential fix for code scanning alert no. 142: Code injection

a981b7d

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Update .skills/dspy-advanced/references/output-refinement.md

f20cda0

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Update .skills/dspy-basics/references/programs.md

80ae5fc

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Potential fix for code scanning alert no. 127: Information exposure t…

1dd4cd1

…hrough an exception Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Potential fix for code scanning alert no. 143: Code injection

2436853

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

fix(workflows): Enhance environment variable handling for retry logic

88a01c4

Merge branch 'structure' of https://github.com/Qredence/skill-fleet i…

38cd8da

…nto structure

github-code-quality bot found potential problems Feb 2, 2026

View reviewed changes

Zochory and others added 4 commits February 2, 2026 18:44

Potential fix for code scanning alert no. 128: Information exposure t…

0fea57f

…hrough an exception Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Potential fix for code scanning alert no. 130: Log Injection

20947a6

Co-authored-by: Copilot Autofix powered by AI <62310815+github-advanced-security[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Merge branch 'main' into structure

0a4ea4d

Potential fix for pull request finding 'Signature mismatch in overrid…

f779972

…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

github-code-quality bot found potential problems Feb 2, 2026

View reviewed changes

Zochory added 2 commits February 2, 2026 19:31

Remove Junie workflow and add CLAUDE.md for project guidance

12dfa3d

Enhance Claude Code Review workflow with environment variables and ar…

6f40715

…guments

Zochory requested a review from Copilot February 2, 2026 18:52

Copilot AI reviewed Feb 2, 2026

View reviewed changes

Zochory and others added 3 commits February 2, 2026 19:56

Potential fix for pull request finding 'Signature mismatch in overrid…

a768ffb

…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Update .github/workflows/claude-autofix.yml

3158a5e

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

Update .env.example

0b614e1

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

github-code-quality bot found potential problems Feb 2, 2026

View reviewed changes

Potential fix for pull request finding 'Signature mismatch in overrid…

e7a3c7f

…ing method' Co-authored-by: Copilot Autofix powered by AI <223894421+github-code-quality[bot]@users.noreply.github.com> Signed-off-by: Zachary BENSALEM <zachary@qredence.ai>

github-code-quality bot found potential problems Feb 2, 2026

View reviewed changes

src/skill_fleet/core/modules/understanding/requirements.py

Comment on lines +136 to +138

async def aforward(

self, task_description: str, user_context: dict | None = None

) -> dspy.Prediction:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: introduce type safety and centralized configuration #61

refactor: introduce type safety and centralized configuration #61

Zochory commented Feb 2, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 2, 2026

Uh oh!

Copilot AI Feb 2, 2026

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# OR directly with uvicorn
		uvicorn skill_fleet.api.main:app --reload

	- LLM config is loaded by `configure_dspy()` from `src/skill_fleet/config/config.yaml` (`src/skill_fleet/dspy/config.py`).
	- LLM config is loaded by `configure_dspy()` from `config/config.yaml` (`src/skill_fleet/llm/dspy_config.py`).

		gh pr comment "$PR_NUMBER" \
		--body "🤖 Claude Auto-Fix Workflow

Uh oh!

refactor: introduce type safety and centralized configuration #61

Are you sure you want to change the base?

refactor: introduce type safety and centralized configuration #61

Conversation

Zochory commented Feb 2, 2026

Summary

Type Safety & Contracts

Centralized Configuration Management

Enhanced Utilities

Code Quality Improvements

Documentation Updates

Key Features

Files Changed

Quality Assurance

Statistics

Related Issues

Testing

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Copilot AI Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants