[Pelis Agent Factory Advisor] Agentic Workflow Advisory Report — March 2026 #1320
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-03-23T03:42:37.909Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Executive Summary
The
gh-aw-firewallrepository has a mature and security-focused agentic workflow portfolio of 21 workflows — well above average for its size — with strong coverage of security scanning, smoke testing, CI fault investigation, and documentation maintenance. The primary gaps are in meta-monitoring (no workflow health manager despite multiple recent workflow failures), issue triage (issues are auto-assigned to Copilot but never labeled/categorized first), and code quality automation (no simplifier or schema drift checker for this structurally complex codebase).🎓 Patterns Learned from Pelis Agent Factory
Key Patterns from the Documentation Site
From crawling the Pelis Agent Factory and its 19-part series:
From the
githubnext/agenticsRepositoryThe agentics repo showcases:
daily-repo-goals: Goal-driven daily automation that tracks progress toward explicit targetsdaily-workflow-sync: Keeps workflow templates up to date from upstream sourcesimport-workflow: Slash command to import workflows from registrieslink-checker: Automated broken link detection across docsmaintainer: Generalist daily maintenance agentComparison to This Repo
📋 Current Agentic Workflow Inventory
build-testci-cd-gaps-assessmentci-doctorcli-flag-consistency-checkerdependency-security-monitordoc-maintainerissue-duplication-detectorissue-monsterpelis-agent-factory-advisorplansecret-digger-*security-guardsecurity-reviewsmoke-chrootsmoke-claudesmoke-codexsmoke-copilottest-coverage-improverupdate-release-notes🚀 Actionable Recommendations
P0 — Implement Immediately
🏥 Issue Triage Agent
What: Automatically label incoming issues with categories like
bug,feature,documentation,question,security,performancebased on content analysis.Why: The Issue Monster currently dispatches every open issue to Copilot agents, but without prior triage, agents work on unlabeled issues with no priority signal. Multiple recent failures (#1308, #1291, #1287, #1284, #1283, #1282) would benefit from the
buglabel applied immediately for faster triage. In Pelis Factory, triage is the foundational workflow everything else builds on.How: Classic triage pattern — trigger on
issues: opened, analyze title+body, apply label, comment with reasoning.Effort: Low (1-2 hours)
Example:
📊 Workflow Health Manager
What: A meta-monitoring workflow that inspects all 21+ agentic workflows, detects unhealthy patterns (repeated failures, no-ops, high cost, zero PRs), and creates targeted issues.
Why: The open issues list shows 6+ recent workflow failures (#1308, #1291, #1287, #1284, #1283, #1282) but there's no automated agent tracking these patterns over time. In Pelis Factory, the Workflow Health Manager created 40 issues, directly caused 14 PRs, and caught problems like missing runtime files and configuration drift. At 21 workflows and growing, this is now needed.
How: Use
agentic-workflowstool to get status + recent logs. Analyze failure patterns usingcache-memoryto track trends across runs. Create issues for workflows with 3+ consecutive failures or zero successful runs in 7 days.Effort: Medium (2-4 hours)
Example trigger:
P1 — Plan for Near-Term
🔍 Breaking Change Checker
What: On each PR, detect changes that could break backward compatibility: CLI flag removals/renames, API changes, Docker Compose schema changes, configuration file format changes.
Why: This is a security/firewall tool used in CI pipelines by other teams. Breaking changes to
awfCLI flags or Docker Compose structure could silently break downstream workflows. The codebase hassrc/types.ts(WrapperConfig),src/cli.ts(flag definitions), andcontainers/*/Dockerfile— all prime sources of breaking changes.How: Trigger on PRs, use bash to diff flag definitions and type signatures against main, create alert issues for backward-incompatible changes.
Effort: Medium
🎭 Static Analysis Report (zizmor/actionlint/poutine)
What: Daily workflow that runs security scanners (zizmor for GitHub Actions security, actionlint for syntax, poutine for supply chain) on all
.lock.ymlfiles and posts findings as a discussion.Why: This repo is a security tool — its own workflow security should be exemplary. The
security-reviewworkflow is comprehensive but narrative; a dedicated scanner workflow provides machine-readable, reproducible findings. Pelis Factory's Static Analysis Report created 57 discussions + 12 Zizmor security reports. Theagenticworkflows-compiletool already supports--zizmor,--poutine, and--actionlint.How: Daily schedule, run
gh aw compile --zizmor --poutine --actionlint, post structured findings.Effort: Low
⚡ CI Optimization Coach
What: Weekly analysis of CI workflow performance (build times, cache hit rates, flakiness, redundant steps) with PRs to optimize.
Why: CI Doctor fixes failures reactively; CI Coach improves proactively. With 4 smoke workflows running every 12 hours and multiple integration test suites, there are likely optimization opportunities. In Pelis Factory, CI Coach had a 100% merge rate on 9 PRs.
How: Analyze GitHub Actions timing data, identify slowest steps, propose caching improvements, parallelization, or test deduplication.
Effort: Medium
🏗️ Code Simplifier
What: Daily agent that reviews recently-changed code and proposes simplification PRs without changing functionality.
Why:
docker-manager.tshas only 18% test coverage and is the most complex file in the codebase (250 statements).cli.tshas 0% coverage. After rapid development, complexity accumulates. The Code Simplifier in Pelis Factory achieved 83% merge rate on PRs. For a security tool, simpler code = fewer audit surface area issues.How: Daily schedule, look at commits from last 3 days, identify TypeScript files with complexity issues, propose refactoring PRs.
Effort: Medium
P2 — Consider for Roadmap
🗂️ Schema Consistency Checker
What: Weekly check that
WrapperConfiginterface insrc/types.ts, CLI flag definitions insrc/cli.ts, documentation indocs/, and theaction.ymlschema all stay in sync.Why: Types drift is common in TypeScript codebases. The AGENTS.md already documents a complex config model; drift between implementation and docs could mislead contributors. Pelis Factory's Schema Checker created 55 analysis discussions.
Effort: Medium
📦 Changeset Auto-Generator
What: When a release is tagged, automatically generate a structured changeset (version bump type + categorized changelog) as a PR, complementing the existing
update-release-notesworkflow.Why:
update-release-notesenhances existing release notes but a full changeset workflow would propose the version bump (major/minor/patch) based on commit analysis. Pelis Factory's Changeset had 78% PR merge rate.Effort: Low-Medium
🔭 MCP Inspector
What: Weekly validation that all MCP servers configured in workflow files (GitHub, playwright, etc.) are reachable and exposing the expected tools.
Why: The smoke workflows use MCP servers (ghcr.io/github/gh-aw-mcpg); a silent MCP misconfiguration would fail all smoke tests. The
agenticworkflows-mcp-inspecttool makes this easy.Effort: Low
🔄 Mergefest (Auto-Merge Main into PRs)
What: Hourly workflow that merges the main branch into stale PR branches to prevent large merge conflicts.
Why: The repo has multiple long-running PRs (dependency updates, docs). Keeping them current reduces review friction. Pelis Factory called this an "orchestrator workflow."
Effort: Low
P3 — Future Ideas
📈 Agent Performance Metrics Collector
Track daily performance across all workflows: cost, turn counts, merge rates, no-op rates. Use
agentic-workflows.logs+cache-memoryto build trend data. Pelis Factory's Metrics Collector created 41 daily discussions.🌳 Issue Arborist
Automatically link related issues as sub-issues (e.g., link all CI failure issues under a parent tracking issue). Pelis created 18 parent issues and 77 reports.
🤝 PR Review Comment Responder
Slash command
/addressthat reads unresolved PR review comments and creates sub-issues or draft PRs to address them.🎪 Daily Multi-Device Docs Tester
Use Playwright to test the Astro/Starlight docs site (at
docs-site/) on mobile and desktop viewports. The docs site deploys to GitHub Pages — ensuring it renders correctly on all devices adds quality. Pelis achieved 100% merge rate on 2 PRs from this workflow.📈 Maturity Assessment
Current Overall Level: 3.5/5 — "Accomplished Practitioner"
Target Level: 4.5/5 — "Advanced Practitioner"
Gap to close: Add workflow health manager (P0), issue triage (P0), breaking change checker (P1), CI coach (P1), and static analysis report (P1) to reach level 4.5.
🔄 Comparison with Best Practices
What This Repository Does Exceptionally Well
secret-digger,cli-flag-consistency-checker, andsecurity-guardare all tailored specifically to this security tool's domain.What Could Improve
docker-manager.ts(18%) andcli.ts(0%) represent the most critical paths in a security tool. Weekly coverage improver may not be aggressive enough.--zizmor --poutine --actionlint— not using these in an automated daily report is a missed opportunity for a security-first repo.Unique Opportunities Given This Repository's Domain
secret-diggerand smoke workflows already test containment. A daily "Firewall Escape Report" discussion summarizing all escape attempts would be valuable public documentation of security posture.tests/fixtures/) to ensure only legitimate domains are being whitelisted.squidandagentimages.Generated by Pelis Agent Factory Advisor — March 16, 2026. Sources: Pelis Agent Factory docs, githubnext/agentics, repository workflow analysis.
Beta Was this translation helpful? Give feedback.
All reactions