📊 Lockfile Statistics Analysis - November 6, 2025 #3305
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 week ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - November 6, 2025
Executive Summary
This comprehensive analysis examines 71 lock files across the gh-aw repository, revealing patterns in workflow design, trigger usage, safe outputs, and structural characteristics. The analysis shows that workflows in this repository are predominantly manual (workflow_dispatch: 82%) and scheduled (schedule: 51%), with an average file size of 207 KB and 5.4 jobs per workflow. The GitHub MCP server dominates tool usage, and the "audits" discussion category is the most popular destination for workflow outputs.
Key Highlights:
Full Report Details
File Size Distribution
The lock files in this repository are generally substantial in size, reflecting comprehensive workflow definitions with detailed agent instructions, MCP configurations, and multi-job orchestration.
Statistics:
opencode.lock.yml(22 KB) - A shared configuration filepoem-bot.lock.yml(381 KB) - Feature-rich creative bot with extensive promptsSize Distribution Analysis
The predominance of large files (>200 KB) indicates that most workflows include:
Trigger Analysis
Most Popular Triggers
Workflows in this repository utilize a diverse set of triggers, with manual and scheduled triggers dominating:
Insights
workflow_dispatch, enabling developers to run analysis and agents on-demand/archie,/scoutCommon Trigger Combinations
Based on the data, typical combinations include:
Schedule Patterns
The repository uses various cron schedules for periodic execution:
0 9 * * *0 0,6,12,18 * * *0/10 * * * *0 6 * * 00 2 * * 1-50 15 * * 1Typical Patterns:
Safe Outputs Analysis
Agentic workflows use "safe outputs" to communicate results securely without direct repository write access. This analysis shows diverse output strategies across the repository.
Safe Output Types Distribution
Discussion Categories
For workflows using
create-discussion, the most popular categories are:Note: "audits" and "Audits" appear to be the same category with case inconsistency.
Safe Output Strategy Insights
Example Workflows by Output Type
lockfile-stats,audit-workflows,daily-firewall-report,github-mcp-tools-reportdaily-doc-updater,security-fix-pr,unbloat-docs,semantic-function-refactorarchie,scout,dev-hawk,ci-doctorissue-classifier,smoke-detector,duplicate-code-detectorStructural Characteristics
Job Complexity
Workflows in this repository exhibit moderate to high complexity with multi-job orchestration:
Typical Lock File Structure
Based on statistical analysis, a typical
.lock.ymlfile has:Job Orchestration Patterns
Workflows typically follow these job patterns:
Complexity Distribution
Most workflows (>70%) are moderately complex with:
Permission Patterns
Agentic workflows follow a least-privilege model with specific permissions per job.
Most Common Permissions
Permission Distribution Analysis
Typical Permission Patterns
Interactive Agents (archie, scout):
contents: read,issues: read,pull-requests: readissues: write,pull-requests: write,discussions: writePR Creation Workflows:
contents: readcontents: write,pull-requests: writeAnalysis/Audit Workflows:
contents: read,actions: readdiscussions: writeTool & MCP Patterns
MCP Server Usage
The GitHub MCP server is ubiquitous across workflows, providing GitHub API interaction capabilities.
Most Common GitHub MCP Tools:
mcp__github__search_*(code, issues, PRs, users, repos)mcp__github__pull_request_readmcp__github__list_*(workflows, commits, branches, releases, issues, PRs)mcp__github__issue_readmcp__github__get_*(commit, file_contents, me, pull_request, etc.)Tool Configuration Patterns
Based on the lock file structure, typical tool configurations include:
MCP Server Insights
shared/mcp/serena.md)Interesting Findings
1. Workflow_Dispatch Ubiquity
82% of workflows support manual triggering, suggesting a developer-first approach where engineers want control over when agents run. This contrasts with traditional CI/CD where automatic triggers dominate.
2. Large File Sizes Indicate Rich Prompts
The average 207 KB file size is substantial for YAML. This suggests workflows embed:
3. "Audits" Discussion Category Dominance
The "audits" category is used by 14+ workflows, making it the primary destination for automated reports. This suggests the repository uses Discussions as a centralized reporting dashboard.
4. Multi-Output Workflows Are Rare
Most workflows use a single safe output mechanism. Only 2 workflows (
lockfile-stats,poem-bot) useupdate-issue, suggesting workflows prefer creating new content over updating existing content.5. Comment-Activated Agents
17% of workflows respond to comments, enabling conversational UX where developers invoke agents via slash commands (
/archie,/scout) similar to Slack bots.6. High-Frequency Monitoring
3 workflows run every 10 minutes (
0/10 * * * *), indicating real-time monitoring needs for critical infrastructure or firewall protection.7. Job Naming Conventions
Common job names follow a pattern:
pre_activation,activation,agent,detection,add_comment,create_discussion,missing_tool,update_reaction. This suggests a standardized workflow template.8. Shared MCP Configurations
Multiple workflows import from
shared/mcp/serena.md, indicating a reusable MCP configuration strategy that reduces duplication.9. Minimal Push Triggers
Only 7% use
pushtriggers, suggesting workflows are designed for analysis and reporting rather than traditional CI/CD build/test automation.10. Concurrency Control
Most workflows use concurrency groups like
gh-aw-${{ github.workflow }}-${{ github.event.issue.number }}to prevent multiple simultaneous runs on the same issue/PR, ensuring sequential processing.Recommendations
Based on this analysis, here are recommendations for workflow authors and repository maintainers:
1. Standardize Discussion Categories
Consolidate "audits" and "Audits" to a single category (lowercase recommended) to avoid fragmentation.
2. Consider File Size Optimization
At 207 KB average, lock files are substantial. Consider:
3. Document Trigger Combinations
Create a guide explaining when to use:
workflow_dispatchonly (manual tasks)scheduleonly (periodic reports)workflow_dispatch + schedule(flexible automation)issues + issue_comment(interactive agents)4. Expand MCP Server Usage
While GitHub MCP dominates, consider integrating additional MCP servers for:
5. Template Workflows
Given the consistent job naming patterns, create template workflows with:
6. Monitor Large Workflows
poem-bot.lock.yml(381 KB) is notably large. Review for:7. Enhance Update-Issue Pattern
Only 2 workflows use
update-issue. This could be useful for:8. Security Permissions Review
Only 4 workflows access security-events. Consider if more workflows should integrate security scanning for:
Methodology
Analysis Tools & Approach
Data Sources
.lock.ymlfiles from.github/workflows/and subdirectoriesgh-aw compilefrom corresponding.mdfilesCache Memory
Analysis scripts and historical data stored in
/tmp/gh-aw/cache-memory/:scripts/: Reusable analysis scriptshistory/2025-11-06-analysis.json: Aggregated statisticspatterns/: Extracted patterns for future analysisLock Files Analyzed
All 71 lock files in the repository, including:
Generated by Lockfile Statistics Analysis Agent
Analysis Date: November 6, 2025
Total Files Analyzed: 71
Repository: githubnext/gh-aw
Beta Was this translation helpful? Give feedback.
All reactions