📊 Agentic Workflow Lock File Statistics - November 30, 2025 #5107
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 3 days ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - November 30, 2025
This report provides comprehensive statistical analysis of all 99 agentic workflow lock files (
.lock.yml) in the githubnext/gh-aw repository, revealing usage patterns, structural characteristics, and interesting insights about how agentic workflows are configured and deployed.Executive Summary
workflow_dispatch(73 workflows, 73.7%)create-discussion(36 workflows)Key Finding: The vast majority of workflows (94.9%) are substantial in size (>100KB), indicating complex, multi-step agentic processes. Nearly three-quarters of workflows support manual triggering via
workflow_dispatch, showing emphasis on on-demand execution alongside scheduled automation.Full Report Details
File Size Distribution
Statistics:
arxiv.lock.yml(80.2 KB)poem-bot.lock.yml(491.9 KB)Insight: All workflows exceed 50KB, reflecting the substantial complexity and comprehensive instructions embedded in agentic workflows. The lack of small workflows suggests a baseline complexity floor for useful agentic automation.
Trigger Analysis
Most Popular Triggers
workflow_dispatchschedulecommandreactionpull_requestskip-if-matchpushworkflow_runstop-afterissuesworkflow_callCommon Trigger Combinations
The most prevalent trigger patterns reveal design philosophies:
schedule + workflow_dispatchworkflow_dispatch(only)command(only)schedule + skip-if-match + workflow_dispatchpull_request + reaction + schedule + workflow_dispatchPattern Analysis:
Insight: The dominance of
schedule + workflow_dispatchshows teams want regular automated execution with manual override capability—a best practice for production agentic workflows.Schedule Patterns
From 58 workflows with
scheduletriggers, we identified 121 total cron schedules (some workflows have multiple schedules):0 9 * * *0 14 * * 1-50 0,6,12,18 * * *0 9 * * 10 8 * * *Schedule Distribution:
Insight: Morning UTC hours (8-9 AM) are most popular, suggesting workflows designed to have results ready for European/early US business hours.
Safe Outputs Analysis
Safe outputs enable agents to create GitHub content (discussions, issues, PRs, comments) safely.
Safe Output Types Distribution
create-discussionadd-commentcreate-issuecreate-pull-requestTotal: 87 safe output configurations across 71 workflows (71.7% of all workflows)
Insight:
create-discussionis the most popular safe output, indicating preference for persistent, threaded conversations over transient or scattered comments.Safe Output Combinations
Most workflows use a single safe output type, but some combine multiple:
create-discussion(only)create-issue(only)add-comment(only)create-pull-request(only)add-comment + create-pull-requestcreate-discussion + create-pull-requestadd-comment + create-issueNotable:
craft.lock.ymluses 6 different safe output configurations, making it the most versatile workflow.Discussion Categories
For workflows using
create-discussion, the target categories are:auditsGeneraldevartifactssecurityresearchreportsdaily-newsannouncementsInsight: The "audits" category is the most popular destination, showing heavy usage of agentic workflows for analysis, reporting, and quality monitoring.
Structural Characteristics
Job Complexity
poem-bot.lock.yml)Top 5 Most Complex Workflows (by steps):
poem-bot.lock.yml- 129 steps, 15 jobscloclo.lock.yml- 102 steps, 14 jobsunbloat-docs.lock.yml- 96 steps, 10 jobstechnical-doc-writer.lock.yml- 96 steps, 10 jobsq.lock.yml- 90 steps, 13 jobsInsight: The average workflow contains ~7 jobs with ~10 steps each. The most complex workflows (100+ steps) typically involve documentation generation, code analysis, or creative content creation requiring multiple agent interactions.
Average Lock File Structure
Based on statistical analysis, a typical
.lock.ymlfile has:schedule + workflow_dispatch)Timeout Patterns
Average Timeout: 14 minutes
Insight: Most workflows use conservative 10-15 minute timeouts, balancing execution time against cost and responsiveness. Longer timeouts (30-45 min) are used for complex analysis or documentation workflows.
Tool & MCP Patterns
MCP Server Usage
Workflows Using MCP Servers: 97 out of 99 (98.0%)
githubplaywrightdeepwikiarxivInsight: The GitHub MCP server is virtually universal (98%), reflecting that agentic workflows are deeply integrated with GitHub's platform. Specialized MCP servers like Playwright, DeepWiki, and ArXiv appear in specific use cases requiring web automation or research capabilities.
Engine Distribution
claudecloclo,commit-changes-analyzer,daily-multi-device-docs-testercopilotglossary-maintainer,poem-bot,smoke-copilot,smoke-copilot-no-firewallcodexchangeset,daily-factNote: Only 11 workflows explicitly specify an engine in frontmatter. Most (88) use the system default, likely Claude-based given the gh-aw implementation.
Interesting Findings
1. The "Poem Bot" is the Most Complex Workflow
poem-bot.lock.ymltops all complexity metrics:This creative content generation workflow demonstrates that artistic/creative tasks can be as complex as technical ones.
2. Documentation Workflows Dominate Size Rankings
Top 5 largest workflows:
poem-bot.lock.yml(491.9 KB)cloclo.lock.yml(390.8 KB)q.lock.yml(382.0 KB)pr-nitpick-reviewer.lock.yml(379.8 KB)copilot-session-insights.lock.yml(378.6 KB)Many involve documentation, code review, or analysis—tasks requiring extensive context and instructions.
3. Morning UTC is Prime Time
The
0 9 * * *schedule (9 AM UTC) is used by 12 workflows—more than any other single time. This suggests optimization for results availability during European/US morning hours.4. Weekday Workflows Outnumber 24/7 Ones
34 workflows use weekday-only schedules (
* * * * 1-5), showing preference for business-hours-aligned automation over continuous monitoring.5. Multi-Output Versatility
craft.lock.ymluses 6 safe output configurations, making it the Swiss Army knife of workflows—able to create discussions, issues, PRs, and comments depending on context.6. "Command" Triggers Enable CLI Integration
13 workflows use the custom
commandtrigger, enabling CLI-driven agentic automation viagh awcommands—a unique gh-aw feature for developer-friendly invocation.7. Reaction Triggers for Interactive Workflows
10 workflows use
reactiontriggers, allowing users to trigger agent actions via emoji reactions—a creative, low-friction interaction pattern.8. Minimal Engine Specification
Only 11% of workflows explicitly specify an engine. This suggests strong defaults and trust in the platform's engine selection.
9. No Workflows Under 50 KB
Every single workflow exceeds 50 KB, with 95% exceeding 100 KB. This indicates a baseline complexity threshold—agentic workflows are inherently substantial.
10. Test Workflows Are Isolated
7 workflows live in
tests/subdirectories, showing good organizational hygiene for test vs. production workflows.Recommendations
Based on this analysis, here are recommendations for agentic workflow development:
1. Use the "Golden Pattern" for Triggers
Combine
schedule + workflow_dispatch:This provides automated execution with manual override—the most flexible approach used by 47 workflows.
2. Target 10-15 Minute Timeouts
Most workflows succeed within 10-15 minutes. Use this as your initial timeout, adjusting upward only if needed:
3. Prefer
create-discussionfor Persistent OutputWith 36 uses,
create-discussionis the most popular safe output. Discussions provide:4. Include the GitHub MCP Server
98% of workflows use it. Unless you have specific reasons not to, include:
5. Weekday Schedules for Team-Focused Workflows
If your workflow generates content for human review, use weekday-only schedules:
6. Start Simple, Scale Complexity
Average workflows have ~7 jobs and ~70 steps. If you're exceeding 100 steps, consider whether you're trying to do too much in one workflow.
7. Use the "audits" Category for Analysis Workflows
With 9 workflows targeting it, "audits" has become the de facto standard for analysis and monitoring workflows.
Methodology
/tmp/gh-aw/cache-memory/.lock.ymlfiles in.github/workflows/and subdirectoriescache-memory/history/2025-11-30.jsonAnalysis Scripts (available in cache memory):
analyze_lockfiles.sh- Main data extractionextract_frontmatter.sh- Frontmatter parsingtrigger_combinations.sh- Trigger pattern analysissafe_outputs_analysis.sh- Safe outputs deep-diveinteresting_findings.sh- Insight generationgenerate_stats.sh- Statistical summariesHistorical Context
This is the first comprehensive statistical analysis of gh-aw lock files. Future runs will enable trend analysis:
Baseline Established: 2025-11-30
Conclusion
The agentic workflow ecosystem in githubnext/gh-aw is mature, sophisticated, and well-structured. With 99 workflows averaging 286 KB and 70 steps each, these are substantial automation systems. The heavy use of safe outputs (71.7%), universal GitHub MCP integration (98%), and flexible trigger patterns (69.7% multi-trigger) demonstrate thoughtful design focused on controlled, observable automation.
The dominance of the
schedule + workflow_dispatchpattern and the preference forcreate-discussionoutputs reveal a community that values both automated execution and human oversight, with persistent, categorized discussion threads as the primary output mechanism.As agentic workflows continue to evolve, this baseline analysis provides a foundation for tracking trends and optimizing practices.
📊 Generated by Lockfile Statistics Analysis Agent on 2025-11-30 03:35 UTC
Analysis scripts available in
/tmp/gh-aw/cache-memory/scripts/for reuseBeta Was this translation helpful? Give feedback.
All reactions