📊 Agentic Workflow Lock File Statistics - November 2025 #3024
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 week ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - 2025-11-03
This comprehensive analysis examines 66 agentic workflow lock files (
.lock.yml) in thegithubnext/gh-awrepository to identify structural patterns, usage trends, and characteristics of agentic workflows at scale.Executive Summary
Dataset Overview:
Key Findings:
workflow_dispatchFile Size Distribution
File Size Distribution
The majority of lock files fall in the 200-300 KB range, indicating consistent workflow complexity across the repository.
Size Statistics:
example-permissions-warning.lock.yml(82.3 KB)poem-bot.lock.yml(371.7 KB)The tight clustering around 200 KB suggests standardized workflow patterns and consistent instruction complexity.
Trigger Analysis
Trigger Analysis
Most Popular Triggers
Workflows in this repository use a diverse set of triggers, with strong preference for manual invocation combined with scheduled automation.
workflow_dispatchschedulepull_requestissue_commentissuesworkflow_rundiscussion_commentdiscussionpull_request_review_commentpushKey Insights:
Common Trigger Combinations
schedule+workflow_dispatchworkflow_dispatchonlypull_request+schedule+workflow_dispatchworkflow_runissue_commentissuesThe dominance of
schedule+workflow_dispatch(40.9%) shows a preference for workflows that run periodically but can also be invoked on-demand.Schedule Patterns
Most Common Cron Schedules:
0 0,6,12,18 * * *0 9 * * *0 6 * * 00 2 * * 1-50 0 * * *Schedule Frequency Distribution:
Observation: The 4x daily schedule (
0 0,6,12,18 * * *) is popular for monitoring and reporting workflows that need regular but not continuous updates.Safe Outputs Analysis
Safe Outputs Analysis
Safe outputs are the approved mechanisms for agentic workflows to communicate results back to GitHub.
Safe Output Types Distribution
create-discussioncreate-issueadd-commentcreate-pull-requestupdate-issueTotal workflows with safe outputs: 68 job instances across 66 workflows (some use multiple)
Key Findings:
poem-bot.lock.ymluses 5 different types)Workflows with Multiple Safe Outputs
poem-bot.lock.ymlcraft.lock.ymltechnical-doc-writer.lock.ymlunbloat-docs.lock.ymlq.lock.ymlsmoke-detector.lock.ymlci-doctor.lock.ymlInsight: Workflows that interact with PRs or issues often need multiple output types to provide comprehensive feedback (e.g., create issue + add comment to explain).
Structural Characteristics
Structural Characteristics
Job Complexity
poem-bot.lock.ymlJob Count Distribution:
Steps per Job
Steps Distribution:
Average Lock File Structure
Based on median values, a typical agentic workflow in this repository has:
schedule+workflow_dispatchgh-aw-${{ github.workflow }}Permission Patterns
Permission Patterns
Most Common Permissions
contentspull-requestsissuesactionsdiscussionssecurity-eventsrepository-projectsattestationschecksdeploymentsmodelspackagespagesstatusesTotal Permission Grants: 233 read permissions, 0 explicit write permissions in analyzed data
Key Observations:
contents: readto access repository codeissuesandpull-requestspermissions for safe outputsactions: readto inspect workflow runs and artifactsPermission Distribution
Based on the number of permissions requested per workflow:
Security Posture: The repository follows the principle of least privilege, with workflows requesting only the permissions they need. Write operations are funneled through safe output mechanisms rather than broad write permissions.
Timeout Configuration
Timeout Configuration
Timeout Distribution
Timeout Statistics:
Insights:
Tool & MCP Patterns
Tool & MCP Patterns
MCP (Model Context Protocol) Server Usage
MCP Tool Invocations Detected:
githubplaywrightdeepwikiarxivUniversal GitHub MCP Usage: Every single workflow (100%) uses the GitHub MCP server, demonstrating its critical role in agentic workflows. With 1,888 tool invocations across 66 workflows, that's an average of ~28.6 GitHub MCP tool calls per workflow.
Specialized MCP Servers:
Common Tool Patterns
Based on the GitHub MCP server dominance and file analysis:
Core Tool Categories:
Observation: The GitHub MCP server provides the essential toolkit for agentic workflows, handling ~97% of all MCP interactions.
Concurrency Patterns
Concurrency Patterns
Concurrency groups prevent multiple instances of the same workflow from running simultaneously.
Concurrency Group Distribution
gh-aw-${{ github.workflow }}gh-aw-${{ github.workflow }}-${{ github.event.issue.number || github.event.pull_request.number }}gh-aw-${{ github.workflow }}-${{ github.event.pull_request.number || github.ref }}dev-workflow-${{ github.ref }}Concurrency Strategies:
Insight: The dominant pattern (workflow-level concurrency) prevents race conditions and resource conflicts for scheduled/on-demand workflows.
Interesting Findings
Interesting Findings
1. Standardized Workflow Architecture
Analysis reveals a highly consistent workflow structure:
daily-*,smoke-*,test-*)This standardization suggests the repository uses a template or framework for generating workflows.
2. Safe Output Preferences
Create-discussion is the clear winner (36% of outputs), likely because:
3. Multi-Purpose Workflows
Some workflows support extensive trigger combinations:
4. The "4x Daily" Pattern
Five workflows use the
0 0,6,12,18 * * *schedule (00:00, 06:00, 12:00, 18:00 UTC). This appears to be a standard for:5. Conservative Timeout Philosophy
With 85% of workflows timing out at ≤20 minutes:
6. Read-Only Permission Model
Despite workflows creating issues, PRs, and discussions, the base permissions are read-only. Write operations go through:
github-scriptor direct API calls with job tokensThis architecture provides security through forced code review of write operations.
7. Test/Firewall Workflows
8 workflows (12%) are minimal "test" or "firewall" workflows with only 2 jobs and <100 KB size:
test-post-steps.lock.ymltest-svelte.lock.ymltest-secret-masking.lock.ymltest-jqschema.lock.ymltest-manual-approval.lock.ymldev.firewall.lock.ymlfirewall.lock.ymlexample-permissions-warning.lock.ymlThese likely serve as integration tests or safety checks for the agentic workflow system.
Recommendations
Recommendations
Based on this statistical analysis, here are recommendations for optimizing and evolving agentic workflows:
1. Standardize on Common Patterns
Finding: 71% of workflows are 200-300 KB with 6 jobs and 9 steps/job.
Recommendation:
poem-bot.lock.ymlwith 16 jobs)2. Optimize Timeout Configurations
Finding: 42% use 10-minute timeouts, but average is 14.5 minutes.
Recommendation:
3. Expand Safe Output Usage
Finding: Create-discussion is most popular (36%), but some workflows have no safe outputs (10%).
Recommendation:
4. Leverage Multi-Trigger Patterns
Finding: Only 5 workflows use
pull_request+schedule+workflow_dispatch.Recommendation:
issue_commenttriggers for interactive "command" workflows5. Investigate Outliers
High-complexity workflows:
poem-bot.lock.yml(16 jobs, 371 KB, 5 safe output types)q.lock.yml(14 jobs, 308 KB)scout.lock.yml(14 jobs)Recommendation: Review these for potential optimization or splitting into multiple workflows.
6. Expand MCP Server Usage
Finding: GitHub MCP is universal, but specialized servers (playwright, deepwiki, arxiv) are rare.
Recommendation:
7. Schedule Optimization
Finding: 5 workflows run 4x daily, but most run 1x daily.
Recommendation:
push,workflow_run) instead of frequent schedulesMethodology
Methodology
Analysis Approach
This statistical analysis was performed using automated scripts stored in
/tmp/gh-aw/cache-memory/scripts/:Data Collection:
findandglobto locate all.lock.ymlfiles in.github/workflows/yaml.safe_load()du -bStatistical Analysis:
statisticsmodulecollections.CounterPattern Detection:
mcp__*__)Data Validation:
Data Sources
.lock.ymlfiles in.github/workflows/githubnext/gh-awLimitations
Scripts Used
All analysis scripts are stored in
/tmp/gh-aw/cache-memory/scripts/:analyze_lockfiles.py: Main analysis script for file sizes, jobs, steps, permissionsfile_sizes.sh: Shell script for file size collectionGenerated by Lockfile Statistics Analysis Agent
Date: 2025-11-03
Repository: githubnext/gh-aw
Analysis Version: 1.0
Total Lock Files: 66
Total Size Analyzed: 13.36 MB
Beta Was this translation helpful? Give feedback.
All reactions