π€ AI-powered developer experience platform demo for PingCAP - transforming 37K+ GitHub stars into actionable insights
Lovingly created for the TidDB community with the help of Calude AI.
Transforming PingCAP's 37,000+ GitHub stars into an AI-powered developer experience platform
This project demonstrates how to leverage TiDB's massive community engagement to create an intelligent developer support system. Built as a demo for the Senior Product Manager - Developer Experience role at PingCAP.
# Clone the repository
git clone https://github.com/yourusername/tidb-community-intelligence.git
cd tidb-community-intelligence
# Set up environment
python -m venv venv
venv\Scripts\activate # Windows
# source venv/bin/activate # Mac/Linux
# Install dependencies
pip install streamlit requests plotly pandas
# Collect community data
cd src
python simple_collector_basic.py
# Launch demo
cd ../demo
streamlit run basic_app.py- Real-time similarity matching with TiDB community issues
- Semantic understanding of technical problems
- Confidence scoring for solution quality
- Personalized recommendations based on technology stack
- Success patterns from similar developer environments
- Proactive issue prevention insights
- Automated pattern extraction from 37K+ GitHub stars
- Trend identification for emerging problems
- Performance insights by configuration type
- Business impact projections
- Implementation roadmap
- ROI analysis and success metrics
Data Collection β Pattern Analysis β AI Recommendations β Developer Interface
β β β β
GitHub API NLP Processing Machine Learning Streamlit Demo
Community Data Issue Clustering Similarity Matching Interactive UI
| Metric | Current State | Target State | Impact |
|---|---|---|---|
| Developer Onboarding | 2-3 days | < 1 day | 60% faster |
| Issue Resolution | 24-48 hours | < 4 hours | 80% faster |
| Community Self-Service | 30% | 80% | 150% improvement |
| Developer Satisfaction | +20 NPS | +50 NPS | 2.5x improvement |
User: "TiDB connection timeout in Kubernetes"
AI: Found 5 similar issues with 94% resolution rate
β Most effective solution: adjust connection pool settings
β Used by 23 companies with similar k8s setup
User Stack: [Kubernetes, Docker, MySQL]
AI Insights:
β 89% of migrations use these TiDB settings
β Common gotcha: charset configuration
β Recommended monitoring: these 3 metrics
tidb-community-intelligence/
βββ src/ # Data collection scripts
β βββ simple_collector_basic.py # Basic data collector
β βββ data_collector.py # Advanced collector (requires pandas)
βββ demo/ # Demo applications
β βββ basic_app.py # Basic Streamlit demo
β βββ advanced_app.py # Full-featured demo
βββ data/ # Collected data (auto-generated)
βββ docs/ # Documentation
βββ requirements.txt # Python dependencies
βββ README.md # This file
pip install streamlit requests plotly
cd src && python simple_collector_basic.py
cd ../demo && streamlit run basic_app.py# With conda (recommended)
conda create -n tidb-intelligence python=3.10 -y
conda activate tidb-intelligence
conda install -c conda-forge pandas numpy scikit-learn streamlit plotly requests -y
# Or with pip
pip install -r requirements.txt
# Run advanced collector and demo
cd src && python data_collector.py
cd ../demo && streamlit run advanced_app.py- Manual Support Burden: 40% of support tickets are repetitive
- Slow Onboarding: New developers take 2-3 days to get productive
- Knowledge Fragmentation: Community wisdom scattered across platforms
- No Predictive Insights: Issues only addressed after they occur
TiDB Community Intelligence Platform - An AI-powered system that:
- Instantly matches user problems with community solutions
- Predicts and prevents common issues before they happen
- Personalizes guidance based on developer's tech stack
- Scales community knowledge without proportional support staff increase
- Network effects - more community data improves recommendations
- Unique positioning - leverages TiDB's strong community engagement
- Defensible moat - proprietary knowledge graph from TiDB-specific patterns
- β Real-time community data ingestion
- β Basic AI similarity matching
- β Web interface prototype
- π― Target: 80% search accuracy, 50% faster onboarding
- π Advanced semantic understanding
- π Predictive issue classification
- π Multi-modal recommendations
- π― Target: 90% solution confidence, 70% ticket reduction