Skip to content

Conversation

@takurot
Copy link
Owner

@takurot takurot commented Jan 24, 2026

Summary

This PR implements the offline training and evaluation pipeline for the AI Sidecar (Phase 8), enabling the system to learn optimal caching policies from query logs.

Changes

  • P8-1: Training Pipeline: Added to train a GBDT model (XGBoost/Sklearn) on system metrics (, , 'latency' must be run as root..., ) and export it to ONNX.
  • P8-2: Evaluation Pipeline: Added to simulate the model's impact on historical data, estimating Cost Savings and P99 improvements.
  • P8-3: ONNX Validation: Added robust ONNX structure and runtime inference checks to ensuring model validity before deployment.
  • Dependencies: Added , , , to .

Verification

  • Training: Verified runs successfully on synthetic/logged data.
  • Evaluation: Verified produces a simulation report (Estimated ~27% P99 improvement on test logs).
  • Tests: Regressions tests passed ().

@takurot takurot merged commit 950eddf into main Jan 24, 2026
4 checks passed
@takurot takurot deleted the feature/ai-model-training branch January 24, 2026 04:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants