Implement Phase 8: Offline AI Model Training & Evaluation #47

takurot · 2026-01-24T04:18:50Z

Summary

This PR implements the offline training and evaluation pipeline for the AI Sidecar (Phase 8), enabling the system to learn optimal caching policies from query logs.

Changes

P8-1: Training Pipeline: Added to train a GBDT model (XGBoost/Sklearn) on system metrics (, , 'latency' must be run as root..., ) and export it to ONNX.
P8-2: Evaluation Pipeline: Added to simulate the model's impact on historical data, estimating Cost Savings and P99 improvements.
P8-3: ONNX Validation: Added robust ONNX structure and runtime inference checks to ensuring model validity before deployment.
Dependencies: Added , , , to .

Verification

Training: Verified runs successfully on synthetic/logged data.
Evaluation: Verified produces a simulation report (Estimated ~27% P99 improvement on test logs).
Tests: Regressions tests passed ().

takurot added 3 commits January 24, 2026 13:18

Implement Phase 8: Offline AI Model Training & Evaluation Pipeline

26fe2f9

Fix python linting errors (flake8)

2a879f7

Format python code with black

b512b13

takurot merged commit 950eddf into main Jan 24, 2026
4 checks passed

takurot deleted the feature/ai-model-training branch January 24, 2026 04:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Phase 8: Offline AI Model Training & Evaluation #47

Implement Phase 8: Offline AI Model Training & Evaluation #47

takurot commented Jan 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Implement Phase 8: Offline AI Model Training & Evaluation #47

Implement Phase 8: Offline AI Model Training & Evaluation #47

Conversation

takurot commented Jan 24, 2026

Summary

Changes

Verification

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants