MLOps CI/CD with TensorFlow and CML 🤖

A complete MLOps pipeline demonstrating automated machine learning model training, evaluation, and reporting using TensorFlow, GitHub Actions, and Continuous Machine Learning (CML). This project showcases best practices for ML automation, model performance tracking, and reproducible machine learning workflows.

🎯 Project Overview

This repository implements an end-to-end machine learning pipeline that:

Automatically trains a TensorFlow neural network on synthetic linear data
Evaluates model performance using comprehensive metrics
Generates visual reports with training results and predictions
Creates automated reports via CML comments on GitHub PRs
Ensures reproducibility through version-controlled ML workflows

Key Features

🔄 Automated CI/CD Pipeline: Triggered on every push/PR
📊 Performance Tracking: MAE, MSE, R² score monitoring
📈 Visual Analytics: Automated plot generation and publishing
🚀 Production Ready: Near-perfect model performance (R² ≈ 1.0)
📋 Comprehensive Reporting: Detailed model configuration and metrics

🏗️ Project Structure

├── .github/workflows/
│   └── cml.yml                 # GitHub Actions CI/CD workflow
├── model.py                    # Main ML training script  
├── requirements.txt            # Python dependencies
├── README.md                   # Project documentation
├── metrics.txt                 # Generated model performance metrics
└── model_results.png          # Generated visualization plot

🚀 Quick Start

Prerequisites

Python 3.8+
GitHub repository with Actions enabled
Basic understanding of TensorFlow and MLOps

Setup Instructions

Clone the repository:

git clone https://github.com/dev-opsss/MLOps-CI.git
cd MLOps-CI

Install dependencies:
```
pip install -r requirements.txt
```
Run locally (optional):
```
python model.py
```
Enable GitHub Actions:
- Push to your repository to trigger the automated pipeline
- Check the Actions tab for workflow execution
- View CML reports in PR comments

🤖 Model Architecture

Neural Network Design

model = tf.keras.Sequential([
    tf.keras.layers.Dense(1, input_shape=(1,))  # Single layer for linear regression
])

Key Specifications

Framework: TensorFlow 2.20+
Model Type: Sequential Neural Network
Architecture: Single Dense Layer (Linear Regression)
Optimizer: Adam (learning_rate=0.1)
Loss Function: Mean Squared Error
Training Epochs: 200
Data Normalization: StandardScaler applied

📊 Dataset Details

Synthetic Linear Data

Relationship: y = x + 10
Total Samples: 50
Feature Range: X ∈ [-100, 96] (step=4)
Target Range: y ∈ [-90, 106] (step=4)
Train/Test Split: 70/30 (shuffled)
Validation Split: 20% of training data

Data Preprocessing

Random shuffling to prevent extrapolation issues
Feature standardization for stable training
Proper tensor reshaping for TensorFlow compatibility

🎯 Performance Metrics

Exceptional Results Achieved

Mean Absolute Error = 0.000709
Mean Squared Error = 0.000001  
R² Score = 1.000000
Final Training Loss = 4.62e-10
Final Validation Loss = 1.89e-10

Model Performance Indicators

✅ Near-perfect accuracy (MAE < 0.001)
✅ Perfect correlation (R² = 1.0)
✅ No overfitting (validation loss ≈ training loss)
✅ Production ready performance levels

🔄 CI/CD Pipeline

GitHub Actions Workflow

The automated pipeline (/.github/workflows/cml.yml) performs:

Environment Setup
- Ubuntu latest runner
- Python dependencies installation
- CML tools configuration
Model Training
- Execute model.py script
- Generate performance metrics
- Create visualization plots
Report Generation
- Publish model results visualization
- Create comprehensive performance report
- Post automated comments on PRs

Workflow Triggers

Push events: Any commit to main branch
Pull requests: Automatic model evaluation on PRs
Manual dispatch: On-demand workflow execution

📈 Visualization & Reporting

Automated Plots

The pipeline generates publication-ready visualizations showing:

Training data points (blue scatter)
Test data points (green scatter)
Model predictions (red scatter)
True relationship line (black dashed)
Performance metrics overlay

CML Reports

Automated GitHub comments include:

📊 Model Performance Metrics
📈 Training Result Visualizations
🔧 Model Configuration Details
📋 Training Process Summary
🎯 Results Analysis & Status

🛠️ Configuration

Requirements (requirements.txt)

tensorflow>=2.20.0
numpy>=2.3.0
matplotlib>=3.10.0

Key Model Parameters

# Training Configuration
EPOCHS = 200
LEARNING_RATE = 0.1
BATCH_SIZE = 35  # Full batch training
VALIDATION_SPLIT = 0.2
TRAIN_TEST_SPLIT = 0.7

# Data Configuration  
RANDOM_SEED = 42
FEATURE_RANGE = (-100, 96)
STEP_SIZE = 4

🔧 Development Guidelines

Running Locally

# Install dependencies
pip install -r requirements.txt

# Execute training script
python model.py

# View generated files
ls -la *.png *.txt

Modifying the Model

To experiment with different architectures:

# Example: Multi-layer network
model = tf.keras.Sequential([
    tf.keras.layers.Dense(64, activation='relu', input_shape=(1,)),
    tf.keras.layers.Dense(32, activation='relu'),
    tf.keras.layers.Dense(1)
])

Custom Datasets

To use your own data:

# Replace synthetic data generation
X = your_features.reshape(-1, 1)
y = your_targets.reshape(-1, 1)

📋 Troubleshooting

Common Issues & Solutions

1. GitHub Actions Permission Errors

# Add to workflow permissions
permissions:
  contents: read
  pull-requests: write
  issues: write

2. TensorFlow Version Compatibility

# Ensure compatible versions
pip install tensorflow>=2.20.0

3. CML Report Generation Issues

# Use heredoc syntax for complex reports
cat << 'EOF' >> report.md
# Your markdown content
EOF

🏆 Project Achievements

Technical Accomplishments

✅ Perfect Model Performance: R² = 1.000000
✅ Automated MLOps Pipeline: End-to-end automation
✅ Comprehensive Testing: Training & validation monitoring
✅ Production Readiness: Sub-millimeter precision
✅ Reproducible Workflows: Version-controlled ML pipeline

Best Practices Implemented

Data normalization for training stability
Proper train/test splitting with shuffling
Comprehensive metrics tracking (MAE, MSE, R²)
Automated visualization generation
CI/CD integration with GitHub Actions
Version control for ML experiments

🤝 Contributing

Fork the repository
Create a feature branch (git checkout -b feature/improvement)
Commit your changes (git commit -am 'Add improvement')
Push to the branch (git push origin feature/improvement)
Create a Pull Request

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

TensorFlow Team for the excellent ML framework
Iterative.ai for CML (Continuous Machine Learning)
GitHub for Actions CI/CD platform
Open Source Community for inspiration and best practices

📊 Latest Results

Last Updated: Automatically updated by CML workflow

For the most recent model performance and visualizations, check the latest GitHub Actions run or PR comments.

This project demonstrates production-ready MLOps practices with automated model training, evaluation, and reporting. Perfect for learning CI/CD for machine learning workflows! 🚀

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.github/workflows		.github/workflows
.gitignore		.gitignore
README.md		README.md
model.py		model.py
requirements.txt		requirements.txt

dev-opsss/MLOps-CI

Folders and files

Latest commit

History

Repository files navigation

MLOps CI/CD with TensorFlow and CML 🤖

🎯 Project Overview

Key Features

🏗️ Project Structure

🚀 Quick Start

Prerequisites

Setup Instructions

🤖 Model Architecture

Neural Network Design

Key Specifications

📊 Dataset Details

Synthetic Linear Data

Data Preprocessing

🎯 Performance Metrics

Exceptional Results Achieved

Model Performance Indicators

🔄 CI/CD Pipeline

GitHub Actions Workflow

Workflow Triggers

📈 Visualization & Reporting

Automated Plots

CML Reports

🛠️ Configuration

Requirements (requirements.txt)

Key Model Parameters

🔧 Development Guidelines

Running Locally

Modifying the Model

Custom Datasets

📋 Troubleshooting

Common Issues & Solutions

🏆 Project Achievements

Technical Accomplishments

Best Practices Implemented

🤝 Contributing

📄 License

🙏 Acknowledgments

📊 Latest Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages