Neural Architecture Search with Differential Evolution

Automated neural network architecture discovery using Differential Evolution. Achieved 69.91% accuracy on UCI Adult dataset, beating random search baseline by +0.88% and discovering a novel efficiency-complexity tradeoff.

🎯 Quick Links

📊 Live Interactive Demo ← Try it now!
📓 Full Colab Notebook ← Run experiments yourself

🌟 Highlights

Key Results

Best Architecture: [21, 48, 11] (3-layer hourglass pattern)
Test Accuracy: 69.91% ± 0.12%
Improvement over Baseline: +2.67% (67.24% → 69.91%)
Beat Random Search: +0.88% with same computational budget
Search Time: 33.6 minutes on Tesla T4 GPU

Novel Finding: Efficiency-Complexity Tradeoff

Through systematic ablation, I discovered that single-trial evaluation outperforms multi-trial averaging at short search horizons:

Configuration	Accuracy	Time	Result
Single-trial	70.12%	1,108s	⚡ Optimal
Full system (multi-trial)	69.88%	2,021s	-0.24%, +45% slower

Insight: At short horizons (≤8 generations), population diversity creates more variance than random initialization. Multi-trial averaging adds overhead without reducing overall noise.

Impact: 2x speedup for rapid prototyping without accuracy loss.

🚀 Quick Start

Try the Demo

Visit the Live Demo to explore results interactively.

Run Locally

# Clone repository
git clone https://github.com/omar-camara/nas-differential-evolution.git
cd nas-differential-evolution

# Install dependencies
pip install -r requirements.txt

# Option 1: Run in Colab (recommended)
# Open notebooks/Neural_Architecure_Search_DE.ipynb in Google Colab

# Option 2: Run locally (requires GPU)
python -m notebooks.Neural_Architecure_Search_DE

Quick Test (5 minutes)

from src import EnhancedNASEngine
import numpy as np

# Create dummy data
X_train = np.random.randn(1000, 80)
X_test = np.random.randn(200, 80)
y_train = np.random.randint(0, 2, (1000, 1))
y_test = np.random.randint(0, 2, (200, 1))

# Run mini search
nas = EnhancedNASEngine(X_train, X_test, y_train, y_test, budget=60, max_layers=3)
nas.run_differential_evolution(pop_size=5, max_generations=3)

print(f"Best architecture: {nas.best_architecture}")
print(f"Best accuracy: {nas.best_accuracy:.4f}")

📊 Results

1. Differential Evolution vs Random Search

Method	Best Architecture	Accuracy	Evaluations
Differential Evolution	[21, 48, 11]	69.91%	144
Random Search	[25, 35, 20]	69.03%	144
Advantage	-	+0.88%	Same budget

Conclusion: Guided evolutionary search outperforms random exploration with identical computational cost.

2. Ablation Study Results

Systematic component removal to measure impact:

Configuration             Accuracy    Time      Finding
─────────────────────────────────────────────────────────
Single Trial              70.12%      1,108s    ⚡ Best!
Minimal (No Features)     69.97%      1,021s    Also efficient
Full System               69.88%      2,021s    Baseline
No Adaptive DE            69.93%      2,129s    Minimal impact
No LR Scheduler           69.96%      2,134s    Minimal impact

Key Insight: At short search horizons, simpler evaluation strategies are superior.

3. Architecture Analysis

Discovered Pattern: Hourglass

Layer 1:  21 neurons  (compress)
Layer 2:  48 neurons  (expand) 
Layer 3:  11 neurons  (compress)

This asymmetric design was automatically discovered and outperforms intuitive symmetric patterns.

🛠️ Technical Details

Architecture Search

Algorithm: Differential Evolution (DE/rand/1)
Search Space: 1-4 layers, 80 neuron budget
Population: 8 individuals
Generations: 8 iterations
Mutation Factor (F): 0.8 (adaptive)
Crossover Rate (CR): 0.7

Training Configuration

Framework: PyTorch 2.0+ with CUDA
Optimizer: Adam (lr=0.001, weight_decay=0.001)
Loss: CrossEntropyLoss
Regularization: Dropout (0.2), L2 (0.001)
Early Stopping: Patience=10 epochs
LR Scheduling: ReduceLROnPlateau
Batch Size: 256

Dataset

Name: UCI Adult Income
Size: 39,073 training, 9,769 test samples
Features: 80 (after preprocessing)
Task: Binary classification (income >$50K)
Preprocessing: StandardScaler + OneHotEncoder

Optimizations

✅ GPU acceleration (10-50x speedup)
✅ Evaluation caching (avoids redundant training)
✅ Model checkpointing (saves progress every 5 generations)
✅ Gradient clipping (max_norm=1.0)
✅ Adaptive parameters (F adjusts based on success rate)

📈 Experimental Validation

Random Search Baseline

Purpose: Prove DE is optimizing, not just lucky
Setup: Same evaluation budget (144 architectures)
Result: DE found 0.88% better architecture
Conclusion: Guided search > random exploration ✓

Ablation Study

Purpose: Measure impact of each component
Configurations: 5 systematic variations
Key Finding: Single-trial optimal at short horizons
Impact: 2x speedup for rapid prototyping

Statistical Validation

Method: 5 independent trials per final evaluation
Metrics: Mean, standard deviation, confidence intervals
Result: 69.91% ± 0.12% (reproducible)

📁 Repository Structure

nas-differential-evolution/
├── notebooks/
│   └── NAS_Complete_Notebook.ipynb    # Full implementation & experiments
├── deployment/
│   ├── app.py                         # Gradio interactive demo
│   └── requirements.txt               # Demo dependencies
├── results/
│   ├── comprehensive_report.json      # All experimental results
│   ├── search_results.png            # Main visualization
│   ├── ablation_study.png            # Ablation analysis
│   └── search_space_visualization.png # t-SNE plot
├── docs/
│   └── STUDY_GUIDE.md                # Comprehensive documentation
├── README.md                          # This file
├── requirements.txt                   # Project dependencies
├── .gitignore                        # Git ignore rules
└── LICENSE                           # MIT License

🎓 Key Learnings

1. Empirical Validation is Critical

Theory suggested multi-trial averaging would improve stability. Empirical testing showed it hurt performance at short horizons. Lesson: Always validate assumptions with experiments.

2. Match Complexity to Scale

Optimization features should match problem scale. Features that help at long horizons can hurt at short ones. Lesson: Don't optimize prematurely.

3. Negative Results Have Value

The ablation "failure" became the project's most interesting finding. Lesson: Unexpected results often teach more than expected ones.

📚 Documentation

Colab Notebook: Fully reproducible experiments with detailed explanations
Live Demo: Interactive exploration of results

🤝 Contributing

Contributions welcome! Please:

Fork the repository
Create a feature branch (git checkout -b feature/improvement)
Commit changes (git commit -m 'Add improvement')
Push to branch (git push origin feature/improvement)
Open a Pull Request

📄 License

This project is licensed under the MIT License - see LICENSE file for details.

🙏 Acknowledgments

Dataset: UCI Machine Learning Repository - Adult Income dataset
Framework: PyTorch team for excellent deep learning tools
Inspiration: Storn & Price (1997) - Differential Evolution algorithm
Platform: Hugging Face for free hosting

📧 Contact

Omar
MS Computer Science, Syracuse University
Graduate Teaching Assistant

Email: omarcamara000@gmail.com
LinkedIn: https://www.linkedin.com/in/oc18/
Hugging Face: @Username273183

📊 Project Stats

Lines of Code: ~1,500
Experiments Run: 500+ architecture evaluations
GPU Hours: ~40 hours on Tesla T4
Development Time: 2 weeks
Key Finding: Efficiency-complexity tradeoff in evaluation strategy

🎯 Future Work

Multi-objective optimization (accuracy + model size)
Extended search space (skip connections, batch normalization)
Distributed evaluation across multiple GPUs
Transfer learning initialization
Additional datasets and benchmarks

📖 Citation

If you use this work, please cite:

@software{omar_nas_2025,
  author = {Omar},
  title = {Neural Architecture Search with Differential Evolution},
  year = {2025},
  institution = {Syracuse University},
  url = {https://github.com/omar-camara/nas-differential-evolution},
  note = {Interactive demo: https://huggingface.co/spaces/Username273183/nas-differential-evolution}
}

⭐ Star History

If you find this project useful, please consider giving it a star! ⭐

Built with ❤️ using PyTorch, Differential Evolution, and a lot of GPU hours

Last updated: December 2025

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
deployment		deployment
notebooks		notebooks
results		results
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

Neural Architecture Search with Differential Evolution

🎯 Quick Links

🌟 Highlights

Key Results

Novel Finding: Efficiency-Complexity Tradeoff

🚀 Quick Start

Try the Demo

Run Locally

Quick Test (5 minutes)

📊 Results

1. Differential Evolution vs Random Search

2. Ablation Study Results

3. Architecture Analysis

🛠️ Technical Details

Architecture Search

Training Configuration

Dataset

Optimizations

📈 Experimental Validation

Random Search Baseline

Ablation Study

Statistical Validation

📁 Repository Structure

🎓 Key Learnings

1. Empirical Validation is Critical

2. Match Complexity to Scale

3. Negative Results Have Value

📚 Documentation

🤝 Contributing

📄 License

🙏 Acknowledgments

📧 Contact

📊 Project Stats

🎯 Future Work

📖 Citation

⭐ Star History

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages