GitHub

This project implements a privacy-preserving Intrusion Detection System (IDS) using Federated Learning across distributed edge nodes (cloud servers or VMs). The system enables collaborative training of IDS models without sharing raw data between nodes..

# Install necessary libraries
pip install -r requirements.txt

# To run the application
python3 app.py

🔧 Architecture Overview

Edge Nodes train local IDS models (e.g., SVM, Random Forest) using private data like network traffic logs and system logs.
Central Aggregator Server receives encrypted model updates (not raw data), aggregates them using Federated Averaging (FedAvg), and sends the updated global model back to edge nodes for the next training round.
Secure Communication is established using HTTPS/TLS protocols to encrypt data in transit.

🔐 Privacy & Security Enhancements

Differential Privacy: Adds noise to model updates to protect individual data points.
Homomorphic Encryption: Allows encrypted updates to be aggregated without decryption.
No Raw Data Transmission: Only model parameters (weights or gradients) are shared between nodes and the aggregator.

🧠 Intrusion Detection Model

Each node:

Preprocesses its local logs (e.g., extracting IP addresses, traffic patterns, protocol types).
Performs consistent feature engineering to ensure compatibility across nodes.
Trains a local machine learning model to classify normal vs. malicious traffic.

🔄 Model Update and Aggregation Flow

Local Training: Each node trains on its own data.
Model Update: Sends learned parameters to the central server.
Federated Averaging: Central server aggregates updates.
Global Model Distribution: Sends back the updated model for the next training round.

🧰 Tech Stack

Languages: Python, Bash
ML Libraries: Scikit-learn, PyTorch
Security: OpenSSL, HTTPS/TLS
Web & API: Flask
Environment: Linux, Docker

Results and Analysis

The integration of Federated Learning (FL) with cloud computing provides an efficient, scalable, and privacy-preserving solution for large-scale anomaly detection tasks like Intrusion Detection Systems (IDS).

Traditional ML models rely on centralized data, which poses challenges in terms of privacy, bandwidth, and scalability. FL eliminates the need to transfer raw data — models are trained locally on edge devices, and only model updates are shared with a central cloud server for aggregation. This ensures:

Faster processing
Reduced bandwidth
Improved data privacy

When combined with cloud computing, FL enables:

Real-time aggregation of model updates
Seamless scaling across distributed environments
Reduced infrastructure maintenance

This makes FL ideal for real-time applications in cybersecurity, healthcare, and finance

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
templates		templates
CLOUD1.PNG		CLOUD1.PNG
README.md		README.md
apidf.csv		apidf.csv
app.py		app.py
fl_model.PNG		fl_model.PNG
local_model_1.PNG		local_model_1.PNG
local_model_2.PNG		local_model_2.PNG
preprocessing.ipynb		preprocessing.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🔧 Architecture Overview

🔐 Privacy & Security Enhancements

🧠 Intrusion Detection Model

🔄 Model Update and Aggregation Flow

🧰 Tech Stack

Results and Analysis

Local Model 1

Local Model 2

Federated Model

About

Uh oh!

Releases

Packages

Languages

sudarshan710/fl_IDS

Folders and files

Latest commit

History

Repository files navigation

🔧 Architecture Overview

🔐 Privacy & Security Enhancements

🧠 Intrusion Detection Model

🔄 Model Update and Aggregation Flow

🧰 Tech Stack

Results and Analysis

Local Model 1

Local Model 2

Federated Model

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages