Malware_detection

Malware detection full stack project for the users

🛡️ Malware Detection System

A full-stack malware classification project using Machine Learning & Modern Web Technologies.

📌 Overview

This project is designed to detect malicious vs. benign files using machine learning models trained on a custom malware dataset.
It also includes a frontend UI and backend API so users can upload files and receive predictions in real-time.

This repository contains:

✔️ Machine Learning model for malware detection
✔️ Full dataset (CSV format)
✔️ Training script (train.py)
✔️ Backend API server
✔️ Frontend interface
✔️ Confusion matrix and evaluation results
✔️ Setup scripts and project structure

🚀 Features

Binary classification (Malware vs Benign)
Custom dataset with expanded features
Machine Learning pipeline (preprocessing → training → prediction)
Visualization of evaluation metrics
REST API backend for prediction
React/Node frontend for file upload + result display
100% open-source

🧠 Machine Learning Model

Model used:

XGBoostClassifier
StandardScaler for normalization
80/20 train-test split
Evaluation metrics:
- Accuracy
- Precision / Recall / F1-score
- Confusion matrix (saved as confusion_matrix.png)

Training script:

📁 Project Structure

Malware_detection/ │ ├── backend/ # Backend API server ├── malware-frontend/ # Frontend application ├── confusion_matrix.png # Model performance plot ├── malware_dataset_expanded.csv ├── malware_dataset_gmm_5000.csv │ ├── train.py # Model training script ├── run.sh # Script to run backend + frontend ├── package.json # Node project file ├── .gitignore └── README.md

⚙️ Installation & Usage

✅ Clone the Repository

git clone https://github.com/Harry-Khatri/Malware_detection.git
cd Malware_detection

For Model Training
python3 train.py

cd backend
npm install
npm start
cd malware-frontend
npm install
npm run dev

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Malware_detection

🛡️ Malware Detection System

📌 Overview

🚀 Features

🧠 Machine Learning Model

📁 Project Structure

⚙️ Installation & Usage

✅ Clone the Repository

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
backend		backend
malware-frontend		malware-frontend
node_modules		node_modules
.gitignore		.gitignore
2025-11-19_23-20.png		2025-11-19_23-20.png
README.md		README.md
confusion_matrix.png		confusion_matrix.png
malware_dataset_expanded_5000.csv		malware_dataset_expanded_5000.csv
malware_dataset_gmm_5000.csv		malware_dataset_gmm_5000.csv
package-lock.json		package-lock.json
package.json		package.json
run.sh		run.sh
train.py		train.py

Folders and files

Latest commit

History

Repository files navigation

Malware_detection

🛡️ Malware Detection System

📌 Overview

🚀 Features

🧠 Machine Learning Model

📁 Project Structure

⚙️ Installation & Usage

✅ Clone the Repository

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages