Skip to content

Machine Learning based Malware Detection system that analyzes files and predicts malicious behavior using classification algorithms and feature extraction techniques.

License

Notifications You must be signed in to change notification settings

mohityadav8/malwaredetector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

4 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“§ Spam Mail Detector β€” Machine Learning Project

A lightweight Machine Learning application that classifies messages as Spam or Ham (Not Spam) in real-time using Natural Language Processing and Logistic Regression. The project also includes a simple GUI interface for easy interaction.

πŸš€ Features

Real-time spam detection

Clean Tkinter GUI

Fast and lightweight model

NLP based text preprocessing

TF-IDF vectorization

Modular training & prediction pipeline

🧠 How It Works

Load dataset (mail_data.csv)

Preprocess text (cleaning & tokenization)

Convert text β†’ numerical vectors using TF-IDF

Train Logistic Regression model

Predict message as:

HAM (Not Spam)

SPAM

πŸ›  Tech Stack

Language

Python

Libraries

scikit-learn

pandas

numpy

tkinter

Concepts Used

Machine Learning Classification

Natural Language Processing (NLP)

TF-IDF Vectorization

Logistic Regression

πŸ“§ Spam Mail Detector β€” Machine Learning Project

A lightweight Machine Learning application that classifies messages as Spam or Ham (Not Spam) in real-time using Natural Language Processing and Logistic Regression. The project also includes a simple GUI interface for easy interaction.

πŸš€ Features

Real-time spam detection

Clean Tkinter GUI

Fast and lightweight model

NLP based text preprocessing

TF-IDF vectorization

Modular training & prediction pipeline

🧠 How It Works

Load dataset (mail_data.csv)

Preprocess text (cleaning & tokenization)

Convert text β†’ numerical vectors using TF-IDF

Train Logistic Regression model

Predict message as:

HAM (Not Spam)

SPAM

πŸ›  Tech Stack

Language

Python

Libraries

scikit-learn

pandas

numpy

tkinter

Concepts Used

Machine Learning Classification

Natural Language Processing (NLP)

TF-IDF Vectorization

Logistic Regression spam-mail-detector/ │── main.py │── mail_data.csv │── README.md

βš™οΈ Installation & Run 1️⃣ Clone Repository git clone https://github.com/mohityadav8/spam-mail-detector-.git cd spam-mail-detector-

2️⃣ Create Virtual Environment

Windows

python -m venv venv venv\Scripts\activate

Linux/Mac

python3 -m venv venv source venv/bin/activate

3️⃣ Install Dependencies pip install pandas numpy scikit-learn

4️⃣ Run Application python main.py

🎯 Output

The GUI will open and allow you to enter any message. The model will instantly classify it as Spam or Ham.

πŸ“Œ Future Improvements

Convert to Web App (Flask/Django)

Add Deep Learning model (LSTM/BERT)

Deploy online

Improve dataset accuracy

πŸ‘¨β€πŸ’» Author

Mohit Yadav

About

Machine Learning based Malware Detection system that analyzes files and predicts malicious behavior using classification algorithms and feature extraction techniques.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages