A Machine Learning project that detects whether a text message or email is Spam or Ham. This project includes a model training script and a user-friendly Graphical User Interface (GUI) built with Tkinter.
- Machine Learning: Uses Logistic Regression to classify text with ~96% accuracy.
- Text Processing: Implements TF-IDF (Term Frequency-Inverse Document Frequency) to convert text into numerical features.
- GUI Application: A clean desktop interface to test custom messages instantly.
- Reproducibility: Scripts to train the model from scratch and save it for inference.
- Language: Python 3.x
- Libraries: Scikit-Learn, Pandas, NumPy, Pickle
- GUI: Tkinter (Standard Python Interface)
SPAMCLASSIFIER/
βββ .venv/ # Virtual environment (ignored in git)
βββ app.py # GUI Application for testing
βββ main.py # Script to train and evaluate the model
βββ spam.csv # Dataset file
βββ README.md # Project documentation
βββ .gitignore # Files to ignore (pkl, venv, etc.)