NdV_Code_By_RibkaA_Ass_7.py by ribkaaramalla322 · Pull Request #563 · ndvtechsyssolutions/Internships

ribkaaramalla322 · 2025-07-10T09:13:00Z

This project builds a spam detection classification model using supervised learning techniques. The dataset consists of labeled SMS messages (spam or ham) and is preprocessed by encoding labels and vectorizing text using CountVectorizer. The dataset is split into training and testing sets (80/20) to evaluate generalization. Two models are trained: Logistic Regression and Naive Bayes, and their performances are compared. Evaluation metrics include accuracy, precision, recall, F1-score, confusion matrix, and ROC curve. The Logistic Regression model's feature importance is visualized to interpret the most influential words in spam classification. Confusion matrices are plotted to compare predicted vs actual outcomes. The ROC curve visually confirms the model’s predictive power. This project demonstrates effective text classification and model explainability using machine learning.

NdV_Code_By_RibkaA_Ass_7.py

f7eb1c5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NdV_Code_By_RibkaA_Ass_7.py#563

NdV_Code_By_RibkaA_Ass_7.py#563
ribkaaramalla322 wants to merge 1 commit intondvtechsyssolutions:mainfrom
ribkaaramalla322:patch-19

ribkaaramalla322 commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ribkaaramalla322 commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant