Product Review Sentiment Analysis

An end-to-end NLP project that classifies product reviews as Positive or Negative using TF-IDF features and a Linear SVM model, deployed as an interactive Streamlit web application.

📌 Overview

This project demonstrates a complete sentiment analysis workflow:

Data ingestion from the Amazon Fine Food Reviews dataset (Kaggle).
Text preprocessing using NLTK (tokenization, stopword removal, lemmatization).
Feature extraction using TF-IDF (unigrams + bigrams).
Model training with Linear SVM (and Logistic Regression benchmark).
Model persistence (sentiment_model.pkl, vectorizer.pkl) with joblib.
Web UI for single review and bulk CSV predictions using Streamlit.
Ready for deployment on Streamlit Community Cloud.

🧠 Architecture

Data Layer
- Raw reviews from Kaggle (data/Reviews.csv).
- Columns Text and Score used for sentiment modeling.
Preprocessing & Feature Layer
- Custom text cleaner:
  - lowercase
  - HTML & URL removal
  - digits & punctuation removal
  - stopword removal
  - lemmatization
- TF-IDF vectorization (max_features=50,000, ngram_range=(1,2)).
Model Layer
- Binary labels:
  - Score 4–5 → positive
  - Score 1–2 → negative
- Neutral (Score = 3) reviews removed.
- Models evaluated:
  - Logistic Regression
  - Linear SVM (chosen as final model).
Deployment Layer
- app.py (Streamlit app) for:
  - Single review prediction
  - CSV upload (review column) for bulk prediction
- Hosted locally or on Streamlit Community Cloud.

📂 Project Structure

product-review-sentiment-analysis/
├── app.py
├── requirements.txt
├── README.md
├── data/
│   └── Reviews.csv
├── models/
│   ├── sentiment_model.pkl
│   └── vectorizer.pkl
└── notebooks/
    └── product_review_sentiment.ipynb

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
models		models
notebooks		notebooks
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Product Review Sentiment Analysis

📌 Overview

🧠 Architecture

📂 Project Structure

About

Uh oh!

Releases

Packages

Languages

License

ArjunPramod/Product-Review-Sentiment-Analysis

Folders and files

Latest commit

History

Repository files navigation

Product Review Sentiment Analysis

📌 Overview

🧠 Architecture

📂 Project Structure

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages