Text & Audio Sentiment Analyzer

This project performs sentiment analysis on both text and audio inputs using Natural Language Processing and Machine Learning techniques. The system analyzes customer reviews from the Amazon Alexa dataset and classifies sentiment into Positive, Negative, or Neutral.

For audio input, the system converts speech into text using Speech Recognition, extracts MFCC (Mel Frequency Cepstral Coefficients) features from audio signals, and then predicts sentiment using trained machine learning models.

The project demonstrates an end-to-end sentiment analysis pipeline including preprocessing, feature extraction, model training, and deployment using a Flask web application.

Dataset

The project uses the Amazon Alexa Reviews Dataset (amazon_alexa.tsv), which contains user reviews for Amazon Alexa products.

Sentiment labels are generated using VADER Sentiment Analysis, which produces a compound score that is mapped to three classes:

Positive
Negative
Neutral

Features

Text sentiment analysis
Audio sentiment analysis
Speech-to-text transcription
Text preprocessing using NLP techniques
MFCC feature extraction for audio
TF-IDF vectorization for text
Handling class imbalance using SMOTE
Machine learning based sentiment classification
Flask based web interface

Methodology

Text Sentiment Analysis

1. Data Collection

Amazon Alexa Reviews Dataset

2. Preprocessing

Text cleaning
Tokenization
Stopwords removal

3. Feature Extraction

TF-IDF Vectorization

4. Sentiment Classification

Naïve Bayes
Logistic Regression
VADER Sentiment Analysis

Audio Sentiment Analysis

1. Data Collection

Custom audio clips generated using gTTS

2. Preprocessing

Audio processing using Librosa

3. Feature Extraction

MFCC (Mel Frequency Cepstral Coefficients) extraction using Librosa

4. Audio Transcription

Convert audio to text using SpeechRecognition

5. Sentiment Classification

Naïve Bayes
Logistic Regression
VADER Sentiment Analysis

Machine Learning Models

The following machine learning models are used for sentiment classification:

Logistic Regression
Multinomial Naïve Bayes

These models are trained on features extracted using TF-IDF for text and MFCC for audio.

Technologies Used

Python
Flask
Scikit-learn
NLTK
VADER Sentiment Analysis
Librosa
SpeechRecognition
Pandas
NumPy

Project Workflow

Load Amazon Alexa review dataset
Clean and preprocess review text
Generate sentiment scores using VADER
Convert text into numerical features using TF-IDF
Extract MFCC features from audio signals
Handle class imbalance using SMOTE
Train Logistic Regression and Naïve Bayes models
Convert audio to text using Speech Recognition
Predict sentiment for both text and audio inputs

How to Run the Project

Clone the repository

git clone https://github.com/your-username/Text-Audio-Sentiment-Analyzer.git

Install dependencies

pip install -r requirements.txt

Run the application

python app.py

Open in browser

http://127.0.0.1:5000

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
audios		audios
templates		templates
Audio sentiment.png		Audio sentiment.png
README.md		README.md
Text sentiment.png		Text sentiment.png
amazon_alexa.csv		amazon_alexa.csv
amazon_alexa.tsv		amazon_alexa.tsv
amazon_alexa_with_sentiment.csv		amazon_alexa_with_sentiment.csv
amazon_alexa_with_sentiment.tsv		amazon_alexa_with_sentiment.tsv
app.py		app.py
audio sentiment workflow.png		audio sentiment workflow.png
audio_model.pkl		audio_model.pkl
audio_scaler.pkl		audio_scaler.pkl
audio_sentiment.py		audio_sentiment.py
audio_to_text.py		audio_to_text.py
datasets.py		datasets.py
logistic_regression.pkl		logistic_regression.pkl
naive_bayes.pkl		naive_bayes.pkl
review_sentiment.ipynb		review_sentiment.ipynb
sentiment_analysis_results.csv		sentiment_analysis_results.csv
sentiment_model.py		sentiment_model.py
text sentiment workflow.png		text sentiment workflow.png
transcribed_audio.csv		transcribed_audio.csv
transcribed_audio_with_sentiment.csv		transcribed_audio_with_sentiment.csv
vectorizer.pkl		vectorizer.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Text & Audio Sentiment Analyzer

Dataset

Features

Methodology

Text Sentiment Analysis

Audio Sentiment Analysis

Machine Learning Models

Technologies Used

Project Workflow

How to Run the Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Text & Audio Sentiment Analyzer

Dataset

Features

Methodology

Text Sentiment Analysis

Audio Sentiment Analysis

Machine Learning Models

Technologies Used

Project Workflow

How to Run the Project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages