Project Title : Predicting Emotion from Speech Using CNN-LSTM Model for a Speech Emotion Recognition (SER) System.

This project is a part of the AAI-501 course in the Applied Artificial Intelligence Program at the University of San Diego (USD).

Project Status: [Complete]

Installation

Download the CREMA-D dataset from Kaggle, which is available under an ODC Attribution license.

Launch Jupyter notebook and open the G7_CNN_LSTM_HuBERT_MFCC_Augmented.ipynb file from this repository.

The G7_CNN_LSTM_HuBERT_MFCC_Augmented.ipynb is the final version of the CNN-LSTM model including comparisons with the pre-trained HuBERT model from Meta.

Required libraries to be installed including:

import math
import os
import librosa
import librosa.display
import numpy as np
import pandas as pd
from sklearn.preprocessing import LabelEncoder
from sklearn.model_selection import train_test_split
import torch
import torch.nn as nn
import torch.optim as optim
from torch.utils.data import DataLoader, TensorDataset
from sklearn.metrics import classification_report, confusion_matrix
import seaborn as sns
import matplotlib.pyplot as plt
from sklearn.preprocessing import StandardScaler
from collections import Counter

For HuBERT Model

Import the following (in addition to the above)

from transformers import Wav2Vec2Processor, HubertModel, Wav2Vec2Model
from transformers import Wav2Vec2FeatureExtractor
from torch.utils.data import DataLoader, TensorDataset, Dataset
from sklearn.cluster import KMeans
from sklearn.decomposition import PCA

Please note, to utilize CUDA, the appropriate PyTorch CUDA version needs to be installed in your environment via pip or conda.

Project Intro/Objective

To classify 6 emotions from speech. The emotions are Anger, Sad, Neutral, Disgust, Fear, and Happy.

Partner(s)/Contributor(s)

• Donavan Trigg

• Dean P. Simmer

• Payal Patel

Methods Used

• Classification

• Machine Learning

• Neural Networks

• Deep Learning

Technologies

• Python

• Jupyter Notebook

• PyTorch

Project Description

Leveraging CNNs and LSTMS to train and classify emotions from speech.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
.gitignore		.gitignore
AAI501_Final_Project_G7_EDA.ipynb		AAI501_Final_Project_G7_EDA.ipynb
AAI501_Final_Project_Group7_Main.ipynb		AAI501_Final_Project_Group7_Main.ipynb
AAI_501_Final_Project_G7_CNN_LSTM.ipynb		AAI_501_Final_Project_G7_CNN_LSTM.ipynb
AAI_501_Final_Project_G7_CNN_LSTM_Clustering.ipynb		AAI_501_Final_Project_G7_CNN_LSTM_Clustering.ipynb
AAI_501_Final_Project_G7_CNN_LSTM_HuBERT.ipynb		AAI_501_Final_Project_G7_CNN_LSTM_HuBERT.ipynb
AAI_501_Final_Project_G7_CNN_LSTM_HuBERT_MFCC.ipynb		AAI_501_Final_Project_G7_CNN_LSTM_HuBERT_MFCC.ipynb
AAI_501_Final_Project_G7_CNN_LSTM_SAD.ipynb		AAI_501_Final_Project_G7_CNN_LSTM_SAD.ipynb
AAI_501_Final_Project_Group7_CNN.ipynb		AAI_501_Final_Project_Group7_CNN.ipynb
AAI_501_Final_Project_Group7_CNN_3Features.ipynb		AAI_501_Final_Project_Group7_CNN_3Features.ipynb
G7_CNN_LSTM_HuBERT_MFCC_Augmented.ipynb		G7_CNN_LSTM_HuBERT_MFCC_Augmented.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title : Predicting Emotion from Speech Using CNN-LSTM Model for a Speech Emotion Recognition (SER) System.

Project Status: [Complete]

Installation

Required libraries to be installed including:

Project Intro/Objective

Partner(s)/Contributor(s)

Methods Used

Technologies

Project Description

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

mojodean/MSAAI501FinalProject_Group7

Folders and files

Latest commit

History

Repository files navigation

Project Title : Predicting Emotion from Speech Using CNN-LSTM Model for a Speech Emotion Recognition (SER) System.

Project Status: [Complete]

Installation

Required libraries to be installed including:

Project Intro/Objective

Partner(s)/Contributor(s)

Methods Used

Technologies

Project Description

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages