GitHub - Ashishm1214/Movie_Recommender_system: A content based movie recommender system using cosine similarity

Movie Recommendation System

Overview

This project involves creating a movie recommendation system using a dataset containing movie attributes. The goal is to recommend movies similar to a given movie based on content similarity.

Dataset Link - https://www.kaggle.com/datasets/tmdb/tmdb-movie-metadata

Dataset

The dataset includes the following columns: budget, genres, homepage, id, keywords, original_language, original_title, overview, popularity, production_companies, runtime, spoken_languages, status, tagline, title, vote_average, and vote_count.

Data Preprocessing

Handling Missing Values: Missing values were identified and handled. Specifically:
- overview had 3 missing values, which were removed.
- Other columns had no missing values.
- No duplicate entries were found in the dataset.
Data Transformation:
- Genres, Keywords, Cast, and Crew: Extracted and converted relevant information to a list format.
- Tags Creation: Combined overview, genres, keywords, cast, and crew into a single tags column for each movie.
- Text Normalization: Applied stemming to the tags column using the Porter Stemmer to standardize the text data.

Feature Extraction

Count Vectorization: Converted the tags into numerical features using CountVectorizer with a maximum of 5000 features and excluding common stop words.

Similarity Measurement

Cosine Similarity: Calculated pairwise cosine similarity between movies based on their vectorized tags to quantify content similarity.

Recommendation System

Functionality:
- The recommend function finds the most similar movies to a given movie based on cosine similarity scores.
- It excludes the movie itself and returns the top 5 most similar movies.

Example

For the movie "Avatar", the system recommended:

"Titan A.E."
"Small Soldiers"
"Independence Day"
"Ender's Game"
"Aliens vs Predator: Requiem"

Conclusion

This recommendation system effectively suggests movies similar in content to the input movie, enhancing user experience by providing relevant suggestions based on textual analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Movie-Recommender-System.ipynb		Movie-Recommender-System.ipynb
README.md		README.md
appMovie.py		appMovie.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Movie Recommendation System

Overview

Dataset

Data Preprocessing

Feature Extraction

Similarity Measurement

Recommendation System

Example

Conclusion

About

Uh oh!

Releases

Packages

Languages

Ashishm1214/Movie_Recommender_system

Folders and files

Latest commit

History

Repository files navigation

Movie Recommendation System

Overview

Dataset

Data Preprocessing

Feature Extraction

Similarity Measurement

Recommendation System

Example

Conclusion

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages