🎬 Movie Recommendation Engine

An intelligent, scalable Recommendation System built with FastAPI, LightGBM, and Annoy, designed to serve personalized movie suggestions in real time.

This project demonstrates how to combine content-based filtering (via embeddings) with learning-to-rank models for fine-tuned personalization.

🚀 Features

✅ Hybrid Recommendation System

Uses Annoy (Approximate Nearest Neighbors) for fast candidate generation based on movie metadata (genres, actors, keywords).
Uses LightGBM (LambdaRank) to rank the candidates based on user-specific behavioral patterns.

✅ Event-based Data Pipeline

Collects user interactions (viewed, liked, rated, etc.) in the events table.
Generates training labels automatically from ratings (or event scores).

✅ Fast Model Training

Annoy index for scalable similarity search.
LightGBM LambdaRank model for optimized ranking.

✅ Modular Codebase

DAO layer for all DB interactions.
Service layer for model logic and ML pipeline.
REST endpoints for training and recommendation.

✅ Extensible Architecture

Easily add new metadata fields (e.g., directors, tags).
Switch between test and production databases.
Compatible with AWS / Docker deployment.

⚙️ Tech Stack

Component	Technology
Backend Framework	FastAPI(Recommendor) / Express (Expose to customer)
Database	PostgreSQL
ORM / Data Access	SQLAlchemy
ML Model	LightGBM (LambdaRank Objective)
Approximate Nearest Neighbors	Annoy
Feature Engineering	Pandas, NumPy
Environment	Python 3.10+
Model Serving	FastAPI service layer
Visualization / Docs	Swagger UI (FastAPI built-in)
Deployment Ready For	Docker / Uvicorn / Gunicorn

👩🏻‍💻 Set-up

1️⃣ Clone the repository

git clone https://github.com/your-username/movie-recommender.git cd movie-recommender

2️⃣ Create and activate a virtual environment

python -m venv venv source venv/bin/activate # (Linux / macOS) venv\Scripts\activate # (Windows)

3️⃣ Install dependencies

pip install -r requirements.txt

4️⃣ Configure environment variables

Create a .env file in the project root with the following content: DATABASE_URL=postgresql://username:password@localhost:5432/movies_db

5️⃣ Run the Node.js Backend (API Gateway + DB Layer)

cd backend
npm install
npm run dev

6️⃣ Set up the PostgreSQL database

Ensure PostgreSQL is running and create the database: psql -U postgres -c "CREATE DATABASE movies_db;"
Create tables npx drizzle-kit generate --name=db_init npx drizzle-kit migrate
Seed dummy data using cron files
- create some dummy users
- import movies data from https://www.themoviedb.org/
- create some dummy ratings

7️⃣ Run the FastAPI server

uvicorn main:app --reload

Access the Swagger UI at: http://localhost:8000/docs

💡 Future Improvements

✅ User Embeddings: Incorporate matrix factorization or neural embeddings to better capture user–movie relationships.

✅ Hybrid Re-Ranking: Blend collaborative signals with content-based attributes (genre, keywords, cast).

✅ Context-Aware Recommendations: Integrate time-based and situational relevance (recent activity, seasonality).

✅ Feedback Loop: Implement reinforcement learning or Bayesian updates to fine-tune model weights.

✅ Scalability: Move to distributed training (LightGBM on GPU or Dask) and serve via model registry (e.g., MLflow).

✅ CI/CD Pipeline: Add automated retraining and deployment workflows (Cron Jobs + Docker + AWS ECS).

🏗️ Architecture Overview

flowchart TD
    A[User Interactions] -->|events, ratings| B[(PostgreSQL DB)]
    B -->|fetch metadata| C[MoviesDAO and EventsDAO]
    C --> D[Feature Preparation Layer]
    D --> E[Annoy Index Builder]
    E --> F[Annoy Index File]
    D --> G[LightGBM Trainer]
    G --> H[Ranker Model File]
    F & H --> I[Recommendation Service]
    I --> J[FastAPI Endpoints]
    J --> K[Client or Frontend]

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
backend		backend
cron		cron
recommendor		recommendor
.gitattributes		.gitattributes
README.md		README.md
docker.yaml		docker.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎬 Movie Recommendation Engine

🚀 Features

⚙️ Tech Stack

👩🏻‍💻 Set-up

1️⃣ Clone the repository

2️⃣ Create and activate a virtual environment

3️⃣ Install dependencies

4️⃣ Configure environment variables

5️⃣ Run the Node.js Backend (API Gateway + DB Layer)

6️⃣ Set up the PostgreSQL database

7️⃣ Run the FastAPI server

💡 Future Improvements

🏗️ Architecture Overview

About

Uh oh!

Releases

Packages

Languages

kushCT/recommendation-engine

Folders and files

Latest commit

History

Repository files navigation

🎬 Movie Recommendation Engine

🚀 Features

⚙️ Tech Stack

👩🏻‍💻 Set-up

1️⃣ Clone the repository

2️⃣ Create and activate a virtual environment

3️⃣ Install dependencies

4️⃣ Configure environment variables

5️⃣ Run the Node.js Backend (API Gateway + DB Layer)

6️⃣ Set up the PostgreSQL database

7️⃣ Run the FastAPI server

💡 Future Improvements

🏗️ Architecture Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages