Skip to content

likithakaraka/netflix_data_analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Ί Netflix Data Analysis Project

This project involves analyzing a Netflix dataset using Python (pandas and SQL), cleaning the data, and visualizing insights with Power BI. The objective is to explore content distribution, types, durations, and trends over time.

πŸ“ Folder Structure

netflix-data-analysis/ β”œβ”€β”€ data/ β”‚ β”œβ”€β”€ netflix_titles.xlsx # Raw Netflix dataset β”‚ └── CLEANED_DATA.xlsx # Cleaned data after preprocessing β”œβ”€β”€ scripts/ β”‚ β”œβ”€β”€ clean.py # Python script for data cleaning β”‚ └── clean_using_sql.py # Python script to query data using SQLite β”œβ”€β”€ database/ β”‚ └── cleaned_data.db # SQLite database generated from cleaned data β”œβ”€β”€ dashboard/ β”‚ └── netflix_dashboard.pbix # Power BI dashboard file β”œβ”€β”€ README.md # Project documentation


πŸ“Š Key Tasks Performed

βœ… Data Cleaning

  • Removed or handled null values
  • Standardized column formats (e.g., date_added, duration)
  • Resolved data type mismatches (e.g., year to integer)

βœ… Data Querying

  • Used sqlite3 and pandas.read_sql() to run SQL queries on the dataset
  • Extracted insights like:
    • Number of shows per year
    • Movies vs TV shows distribution
    • Country-wise production frequency
    • Duration patterns by content type

βœ… Dashboard (Power BI)

  • Map of content distribution by country 🌍
  • Line/bar chart showing release trends over the years πŸ“ˆ
  • Comparison of durations across content types ⏱
  • Filters by genre, country, and type

🧰 Tools & Technologies Used

  • Python (pandas, sqlite3)
  • Power BI (interactive dashboards)
  • SQLite (in-memory + persistent queries)
  • Excel (for initial and final data handling)

About

Netflix data analysis using Python, SQL, and Power BI

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages