Skip to content

End-to-end data science and machine learning analysis of 1M+ real estate listings from Divar, including EDA, statistical analysis, geospatial clustering, and price/rent prediction using Random Forest and LightGBM.

Notifications You must be signed in to change notification settings

mehdirt/Divar-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

63 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

πŸ“Š Divar Project

Python License: MIT

🌟 Project Description

This project analyzes data from the Divar platform (an advertising company in Iran), including Exploratory Data Analysis (EDA), statistical analysis, recommender system, and price/rent prediction. The main goal is to use machine learning techniques to better understand the data and provide predictive models. πŸš€

πŸ‘₯ Contributors

  • Mahdi πŸ‘¨β€πŸ’»
  • Ramin πŸ‘¨β€πŸ’»

πŸ“‚ Dataset

The dataset used in this project is not available in the repository due to its large size (approximately 1 million records πŸ“ˆ). It consists of 64 columns, including:

  • 20 numerical columns πŸ”’
  • 44 categorical columns 🏷️

πŸ—‚οΈ Project Structure

The divar_project repository is divided into 5 main sections in src/:

  1. EDA (Exploratory Data Analysis) πŸ”: Exploratory analysis of data to understand distributions, relationships, and patterns.
  2. Statistical_analysis πŸ“Š: Advanced statistical analyses such as statistical tests and statistical modeling.
  3. recommender_system πŸ€–: Implementation of a recommender system for suggesting products or ads.
  4. prediction_price πŸ’° and prediction_rent 🏠: Predictive models for purchase price and rent. These two sections are in a shared folder.

πŸ› οΈ Technologies and Libraries Used

  • Data Processing: pandas πŸ“Š, numpy πŸ”’, scipy πŸ”
  • Visualization: matplotlib πŸ“ˆ, seaborn 🎨, plotly πŸ“Š, geopandas πŸ—ΊοΈ
  • Machine Learning: sklearn πŸ€–, scipy πŸ”¬
  • Algorithms and Models: k-means πŸ“, DBSCAN πŸ”„, LightGBM 🌟, Random Forest Regressor 🌲

πŸ“‹ Prerequisites

  • Python 3.11 🐍
  • Install required libraries via pip install -r requirements.txt (the requirements.txt file should be available in the repository).

πŸš€ How to Run

  1. Clone the repository: git clone https://github.com/username/divar_project.git πŸ“₯
  2. Navigate to the project directory: cd divar_project πŸ“
  3. Create a virtual environment (optional): python -m venv env πŸ—οΈ
  4. Install libraries: pip install -r requirements.txt πŸ“¦
  5. For each section, run the corresponding scripts (e.g., for EDA: python eda/main.py ▢️).

⚠️ Important Notes

  • The original data is not uploaded to the repository due to its size. Please download the data from the relevant source and place it in the data/ folder. πŸ’Ύ
  • For questions or collaboration, use Issues or Pull Requests. πŸ’¬

πŸ“œ License

This project is released under the MIT License. πŸ“„

About

End-to-end data science and machine learning analysis of 1M+ real estate listings from Divar, including EDA, statistical analysis, geospatial clustering, and price/rent prediction using Random Forest and LightGBM.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •