Skip to content

FelipeAvila2/ML-House-prices-Advanced-Regression

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

House Prices Advanced Regression

Overview

We tackled the Kaggle challenge 'House Prices - Advanced Regression Techniques'. It consists of predicting with Machine Learning Regression models the prices of suburbans houses in Ames, Iowa (USA).

Dataset

Dataset is given by the Kaggle competition. It contains 43 categorical and 36 numerical variables describing the characteristics of residential homes in Ames, Iowa (USA).

The data description file can be found here

Main Steps

  • Dataset exploration
  • Selection of features (for numerical and categorical variables)
  • Data cleaning
  • Feature engineering
  • Trainning + testing the model
  • Improving Predictions
  • Final testing

How to run the code

  1. Either clone the repository or download the files
  2. Install requirements (requirements.txt)
  3. Download the dataset from Kaggle
  4. Open the notebook: House-prices-Advanced-Regression/main.ipynb
  5. Run the notebook

Techniques and tools

  • Data visualization : correlation matrix, histograms, scatterplots,bars - [Matplotlib, Seaborn]
  • Features tweeking :masked variables, one hot encoding, grouping and new feature creation (Neighborhood mean prices).
  • Standardization : StandardScaler
  • PCA (principal component analysis)
  • Pycaret
  • Random Forest regressor
  • Hyperparameter tuning: gridsearch

Model

The model that we selected, after doing the Pycaret, was RandomForestRegressor.

Pycaret

image

This were the hyperparameters:

  • n_estimators=100
  • max_leaf_nodes=40
  • max_depth=10

Final score

Test/Train Score
Test 0.85
Train 0.88

About the team

Ironbuddies

Felipe de Ávila Granja Linkedin Kaggle

Luc fley My other projects

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors