CSCK503 Machine Learning in Practice

Emissions Model Project

Overview

This project aims to develop machine learning models to predict CO2 emissions and categorize boroughs Low, Medium, or High Emission Areas as well as into emission intensity clusters based on road characteristics and vehicle types.

Data

The project utilizes a dataset containing information on road characteristics and pollution caused by different types of vehicles. The dataset includes features such as borough name, road length, type of pollutant emitted, and the amount of pollution caused by petrol, diesel, and electric vehicles.

Exploratory Data Analysis (EDA)

Exploratory data analysis was conducted to understand the characteristics of the dataset.

Model Building

Regression Model

A linear regression model was trained to predict CO2 emissions based on road length and pollution caused by different types of vehicles.

Classification Model

A random forest classifier was trained to categorize boroughs into Low, Medium, or High Emission Areas based on their emissions profile.

Clustering Model

A K-Means clustering model was trained to group boroughs into clusters based on pollution caused by different types of vehicles.

Model Evaluation

The performance of each model was assessed using appropriate evaluation metrics.

Conclusion

The machine learning models developed in this project provide valuable insights into CO2 emissions patterns and help categorise boroughs based on emission intensity. Further improvements and refinements to the models could be explored in future work.

References

Scikit-learn Documentation - Documentation for scikit-learn library.
Pandas Documentation - Documentation for Pandas library.

Contributors

[Ibrahim Amr]
[Alexandros Arcudis]
[Chantal Maskell]
[Aleksander Palamarczuk]

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
README.md		README.md
emissions_data_cleanup.ipynb		emissions_data_cleanup.ipynb
emissions_model_training_k_mean_clustering.ipynb		emissions_model_training_k_mean_clustering.ipynb
emissions_model_training_linear_regression.ipynb		emissions_model_training_linear_regression.ipynb
emissions_model_training_random_forest.ipynb		emissions_model_training_random_forest.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSCK503 Machine Learning in Practice

Emissions Model Project

Overview

Data

Exploratory Data Analysis (EDA)

Model Building

Regression Model

Classification Model

Clustering Model

Model Evaluation

Conclusion

References

Contributors

About

Uh oh!

Releases

Packages

Languages

chantalmaskell/ML_Predict_Co2_Emissions

Folders and files

Latest commit

History

Repository files navigation

CSCK503 Machine Learning in Practice

Emissions Model Project

Overview

Data

Exploratory Data Analysis (EDA)

Model Building

Regression Model

Classification Model

Clustering Model

Model Evaluation

Conclusion

References

Contributors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages