Final Project - Coffee Quality

Background & Objectiv

I am a great coffee fan and asked myself wheter its is possible to predict the quality of coffee by some gives parameters. So I found a dataset from the Coffee Quality Institute. There I wanted to try oiut different regressors and answer the question whether it is possible or not. For the modeling part I used Python and the column "cupper points" as the target. The visualization part is mostly Tableau.

Dataset

In this project, I used the provided ** Coffee Quality** dataset from kaggle.

Data: The data set consists of information on 1339 different coffees which the Coffee Quality Insitute tested and presented January 2018. Existing Colunmns are f.e.:

Quality Measures

Aroma
Flavor
Aftertaste
Acidity
Body
Balance
Uniformity
Cup Cleanliness
Sweetness
Moisture
Defects

Bean Metadata

Processing Method
Color
Species (arabica / robusta)

Farm Metadata

Owner
Country of Origin
Farm Name
Lot Number
Mill
Company
Altitude
Region

Workflow

After importing all nesessary libraries and loading the data from SQL, I did some data exploration. After I get a feel for it, I did some data cleaning: droping duplicates (there were none), renaming the columns, droping unesessary columns, dealing with null values, reducing to many unique values in a column, changing outliers in numerical columns etc. After tranforming the numericals with StandardScaler and Normalizer, I tried diffrent regressors:

Linear Regression
Decision Trees Regressor
KNN Regressor
Random Forest Regressor Therefore I used the whole dataset (categorical & numerical) and only numericals (standard scaled/ normalized).

The last step was clustering the data and visulization in Tableau.

Conclusion

The transformed data with the StandardScaler perfomed way better than the normalizes data. The best Regressor was the Random Forest Regressor with the only numerical dataset. But the R2 Score is still only 0,6829. So you can't fully predict the quality of a coffee.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
Final Project - Predicting the Quality of Coffee.ipynb		Final Project - Predicting the Quality of Coffee.ipynb
Presentation - Coffee Quality.pdf		Presentation - Coffee Quality.pdf
Presentation - Coffee Quality.pptx		Presentation - Coffee Quality.pptx
README.md		README.md
World Map.twb		World Map.twb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Final Project - Coffee Quality

Background & Objectiv

Dataset

Workflow

Conclusion

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Final Project - Coffee Quality

Background & Objectiv

Dataset

Workflow

Conclusion

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages