CSC466FinalProject

Aditi Gajjar, Anagha Sikha, Othilia Norell, Soren Paetau, Nicholas Tan \ agajjar@calpoly.edu / arsikha@calpoly.edu \ onorell@calpoly.edu / spaetau@calpoly.edu / nktan@calpoly.edu

HOW TO RUN CODE

Q1: (Random Forest on Feature Importance)

run python3 randomForest.py To run code on different subsets of attributes (all, without best attributes, and without worst attributes) by updating line 34 in randomForest.py – instructions in randomForest.py

Q2 (Apriori Rules): All code is contained within a jupyter notebook

Q3 (Clustering):

For KMeans, run python3 kmeans.py spotify_songs.csv [-p (kmeans_plus)] [-m (manhattan dist)] [-n (normalize data)] [-t (testing)]
For DBScan, run python3 dbscan.py spotify_songs.csv [-m (manhattan dist)]

Q4 (Decision Tree vs Random Forest):

To run C4.5 run python3 InduceC45.py spotify_new_train.csv in the terminal
To run classify on the decision tree from InduceC45.py run on the test dataset: python3 classify.py spotify_new_test.csv spotify_new_train.json
To run Random Forests run python3 randomForest.py spotify_songs_new.csv (example: python3 randomForest.py spotify_songs_new.csv 9 600 50)
Results of the random forests classification are found in spotify_songs_new_rf_all.txt for all 9 attributes
From hypertuning results of the random forests classification are found in spotify_songs_new_rf.txt for 4 attributes

Q5 (Collaborative Filtering):

-Reference readme in CollabFiltering -use -sp with testcases given

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
Q1 – RF on Features Importance		Q1 – RF on Features Importance
Q2_Apriori		Q2_Apriori
Q3_Clustering		Q3_Clustering
Q4 - C45_RF		Q4 - C45_RF
Q5 _Collab		Q5 _Collab
.DS_Store		.DS_Store
README.md		README.md
bin_genre.csv		bin_genre.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CSC466FinalProject

HOW TO RUN CODE

Q1: (Random Forest on Feature Importance)

Q2 (Apriori Rules): All code is contained within a jupyter notebook

Q3 (Clustering):

Q4 (Decision Tree vs Random Forest):

Q5 (Collaborative Filtering):

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CSC466FinalProject

HOW TO RUN CODE

Q1: (Random Forest on Feature Importance)

Q2 (Apriori Rules): All code is contained within a jupyter notebook

Q3 (Clustering):

Q4 (Decision Tree vs Random Forest):

Q5 (Collaborative Filtering):

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages