Speed_Dating_DataAnalysis

Analyzing the data in Speed_date_data Using data from kaggle. https://www.kaggle.com/datasets/mexwell/speed-dating/data

Using cleaned_speed_data by performing a cleaning data process on the data in advance

Code that changes the parameters of the classification model and finds the best combination using various Scaling and Encoding methods. We use 2 Encoding method(One-hot Encoding / Label Encoding), 2 Scaler (MinMax / Robust), 2 Parameter of classifier model(k = 3, 5) so the number of total case is 8.

Instructions

Files exist for cases 1 through 8. When each file is executed, it outputs a data inspection process and proceeds with the preprocessing process. At this time, each file has a different method of scaling and encoding. In the Kn classifier process, the accuracy can be obtained by converting the k value.

Knn allows us to predict about new data. We also do the k-means cluster process The evaluation for each model proceeds at the end.

Data Information

gender(integer) : Gender of the person evaluated. female is 0 / male is 1 age(integer): age of the person evaluated income(float): income of the person evaluated career(object): carrer of the person evaluated dec(int): whether this individual was a match (rater perspective) attr(float): attractiveness of the person evaluated by rater sinc(float): sincerity of the person evaluated by rater intel(float): intelligence of the person evaluated by rater fun(float): fun of the person evaluated by rater amb(float): ambitiousness of the person evaluated by rater shar(float): degree of shared interest of the person evaluated by rater like(float): overall rating prob(float): whether the rater believed that interest would be reciprocated met(int):whether the two had met prior to the speed date -> Because We cleaning data already, most data types change from integers to floats.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Classification_KNN.py		Classification_KNN.py
DataInspection_Exploration.py		DataInspection_Exploration.py
LabelEncoding_MinMax.py		LabelEncoding_MinMax.py
LabelEncoding_Robust.py		LabelEncoding_Robust.py
OneHotEncoding_MinMax.py		OneHotEncoding_MinMax.py
OneHotEncoding_Robust.py		OneHotEncoding_Robust.py
README.md		README.md
case1.py		case1.py
case2.py		case2.py
case3.py		case3.py
case4.py		case4.py
case5.py		case5.py
case6.py		case6.py
case7.py		case7.py
case8.py		case8.py
cleaned_speed_data.csv		cleaned_speed_data.csv
evaluation_analysis.py		evaluation_analysis.py
evaluation_analysis_for_Label.py		evaluation_analysis_for_Label.py
full_clustering.py		full_clustering.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Speed_Dating_DataAnalysis

Instructions

Data Information

About

Uh oh!

Releases

Packages

Languages

chanudevelop/Speed_Dating_DataAnalysis

Folders and files

Latest commit

History

Repository files navigation

Speed_Dating_DataAnalysis

Instructions

Data Information

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages