Skip to content

An exploratory analysis on all fighters in the UFC (Ultimate Fighting Championship) and their respective matches.

Notifications You must be signed in to change notification settings

estgarci/UFC-Data-Analysis

Repository files navigation

28 Years of UFC History

Extraction, preparation, and analysis of the Ultimate Fighting Championship historical data.

Purpose

The purpose of this project was to use python for data mining and analysis. The main objective was to perform an exploratory analysis on a historical data-set.

Central questions

1. What's the standard deviation of a fighter’s height in each weight division?
2. How does height distribution look like in each weight division?

Data extraction

I built a web-scraping Python script that downloads public data from www.ufcstats.com. The raw dataset contains a historical roster of fighters in the UFC, from the year 1993 to present.

Data preparation

Built a script using Python that predicts a fighter's missing value based on their name. Used historical names from the U.S national database www.datagov.org to determine if a fighter belongs to the female or male division based on the relative proportion of males/females. The classifyer attained 96% precision and 70% recall.

Data Analysis

A detailed explanation of the analysis can be found in this project's Python jupyter-notebook.

About

An exploratory analysis on all fighters in the UFC (Ultimate Fighting Championship) and their respective matches.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published