Creation of two metrics to analyze the performances in the UEFA Champions League from the 2020-21 to the 2024-25 season using Football Reference's data.
There are four steps to create these two metrics, as follows:
We select the match statistics that are relevant to compute the quality of the performance of a team in a football match. We decided to select the following statistics that comes from the Football Reference website:
npxG E (Non Penalty Expected Goals), SoT (Number of Shots on Target), xAG E (Expected Assisted Goals), SCA (Number of Shot-Creating Actions), GCA (Number of Goal-Creating Actions), PrgP (Number of Progressive Passes), PrgC (Number of Progressive Carries), STO(Number of Successful Takes-Ons), Tkl (Number of Tackles), Int (Number of Interceptions), Blocks (Number of Blocks).
For each statistic, we compute the Empirical Cumulative Distribution based on the values of the same statistic from all the UEFA Champions League performances between the 2020-21 and 2024-25 season. We multiply the value obtained by 100 to have a more understandable score between 0 and 100.
Here is the formula that we use for each statistic:
Where:
-
$v_{s, t, m}$ is the value of the statistic$s$ for team$t$ in match$m$ . -
$x_{s, i}$ is the value of the statistic$s$ for the team performance$i$ . -
$n$ is the number of team performances from the 2020-21 to 2024-25 Champions Leagues seasons. -
$\mathbf{1}{x_{s, i} \leq v_{s, t, m}}$ is an indicator function that equals 1 if$x_{s, i} \leq v_{s, t, m}$ else 0
For all the statistics presented in the first step, we apply the
When the