USA Methodology

The Math

How are win probabilities calculated?

Given two teams, H and A, we use each team's Elo rating entering the game. Let \( E_H \) and \( E_A \) represent the Elo ratings for the home and away teams, respectively. If the game is not at a neutral site, we add a league-wide and season-specific value of home field advantage, \( \text{HFA} \), to the home team's Elo rating.

The probability that the home team wins is calculated using the following logistic formula:

\[P(H) = \frac{1}{1 + 10^{(E_A - (E_H + \text{HFA}))/400}}\]

The probability of the away team winning is simply: \[P(A) = 1 - P(H)\]

In games played at a neutral site, we do not apply home field advantage to either team. In these cases, win probabilities are based solely on the teams' Elo ratings.

How is home field advantage calculated?

We estimate home field advantage by assuming that, over the course of a full season, the Elo ratings of home and away teams are equal on average. Under this assumption, any observed edge for the home team can be attributed to home field advantage alone. To estimate the value of home field advantage, we calculate the actual percentage of games won by the home team, let the home and away Elo ratings cancel in the above equation, and solve for \( HFA \).

For example, if the home team won 55% of games, we would solve:

\[ 0.55 = \frac{1}{10^{-HFA/400} + 1} \]

to get \( HFA = 34.8 \). This means that the average home team plays 34.8 points stronger at home than it would on the road.

How are Elo ratings adjusted after each game?

The change in a team's Elo rating after a game depends on three components: the weighting factor, the scaling factor, and the game result.

Weighting Factor

The weighting factor, denoted by \( k \), dictates how quickly Elo ratings change relative to the importance of the game. The larger the value of \( k \), the quicker the ratings change. Leagues with shorter seasons (like the NFL) will have larger weighting factors since each individual game carries more weight and there are fewer opportunities for the ratings to self-correct. Leagues with longer seasons (such as MLB) will have smaller weighting factors to avoid overreacting to extreme results. Each league and season has its own weighting factor, calibrated based on the length of the season and the parity of the league.

Scaling Factor

The scaling factor, denoted \( f(MoV) \), is a function that dictates how quickly Elo ratings change relative to the margin of victory. Larger margins of victory lead to larger Elo adjustments, while closer games result in smaller changes. The function used varies by league, but it is common to use the natural log and square root transformations to avoid inflating the Elo ratings in blowouts. For tie games, a small positive value (less than 1) is used as the margin of victory to ensure the ratings shift slightly.

Game Result

The game result is measured as the difference between the actual result of the game and the probability of the home team winning the game in the simulation. The actual result is 1 if the home team won, 0 if the away team won, and 0.5 if the game ended in a tie.

Putting It All Together

The Elo rating change for the home team is calculated as:

\[ \Delta_H = k \times f(MoV) \times (\text{Home Result} - \text{Home Win Prob}) \]

The away team's Elo rating is adjusted by the same amount in the opposite direction.

How are tie probabilities calculated?

This was the most complex and sensitive part of the modeling process. Intuitively, we expect ties to be most likely when teams are evenly matched and become less likely as the difference in the Elo ratings increases.

Just like the win probability calculation, we begin by calculating the difference in Elo ratings between the two teams (including adjusting for home field advantage). We then normalize this difference by a scaling denominator that captures how tie frequency behaves across leagues and seasons. Specifically, the Elo difference is divided by a function of the:

Weighting factor \( k \)
Standard deviation \( \sigma \) of the proportion of games ending in a tie
Slope from a regression of seasonal tie proportions on \( k \)
Number of points a win is worth in the standings (2 for hockey, 3 for soccer)

Denote the result of this calculation as \( D \). From there, we treat the probability of the game ending in a tie as a normal distribution centered at 0 and compute:

\[ P(T) = \frac{1}{\sigma \sqrt{2\pi}} e^{-\frac{1}{2} D^2} \]

How do you know the probabilities are accurate?

We tested the accuracy of our predicted win and tie probabilities using a chi-squared goodness of fit test, which checked whether the outcomes of the games were consistent with the probabilities assigned by the model.

For each season, we divided the range of predicted probabilities into 100 equal-width bins, each spanning 0.01. Every game was assigned to a bin based on the model's predicted win probability for the home team. For example, a game with a win probability of 0.743 was placed in the 0.74-0.75 bin.

Within each bin, we counted:

Observed wins: How many times the home team actually won
Expected wins: Total number of games in the bin multiplied by the average predicted win probability for games in the bin

From here, we calculated the chi-squared statistic to compare the observed and expected win counts across all bins to ensure that the win probabilities derived by our model were in accordance with how the games actually played out. This process was repeated for tie probabilities where applicable.

The results show that the model is very well calibrated:

In leagues without ties, the p-value for the goodness of fit test exceeded 0.99 in every season, indicating that the actual game outcomes closely matched the predicted win probabilities.
In leagues with ties, the goodness of fit test returned a p-value above 0.90 for both win and tie probabilities in each season, again indicating excellent fit.

These results provide strong statistical evidence that our model is calibrated accurately across all seasons and that the probabilities of wins and ties reflect reality.

How are the probabilities for the current season projections handled?

Season projections are handled much like the historic simulations with two key exceptions.

First, early-season results are volatile, and it takes time to identify league-wide trends. For the first 25% of the current season, we use the same weighting factor and home field advantage from the previous season. After this point, we have enough data to estimate both values more reliably based on how the current season is unfolding.

Second, because new games are played every day, we constantly get new information about the league. Thus, both the weighting factor and home field advantage are recalculated daily as the season progresses. This means that Elo ratings from earlier in the season are likely to shift as more games and information become available. However, these differences tend to be negligible and only have minimal impact on the long-term projections once the first quarter of the season is complete.