Win Probabilities in chess using centipawns

Intro

In this notebook I want to show how you can calculate the win probability of a chess player given the engone evaluation of the position. This was to reproduce the numbers in this post. Converting an engine evaluation to a winning chance is useful because it considers how effective humans convert a position compared to an engine.

Lichess calculates the winning chances using a logistic regression. The winning chance of the White player is calculated as $$p(white\ win) = 50 + 50 * \left(\frac{2}{(1 + e ^ {-0.00368208* centipawns})} - 1\right)$$

The blogpost claims the exponent is based on real game data. So I downloaded a bunch of pro games and decided to try to reproduce this number. I analyzed over 1000 games with Stockfish and calculated an exponent of $0.0030087$ to solve the logistic regression.

If we plot all positions and their evaluations we can see the following distribution: At the beginning of the game its unclear. The longer the games goes the more the evaluations tend to be decisive.

Methods

The games were parsed and each position fen written in to a database. I then used Stockfish 15 to evaluate the positions. As search limit was set to three million nodes. This is more than the Lichess server analysis but still runs good on a normal PC. I also saved the Elo of both players and move numbers.

For the next step I used scikit-learn. All samples were weighted such that each game has the same effect on the result. To fit the logistic regression I used the LogisticRegressionCV module of scikit-learn which uses cross validation to find the best hyperparameters.

Results

The results are pretty interesting. I got an exponent of $w = 0.0030087$ which is close to the hypothsis. With this setting we have an accuracy of $0.676$. The algorithm detects white wins better than black wins. The graph is normalized along the True Label axis.

The function is the following:

Using different hyperparameters yields highly varying results. The most important hyperparameter seems to be the regularization strength. A low regularization ( $10$ ) gives results close to the expected value. A high regularization ( $0.0001$ )¹ gives a coefficient as low as $w = 0.000124$. The accuracy is still $0.676$. Since regularization prevents against overfitting it seems that the model is actually overfit with this parameter.

Using the elo difference of both players can increase the accuracy to $0.721$.

Possible Enhancements

It is easy to extend this model to use more features or to predict win/draw/loss chances instead of only win/loss chances. Ply numbers and the elo of the players are already implemented in the notebook.

Data

I analyzed games from the Lichess Elite Database from April 2022.

The parameter is the inverse of the regularization strength. ↩

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
images		images
.gitignore		.gitignore
create_database.py		create_database.py
games.sqlite		games.sqlite
new.ipynb		new.ipynb
readme.md		readme.md
win_prob.ipynb		win_prob.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Win Probabilities in chess using centipawns

Intro

Methods

Results

Possible Enhancements

Data

About

Releases

Packages

Languages

SandroMartens/Lichess-Win-Predictions

Folders and files

Latest commit

History

Repository files navigation

Win Probabilities in chess using centipawns

Intro

Methods

Results

Possible Enhancements

Data

Footnotes

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages