php - Glicko-2 Rating System: Bug or exploit? - Stack Overflow

IT技术

更新时间：2025-04-080

admin管理员组
文章数量:1356522

Glicko-2 is a rating system used in chess, but can be used in many other situations. Glicko-2 is an improvement on Glicko-1, which addressed problems of the older ELO rating.

What makes Glicko-2 special in parison to version 1 is that it incorporates a higher rating deviation (RD) the longer someone has been inactive. It does this with the notion of a system constant which relates to time/rating periods.

An example write up from the author is found here: .pdf.
Within this document, he explains:

The Glicko-2 system works best when the number of games in a rating period is moderate to large, say an average of at least 10-15 games per player in a rating period. The length of time for a rating period is at the discretion of the administrator.

Making an assumption that a group of active chess players play 10-15 games on average in a 1 month time period, the administrator would then update ratings at the end of every month.

I needed a PHP Implementation of the Glicko-2 rating system and came across the following:

Glicko-2 JavaScript Implementation

The JavaScript had a small error, in which didn't let it match the technical write-up example, the author found it close enough, and didn't bother to debug.

Glicko-2 PHP Implementation

The PHP implementation was plagued with many bugs, but that wasn't apparent unless you did more than one rating period (which the technical write-up never shows expected values of)

Glicko-2 Calculator in Excel

Finally the Excel calculator seemed to be error-free and the most professional, done by someone in the chess munity. Once the JavaScript bug was solved, the JavaScript and Excel Calculator matched very closely with each other (albeit not perfect, could be within rounding error)

I had fixed the bugs (and submitted issues/patches to the authors) I could find on the PHP and JavaScript versions to match as closely to the Excel Calculator

Now I am 99% confident that I have an accurate Glicko-2 implementation (between the 3 of them) for analysis and that is when I came across something strange, and the topic of this discussion.

Given the suggested default for Glicko-2 for a new player:

Rating:      1500
RD:           350
Volatility:  0.06

If you face an average opponent of rating 1378 and RD 99 (Source) only once every rating period (1 month) for the next 12 periods (1 year) you will have accumulated an assumed National Class A (1800-1999) rating of 1852 when in reality you have only beat 12 average rated players over a span of 12 months.

Month   Rating      RD      Volatility      Class
1       1625        259     0.059999        National Class B
2       1682        225     0.059998        〃
3       1718        205     0.059997        〃
6       1784        174     0.059994        〃
12      1852        148     0.059988        National Class A
24      1922        127     0.059976        〃

If you face 2 average opponents every rating period, you can get to National Class A about 4-5 months, facing only 8-10 average opponents.

Month   Rating      RD      Volatility      Class
1       1672        215     0.059999        National Class B
2       1733        183     0.059997        〃
3       1770        166     0.059995        〃
4       1797        154     0.059993        〃
5       1819        146     0.059992        National Class A
6       1836        140     0.059991        〃

Are these assumptions accurate? Is there a bug in my calculator?

If it is not a bug, what are some ways of countering this besides:

Consider "true rating" to be lower bound of the deviation (Rating - RD)
Do not show inactive user's rating
Do not show users with less than N games

Glicko-2 is a rating system used in chess, but can be used in many other situations. Glicko-2 is an improvement on Glicko-1, which addressed problems of the older ELO rating.

What makes Glicko-2 special in parison to version 1 is that it incorporates a higher rating deviation (RD) the longer someone has been inactive. It does this with the notion of a system constant which relates to time/rating periods.

An example write up from the author is found here: http://www.glicko/glicko/glicko2.pdf.
Within this document, he explains:

The Glicko-2 system works best when the number of games in a rating period is moderate to large, say an average of at least 10-15 games per player in a rating period. The length of time for a rating period is at the discretion of the administrator.

Making an assumption that a group of active chess players play 10-15 games on average in a 1 month time period, the administrator would then update ratings at the end of every month.

I needed a PHP Implementation of the Glicko-2 rating system and came across the following:

Glicko-2 JavaScript Implementation

The JavaScript had a small error, in which didn't let it match the technical write-up example, the author found it close enough, and didn't bother to debug.

Glicko-2 PHP Implementation

The PHP implementation was plagued with many bugs, but that wasn't apparent unless you did more than one rating period (which the technical write-up never shows expected values of)

Glicko-2 Calculator in Excel

Finally the Excel calculator seemed to be error-free and the most professional, done by someone in the chess munity. Once the JavaScript bug was solved, the JavaScript and Excel Calculator matched very closely with each other (albeit not perfect, could be within rounding error)

I had fixed the bugs (and submitted issues/patches to the authors) I could find on the PHP and JavaScript versions to match as closely to the Excel Calculator

Now I am 99% confident that I have an accurate Glicko-2 implementation (between the 3 of them) for analysis and that is when I came across something strange, and the topic of this discussion.

Given the suggested default for Glicko-2 for a new player:

Rating:      1500
RD:           350
Volatility:  0.06

If you face an average opponent of rating 1378 and RD 99 (Source) only once every rating period (1 month) for the next 12 periods (1 year) you will have accumulated an assumed National Class A (1800-1999) rating of 1852 when in reality you have only beat 12 average rated players over a span of 12 months.

Month   Rating      RD      Volatility      Class
1       1625        259     0.059999        National Class B
2       1682        225     0.059998        〃
3       1718        205     0.059997        〃
6       1784        174     0.059994        〃
12      1852        148     0.059988        National Class A
24      1922        127     0.059976        〃

If you face 2 average opponents every rating period, you can get to National Class A about 4-5 months, facing only 8-10 average opponents.

Month   Rating      RD      Volatility      Class
1       1672        215     0.059999        National Class B
2       1733        183     0.059997        〃
3       1770        166     0.059995        〃
4       1797        154     0.059993        〃
5       1819        146     0.059992        National Class A
6       1836        140     0.059991        〃

Are these assumptions accurate? Is there a bug in my calculator?

If it is not a bug, what are some ways of countering this besides:

Consider "true rating" to be lower bound of the deviation (Rating - RD)
Do not show inactive user's rating
Do not show users with less than N games

Share Improve this question edited Aug 21, 2012 at 12:13 asked Aug 21, 2012 at 12:08 ParoX 5,94125 gold badges86 silver badges155 bronze badges

As you are not asking an actual programming question, this would be better at math.stackexchange. – BlueRaja - Danny Pflughoeft Commented Aug 21, 2012 at 14:06
It's possible that this is really a bug. – ParoX Commented Aug 21, 2012 at 15:09
In which case, you could give us the expected oute, and we might be able to help track down the bug. But determining if it really is a bug still involves no programming, only math, and thus is a better fit for that site. – BlueRaja - Danny Pflughoeft Commented Aug 21, 2012 at 15:10

Add a ment |

1 Answer 1

Sorted by: Reset to default 9

It may seem counter-intuitive but this is actually a correct result. If you continuously play average players, but you always win, regardless of the time periods, you're demonstrating you have a high ranking (not an average ranking even though your opponents are average). A player who is average (has a 'true' average rank), playing opponents of exactly the same 'true' rank (average) should win and lose about 50% of the time. A player with a 'true' rank that is very high, will win a larger percentage of the time when playing average players which depends on just how far apart their ranks are, but lets say it's a high enough rank that they should win 90% of the time. That means for ever 10 games played against an average player, this highly ranked player should lose 1 of them.

What you've effectively modeled is a player that has a rank high enough to win every single game against an average player (more than 12 or 24 games without a loss) which means their score will continue to go up unbounded if they continue to win, because they've never lost. Their demonstrating an ability that (until a loss happens) should have a rank separation large enough to approach an expected win ratio of 100%.

本文标签： phpGlicko2 Rating System Bug or exploitStack Overflow

版权声明：本文标题：php - Glicko-2 Rating System: Bug or exploit? - Stack Overflow 内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.betaflare.com/web/1744068518a2585457.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

编程频道|软件玩家 - 软件改变生活！

php - Glicko-2 Rating System: Bug or exploit? - Stack Overflow

1 Answer 1

更多相关文章

php - Glicko-2 Rating System: Bug or exploit? - Stack Overflow

发表评论

推荐文章

javascript - How to manually add an item into datasource in Kendo UI Combobox - Stack Overflow

笔记本Win10没有WLAN选项的解决方法

javascript - Raphael.js bar chart with tutorial - Stack Overflow

javascript - How to update a ref with a signal in Solid.js to control table scroll? - Stack Overflow

javascript - connect to walletconnect with web3modal - Stack Overflow

热门文章

javascript - Set and Get array from cookies with js-cookie and JSON.parse - Stack Overflow

javascript - How to use source map to find minification error - Stack Overflow

datetime - Make AutoHotKey produce uppercase text in timestamp - Stack Overflow

visual c++ - how to return a struct from V8 C++ function to javascript module - Stack Overflow

javascript - Update in MySQL from Node.js - How to tell if zero rows are effected? - Stack Overflow

typescript - Why is assignment in while statement discouraged in Javascript? - Stack Overflow

How to create a pure JavaScript project using Maven? - Stack Overflow

javascript - Working with a message having repeated field - Stack Overflow

win10控制面板快捷键_Windows Update在哪 Win10自动更新关闭方法【详解】

javascript - Zend Framework CSSJS minifier-obfuscator? - Stack Overflow

最新文章

WIN11，如何同时连接有线网络与WLAN无线网络

安可信esp01wifi模块使用（超级坑）

解决windows中安装VMware后宿主机wifi网卡无法正常使用的问题

使用手机连接树莓派（无需电脑，只需要一台手机）

电脑有网，浏览器连不上网，其他应用却能用

javascript - Scroll to anchor with fixed header, content hidden behind header, margin and top padding not working - Stack Overfl

single page application - Urls to assets should be subpath of base path of vite proxy - Stack Overflow

javascript - How to change the text of the label box dynamically in jsp page? - Stack Overflow

javascript - How do I check if a FormData file is empty? - Stack Overflow

How to Make a Flutter App Responsive for Different Mobile Screen Sizes? - Stack Overflow

惠普OMEN 15-CE001TX 2EF91PA参数报价

苹果新款MacBook Pro 15英寸 i732GB1TBVega Pro 20参数报价

联想Y330A-PSE L参数报价

神舟战神Z7 D6 i7-12650H16GB512GBRTX4050旗舰版参数报价

神舟战神Z7 D6 i7-12650H16GB1TBRTX4050参数报价