@SelfmateMan:
Nope, I explicitly used the words used in the title. This is from an article he wrote 9 years later: en.chessbase.com/post/the-elo-rating-system-correcting-the-expectancy-tables
A quote from the article:
"I want to call attention to two main things here. You can see that in the middle region of the graph (for instance within the red box), the white line is steeper than the black trend. This means that rating favorites are not scoring as well as their ratings say they should. Also, look out at the edges, for instance within the blue box. You can see that due to the "400-point rule", the expected score (for the heavy rating favorites) levels off at 92%, but in reality the heavy favorites are scoring much higher than 92%."
So yes, there are parts of the curve where higher-rated players score worse than the model predicts, but there's also a point where they score higher.
The portion where the higher-rated score worse is when the difference is 100-400 points, while above that there's a flip and higher rated players score much higher than the model predicts.
As a reminder, we're talking about a >500 point difference in this lichess case.
Now, this is all consistent with his earlier paper, of course.
Games where the difference is 100-400 points are much more common than games with the very large rating differences, so as a criticism of Elo, that's a fair point (and you'll notice all his graphs in the original paper cut off before we get to the wonderland of >400 rating point differences, and the big focus was on 0-200).
It's just not applicable here, because we're talking about the portion of the curve where the model is mistaken in the other direction.
Further, my point wasn't about the predicted scores from Elo at all. The predictions could be too high, too low, or spot on, it doesn't matter.
My point was that empirical results from games where the players were 500 points apart showed an expected score very close to that implied by the -15 for a loss/+1 for a win ratio.
In summary, I neither am nor was confused. I appreciate your concern though :)
Nope, I explicitly used the words used in the title. This is from an article he wrote 9 years later: en.chessbase.com/post/the-elo-rating-system-correcting-the-expectancy-tables
A quote from the article:
"I want to call attention to two main things here. You can see that in the middle region of the graph (for instance within the red box), the white line is steeper than the black trend. This means that rating favorites are not scoring as well as their ratings say they should. Also, look out at the edges, for instance within the blue box. You can see that due to the "400-point rule", the expected score (for the heavy rating favorites) levels off at 92%, but in reality the heavy favorites are scoring much higher than 92%."
So yes, there are parts of the curve where higher-rated players score worse than the model predicts, but there's also a point where they score higher.
The portion where the higher-rated score worse is when the difference is 100-400 points, while above that there's a flip and higher rated players score much higher than the model predicts.
As a reminder, we're talking about a >500 point difference in this lichess case.
Now, this is all consistent with his earlier paper, of course.
Games where the difference is 100-400 points are much more common than games with the very large rating differences, so as a criticism of Elo, that's a fair point (and you'll notice all his graphs in the original paper cut off before we get to the wonderland of >400 rating point differences, and the big focus was on 0-200).
It's just not applicable here, because we're talking about the portion of the curve where the model is mistaken in the other direction.
Further, my point wasn't about the predicted scores from Elo at all. The predictions could be too high, too low, or spot on, it doesn't matter.
My point was that empirical results from games where the players were 500 points apart showed an expected score very close to that implied by the -15 for a loss/+1 for a win ratio.
In summary, I neither am nor was confused. I appreciate your concern though :)