Plain and simple: we have been using KD as a catch-all statistic for too long. Having a KD of 3 only means that you killed 3x as many people as killed you. This could mean you went 3-1 or 30-10. A high KD doesn’t always mean that your team wins, either. Players frequently cite KD as a reason for getting cut from teams or a reason why they are the best player, but the truth is: if you are not getting your team WINS you are not helping the team.
Our Codstats Rating is the probability a player gave his team of winning, based on his performance, divided by 50% (league average win%, obviously). This is done to make the average Rating = 1.0. This system is done much the same vein as the csgo website htlv.org's Rating 2.0.
Codstats.gg Rating takes all your stats from a game and tells you how likely an average team would be to win given your performance. I will go into more detail on the formula for Rating, but let's pretend for now that the formula, f, is simply:
_f(all stats) = Rating_
Importantly, the stats that go into Rating do change by game mode. Another shortfall of KD is that kills and deaths alone are not as important in Hardpoint as they are in Search & Destroy, where getting 5 kills wins the round. Rating solves this problem by allowing us to change the Rating function by mode and by player type.
Subs and AR players play very different roles in BO4, and our win probability models indicate a significant difference in the statistics that best describe their contributions to a team win. Rating allows us to break Sub/AR play apart, while still comparing all players on the same scale, win percentage. That is the most important takeaway from this whole section: every game, every type of player gets a Rating thats on the same scale 0-2.
Rating is broken down by mode and by Sub/AR role. Logit models are built to predict win probability for each of the 6 combinations (HP x sub, HP x AR, CTL x sub, … ). The predictions from the models are used to generate Ratings. Combinations of the following rate statistics are used in the 6 models to predict win probability:
For now, the exact combinations of the 6 models will be kept proprietary, but we will consider updating in the future.
Yes. Teams are trying to win games and the team with the higher KD is not crowned the champion. By modeling past performance using all of Black Ops 4 stats to train our models, Rating performs significantly better than KD.
No. With any stat, attempting to summarise performance with one number will have shortcomings. This stat will not encompass the shot-calling prowess a player may possess or the leadership a team needs to succeed. Conversely, a team with great Ratings may lack patience or finesse. However, we firmly believe the Codstats Rating system is a huge step forward for the entire Cod Community in terms of valuing players.
All single-mode Ratings are available in our leaderboards for CWL Fort Worth and will be available going forward. *Overall Ratings are the average Rating over all games played, which is to show the average win probability of playing with a player, not the sum of all his scores. Summing stats would show the win% of the “average round” which could be biased by lopsided results.
Here are two players compared by their performances at CWL Fort Worth:
By KD, the right-hand player had a much better event. I think we all know why I picked this matchup, so let’s dig deeper and look at the underlying stats by percentile in each mode. (*Note: in Deaths per minute - higher percentile is good still, the scale is inverted to keep things sensible)
So players were much more even in categories that matter. The left-hand player had more score per minute in SND and HP and fewer Deaths in Control. The left-hand player might be described as playing more at the extremes, high score and more deaths, where the right-hand is lower score and fewer deaths (except in control). Keep in mind that more than just these two variables go into the Codstats Rating, this is just a cross-section.
So who are these two players?
Obviously, in general, aKD correlates more strongly with Rating, but I choose an example where it did not to show where KD can be misleading. Nagafen plays his role very well and gets very high win percentages for his team in the process.
Here is a graph of every player at CWL Fort Worth in overall aKD vs. Rating. The blue line is the average relationship between the two.
Above the line = aKD doesn’t give them enough credit; Below = Kills alone not helping the team enough; Near the line = aKD and Rating similar descriptors of talent
Codstats Ratings added in all player categories in our leaderboards