5 Qualities Of Elite Computer Rankings
What elements are most important when developing your own computer based sports team ranking system? Today the developer of The Power Rank, Ed Feng discusses.
You're looking for an edge in the sports gambling markets. Anything that gets you an extra 1% against those wise crowds.
One powerful tool for gaining an edge is computer rankings. In contrast with the typical gut feeling about teams, computer rankings provide a data driven, unbiased view of sports.
They take data on games and filter it into a tidy list of teams. Good computer rankings provide not only a rank but also a rating, which can be converted into a point spread between two teams.
What should you look for in a ranking system?
And why should you listen to my advice on the topic?
How A Stanford Ph.D. Got Into The Sports Business
While I've been a life long sports fan, I never thought I would work in the industry. After getting my Ph.D. from Stanford in chemical engineering, I was one of many looking for an academic job.
That changed in 2008.
I discovered the paper behind Google's PageRank, the algorithm that started a billion dollar company. This algorithm sorted the complicated mess of the world wide web into search results that magically gave you the right website. I realized that this algorithm was based on the same math as my research in statistical physics.
This discovery inspired me to develop a new algorithm for ranking sports teams. I calculated some NFL rankings and sent an email off to my friends. Their interest inspired me to look at other sports. Now, here I am, writing for you.
What has this experience taught me about computer ranking systems? Let's look at 5 important factors.
#1 - Margin Of Victory
It should be obvious that a team ranking system should consider margin of victory in games.
Do you care that Amazon has lower prices than your neighborhood book store? No. It's the 40% discount on all titles that compels you to buy online.
The same lesson applies to computer rankings.
However, many well known rankings systems do not use margin of victory in ranking teams. The primary example is the Elo ranking system developed in the 1960's. With the power of modern computers, it's time to move past a system designed to be computed with pencil and paper.
And don't even get me started on the RPI rankings that the selection committee uses to pick teams for the NCAA men's basketball tournament each year.
#2 - Adjusting For Strength Of Schedule
In a nutshell, computer ranking systems take a statistic like margin of victory and adjust for strength of schedule. That's it.
This adjustment is more critical in some leagues than others. American college sports and international soccer have a huge disparity in the strength of teams. Beating Spain and Italy by a goal means something much different than beating Indonesia and Tahiti by the same margin.
This adjustment is less important in leagues such as the NFL in which a salary cap levels the playing field between teams. However, it can't be ignored.
#3 - Solving For Unknown Variables Simultaneously
Suppose you have a statistic like average margin of victory to rate a team. You want to adjust a team's raw statistic for the strength of opposition faced. This gives the rating of a team.
For example, suppose the New England Patriots have won their games by an average of 7 points. However, they have played 3 teams that each have a negative margin of victory.
You develop a method to adjust these margin of victories. New England's raw 7 points per game drops due to its poor competition.
After you make this adjustment for all teams, you're left with a new set of margin of victories, or ratings. So you do this again. If you're method works, these ratings will converge to a final value.
Beware ranking systems that iterate once or twice. They will do a poor job of accounting for strength of schedule since the ratings have yet to converge. If adjusting for strength of schedule seems like an after thought in the method's description, that's probably a bad sign.
Instead, look for ranking systems that solve for the unknown variables (in this case, the ratings) simultaneously. One way to do this is iteration. However, there are many other ways to perform the calculation as well.
#4 - Home Field Advantage
This is another obvious factor. In all sports, the home team wins more often than the road team. A ranking system should adjust the final margin of victory in a game for this factor.
For example, suppose Pittsburgh beats New England by 3 points at home. However, NFL home teams win by about 3 points. Hence, the margin of victory is zero for this game.
While home field advantage is still mysterious, there has been some interesting research into its causes. Check out the book Scorecasting by Tobias Moscovitz and Jon Wertheim for compelling evidence that referee bias is one factor that contributes to home advantage.
#5 - Diminishing Returns For Blow Outs
As mentioned previously, a good ranking system also gives each team a rating. The difference in the rating between two teams implies a point spread. In my NFL rankings, the spread between the best and worst team is less than 20 points.
With this typical spread, how does a ranking system account for a 59-0 win by New England over Tennessee? (This actually happened during the 2009 season.) It can't simply say that this game means New England is 59 points better than Tennessee, give or take home field advantage.
Good ranking systems give diminishing returns for a large margin of victory. Winning by 10 instead of 5 means much more than winning by 55 instead of 50.
To be honest, I never designed this feature into my ranking system. I was actually a bit worried about how overrated New England might be after destroying Tennessee in 2009.
However, New England didn't rise too much. The diminishing returns for blowouts was a nice little present from the math I developed.
Recommended Ranking Systems
I'm biased. I think my rankings at The Power Rank are pretty good.
However, here are three other ranking systems worth checking out.
Jeff Sagarin predictor. He developed the predictor rankings in the late 70's, well ahead of the times. While that date makes me wonder whether he simultaneously solves for unknown variables, his rankings have become well known. And please ignore his Elo rankings that do not account for margin of victory.
Dokter Entropy. His rankings seem pretty good from the description. In addition, entropy suggests that the method has roots in statistical physics, always a good sign.
Soccer Power Index. ESPN commissioned Nate Silver, the baseball turned American politics data maven, to rank countries in international soccer. I don't like how he uses club data to predict national teams, since it tends to overrate countries like England. But I do respect his work.
Computer rankings provide a data driven, unbiased look at team sports. While blindly wagering based on their predictions will not work in the long run, use them to back up your own judgements and find value in the markets.
Follow Ed on Twitter: @thepowerrank
And check out Ed's US sports team rankings and ratings at ThePowerRank.com