Real-Time Super Stats And Hockey Success
The NHL publishes data for what it calls "Real-Time Super Stats." These stats include most of the subjective judgments like blocked shots, giveaways, missed shots, and hits. Some coaches have structured their game plan around dominating in these areas, and there are players that are sought after specifically for their ability to succeed in one of these categories.
The question I'm going to look into today is how these categories contribute to winning hockey games. I'll be using a simple correlation analysis that looks at multiple years of data. Join me after the jump.
I collected data from NHL.com for four of the real-time super stats: blocked shots, hits, giveaways, and takeaways. I then aggregated the team totals over the last five seasons to come up with a large sample. I also aggregated the total number of regular season points earned by each team over the last five years to serve as a proxy for winning. The table below presents those five year totals:
|
Hits |
BS |
GV |
TK |
Reg. Pts. |
|
|
ANA |
8207 |
3854 |
3368 |
2591 |
490 |
|
ATL |
7088 |
5683 |
3870 |
3923 |
422 |
|
BOS |
7902 |
5022 |
2659 |
2305 |
451 |
|
BUF |
6703 |
5573 |
4426 |
2285 |
504 |
|
CAR |
8690 |
5763 |
2903 |
2955 |
469 |
|
CBJ |
8111 |
4956 |
2157 |
2371 |
398 |
|
CGY |
8032 |
4607 |
4029 |
3040 |
481 |
|
CHI |
7262 |
4712 |
2354 |
2438 |
440 |
|
COL |
6113 |
6129 |
3426 |
3582 |
449 |
|
DAL |
9425 |
5004 |
4305 |
3388 |
487 |
|
DET |
7286 |
4142 |
4045 |
2905 |
566 |
|
EDM |
6590 |
5972 |
5779 |
3080 |
401 |
|
FLA |
7830 |
5366 |
3189 |
2492 |
426 |
|
LAK |
8763 |
5235 |
4562 |
2079 |
408 |
|
MIN |
6302 |
5071 |
3370 |
2473 |
459 |
|
MTL |
8943 |
6391 |
4521 |
3601 |
468 |
|
NJD |
7034 |
4879 |
3525 |
2762 |
516 |
|
NSH |
6940 |
5015 |
3316 |
3251 |
495 |
|
NYI |
8601 |
6372 |
3998 |
4151 |
389 |
|
NYR |
9758 |
5752 |
3244 |
3173 |
473 |
|
OTT |
9272 |
5694 |
4054 |
3257 |
489 |
|
PHI |
7545 |
6010 |
4088 |
2812 |
439 |
|
PHX |
8027 |
5302 |
2647 |
2267 |
417 |
|
PIT |
8350 |
6048 |
3134 |
2173 |
465 |
|
SJS |
8639 |
4974 |
4206 |
2639 |
544 |
|
STL |
7607 |
5660 |
3333 |
2382 |
399 |
|
TBL |
7370 |
5365 |
3459 |
3012 |
402 |
|
TOR |
8756 |
5730 |
4386 |
2670 |
419 |
|
VAN |
6610 |
4807 |
3096 |
2665 |
488 |
|
WSH |
8090 |
5489 |
4537 |
3424 |
463 |
After collecting this data, I ran a correlation test between regular season points and each of the four individual real-time super stats. That gave me four "r" coefficients.
A quick stat refresher: an "r" coefficient measures the relationship between two variables. It can be between +1 and -1, where +1 indicates a perfectly positive relationship, -1 indicates a perfectly negative relationship, and 0 indicates no relationship at all. As a rule of thumb, an r between 0 and 0.09 indicates no correlation at all; an r between 0.1 and 0.3 indicates a small correlation; an r between 0.3 and 0.5 indicates a medium correlation; and an r between 0.5 and 1 indicates a large correlation. The same rules apply for negative numbers (between 0 and -0.09 means no correlation, and so on).
Here are the four "r" coefficients for the RTSS data:
Before I dive in, I want to talk about how rink bias affects these results. John Fischer over at In Lou We Trust did a marvelous job of looking at the RTSS stats at home and on the road for each team, and found that there were significant differences between the two totals. This would imply that the data reported by the NHL is skewed by the home scorer's definition of what constitutes a giveaway, hit, etc. However, I don't think the problem is that serious, as teams do in fact play very differently on the road, which would lead to significantly different totals between the two venues. And while there might still be a bias for some teams once you control for the differences in home and road play, it most likely is not enough to overcome the complete lack of a relationship in most of the variables.
With that out of the way, the first conclusion is pretty simple: hits, giveaways, and takeaways have no relationship to winning. Coaches who lament that their team isn't being physical enough, turning the puck over too often, or not stealing the puck off the opponent's stick are missing the big picture. They might say this because it makes for a good story line in the media, or because it's convenient to be able to sum up your team's problems in one, short sentence. However, if your team is losing, they're doing something wrong (or just getting unlucky), but it has nothing to do with hits, giveaways, or takeaways. These statistics are, at best, a means to an end, and focusing on their improvement as the solution to your problems is analogous to treating the symptoms instead of curing the disease.
The "r" coefficient for blocked shots was unique in that it demonstrated a strong relationship with regard to winning. The negative sign means that the more blocked shots a team was racking up, the less points they were earning in the standings. This makes intuitive sense as well, as a team that is blocking a ton of shots must be playing in their own end quite a bit. The opposing team thus has the time and space to fire a lot of pucks at the net, which means that you're likely losing the puck possession battle and a lot of games in the long run. These stats show that particular trend.
I think the next logical step is to apply these findings to individual players. Players that excel in takeaways (Pavel Datsyuk) or hits (Brooks Orpik) aren't bad investments because they have a lot of other valuable tools in their skill set. However, going after players only because they're really good at blocking shots, laying out hits, or taking the puck away isn't a wise decision. These skills don't correlate with team success in the long run, and since that's the whole point of playing the game, a team's money could be better spent elsewhere.
34 comments
|
0 recs |
Do you like this story?
Comments
i can see how this can be a negative for both Free Candy & Big Z. Both are RTSS leaders who are eating relatively large contracts on our blue-line. They look like they don’t have much offensive upsides but are key stay-at-home D’s.
Then again, Phoenix is a relatively successful team despite leading the league in BS (ignore the islanders, I know I do), which could be an argument against GP’s report here. Brooks was our best blueliner last year & Big Z is one of the 3 best defensemen in this relatively deep FA for D-men. I’m still happy about what we might have & if Z can show some kind of ability in the PP, we’re golden.
Is it October yet?
Just so I never have to answer this again. I'm from Singapore so whenever you see me online, I'm either sleep deprived or just waking up.
I don’t think of Orpik as an offensive dynamo either, but he did lead all Penguins d-men this year in ES pts/60. He’ll never get confused for Mike Green, but I think he’s got more in there than we give him credit for. He also brings a lot of other tools to the table, like good positioning, excellent penalty killing, leadership, etc. He was also a positive Corsi guy this year, so he’s doing a lot more on the ice than just leveling out hits.
As to Michalek, my previous posts on him have tried to illustrate just how difficult his situation in PHX has been. I think it’s safe to say he’ll be seeing a complete 180 in Pittsburgh in terms of competition and teammates, so I wouldn’t be surprised if his offensive numbers jump up. And in 2008, he put up a very good ES pts/60, so the potential is probably there. He also kills penalties, is sound positionally, and works really hard on the ice (you have to if your teammate is Ed Jovanovski).
Hockey Blogger at Pensburgh.com
Orpik …did lead all Penguins d-men this year in ES pts/60.
Wow. I didn’t know that. On the other hand, most of those points were assists, which we all know aren’t really worth much ;-)
by PensAreYourDaddy on Jul 29, 2010 10:56 AM EDT up reply actions
When I started to see him get some PP time with the first unit earlier in the season I suspected he’d have a good year for point production. But I also was pretty surprised to see that. And really it’s a double whammy with a guy like Orpik, who often gets the stigma of only being a defensive guy.
Still, I’d rather he didn’t get any breakaway chances anytime soon.
By the way GoPens!, awesome work again. As always.
Follow the Penguins on SBN @ Pensburgh.com, twitter and now on the official Pensburgh Facebook page
Thanks a lot Frank! It’s getting tougher and tougher to come up with stuff since this summer isn’t exactly “exciting.”
Hockey Blogger at Pensburgh.com
that and it’s really easy to be brave on the ice when you have a big physical guy like Brooks behind you handing out checks like it’s the first of the month.
Is it October yet?
Just so I never have to answer this again. I'm from Singapore so whenever you see me online, I'm either sleep deprived or just waking up.
regular season vs. playoffs
Nice work GoPens.
Perhaps the #s would indicate that TK/GV and Hits have more relevance in the playoffs? I would guess that hitting goes up by some ridiculous number once April comes around.
Off the top of my head, I remember one the games against Ottawa this spring where the Pens outhit the Sens by something like 25-6 in the first period. I think it was game 2 or 3, but I could be wrong.
also, I like the point about blocked shots. Didn’t Anton Volchenkov cash in big time because his team let the Pens go Rambo III in the offensive zone?
Thanks mcnulty.
I thought about the playoffs midway through writing this post. But I don’t think it would be too valuable to know how these stats change in the playoffs since it’s such a small sample and subject to a lot of luck (which we know all too well this year). But I might look into it for a 2nd part post.
And yes, that is a big part of Volchenkov’s game. He’s had better teammates than Michalek over the last couple of years, but his ES pts/60 have been abysmally low (even lower than Scuderi). I give him a lot of credit though for playing the minutes he does, but I think that his $4.25M yearly salary on NJ is an overpay.
Hockey Blogger at Pensburgh.com
There’s an important distinction to draw with a guy like Volchenkov: although teams that block a lot of shots have a lower winning percentage because of the effect you noticed, AV has an actual talent for blocking them; he blocks more shots than you’d expect given his time on ice. So while it may not be valuable for teams in the aggregate, the ability to block significantly more shots than other defenders in the league is a valuable trait to possess, ceteris paribus.
Only YOU can prevent idiots from commenting!
by Knee high to a duck on Jul 29, 2010 6:17 PM EDT up reply actions
You’re definitely right. AV might be one of the 10 or 15 d-men that actually has the ability to force more blocked shots than one would expect at random. I think that’s a very valuable asset for an individual player, especially on the PK.
But given that he’ll wear down faster due to all the rubber his body is taking, I think $4.25M for four years was too much. He plays really tough minutes, though he’s had more help in Phillips, Spezza, and Alfredsson than Michalek had in PHX. By the numbers, I’d also say that Michalek has more offensive upside, and we got him for $4M. Maybe that was a bargain, but compared to AV’s contract, it looks like the Devils overpaid.
Hockey Blogger at Pensburgh.com
I’m reasonably certain that it was six years for AV and yes, I’d much rather have the Michalek K than the Volchenkov K. Look at AV’s games-played over the last few years, the breakdown is already happening.
Only YOU can prevent idiots from commenting!
by Knee high to a duck on Jul 29, 2010 9:29 PM EDT up reply actions
You’re right, it was six years (I forgot to expand the page at nhlnumbers.com).
Hockey Blogger at Pensburgh.com
Thanks for another informative post, GP. You keep providing the substance and I’ll handle the fluff. We’ll get through this offseason eventually.
by PensAreYourDaddy on Jul 29, 2010 11:07 AM EDT reply actions
We obviously provide some the best 1-2 punch on SB Nation. I hear most people are pretty scared…
Hockey Blogger at Pensburgh.com
Interesting analysis, GoPens! I just have a comment about this quote:
hits, giveaways, and takeaways have no relationship to winning. Coaches who lament that their team isn’t being physical enough, turning the puck over too often, or not stealing the puck off the opponent’s stick are missing the big picture. They might say this because it makes for a good story line in the media, or because it’s convenient to be able to sum up your team’s problems in one, short sentence. However, if your team is losing, they’re doing something wrong (or just getting unlucky), but it has nothing to do with hits, giveaways, or takeaways.
I think you have this pretty much right, except that it should say: hits, giveaways, and takeaways, as recorded as official statistics….
Hits, giveaways, and takeaways, as recorded as official statistics, do not have anything to do with winning/losing, but clearly if your team starts turning the puck over more frequently or stealing it more frequently it will eventually affect the outcome of games.
Interesting stuff, though, and I would not have thought that the correlation would be so low for these things. I wonder if you could control for some other factors involved in winning (shots?,goals?), as correlating wins with hits, takeaways, and giveaways may result in omitted variable bias.
"90% of the game is physical. The other half is mental." - Yogi Berra
but clearly if your team starts turning the puck over more frequently or stealing it more frequently it will eventually affect the outcome of games.
The spread of giveaways and takeaways over the last five seasons has no relationship to winning. What this tells us is that there isn’t a team capable of racking up takeaways or giveaways at a level that actually affects winning percentage, so it’s pointless to focus on these stats. If you think your team is turning the puck over too often or not stealing it enough, chances are there’s a deeper problem that needs some attention.
My understanding is that omitted variable bias is only relevant to regressions analyses, not correlation tests.
Hockey Blogger at Pensburgh.com
Blocked Shots and Giveaways
Blocked shots increase with the number of the opponents total shots. Thus, blocked shots may be indicative of the skill of the blocker or the number of an opponent’s shot. The first is beneficial and the latter somewhat negative. Giveaways may be due to the skill of the players or the amount of time they have the puck. The more you have the puck, the more giveaways you are bound to have. Again players who protect the puck are beneficial as they give the puck away less per time on ice; however, if more giveaways are merely an indication of having the puck longer, they take on a completely different aspect.
Great article as it looks into stats not often correlated with wins and loses.
however, if more giveaways are merely an indication of having the puck longer, they take on a completely different aspect.
My guess is that overall, more giveaways are indicative of this.
Thus, blocked shots may be indicative of the skill of the blocker or the number of an opponent’s shot.
I think you’ll find this article very interesting.
Hockey Blogger at Pensburgh.com
Is it wrong that I read their blog title as Irrelevant Oiler Fans?
by PensAreYourDaddy on Jul 29, 2010 2:14 PM EDT up reply actions
Haha, no I don’t think so. They do an amazing amount of quality work, but I’m sure most people think of their stuff as irrelevant.
Hockey Blogger at Pensburgh.com
Blocked Shots
Perhaps the negative correlation will put to rest the myth of the excellence of the Scuderi/Gill pairing. They blocked a lot of shots because they were slow and had no exit passing ability. This left the puck in their defensive zone and required more and more blocking of shots due to the number of shots their opponent attempted. This is why each pairing needs at least one puck mover and ought to have a right shooter on the right and a left shooter on the left. Poor Scudedri playing on the right side despite being a left shooter frequently had to try to clear or make an exit pass on his backhand up the boards, so did Gonchar. The 2010-2011 Pens have three left and three right shooters and at least one puck mover on each pairing. By the way, I still feel that Orpik and Letang are the Pens’ best pairing, possessing speed, physicality and much offensive ability not possessed by the other two pairings, despite large salaries.
I agree with a lot of your assessment. I think Gill and Scuderi were good on the PK, but they aren’t great d-men in general.
And I get really excited when I think about the options we now have at defense for the upcoming years.
Hockey Blogger at Pensburgh.com
Blocked Shots
Thanks for the link, I recall that I had read it back in May. I am sure statistics would bear out that the more shots that are taken total, the more shots will be blocked. Thus, a team that allows many shots, regardless of quality or accuracy, will likely need to block more than a team against which there are fewer shots attempted. It would seem that such a team kept the puck in the opponents’ end of the ice and, thus, allowed less shots to be blocked. In fact, that is the theory behind Bylsma’s up-tempo game. Stated crudely, if you keep the puck in your offensive zone, your opponents can take less shots and you can take more, i.e., the best defense is a good sustained offense.
I am sure statistics would bear out that the more shots that are taken total, the more shots will be blocked.
Yup, that’s why I think the NHL should report BS%, which would be BS/(BS+MS+SOG). That will tell you if a team actually has a unique talent for shot-blocking, as opposed to the raw numbers, which tell you which teams are getting murdered at Corsi and losing the most games.
Hockey Blogger at Pensburgh.com
BS%
I rather think that a time of possession analysis in the various zones might be more revealing. I know that some teams (probably all) rely on such an analysis to coach players. For example if team A permits its opponents team B large minutes in team A’s defensive zone they had better have a goal tender who can stand on his head as he will get many shots on goal. Coaches need to see this as an indication that its defenders (forwards and defensemen) need to gain possession of the puck and either skate or pass the puck out of the defensive zone more rapidly. Various defensive strategies may accomplish this. First, the Lemaire 1-2-2 neutral zone trap clogs the neutral zone in an effort to prevent the attacking team from gaining its offensive zone. The 2-3 the Constantine left wing lock is more of an attacking (2 fore checkers rather than 1) form of the trap with three defenders rather than two tiers of two. Another, more risky but with more potential, is a defensive where the fore checkers, back checkers and defensemen aggressively attack the puck carrier and possible shooters to whom he may pass. This is the game played by Orpik and Letang and relies upon their speed, puck movement and physicality. Finally, the Gill and Scuderi and Eaton and McKee method of giving ground and maintaining a gap while passively limiting shots to the outside. This is positional hockey and seems to be the most prevalent thinking of coaches in the NHL. I believe it is the worst, but a team must play with the talent at hand. I shall address the offensive implications of a time in the offensive zone directly, but I should like to know your thoughts so far.
I’m not exactly sure what you would like me to respond to; mind providing specifics?
Hockey Blogger at Pensburgh.com
BS%
Would you agree that statistics as to time in the three zones would be more revealing than BS% as to defensive performance?
As to offense, clearly, time in team A’s offensive zone provides more good scoring opportunities. Just as clearly, it is time when team B cannot attempt shots. Both are products of speed and movement as well has puck protection. Stated simply, the faster you skate and the more you create movement, the more you will gain and keep in your offensive zone. This is improved by your ability to protect the puck and to avoid stupid passes to nobody. Thus, time in your offensive zone is just as revealing as time in your defensive zone. I should be very surprised if time in the offensive zone did not correlate positively with wins and time in the defensive zone would correlate negatively with winning. This would seem obvious, but such statistics are not kept where they are available to us.
Would you agree that time on ice in the offensive zone is a significant indicator of success?
Would you agree that statistics as to time in the three zones would be more revealing than BS% as to defensive performance?
Definitely. The NHL even collected that data from 1999-2002, but for some odd reason, it stopped. And I didn’t mean to imply BS% was a substitute for zone time data — only that BS% is better than looking at raw BS totals (at least if you’re trying to figure out who’s good at shot blocking).
Stated simply, the faster you skate and the more you create movement, the more you will gain and keep in your offensive zone.
Totally agree. I’m a big speed guy when it comes to evaluating players.
Would you agree that time on ice in the offensive zone is a significant indicator of success?
I think TOI in the o-zone relative to the d-zone is an extremely good indicator of success.
Hockey Blogger at Pensburgh.com
I like the idea a lot, but I’m not sure aggregating several years of data is the best method to identify the correlations you’re looking for. I think you may find your data sheds more light if you compare (in example) blocked shots for a given season, to total points in that given season for each team for a 5 year period, and then running your regression analysis.
Over a 5 year period, many teams will change personnel and coaches, the effect of which may be lost when the data is aggregated. We watch a lot of Caps and Pens games at my house (my roommate’s a Yinzer), and those two teams in particular have had their fortunes change dramatically in the last 5 years. The Caps and Pens went from 27th and 29th, respectively, in the league in ‘05-’06, to the Pens playing in two cup finals, one of which they won, and the Caps winning the President’s trophy this year.
Aggregating the data for the 5 years would smooth the differences between the good and bad years. Lets say the Pens committed tons of giveaways in ‘05-’06, but improved dramatically in that metric in the two years they made runs to the finals, that change would be lost in the process of aggregating the data.
Just a thought. In any case, it is ca very interesting analysis.
I am a hockey fan first, and a Caps fan second.
This isn’t a regression analysis; just looking at the correlation between two variables.
And the whole point of aggregating five years of data is to see if there is anything sustainable in the long run. It doesn’t mean anything if winning teams one year have a lot of takeaways, since there’s no effect over the five year span, which means that those stats were luck and randomness, as opposed to a sustainable ability or talent. And that’s really what we’re after here: sustainable talent. A lot of hockey is luck (you know that), and making inferences from it is silly. Looking at large samples of data helps to filter out that luck.
Hockey Blogger at Pensburgh.com
I think he raises a decent point though. If team A had 100 pts and 2000 takeaways one season 90 points and 1000 takeaways, and Team B had 95 points and 1500 takeaways twice, a relationship MAY form. But if you see 190pts and 3000 takeaways for both teams, the relationship muddies. Perhaps looking at mean or averages for a team over 5 years and compare the difference between that and each years results. It may yield more insightful data.
Good work though.
I like this, but even with your caveat, the use of home statistics bugs me. The problem isn’t just that teams play differently on the road, which is true, but the recording of those statistics is seriously flawed, and the only real way to combat that is to take stats from away games and throw out the home ones.
There are arenas in which the stats flunky pretty much appears to take entire periods off, and all you get are shots on goal (and even then it’s questionable).
only real way to combat that is to take stats from away games and throw out the home ones.
Not a bad idea. I’m going to repeat the process for road games only, and I’ll throw in a playoff analysis too. Stay tuned!
Hockey Blogger at Pensburgh.com






















