Revisiting Necessary Conditions – Offense

Posted on June 10, 2013 by Brent

If you remember back a couple of months, I did a couple posts that went into necessary vs sufficient conditions regarding the construction of a Super Bowl team. At the time, I looked at every team from the last 10 years and used their PPG and PPA to gauge their relative percentile ranking. However, since then I’ve gone back and adjusted each team’s performance to account for the league average PPG that season. This adjustment will help account for the general offensive inflation the league has seen over the past decade.

Today, we’re looking at the offensive side of the ball.

The question is, how good are Super Bowl winning teams on offense? The idea here is to get an idea of what a truly optimal team construction strategy would look like. There is a salary cap, and the number of roster spots means every team must sacrifice somewhere in order to improve a different area. What is the best mix of offense and defense?

My last look came to this general conclusion (ignoring Special Teams for now): Teams should focus on building an above-average offense. Once that’s assured, the team should focus 100% on developing the best defense possible.

After adjusting each team’s performance, does that still hold? And if so, what does it mean for the Eagles?

First, let’s revisit the best offenses. My main data set only goes back to 2003, but for this, I went back to 2000 to make sure I was including some all-time greats (to give us a sense of just how great they were). You’ve seen this before, but here are the best offenses in recent history:

The teams highlighted Red LOST in the Super Bowl. Teams in yellow WON the Super Bowl. We can see the overwhelming dominance of the Patriots, as well as some less-heralded performances, like the 2011 Packers.

However, we also see a lot of red. Just as in our original look, it seems that a great offense can go a very long way towards winning a Super Bowl, but can’t guarantee a win.

We do have to be careful with the sample size here. 5 of the top 16 offenses since 2000 went to the Super Bowl. The 1-4 record of those teams in the final game may just be bad luck.

How about all the Super Bowl winners?

Here are the winning teams from the last 10 years:

As you can see, the only team to win a Super Bowl in the last ten years with a below-average offense was the 2008 Steelers, and that team was just 1% off the mark.

Additionally, the average offensive performance of Super Bowl winners is +16%.

It appears as though our previous conclusion, at least on the offensive side of the ball, stands. An above-average offense is close to a necessary condition for winning the Super Bowl.

That should be good news for Eagles fans, since it explains why the focus of this offseason has really been on Offense, despite the terrible defensive performance of last year. Given Chip Kelly’s background and skill-set, meeting the league-average offensive threshold SHOULD be close to guaranteed for the Eagles, if not this year, then soon (likely depending on the QB situation).

But wait! What about the Losers?

If there have been Super Bowl LOSING teams that did not have a strong offense, then we may just be over-extrapolating based on a what is likely the result of chance.

Here are the LOSERS from the past 10 years:

Combining this chart with the Winners chart shows us that just 3 teams have even made it to the Super Bowl with below average offense in the past 10 years, and the worst among them was San Francisco this year. The 49ers were just 4% off the league-average mark.

Tomorrow we’ll look at the defense, but it appears as though our original conclusion not only stands, but looks stronger. You cannot win the Super Bowl with a bad offense. Not only that, but you can’t even MAKE IT without a league-average offense.

Vick Notes and the NBA Finals

Posted on June 7, 2013 by Brent

I think we all expected the QB battle to be a major point of focus for the Eagles this offseason, and it is indeed playing out that way. The national media hasn’t really jumped in yet, though they will. The local beat writers have written a lot on it though, with Vick’s apparent displeasure with the competition the current story. Tommy at IgglesBiltz had some thoughts yesterday that I agree with and are worth checking out.

I’m firmly on record as saying that Nick Foles is the better choice. Not only that, IT”S NOT CLOSE. That obviously reflects my personal philosophy and is not a prediction of how Chip Kelly will decide, but I did find a recent Vick quote that was encouraging (for me):

“When you have a strong arm, you can attack all areas of the field, but we’ve got multiple quarterbacks with strong arms; I think that’s not the determining factor,” Vick said. “I think you’ve just got to be able to make good decisions with the football, that’s what’s most important.“

Beyond the obvious importance of QB decision-making, we have been led to believe that Chip Kelly is especially critical of this area of the game. If that’s true, and Vick seems to think it is, there should be very little chance of Vick winning the job. HOWEVER, I wanted to highlight something that does bode well for Vick.

Vick has a career Passer Rating of just 80.6 and a completion percentage of 56.3%. Neither is reflective of a good starting QB.

BUT

Many people overlook the fact that Vick has improved significantly since joining the Eagles. Perhaps it was Andy Reid’s magic or maybe Vick just figured some things out. Regardless, look at his splits:

ATL: 74 Games Played, 53.8% completion, 75.7 Rating, 1.36 TD/INT Ratio

PHI: 47 Games Played, 60.1% completion, 87.8 Rating, 1.73 TD/INT Ratio

Also, his YPA increased from 6.7 to 7.6, meaning the higher completion percentage is not a result of just throwing easier (shorter) passes.

Now a big part of that improvement was his stellar 2010 campaign, which no right-minded person should expect him to duplicate. Note, though, that his rating of 78.1 last year (his worst as an Eagle) was better than all but ONE season with the Falcons. In fact, his best year with the Falcons was in 2002 when he recorded a rating of 81.6.

To be clearer, his worst year so far with the Eagles was nearly as good as his BEST year with the Falcons. That’s not to say he’s now good enough (I still like Foles better), but to ignore his improvement over the past few years is unfair.

The Eagles version of Michael Vick has been a much better “decision-maker” than the Falcons version ever was. Time will tell if that’s good enough for Chip Kelly.

————–

Allow me to delve into the NBA for a moment. If you hate basketball, you can leave now. If, however, you are like most non-NBA fans and just aren’t that enamored with the game, I encourage you to watch this year’s Finals (Game 1 was last night). I won’t go into detail, but believe me when I say that watching these games MAY TURN YOU INTO A TRUE FAN.

If you follow the league casually and occasionally tune in to whatever game is on TV, you probably have not seen anything close to what basketball can be when played at the highest level. The biggest knock on the NBA is that there are only a handful of teams worth watching, and some of those aren’t even worth it until the playoffs. I completely agree with that. However, in the Spurs and the Heat, we’ve now got two of the best teams in recent history going at it in the Finals. You should be watching.

Last night was one of the best-played games of basketball I’ve ever seen and a great example of what the NBA SHOULD be on a more consistent basis. That obviously isn’t happening anytime soon, so you’ve got to appreciate the opportunities when they present themselves.

Also, Grantland has its problems, but it’s a FANTASTIC site for basketball. Zach Lowe does a particularly good job in explaining the strategy behind the game (at the very least, check out the first link below). Here are some excellent articles to get you up to speed:

Notes for the Summer and a few stats

Posted on June 5, 2013 by Brent

Now that summer is here, I’ve decided to scale back the posts a bit. Ideally, this will mean continued daily posts, though of a shorter variety. There’s less relevant information to discuss, and I’d rather not just ramble every day (I try to make every post interesting/thought-provoking/or in some other way valuable). However, that may mean an occasional day without a post; I know you’re all heartbroken. I’ve assembled a lot of data and want to do some higher level things that require more than a few hours work. On days without posts, you can rest assured that I’m spending some time on these larger projects.

We’ll obviously ramp back up as the season approaches.

————————-

In the meantime, I’m working on my articles for the Almanac. Here are a few notes to come out of that:

– Eli Manning has a career Passer Rating of just 82.7 and a TD/INT ratio of 1.47.

– Jason Campbell’s career Passer Rating is 82.5 and his TD/INT ratio is 1.46.

– Donovan McNabb’ career Rating is 85.6 and his TD/INT ratio is 2.0.

Eli Manning is likely headed to the Hall of Fame. You may commence vomiting now…

——–

There is no statistic more important for evaluating college QBs than completion percentage. I’m straying dangerously close to my Almanac stuff now, but obviously that will be more detailed. For now, I’ll just give you a chart, with Pro Passer Rating on the Y-axis and College completion % on the X-axis:

The correlation value is a moderate .324. Given the difficulty of projecting human performance in addition to all the other variables involved, that’s actually an extremely strong indicator.

Kyle Boller had a college completion percentage of just 47.8%.

He was selected #19 overall in the 2003 NFL draft.

27 Pro Bowlers were selected after him (7 more went undrafted, including Tony Romo).

TPR Update

Posted on June 4, 2013 by Brent

I’ve added prospect ratings from Draft Ace to the TPR system. Again, the idea is to get as many reasonable ratings as possible and derive a “consensus” rating for each prospect. That measure then gets adjusted for positional risk and impact to give us a final prospect ranking. I’m not too familiar with Draft Ace, but they’ve performed well over the past 5 years (according to the Huddle Report), so in they go.

I’m not going to go through the entire list again (I’ll update the TPR Tab above though), but here are the major takeaways:

– Lane Johnson improved 1 spot, moving from #9 overall to #8. Not a meaningful change, but still.

– Matt Barkley falls from the #15 overall prospect to #34, a very big drop considering it’s due to the addition of just one ranking. However, given where the Eagles drafted him (#98), he still qualifies as a great value pick.

– Zach Ertz falls, but just 3 spots, from #50 to #53. This is a pick to keep a close eye one. Seems like a bit of a reach (not a huge one), but also fits the Eagles very well (for what we think they want to do). It’s safe to assume he was ranked much higher than #53 on the team’s board; let’s hope that ranking was accurate.

– Bennie Logan, unfortunately, does not benefit greatly from the update. He does improve by 4 spots, but remains a definite “reach”, taken almost a full round early (29 spots). I’m most disappointed by this pick, and nothing I’ve heard or seen since draft day has changed that. If the team really liked him, then fine, but it’s very likely they could have slid down to draft him more in line with his value.

I understand that there might have been another team interested in him, but Logan doesn’t appear to be the type of player for whom the risk of losing outweighs the benefit of trading down and trying to take him lower.

– Jordan Poyer, picked #218 in the draft, rates as the #75 prospect overall on the TPR board. He hasn’t practiced yet (graduations rules), but he’s the guy to watch from the late rounders.

– Ryan Nassib jumps 8 spots and becomes the top QB and the #13 overall prospect.

– Geno Smith falls 8 spots to become the #19 overall prospect. I (along with the rest of the universe), am bearish on Geno Smith, not least because he landed in a terrible spot. The Jets have a miserable recent history with Quarterbacks. If Smith does fail, we won’t know if he was just overrated to begin with or if he wasn’t developed correctly. unfortunately for him, the chances of the second possibility are relatively high.

– John Cyprien and Kenny Vaccaro both fall, to become the #32 and #33 prospects. While Vaccaro was taken earlier, it means even if Cyprien had been available for the Eagles at #35, he would not have been as big a “value” pick as initially indicated. That makes me feel a bit better, given that I really wanted him going into round two.

– The biggest “reaches” of the first round haven’t changed much, and our current bust watch-list is as follows:

Kyle Long, EJ Manuel, DJ Hayden, Justin Pugh, Matt Elam, Travis Frederick, Eric Reid

That’s all for now. Check the TPR Tab for the updated list if you’re interested (I’ll update it within 10 minutes of this post).

Back from Vacation; Odds and Ends

Posted on June 3, 2013 by Brent

Just back from vacation, trying to catch up (I had close to zero internet access). Doesn’t look like I missed much, as the “off-season” has finally arrived. OTAs are happening, but I tend to believe the lead to far more overreaction and hype than genuine intelligence.

Don’t read into the day-to-day depth chart (who’s running with the 1s and so on) too much. Kelly is just getting a feel for every player and will likely use this as an opportunity to test some potential offensive ideas out and see how various personnel groups handle it.

I do, however, think the high-tempo offenses are a very good thing. The risk is that they aren’t coordinated correctly and end up too frantic and scattered. However, if done correctly they:

1) give more reps to everyone, which should help ease the offensive learning curve. It also gives the coaching staff more tape on everyone, meaning players that are lower on the depth chart should have a better chance of getting serious consideration.

2) maximize the inherent advantage of the offense. As everyone knows, prior to the snap, the offense knows the play and the defense does not. Standing at the line for a while or taking a long time in the huddle mitigates this advantage, as it allows the defense to swap personnel and gives them time to read the offensive alignment.

The no-huddle minimizes this time, and therefore takes full advantage of the natural information asymmetry at the snap. It’s not easy (or it’d be more common), but running sprint-paced practices is obviously a key step towards being successful.

3) While it’s tough to tell without watching practice, you’d think lots of reps would also help the overall fitness level of the team. I’ll be keeping an eye on this during the season, particularly as it relates to the O-line play late in the game.

If the O-line is in better physical shape and the opposing D-line can’t rotate (no time with the no huddle), that should translate into a late-game advantage for the Eagles.

——-

I missed this. It’s another great example of why I love having Jerry Jones in our division. As Tommy said, the Cowboys had Sharrif Floyd ranked #5 overall and he was available at their #18 pick. For most people, that’d be a no-brainer pick, immediately followed by a draft room celebration. There are only a handful of elite players in each draft, and getting them is usually very expensive if you don’t have a high pick to begin with. If the Cowboys believed Floyd was one of them (as their draft board suggests), then the decision to trade down is absolutely outrageous.

As readers here know, the key to the draft is two-fold: Find elite players (who are usually selected in the top 15), and maximize value (sticking to “tiers” and getting those players with the lowest possible pick).

The Cowboys obviously do not believe in this strategy, which goes a long way to explaining why they’ve won just 2 playoff games since their 1995 Super Bowl win.

—–

Though most people have probably moved on, I’ve found additional ratings for my TPR draft rankings. Haven’t yet incorporated them, but I will soon. As I’ve explained, consensus rankings should be more accurate than any individual ranking (over the long-term), and each additional set of realistic ratings should improve the overall set.

—–

Thanks to everyone who pre-ordered their 2013 Almanac. We’re doing our best to make sure it’s worth much more than you paid for it.

One of the Coolest NFL stats charts ever…

Posted on May 23, 2013 by Brent

Bonus post before I go on vacation, and this one is awesome.

Remember that area chart I showed you that illustrated how the Eagles playmakers changed over time? Well I gave Jared (who did the 4th down series) my complete data and he put this together. Enjoy. (Sorry you have to click to open it, WordPress apparently doesn’t support Java, so I can’t embed it).

BTW: It’s interactive, play with it…

Final Fourth Down Thoughts

Posted on May 23, 2013 by Brent

I hope you all enjoyed the 4th down series. Thanks again to Jared for doing the research. Today I wanted to give a few thoughts of my own about the data and its implementation. (Go read if you haven’t yet, or see the 4th Down tab above for the strategy chart).

It will surprise nobody that I come down on the side of being more aggressive. The simple fact is that Coaches have been PROVEN to make sub-optimal decisions in certain situations. While we don’t know for sure why this happens, I agree with Jared that the most likely reason is essentially “groupthink” or a “herd mentality” along with slightly misaligned incentives.

The coach is incentivized to KEEP HIS JOB, not to win. Normally those things go hand in hand, and it’s very difficult to keep your job if you don’t win. However, in certain game situations (for example when a team is losing by a lot) coaches clearly make decisions that aren’t aimed at maximizing the odds of winning, like kicking field goals to minimize the margin of loss. Additionally, the “optimal” decision for coaches is NOT “whatever provides the greatest chance to win”. It’s more complicated than that.

The “optimal” decision, given the coach’s incentives, is one that achieves TWO goals; win the game, AND minimize criticism of said coach.

Looking at the results, I do not believe all of these coaches are ignorant of the statistically “optimal” decisions. Some likely are, but given the amount of money at stake and the number of very smart people in league front offices, you can be sure at least a few coaches realize what they’re missing.

The upshot is that this represents a potentially large INEFFICIENCY in the way the game is currently played. Some day a coach will take advantage of it. However, note that just because you play the odds correctly doesn’t mean you’ll be rewarded. This may be another reason for coaches’ reticence. This “aggressive” strategy WILL WORK, but not every time (as several commenters have noted). The benefits will only be clear after a LONG time. Most coaches don’t have the job security to wait that long while being criticized by beat writers for whom anything with a decimal is considered “analytics”.

While I hope (and expect) Chip Kelly to be among the more aggressive coaches in the league, I think it’s EXTREMELY unlikely that he makes a significant departure from what we see now. At the end of the day, Chip wants to keep his job. Unfortunately, such incentive misalignment, however slight, inhibits the pace of innovation in the sport (as it does in many industries).

I will certainly keep an eye on Chip’s 4th down strategy and we’ll discuss it here during the season.

I’d also like to address the points made by a few commenters about the overall utility of something like the 4th Down Strategy Chart.

– First, as explained in the first post, each team would, in practice, adjust the chart to account for the relative strength of the opposing defense. This is not a one-size-fits-all chart. However, given the HORRIBLE success rates, its pretty clear that team-to-team differences are not accounting for the overall results.

– I would not, though, blindly follow that chart. The research explicitly excludes end of half and end of game situations. TIME REMAINING becomes a huge factor in those cases, completely altering optimal strategy.

– I would, however, ALMOST NEVER PUNT with less than 2 yards to gain on 4th down. It is extremely difficult for a defense to stop the offense from gaining just 1 yard. I get the sense that many people don’t realize just how small a distance that is. Today’s homework is to grab a ruler and measure out three feet (a yard).

Also, let’s attack this psychologically. Think back to last season or picture yourself during a game. Your defense has just forced the offense into 4th and 1. Are you hoping for a punt? Or are you hoping the offense goes for it, so that your team can stop it and gain “momentum”?

I don’t care where on the field that situation takes place, most people are hoping for a punt (as is the defense!).

In general, if you (as an offense) are doing things the defense WANTS YOU TO DO, you’re doing it wrong!

Several people have mentioned the “momentum” surrendered by going for it on fourth down and not converting, and while I think the concept of “momentum” is largely exaggerated (though not nonexistent), you must also factor in the demoralizing effect that converting has on the opposing defense.

– Even on your own 1 yard line, I’d strongly consider going for it on 4th and 1. The median NET punting average for the league last year was approximately 40 yards. Using this number tells us that if you punt from your own goal line, you can expect the other team to start its possession around the 40 yard line. For some teams, that’s already in field goal range. For everyone else, it’s just a few yards outside.

So, if you punt from your own goal line you are essentially giving the other team 3 points, with the potential for 7. If you go for it and fail, in all likelihood you are giving the other team 7 points. However, if you go for it, you have decent odds of converting, meaning you’ve now add the possibility of scoring 7, scoring 3, and allowing 0 points to the situation.

At a high level, going for it sounds like the better option to me. Now that I have last seasons play-by-play data (procured last weekend), I will take a look and see if that’s actually the case. Our 4th Down Chart suggests it is.

– I would ALMOST NEVER punt after crossing midfield. Unless it’s a late-game situation or there are a large number of yards to gain (8-9+), IT DOESNT MAKE ANY SENSE!. You’re already passed midfield, meaning you’re not guaranteeing the other team any point if you don’t convert. This is where I expect Chip to be aggressive. It’s a more “defensible” decision and less likely to immediately back-fire. That means the reputational risk is minimized, allowing the coach to weigh the “win the game” side of his incentives more strongly.

– Lastly, I completely agree with the chart regarding 4th and 4 or less yards to go situations in field goal range. Kicking a field goal when you’ve got 4th and 1 is ridiculous (unless its late in the game or time’s running out in the first half). It’s 1 yard, go get it. It’s the statistically optimal decision, and 7 points is a LOT more impactful than 3. For those of you who buy into the “momentum” game, how much does 3 points get you? Close to none… Kicking a field goal with less than 4 yards to gain is a gutless (and stupid) decision.

That’s all for now. I’ll be on vacation, starting tomorrow and running through next week. So probably no posting. I encourage you to explore the archives though, I’ve tried to make it as easy as possible by giving you tools and shortcuts on the sidebar.

4th Down Decisions: Part 3 – Which Coaches are the Worst/Best?

Posted on May 22, 2013 by Brent

Time for the final part of our 4th Down Decision series. Today we look at individual coaches. Again, you can follow Jared at @jaredscohen.

Part 3

Decisions by Coach

Now we’re getting into some fun stuff. Which coaches have the highest pass rate? We’ve already established that as a whole, coaches are far too conservative. But are there any who appear to ‘get it’ more than their peers?

Like Moneyball, have any of them figured out that an overlooked (and more aggressive) approach might lead to better performance and more wins?

Let’s take a look. Below is a table of all the NFL coaches and their 2012 regular reason optimal decision percentage:

Or, for a more interesting look, click this graphic:

Now, I know what you’re going to say.

Norv Turner???

Ron Rivera???

Marvin Lewis???

Not exactly a murderer’s row of coaching legends (Andy Reid’s up there too, by the way). And some of the coaches at the very bottom. The Seahawks? Packers? Falcons? They all had good years.

So what’s the deal? Am I just some crazy idiot from his mom’s basement?

Shockingly, I don’t think so. I don’t think our data or our conclusions are wrong. Although when you illustrate it a different way, there are still questions.

This chart illustrates coaching optimal decision rate (pass rate) against number of regular season wins. Now, what would be best would probably be point differential or Pythagorean wins or something slightly different, but what’s still interesting is that there appears to be a negative relationship between wins and optimal decision-making. Finding the correlation gives us a -0.34, so a slight negative relationship.

So, what gives?

Well, first, there’s a question of causality. Does this data mean that making the ‘optimal’ decisions actually prevents you from winning more games??? Should everyone just punt the living daylights out of the ball on every fourth down? Well, at least kickers and punters would be happy.

One could interpret it that way, but I think that would be wrong. I think anyone who thinks that’s the case has actually got their causality backwards.

What I think is a far more likely scenario, is that the worse a team is, the more often it’s playing from behind. And I think teams who are behind and trying to come back are often more aggressive. That aggression helps those teams to make ‘more optimal’ decisions during games. But unfortunately, those teams are behind on the scoreboard for a reason, and most of the time, they lose.

I think the causality works the other way. Poorer teams are more frequently losing, and therefore more focused on maximizing their total points (e.g., making a comeback). So they make more optimal decisions. The teams that are ahead most of the time are more likely to play conservative to hold their lead, so their decisions may be sub-optimal.

I tried to adjust for this by removing plays late in the 2^nd/4^th quarters and when the score was out of hand, but maybe that wasn’t enough.

Because while I’d like to give Norv and the other guys credit where it’s due, I have to think it’s largely driven by circumstances (but would love to hear other theories as well)

There’s also a way we can check for this.

Decisions by scoring differential

If we take a look at optimal decision rate based on what the score is, we can see if all coaches behave differently when their team is behind and trying to come back.

If the hypothesis is true, that coaches make ‘more optimal’ decisions when they’re trailing, then we’ll see that in the data, and that could explain why losing coaches (or in Norv’s case, fired coaches) have much higher optimal decision rates.

Hmmm…

When coaches are behind, they make more optimal decisions roughly 10% more than when they’re ahead. And in the fourth quarter, that gap is even bigger.

Looks like we may have an answer. Or at least some indication that when you’re behind in a game, you make better calls.

The optimal decisions themselves are more aggressive than most coaches in the NFL. Teams that are behind are often more aggressive to catch up. Therefore, teams who are behind more often (and lose more often) will naturally make more ‘optimal’ decisions.

Makes sense to me, but I was all excited to start the Norv Turner is better than Bill Belichick bandwagon.

I’ll have to put that on hold for now.

Implications

So what have we learned from all this?

We know that coaches are far too conservative, particularly when faced with short yardage situations in opposing territory. They kick the ball far too often rather than trying to convert.

Our data is pretty clear on this point. Coaches only make the right call 15% of the time when they’re supposed to be going for a first down, and when they’re in opposing territory, they make the right call less than half the time.

Well, the natural question, is why?

It can’t be a lack of information. NFL coaches and management have all kinds of data at their disposal (certainly more than I do), and plenty of talented folks. And it’s not like these ideas haven’t made it into the mainstream. High school coaches have made news for never punting, and analysts continue to harp on the conservative tendencies at the NFL level.

At the end of the day, I still maintain that it all comes down to risk aversion. NFL coaches (and most professionals) have one primary goal. To stay employed. And taking a strategy that goes against conventional wisdom exposes you to criticism if the outcomes don’t work out.

I read a quote from Mark Cuban not too long ago where he felt the idea that coaches wouldn’t try anything to win games was laughable. If I remember correctly, he thinking was that professional sports are so competitive and the need to win so great that of course coaches take every opportunity they can to get better. (and in googling around, I can’t seem to find it, so maybe I’m misremembering)

Cuban certainly has more experience in professional sports than I do, but I think at least in the NFL the data clearly suggests that’s not the case.

Coaching at the NFL level is the highest professional position a football coach can ever get. It typically takes years and years of work in all kinds of low-paying jobs (what does a quality coach even do?) and moving from city to city with the hope of one day snagging one of those 32 openings.

Oh, and once you get one of those slots, you can’t screw up, because you’ve spent your entire life specializing in a sport with a fixed number of teams and exactly one relevant professional league. You’ve invested your entire career to get to the top of the pyramid, but there’s nowhere to go but down. You’ve got to stay up there. It’s not like you can go coach pro baseball.

So with a lack of transferable skills outside of football, and the inability to create a startup NFL franchise to coach, NFL coaches are in something of a bind. Unless they have bulletproof job security (and in the long-run, no one has that, right Coach Reid?) they have a clear incentive problem to try such an easily observable strategy.

Why? Because it’s quite possible the outcomes won’t work out, and people will crucify you if they don’t. See the criticism Belichick got for his 4^th down decision against the Colts some years ago. He made the right decision, but since the outcome didn’t work out, all of a sudden he made a terrible mistake.

It’s the problem of evaluating decision-making skill based on the outcome. The equivalent would be telling a poker player he screwed up when he got all his money in with the best hand and someone drew a miracle card to beat him.

If you made all the optimal decisions you could make, and let’s say you were running at a 50% pass rate before, you might do something differently on ~30 plays a year.

Some of those plays might work, but some might not, and the ones people will focus on will be the ones in the situations of highest leverage where the outcome of the game hangs in the balance.

Now, I think making more ‘optimal’ decisions could swing a game or two a year in your favor, but it could also swing a game or two against you (like Belichick and the Colts). When faced with that possibility, it’s no surprise coaches aren’t chomping at the bit to test out the theory.

For a coach to successfully try this (and it’ll only take one to succeed before others join), I would argue one of three scenarios needs to happen:

– An edict needs to come down from the owner themselves, mandating the change to football strategy (of course, what kind of coach would want to be in that kind of situation?) The most likely suspects would be new analytically inclined owners like the Kahn family in Jacksonville, or owners that like to involve themselves in football operations (Jerry Jones, Dan Snyder, the late Al Davis)

– A coach fresh off a super bowl win uses the capital/credibility from his victory to test it out (you could imagine Belichick doing this, maybe Sean Payton still has enough cred, but I think you pretty much need to be a Harbaugh)

– A brand new coach in the first year of a contract just let’s it rip and goes for it, putting their NFL credibility at risk because they at least could always go back to college

If you think I’m suggesting Chip Kelly go for it in his first year with the Eagles. You’d be absolutely correct.

4th Down Decisions: Part 2 – How often do NFL Coaches make the right call?

Posted on May 21, 2013 by Brent

If you missed part one (posted yesterday), I encourage you to read it before moving to today’s continuation. At the end of yesterday’s post, we arrived at a default 4th Down Strategy chart, essentially a cheat sheet that tells coaches when to go for it and when to punt/kick a FG. For future reference, I have added the chart as a permanent fixture that can be accessed through one of the menu tabs at the top of the site. That should make watching the games more fun (or frustrating since you’ll know in real-time when bad decisions are being made).

Today, we move to grading. Using that chart, how do NFL Coaches perform? As I mentioned yesterday, this research was done by Jared Cohen, you can follow him on twitter at @jaredscohen.

Fair warning, this is a very detailed analysis (more of a research project) and therefore is longer than the typical blog post. Please read it when you have some time. If that’s not possible, feel free to skip to the charts.

Part 2

Methodology

To examine 4^th down coaching decisions, I took the following steps.

Download a comprehensive set of all fourth down plays from the 2012 regular season, including a set of key variables I could track and control for, including:
1. Distance to go for a first down/touchdown
2. Quarter and clock time of the play (e.g., Q1, 14:30)
3. Field position (e.g., own 35 yard line)
4. Scoring margin (e.g., team up by 3 points)
Each play was segmented by the choice of its coach as either a punt, FG attempt, or conversion attempt (by rush or pass)
Based on the distance for conversion and field position, I compared the fourth down play call to the optimal strategy matrix (the strategy card), to see what the ‘right’ choice would be
A play call in which the coach made the optimal decision was termed a ‘Pass’, while a play call that was not (e.g., punting instead of aiming to convert 4^th down) was termed a ‘Fail’

Pretty simple right?

Now, before getting to any data, I should also note that I excluded a number of specific plays, for reasons which I’ll explain. Remember, the goal of the analysis is to determine whether coaching decisions are optimal under normal circumstances. The key word here is normal.

– If a team was either leading or trailing by more than 14 points (two touchdowns), we excluded the decisions, reasoning that coaches would be making decisions differently than normal behavior (e.g., trying to catch up)

– If the distance required was longer than 10 yards (e.g., 4^th and 12 yards to go), I excluded it. I did this largely because those situations usually aren’t decisions for the coach. It’s a pretty clear field goal attempt or punt depending on where you are, and my major area of focus was on situations where a coach could decide to go for it

– Plays were also excluded if they occurred in the last 2 minutes of the 2^nd quarter or the last 5 minutes of the 4^th quarter, as coaching behavior will also change significantly. In the 2^nd quarter, it’s because a team can’t maintain possession. In the 4^th quarter, it’s because the game is ending and teams will no longer be trying to maximize their total points, they’ll be more focused on gaining/maintaining a lead.

So while the aforementioned decisions could be interesting, they were in situations which are inherently not-normal. The main goal is to see what coaches do in a typical situation. Even after subtracting all these conditions, we have over 2,100 fourth down calls to evaluate. That should be plenty.

So what did we see?

Results

We saw a pretty large number of failures. I’d never want to play blackjack with these guys.

Below is a chart of the overall grade for NFL coaches’ fourth down decisions, by quarter.

Yes, you’re reading that right. When making a decision as to what to do on 4^th down, the NFL coaching body as a whole makes the ‘optimal’ decision just slightly over half the time.

Think about that for a minute. In just about half of all normal 4^th down situations, coaches are making decisions that fail to maximize their number of expected points (and we should expect, actual points).

That seems kind of strange. And yet it also seems completely believable in a league where principally ALL coaches are far too conservative.

But if we spend some more time peeling back this coaching decision onion, we’ll look at a couple more specific cuts of the data that can give us more insight on exactly where these decisions are happening.

This will include:

– Decisions by optimal decision (what kinds of decisions are the most frequently screwed up)

– Decisions by field position (how do decisions vary by where you are on the field)

– Decisions by yards to go (does optimal decision-making vary by distance)

– Decisions by coach (which coaches appear to have the highest grades)

– Decisions by scoring differential (does decision-making change when you’re ahead/behind)

– Some fun with coaches (looking at specific game decisions to understand exactly what the implications are)

But before we get to that, there are a few caveats to all this analysis, which I want to make clear. This is to head off complaints and anti-analytics folks who may have already commented about how I live in my mom’s basement.

This analysis accepts the illustrated decision matrix as optimal, when in reality, that may not hold completely. It’s based on my interpretation of Brian Burke’s work, which I think is logical and is the leading model that I’ve seen. (I also ran these numbers with an alternative model generated for college football, and the results were consistent with expectations, which means they were much worse as college teams should kick field goals much less frequently than NFL teams do, hash marks and lower kicking talent level etc.)
These optimal decisions do not take into account the talent/performance of the teams in question. It assumes equal teams are playing each other. So could a team with a great offense merit different ‘optimal’ choices where they go for it more often? Of course. You could also adjust for the defense of your opponent, the skill of your kickers, the opposing punt return man, home field advantage, weather, or any recent lunar eclipse. This doesn’t have any of those adjustments.
When we get into very granular cuts of data (specifically with coaches), we start to run into potential sample size issues.

All of this is to say that yes, there are concerns with this (or any) piece of analysis. The goal of this isn’t to find incontrovertible proof, or establish new football dogma, it’s to investigate an issue and understand potential implications.

Analytics can serve as a helpful guide and show you where some issues might be, but I’m not going to pretend the conclusions are absolute.

Decisions by decisions (From the Department of Redundancy Department)

One of the reasons I wanted to look into 4^th downs at all was because I’m continually frustrated by coaches kicking field goals or punting.

Coaches are too conservative, just about any research in football has suggested as such, and so when I put this sample together, I wanted to see what my data looked like.

The first thing I did was filter all the fourth downs based on what the ‘optimal decision’ actually was. Remember, each fourth down, based on our strategy chart, was either ‘Go for it,’ ‘Field Goal,’ or ‘Punt’, based on the action that would maximize your expected points.

So, if we sort our data by what the optimal decision should be, we can see whether NFL coaches are screwing up opportunities to punt, kick field goals, or go for it (hint – my money is on go for it)

As TMQ’s Gregg Easterbrook would exclaim, ‘Ye gods!’

The good news for NFL coaches, is that when they’re supposed to be conservative, they are fantastic about it. In situations where coaches should be kicking a field goal or punting, they decide to do just that 97% and 99% of the time, respectively. Amazingly optimal performance!

Of course, such high rates would suggest that our coaches are being extremely conservative, which probably seeps over into fourth downs where they should be trying to convert. And sure enough, the abysmal 15% pass rate on fourth downs where teams should be going for it is exactly what we’d expect to see from overly conservative managers.

That means that, when faced with a fourth down where the best outcome is to go for it, coaches choose to kick away (punt or FG) about 85% of the time.

Wow. That seems insanely high. Making the wrong choice 85% of the time? I feel like in most jobs those kinds of choices get you fired. If you make 85% of the burgers wrong at McDonald’s, you’ll most definitely be out of a job.

But to give you a sense of what these failing decisions actually look like, I’ve included a sample from my data set below:

All of these are example ‘fails’ by NFL coaches when the best choice would be to go for a first down. Some of them seem pretty obvious. Pete Carroll and the Seahawks had 4^th and 2 from the Arizona 9 yard line and elected to kick a field goal (which by the way, they missed). Some of them, like Dennis Allen opting for a FG from the Chargers 33 on 4^th and 6, seem a bit more arguable.

But across all those possible decisions, only 15% of them were the ‘optimal’ choice. Even with my earlier caveats (not adjusted for team ability or game situation), that still seems like something is systematically wrong with NFL coaches.

Decisions by Field Position

So we know what type of decisions coaches mess up. They’re too conservative and should be going for it more often.

But let’s keep going, and ask ourselves, are these decisions happening all over the field? Are they happening more in some areas than others?

I broke down the field into four main zones, and looked at the data that way.

Own territory – Anywhere between your goal line and the 50 yard line
The ‘Maroon Zone’ – A term I borrowed as an homage to TMQ, who has consistently railed on over-conservative coaching for years. My definition of the Maroon Zone is in opposing territory, but not as far as the opposing 35 yard line. Too far for a field goal, but surely too close to punt!
FG Range – Any position between the opposing 35 yard line and the opposing 20 yard line. From the 35 yard line a field goal would be 52 yards, which is more or less the regular range of today’s NFL kickers. We could split hairs and more it back a few more yards, but this was where I decided to draw the line.
Red Zone – Anywhere from the opposing 20 yard line to the opposing goal line

So I set these bins and filtered the fourth downs. Where on the field are teams making more suboptimal decisions?

I should’ve saved the ‘Ye Gods’ for this, huh?

One of the first things that jumps out is that coaches make the most optimal decisions in their own territory. This makes sense, as we know our coaches are big on punting, and in their own territory, that’s more likely the right decision. (Of course, in an absolute sense, getting only two-thirds of decisions right isn’t exactly fantastic)

The other thing that jumps out is the performance in the Maroon Zone, so opposing territory but a bit too far for most field goals. Coaches are only making the right decision about 32% of the time here.

Again, some of you might be wondering types of decisions this entails, so I’ve pulled a sampling of Maroon Zone decisions. This table illustrates eight examples of Maroon Zone decisions from my data set, all from the first quarter of the first week of this season. It includes the coach, matchup, down and distance, score position, decision (both actual and optimal), and grade.

Let’s take the first row, when Mike Munchak and the Titans, facing a fourth and 1 from the opposing 37 yard line, faced a decision. You’ll see the Titans elected to pass, and that the optimal decision was to go for it. For that ‘passing’ decision, they received a ‘1’ grade.

Contrast that with Mike McCarthy and the Packers in their game against the 49ers. McCarthy had a decision on fourth and 3 from the 49ers 45 yard line. At the time, the Packers were down three points. With arguably the best quarterback in football and a stable of talented receivers, did McCarthy choose to go for it?

No, the Packers punted. Now, you could argue that the 49ers have a great defense and field position is key and blah blah blah. I’m not saying those arguments have no merit, I’m just saying I would’ve gone for it, and in that situation, going for it is the right move.

You may ask what happened on those plays? What were the outcomes? Did the Titans convert the first down? Did the Packers punt give them great field position?

Well, frankly, I don’t care what happened. Judging a decision based on the outcomes creates a whole set of biases which I don’t want to influence our analysis. The goal of this is to understand whether coaches are making the right decisions, not whether those decisions ended up working out. To me, that means we should keep the outcomes completely outside of this conversation.

Decisions by Yards to Go

So coaches aren’t going for it enough, and although they make the wrong decision most of the time whenever they’re in opposing territory, it’s at its worst when they’re beyond field goal range.

That’s interesting, if not completely unexpected.

But how is their decision-making impacted by the distance required for a first down? Is there any difference to a coach’s decision making whether its 8 yards to go vs. 3 yards to go?

Again, not a shocking result.

When the optimal decision is a more conservative approach (like punting on a fourth and ten), coaches almost always get it right.

But as the distance to convert shrinks, performance gets remarkably worse, especially around 2-3 yards to go, when coaches are only getting it right one-fifth of the time.

Again, it’s the conservative approach that does them in. When coaches should be trying for conversions, they’re punting the ball away or attempting a field goal. What’s interesting is that with only one yard to go, they’re actually a bit better. I feel like it’s a gap based on aesthetics more than anything else. One measly yard? We can go get that! The coach may say to himself. But push it back another three feet and it somehow becomes impossible.

Now we’ve seen just how bad NFL coaches (as a group) are when it comes to optimal 4th down decision-making. Tomorrow, we’ll look at individual coaches to see who is the best (or least bad as the case may be). The results are shocking…

4th Down Decisions: Part 1 – Creating an Optimal Strategy Card

Posted on May 20, 2013 by Brent

Today is the first installment of a series of posts about 4th Down Decision-making in the NFL. These are going to be a bit long, but extremely interesting for fans interested in the high-level analysis of the game (which should be just about everyone who comes here).

I’ve broken the entire analysis down into 3 parts:

Part 1 (today) will discuss the overall idea behind the analysis and build to a 4th Down Strategy Chart

Part 2 (tomorrow) will use that strategy chart to grade NFL coaches and show us how often the “right” decisions are made

Part 3 (Wednesday) will show the results for individual coaches and discuss potential implications of the entire analysis.

This was authored by Jared Cohen, with limited editing/input from me. You might remember Jared from his previous posts here, specifically his timely explanation, shortly before the Super Bowl, of why running a kick out from the back of the end zone is usually a GOOD decision. Of course, soon after that analysis, Jacoby Jones ran a kick out from the back of the end zone in the SB, resulting in a 109 yard touchdown…

Also, by way of qualifications, Jared is a:

– Two-time Jeopardy champion, and the man who brought Family Guy to life in Final Jeopardy. You can buy his e-book about the whole experience for just $0.99 here.

– MBA from the Booth School of Business at the University of Chicago, now a management consultant at Booz & Co.

– Play-charter for Football Outsiders for the 2012 season

You can follow him on twitter at @jaredscohen.

Part 1

Always double down on eleven.

It’s a rule of thumb any blackjack player will tell you. Applicable in almost every situation, at any casino, when your cards come up eleven and the dealer isn’t showing an Ace, you double down. No matter how much you’re betting, no matter how bad your luck has been, no matter which random foreign country your expressionless automaton of a dealer is from, you double down on eleven.

Why?

Because it gives you the best chance at winning.

And how do we know that doubling down gives you the best chance at winning?

Because people have researched it. People have done their homework, complete with statistics programs, random number generators and complicated looking equations (or as its known in some parts of the country, “witchcraft”)

They even have a card which tells you what to do. Since Blackjack is a game with well-defined rules and structure, there can be a clear strategy that, if executed, will help you win as much as possible (or, to be technically accurate, help you lose as little as possible).

While you don’t have to do what the card tells you, it’s definitely more right than your ‘gut’ instincts or homebrewed system. It’s based on data.

Yet, some players will insist on hitting their 12 against a dealer 6. They’ll refuse to double down on 11. They’ll split 10s!

Simply put, they’re doing it wrong.

So how does this relate to football? Well, I was wondering if NFL coaches are doing the exact same thing, eschewing data in favor of their own gut.

What I found was a little surprising.

Now, football is a complex game, with a lot of situations and decisions. For this analysis, I decided to focus on what has long been a frustrating aspect of many Sundays spent with the Red Zone channel…Fourth Downs.

That shouldn’t come as a shock. Fourth downs are the most direct view into a coach’s risk tolerance and personal philosophy.

And in thinking back to my blackjack discussion, I wondered what NFL coaches would look like if there were an optimal strategy for fourth downs. There’s already a chart for when you’re supposed to go for a two point conversion. If there were a basic strategy 4^th down card, how would coaches perform when compared them to it? How close would they be to the best strategy?

That is to say, do these coaches know to double down on eleven? Or are they the guy sitting at the table refusing to split their pair of eights?

In these articles, I’ll describe the methodology behind the analysis, illustrate some results, and discuss some of the implications.

Context and Expected Points

To start things off, I wondered if anyone had already done research into optimal fourth down strategy. I know lots of smart football fans out there have already spent time on it, so I figured it would already be established in the literature (and would save me the time of having to establish it).

Turns out it has been, and we’ll get into it in some detail, but before we get to that, we need to just establish the foundation for such an optimal strategy chart. And that means some background in the idea of expected points.

Originally conceived back in the 1980’s, expected points has gained a fair amount of traction with the football analytics community. Even ESPN uses it. Here’s a little bit of their explanation, which I’m cribbing because it’s clear.

Based on statistical analysis of 10 years of NFL play-by-play data, ESPN has created a formula that assigns an “expected points” value to the team with the ball at the start of each play based on the game situation. Expected points (EP) accounts for factors such as down, distance to go, field position, home-field advantage and time remaining.

The value it puts out is on a scale from about minus-3 to 7, and it basically represents “which team is likely to score next, and how many points?” It represents the likely points not just on the current drive but also on the next drive or any subsequent drive until the score changes or the half ends. A lower value indicates a more favorable situation for the defense (i.e. fourth-and-20 from your own 1-yard line could be close to minus-3 EP), and a higher value represents a more favorable situation for the offense (i.e. first-and-goal is generally worth 6 EP).

Essentially, expected points is a way to evaluate football situations against each other. Because each play is based on its situation (what down it is, field position, and how many yards to convert a first down), expected points serves as a normalized metric for the value of possession in any situation. What’s more, is that after a play, you can compare the expected points values from the old and new situations, and determine how valuable the play was. Here’s an example from ESPN’s explanation:

From your own 20-yard line, an 8-yard gain on third-and-10 is worth about minus-0.2 EPA because you don’t get a first down; the same 8 yards on third-and-7 is worth 1.4 EPA for converting a long third down and keeping the drive alive. EPA knows that not all yards are created equal.

So, using expected points as our basis, we can compare the best option (what would create the most expected points increase) for any given fourth down situation. And when I say we, I mean Brian Burke.

Burke has already done a fair amount of research on the optimal fourth down strategy, which I’ve leveraged as the base for this analysis.

Using expected points values and historical data, Burke has done some prior research on what the ‘optimal’ fourth down decision should be (FG, punt, or going for it). Depending on your field position, and yards to go, he put together a view of what decision would be best for a given fourth down. His chart of that strategy is below:

Depending on where your given fourth down situation falls, the best option is illustrated here. For example, if you had a fourth down and two yards to go on your opponent’s 10 yard line, the recommended option would be to go for it. Alternatively, if you faced a fourth down and had eight yards to go to convert it from your opponent’s 10 yard line, you should kick a field goal.

Make sense?

This chart can serve as the basis for our basic fourth down strategy chart.

Put another way, based on my rough transcription, it would look something like this. Apologies for the lack of labels, but:

– The columns show yards to go (so 1 = 4th and 1)

– The rows show yard-line on the field, from your own end zone (so 5 = own 5 yard line, 95 = opponents 5 yard line).

– Together, cell 2 x 2 means shows the correct decision when faced with 4th and 2 from your own 2 yard line.

Looks just like a blackjack strategy card, doesn’t it?

Note that there are a few strange suggestions in the card which are most likely just statistical anomalies that will disappear with more data. The most obvious example of this is cell 2 x 98 (or what to do when faced with a 4th and 2 from the opponents 2 yard line). That chart says kick the field goal, but it makes very little sense to kick from the 2 yard line while going for it from the 1 and 3-6 yard lines.

Also, in practice, teams could adjust the card each week to account for the relative strength of the opposing defense. If you’re playing a very weak defense, you’re odds of converting 4th downs goes up, so you’d see a few more Green blocks above.

Now that we have an “optimal strategy”, or at least a “default”, we can compare it to real life decisions and see how often coaches make the right call. Come back tomorrow for the results in Part 2.

Eagles Rewind

Objective and Analytical Analysis of The Philadelphia Eagles and the NFL