Which WAR Statistic Is Best for Evaluating Pitchers?
When analyzing baseball players, there are a million metrics out there from which to choose.
There is, of course, our very own nERD metric, which aims to tell how much more a player contributes to run production compared to a league average player. Traditionalists still like using numbers like batting average, pitcher's wins and RBI, while the newer school analysts focus on weighted on base average (wOBA), isolated power (ISO) and weighted runs created (wRC+), among many other things.
Most fans gravitate to Wins Above Replacement, or WAR. For pitchers, you get the truest sense of their value by looking at all their stats, including ERA, Fielding Independent Pitching (FIP), strikeout and walk percentages, and ERA+ just to name a few. But WAR has become the go-to stat for most who are ranking players and pitchers.
However, if you have used WAR for pitchers recently, you've certainly noticed a difference in the numbers websites use to calculate just how much more valuable a particular pitcher has been over a league replacement player.
First, a quick overview. Baseball Reference calculates their WAR (bWAR) by using a formula that includes a pitcher's earned run average (ERA) into the equation. FanGraphs (fWAR) uses FIP to make their calculations. In order to understand the difference, you need to know what those two stats mean.
ERA is simply the number of earned runs a pitcher would allow over the course of nine innings. A pitcher with a 3.00 ERA would allow, on average, three runs a game if they pitched nine innings a game every time out. FIP is a sabermetric stat that looks at only the things that a pitcher can control; strikeouts, walks, hit by pitches and home runs. It seeks to remove any numerical information that involves balls put in the field of play. Therefore, pitchers who have high strikeout rates, low walk rates, and give up few home runs typically have a lower FIP. However, if a pitcher is a ground ball pitcher and happens to give up a lot of hits, or perhaps plays in front of a defense with minimal range, their ERA can sometimes be elevated due to that shoddy defense or plain 'ol bad luck, while their FIP would stay low.
The value in FIP is that it is a forward-looking metric, aimed at predicting how much success a pitcher will have moving forward. ERA is simply a measurement of what a pitcher has already done.
The table below shows the top 25 starting pitchers in Major League Baseball last year, according to their FanGraphs Wins Above Replacement (fWAR). In the column next to fWAR is their Baseball Reference Wins Above Replacement (bWAR). You'll notice some differences.
Name | Team | fWAR | bWAR | Difference |
---|---|---|---|---|
Corey Kluber | Indians | 7.3 | 7.4 | -0.1 |
Clayton Kershaw | Dodgers | 7.2 | 7.5 | -0.3 |
Felix Hernandez | Mariners | 6.2 | 6.8 | -0.6 |
David Price | - - - | 6.1 | 4.6 | 1.5 |
Phil Hughes | Twins | 6.1 | 4.3 | 1.8 |
Jon Lester | - - - | 6.1 | 4.5 | 1.6 |
Max Scherzer | Tigers | 5.6 | 6 | -0.4 |
Chris Sale | White Sox | 5.4 | 6.6 | -1.2 |
Jose Quintana | White Sox | 5.3 | 3.5 | 1.8 |
Jordan Zimmermann | Nationals | 5.2 | 4.9 | 0.3 |
Adam Wainwright | Cardinals | 4.5 | 6.1 | -1.6 |
Garrett Richards | Angels | 4.3 | 4.4 | -0.1 |
Stephen Strasburg | Nationals | 4.3 | 3.5 | 0.8 |
Johnny Cueto | Reds | 4.1 | 6.4 | -2.3 |
Jeff Samardzija | - - - | 4.1 | 3.7 | 0.4 |
Dallas Keuchel | Astros | 3.9 | 5.1 | -1.2 |
Zack Greinke | Dodgers | 3.9 | 4.3 | -0.4 |
Cole Hamels | Phillies | 3.8 | 6.6 | -2.8 |
James Shields | Royals | 3.7 | 3.3 | 0.4 |
Madison Bumgarner | Giants | 3.6 | 4 | -0.4 |
Mark Buehrle | Blue Jays | 3.5 | 3.7 | -0.2 |
Hiroki Kuroda | Yankees | 3.5 | 2.4 | 1.1 |
Scott Kazmir | Athletics | 3.3 | 1.6 | 1.7 |
Justin Verlander | Tigers | 3.3 | 1.1 | 2.2 |
Sonny Gray | Athletics | 3.3 | 3.1 | 0.2 |
The pitchers with "negative" differentials in the table above are those whose bWAR are substantially higher than their fWAR. In other words, they likely had an ERA which was better than their FIP. The most extreme example above is Cole Hamels, whose fWAR of 3.8 was only 18th-best in all of baseball. However, his bWAR of 6.6 was tied for 4th-best.
Justin Verlander, meanwhile, posted an fWAR of 3.3, 24th-best in baseball last season. However, his bWAR of 1.1 was tied for 171st in the Majors.
Just by looking at the fWAR rankings, you would believe that Hamels was worth only half a win more than Verlander in 2014, whereas bWAR puts the difference at 5.5 wins. That is a five-win disparity. Now, look at their numbers last year. Which do you think is the more accurate gauge?
Player | ERA | FIP | K/9 | BB/9 | W | L | AVG | HR/9 |
---|---|---|---|---|---|---|---|---|
Cole Hamels | 2.46 | 3.07 | 8.71 | 2.59 | 9 | 9 | .231 | 0.62 |
Justin Verlander | 4.54 | 3.74 | 6.95 | 2.84 | 15 | 12 | .271 | 0.79 |
Looking at those numbers, the fWAR calculations make no sense. Hamels was clearly a superior pitcher last season, and by more than just half a win. In this case, the bWAR numbers make much more sense.
There were other examples, like David Price, who had the fourth-highest fWAR in the league despite giving up more hits than any pitcher in baseball. Phil Hughes had the fifth-highest fWAR in baseball, thanks to a FIP of 2.65. However, his ERA of 3.52 certainly hurt his bWAR, putting him tied for 18th overall.
Here's a look at the top 15 pitchers in each league, according to fWAR and bWAR.
Position | Player | fWAR | Player | bWAR |
---|---|---|---|---|
1 | Corey Kluber | 7.3 | Clayton Kershaw | 7.5 |
2 | Clayton Kershaw | 7.2 | Corey Kluber | 7.4 |
3 | Felix Hernandez | 6.2 | Felix Hernandez | 6.8 |
4 | David Price | 6.1 | Cole Hamels | 6.6 |
5 | Phil Hughes | 6.1 | Chris Sale | 6.6 |
6 | Jon Lester | 6.1 | Johnny Cueto | 6.4 |
7 | Max Scherzer | 5.6 | Adam Wainwright | 6.1 |
8 | Chris Sale | 5.4 | Max Scherzer | 6 |
9 | Jose Quintana | 5.3 | Jake Arrieta | 5.3 |
10 | Jordan Zimmerman | 5.2 | Dallas Keuchel | 5.1 |
11 | Adam Wainwright | 4.5 | Tanner Roark | 5.1 |
12 | Garrett Richards | 4.3 | Jordan Zimmerman | 4.9 |
13 | Stephen Strasburg | 4.3 | Henderson Alvarez | 4.6 |
14 | Johnny Cueto | 4.1 | David Price | 4.6 |
15 | Jeff Samardzija | 4.1 | Jon Lester | 4.5 |
 |  |  | Doug Fister | 4.5 |
Looking at that list, it's easy to nitpick from both sides. Was Price really worth more wins than all but three pitchers in baseball last season? His fWAR says yes. Was Hamels really worth as many wins above replacement as Chris Sale? Baseball Reference says he was. Baseball Reference also says Tanner Roark was slightly better than Jordan Zimmerman last year, but you and I know better. And I know Phil Hughes had an outstanding year last year, but there's no way he was worth half a run more than Sale or Max Scherzer.
For pitchers, I prefer to use bWAR, because I prefer judging pitchers based on how many runs they actually gave up rather than on a formula that does its best to calculate how many of those runs were really their responsibility. While FIP is an extremely useful metric and should always be used in conjunction with ERA, I still tend to favor ERA when making the ultimate call of "who is the better pitcher?"
That being said, anyone analyzing pitchers should really be using both sets of numbers and understanding where the differences come from. Perhaps Hamels is neither the fourth-best pitcher in baseball, nor is he 18th-best. Maybe he's somewhere in the middle.
That, to me, is the lesson in all this. Use all the numbers together and make your judgments based on everything. It's really the only way to get a true sense of just who the best pitchers in baseball are.
But in a pinch, for pitchers, go with bWAR.