Checking the Numbers: Ultimate Matchups

June 19, 2009

In the first inning of an August 13, 2006 game between the Dodgers and the Giants, fans bore witness to an historic matchup, as Greg Maddux squared off against Barry Bonds. The two had faced each other many times throughout their illustrious careers prior to this specific game, with Bonds accruing 34 hits in 120 at-bats against the four-time Cy Young Award winner, but this particular matchup marked the first time in the history of the sport that a 300-game winner would do battle with a member of the 700 home-run club. Even though both players were on the wrong side of 40 years old, this heavyweight matchup still garnered main-event status. Neither disappointed in that first plate appearance, with both players displaying everything we had come to know about their respective games in a short, two-pitch sequence. After Bonds exhibited his tremendous batting eye, taking a two-seamer with incredible movement that sailed just outside for a ball, he smashed the second pitch up the middle. Maddux, not so arguably the greatest-fielding pitcher of all time, snared the scorching liner in what seemed like a millisecond and doubled Ray Durham off of first base. This confluence of events exemplified the intrigue of the batter vs. pitcher matchup.

What should the hurler throw in a specific situation? How should the hitter alter his approach from the previous trip to the plate? Can he decipher any tells or tipping of pitches? Does the pitcher know which offering the batter expecting? Many more such questions could flood the batterpitchermatchup@gmail.com inbox, furthering the concept that much of the variance in the sport can be accredited to different showdowns; some good, some great, and some awful. It stands to reason then that when the best of the best square off, the advantages become relatively unclear. This weekend, the Royals will play the Cardinals, and though Zack Greinke and Albert Pujols will not get a chance to face each other, many are left wondering if a showdown of that magnitude would result in the standard Pujols mastery, or if Greinke would make Albert look like Luis Pujols at the plate.

Entering this season, Greinke and Pujols have faced each other in eight different plate appearances, and Pujols has gone 3-for-6 with a double, two walks, and one strikeout. His .500/.625/.667 line may lead some to believe that he owns Greinke, or simply knows how to handle the Royals ace, but that’s not necessarily true. In actuality, the Pujols-Greinke matchup perfectly illustrates the main reason why batter/pitcher statistics need to be taken with handfuls of salt. For starters, the first three plate appearances between these two took place on June 25, 2004, during Greinke’s rookie campaign. The next three were amassed eleven months later, in a May 20, 2005 contest. The two would not meet again until June of 2007, nearly three years after their initial encounter, when Greinke induced a Pujols groundout on June 14, and issued him a walk on June 20. Suffice to say, the Greinke from 2004-07 was an entirely different pitcher than the one who figured things out down the stretch last year, and has held hitters to .223/.259/.319 this year.

In addition to dealing with different circumstances inherent in a data set, the problem of small sample sizes persists. Knowing what Pujols has done in eight plate appearances against Greinke over a four-year span that ended two seasons ago does not help us understand what will happen in their next meeting. Just as 10 PAs for a hitter would be assigned heavy doses of skepticism when trying to figure out his talent level, definitive claims based on such small samples of data between a batter and pitcher must be avoided. When trying to deduce the odds of success in a given matchup, his projection and his prior data are going to be much more accurate predictors than eight or so plate appearances spread out over four seasons. In the case of Bonds vs. Maddux, things are a little different, since they had faced each other several times each season while simultaneously evolving as players. In addition, by the time their data became statistically significant, both were nearing the end of their careers.

This does not preclude us from investigating or having some fun, however, as something may exist in the data with regards to how the collective group of best hitters fares against top-of-the-line pitchers. To answer, I queried my database for the aggregate batter-pitcher matchup data, from 1999-2008, when pitchers with at least 100 innings and an ERA no higher than 3.85 in a season faced hitters with 400+ PA and an OPS of 900 or better. Here are the slash stats and OPS for the elite matchups during each season:


Year    AVG/ OBP/ SLG   OPS
1999   .279/.348/.512   860
2000   .281/.368/.488   856
2001   .284/.365/.523   888
2002   .285/.375/.512   887
2003   .271/.354/.488   842
2004   .288/.385/.528   913
2005   .278/.360/.504   864
2006   .273/.361/.503   864
2007   .284/.372/.527   899
2008   .286/.367/.525   892

The data here is not at all cut and dried in terms of evaluating which side got the best of the other. Context is key, as we need to know how these slash stats compare to what the hitters did against everyone else. Looking at 1999, the hitters in the sample produced an 860 OPS against the elites. Compare that to the 995 OPS the hitters boasted at the expense of all other pitchers, and it becomes easier to see that the pitchers really did do a good job in making the hitters appear mortal at the dish. The table below quantifies the deltas, or differences, between the OPS of these hitters against the elite pitchers and the OPS allowed by everyone else:


Year   Elite  Other   Delta
1999    860    995   -.135
2000    856   1009   -.153
2001    888   1036   -.148
2002    887   1029   -.142
2003    842    999   -.157
2004    913   1006   -.093
2005    864    996   -.132
2006    864   1001   -.137
2007    899   1003   -.104
2008    892    997   -.105

The pitchers won out in each of these seasons, and by fairly large margins at that. Keep in mind, though, that this data holds little predictive value outside of the fact that elite pitchers are bound to fare much better than others when facing the top-tier hitters. The elite pitchers didn’t exactly turn these hitters into Bloomquist clones, but routinely reducing an opposing player’s OPS by over 100 points is not necessarily an easy feat. There will certainly be hitters that perform at a high level regardless of whether Johan Santana or Radhames Liz is toeing the rubber, but the changes in talent and circumstances, as well as the small samples of data accrued over longer time spans, make specific batter-pitcher data difficult to gauge and therefore less meaningful. Is it interesting to know that Hitter A is 7-for-29 against Pitcher B in his career? While nobody would refute its titillating nature, or deny the presence of a hitter’s ability to adjust their approach based on past experiences with specific hurlers, the knowledge of Hitter A’s .325 batting average over the past three seasons is more likely to indicate his chances of success in this specific plate appearance than his .241 average against our hypothetical pitcher. Matchup data between batters and pitchers can be interesting and noteworthy, but we have to be careful not to treat it as gospel, while also understanding what the information does and does not explain.

A version of this story originally appeared on ESPN Insider .

Thank you for reading

This is a free article. If you enjoyed it, consider subscribing to Baseball Prospectus. Subscriptions support ongoing public baseball research and analysis in an increasingly proprietary environment.

Subscribe now

Eric Seidman

Latest Articles

You need to be logged in to comment. Login or Subscribe

jmkearns

6/19

I don't see that this tells us anything at all, other than verifying that good pitchers are, indeed, good pitchers. Am I missing something?

At the very least, it seems a comparison between the elite hitters' deltas and the average deltas against those same pitchers would be in order. Here we're being told that elite hitters fare worse against elite pitchers without any context for evaluating how much worse any other players fare.

Reply to jmkearns

sde1015

6/19

I was thinking something similar. I bet if you flipped this around and looked at OPS allowed by elite pitchers, you'd find that it's higher when they face elite hitters than it is when they face the others. You could then conclude that the hitters won in each of these seasons.

Reply to sde1015

jrbdmb

6/19

Would the above data (at least tentatively) support the old baseball that good pitching beats good hitting?

Reply to jrbdmb

jrbdmb

6/19

Or to expand a bit, if you put together the chart of elite pitcher ERA's against elite hitters as mentioned above, who takes the bigger hit - the pitcher's ERA or the batter's OPS?

Reply to jrbdmb

agentsteel53

6/20

... and vice versa?

Reply to agentsteel53

Oleoay

6/19

Eric you nibbled on an idea I was wondering about for awhile, though in a different context. Basically I was wondering about high OPS teams and why they don't do as well in getting through the playoffs to the world series. So the avenue I was interested in was how much good pitching and bullpen usage patterns with more platoon advantages being used surprsses OBP and ISO compared to league average (or below average) pitching.

Reply to Oleoay

tphoskin

6/19

shouldn't the way to do this is a weighted breakdown of how a player does against the pitchers arsenal? Wouldn't that be a better description of how Hitter X would face on average against Pitcher Y?

Reply to tphoskin

Mario66

6/19

Isn't this experiment somewhat tautological -- i.e., these particular pitchers were included in the data set in part because of their success against elite hitters? Is there some sort of cause-and-effect circularity that is happening here? I don't know, just asking.

Reply to Mario66

bsolow

6/19

It's certainly possible. On the other hand, you could conceivably think of a pitcher that is good but not great, and dominates everyone in the league except elite hitters, who torch him. He could give up a lot of solo home runs and still have excellent aggregate numbers -- I don't know if this is a good example, but Curt Schilling at one point was giving up 35-ish homers per year, but with very good aggregate ERAs. It's possible that Schilling was dominant against everyone who wasn't elite, but the elites tended to take him deep.

Reply to bsolow

Oleoay

6/20

He was giving up a lot of home runs, but his BB/9 was very low and so was his BABIP against. So, he gave up a lot of home runs but they were usually solo shots.

Reply to Oleoay

blcartwright

6/20

After showing how much the elite batters production was reduced by the elite pitching, I would have then repeated the process to show the delta of the pitchers vs all and pitchers vs elite hitters. I would guess that the average delta of the batters is just about the same as the average delta of the pitchers.

The practical effect of this is that the more elite a league is, in terms of both batting and pitching (shown here by subsets of data), the league total slash lines won't change that much - what will change is the variance of the performances. There will be fewer and fewer outlier performances, eventually converging to where all the batters and all the pitchers perform at league average, even though they are the ultimate elite.

Reply to blcartwright

jepson

6/20

One other issue here is the disparate means of selection between pitchers and hitters: since ERA is inherently a noisier stat than OPS, it seems likely that the study will end up including greater share of pitching mediocrities who happen to be temporarily lucky in their Strand Rate. I would prefer to see pitchers selected by OPS against - not that this isn't also prone to short-term flukiness, but at least it is now more or less balanced on both sides of the ledger.

But I agree - the conclusion that "the pitchers won out" is the real howler in this article. Was this an attempt to offer a nice simple conclusion palatable to the ESPN audience?

Reply to jepson

EJSeidman

6/21

Yeah, I ran into a bit of a word count issue, not necessarily getting in everything I wanted to. The ultimate method would involve comparing the Pitcher's OPS vs Elite/vs Everyone to the Batter's OPS vs Elite/vs Everyone to see which had a larger spread. The likely answer is that they are equivalent and there is no advantage.

Reply to EJSeidman

jepson

6/20

Also, wasn't this issue studied pretty comprehensively in The Book? They found no effect in either direction: elite hitters fare worse against elite pitchers than against average ones - but not disproportionately so - and vice versa.

Reply to jepson

irablum

6/20

for Historic dates, how about this one:

September 14, 1965.

The scene, the perfectly temperature controlled Astrodome. What makes this day special? this game between the Giants and the Astros special? Well, For the game, patrolling centerfield for the Giants as he had for more than 10 years. First in New York, and then in San Francisco. There stood Willie Mays. But the game was not going that well for the Giants. After scoring a first inning run on Jim Ray Hart's single, and then another run on in the 5th, again on a single by Jim Ray Hart, Starter Ron Herbel gave up 2 in the bottom of the sixth on a triple by Eddie Kasko, a sac fly by the pitcher Bob Bruce. He was replaced by Bill Henry after giving up a two out single to Joe Morgan, who let him score on a double by Rusty Staub. Henry got the final out of the inning, but gave up a hit to lead off the bottom of the seventh. After spitballer Gaylord Perry got the first out on a strikeout of Bob Aspromonte he loaded the bases on a single and an intentional walk. After pitcher Bob Bruce reached on an error by Hal Lanier (which plated two runs), San Francisco Manager Herman Franks turned to the old lefthander in his bullpen that he had acquired earlier in the year after he had been released by the Mets. In that move, Herman Franks made history by calling in for 363 game winner Warren Spahn. Why? because the day before, Centerfielder Willie Mays had hit his 500th home run. This marked the first time a 500 home run hitter had ever shared the field with a 300 game winner. (though they were on the same team).

They would play together through the rest of the year, with Spahn pitching 4 more times, once as a starter, and each time, Mays was patrolling center. The Giants would finish second in 1965, and Spahn retired at the end of the year, and the game wouldn't see another 300 game winner until Gaylord Perry in 1982.

Reply to irablum

roycewebb

6/21

Anyone remember Cory Snyder vs. Roger Clemens?

http://www.baseball-reference.com/sotd/archives/1261

Reply to roycewebb

swatdogg9

6/22

"The ultimate method would involve comparing the Pitcher's OPS vs Elite/vs Everyone to the Batter's OPS vs Elite/vs Everyone to see which had a larger spread."

it's a shame that this article couldn't be expanded, b/c i'd love to see the numbers that you refer to above. that being said, i like the idea behind the article, and the information that is presented is logical and concise... although it doesn't necessarily come to any groundbreaking conclusion, the fact that the numbers aren't shocking is a conclusion in itself.

being new to the advanced statistical analysis contained on this site, the article reads well. i'm admittedly not as sophisticated as many of the previous readers, but probably more knowledgable than an espn.com reader, and for me this was an enjoyable read.

as a side note, i wish i could find more articles about defensive metrics, especially for individual players. anyone have any suggestions?

Reply to swatdogg9

EJSeidman

6/22

Jared, I'll likely be posting the numbers in question at some point tonight or tomorrow, either here in the comments or perhaps in Unfiltered, but to give a short answer, the advantage isn't clear - both hitters and pitchers deviate similarly against non-elites than against elites.

The real conclusion I wanted to be taken away is that if you have a choice between 1,200 PA of a player over the last 3 seasons or 10-25 PA in a specific batter-pitcher matchup, always tske the 1,200 PA.

Reply to EJSeidman

EJSeidman

6/23

OK, as promised, here are the deltas for pitchers by year, keeping in mind that their OPS allowed from 1999-2008 is equal to the OPS numbers from 1999-2008 posted by the hitters.

1999: 222
2000: 225
2001: 258
2002: 254
2003: 212
2004: 280
2005: 217
2006: 215
2007: 264
2008: 242

So it seems here that things are in fact much more even, and if anything, things are tipping in favor of the hitters, as they made the elite pitchers deviate from the mean more than pitchers made them deviate.

Reply to EJSeidman

Checking the Numbers: Ultimate Matchups

Thank you for reading

Latest Articles

MLU: Bratt Frustrates Opposing Hitters $

Box Score Banter: Knuckling (Way, Way) Up B

The Most Dominated Teams of All-Time: 18-19 $

Golden Age: April 19-27 B

Yoshinobu Yamamoto Was Too Good To Be Great Right Away $

Eric Seidman

Latest Articles

MLU: Bratt Frustrates Opposing Hitters $

Box Score Banter: Knuckling (Way, Way) Up B

The Most Dominated Teams of All-Time: 18-19 $