â€‹Believe it or not, most of our writers didn't enter the world sporting an @baseballprospectus.com address; with a few exceptions, they started out somewhere else. In an effort to up your reading pleasure while tipping our caps to some of the most illuminating work being done elsewhere on the internet, we'll be yielding the stage once a week to the best and brightest baseball writers, researchers and thinkers from outside of the BP umbrella. If you'd like to nominate a guest contributor (including yourself), please drop us a line.
Craig Glaser is an Application Developer at Bloomberg Sports, where he helped design and implement the algorithms that make Bloomberg’s fantasy baseball tool Front Office tick. He has previously written articles for The Hardball Times, Surviving the Citi, Amazin’ Avenue, and his own site, Sabometrics. A member of SABR, he has recently participated in panels at the SABR Analytics and 50th Anniversary of the Mets conferences. In a prior life, he studied Experimental Economics and Cognitive Psychology at NYU, focusing on how people perceive probabilities, a field of study that continues to color his view of life and the sport of baseball. You can find his musings about sports, probability, and everything else on Twitter @Sabometrics.
When sample size is invoked in baseball research, it is almost always prefaced with the word “small.” Analysts often attempt to identify the true talent level of an individual player, and for that purpose, a single baseball season can be maddeningly brief. We’ve all watched enough Shane Spencers to have some (though often not enough) perspective when we see a Will Middlebrooks come up and have an extremely impressive first few games as a major leaguer. Of course, as anyone who has lived through a mediocre season knows, playing 162 games takes time. The 2011 season featured 2,429 games and 185,245 plate appearances. If you’re not focused on a specific player, the season is anything but small.
I’ve heard that you see something new in every game of baseball you watch. I’m not sure I would go quite that far, but the long season allows for incredibly rare peaks to go along with the typical valleys. Nohitters are one such sort of peak and, while they often say more about the length of the season and the probabilities involved than the skill of the pitcher, it’s always fun to see the pitchers who are good and lucky enough to achieve one get their moment in the sun.
On April 3, 2012, Mets starter Jon Niese completed six innings against the Atlanta Braves without giving up a hit. No Met has ever thrown a nohitter in the franchise’s 50year history, and Mets fans have always been eager to see one. Niese gave them hope. I may have been the only Mets fan in the world with mixed feelings about it.
A few weeks before Niese’s outing, I was asked to take part in The 50th Anniversary of the New York Met Conference held at my Alma Mater, Hofstra University. As part of my panel (titled “By The Numbers: Statistics and Analytics”) I was asked to prepare a 510 minute presentation on a topic of my choosing. I decided to examine just how unlikely it is that the Mets have never thrown a nohitter. I had started doing my research, and everything was coming together nicely.
While I had always rooted hard for a Mets nohitter before, I now had a selfish reason to hope that it didn’t happen in the first month of the season. I knew that my research would be more pertinent and more fun to present if the no nohitter streak was still intact. So it was with a combination of relief and sadness that I watched Freddie Freeman record a hit in the seventh. I figured that Mets fans could wait another month, and that I’d be able to enjoy the eventual nohitter more fully after giving my presentation.
My research focused on three questions:
 How unlikely was it that the Mets had never thrown a nohitter?
 What was the Mets’ expected number of nohitters, given the number of games the team had played?
 How many nohitters would have been equally as unlikely as the zero nohitters they actually had?
All of these questions can be answered by looking at a binary probability distribution, but in order to do that, you need to estimate one key piece of information—the probability that a start would result in a nohitter. I used two models to estimate this piece of information—a naïve model and a model used by Rob Neyer and Bill James based on Out Percentage. Additionally, to make the presentation more interesting, I decided to abandon sound statistics and add a third model, looking at all of the former Mets pitchers who have thrown nohitters for other teams.
The primary question—how unlikely is it that the Mets have never thrown a nohitter—can be answered by a very simple equation. You take the probability of not throwing a nohitter (1p) and raise it to the number of opportunities the Mets had to throw a nohitter (g,) the number of games they have played. To calculate the probability of n nohitters, the calculation becomes a little more complex: (1p)^n * p^(gn) * nCg, where nCg is the number of combinations of n nohitters that could be thrown in g games (think back to high school math.)
The naïve model assumes that each start in the major leagues is equally likely to become a nohitter. Between the birth of the Mets in 1962 and May 27th, 2012, there were 209,764 starts made by majorleague pitchers, with 131 ending up as nohitters. This gives us a p(nohitter) of .000625.
While nohitters always involve some amount of luck, they are not completely random. We should expect the NeyerJames model to be more precise than the naïve model. The logic behind their method is simple—you look at the two stats that matter for nohitters—outs and hits allowed—and calculate the “out percentage” (outs / (outs + hits).) You then raise this out percentage to the 26th power. (There is an average of one out on the bases, which makes 26 more accurate than 27.) This gives us a better estimate of the probability of a nohitter, since it captures the quality of the team’s pitching and defense and the effects of the stadium the games took place in. (Note: All numbers are from 1962 – 4/27/12.)
Team 
Out Percentage 
p(NoHitter) 
Expected NoHitters 
Actual NoHitters 
0.765 
0.000933 
7.5 
10 

0.757 
0.000715 
5.7 
0 

0.757 
0.000708 
5.7 
10 

0.756 
0.000705 
5.6 
5 

0.755 
0.000675 
5.4 
6 
When you look at their out percentage, the fact that the Mets have never thrown a nohitter becomes even more amazing. The Mets’ out percentage of .757 is the second highest over the past 50 season, trailing only that of the Dodgers. This means that the Mets were the secondmost likely team to throw a nohitter. This is not surprising, since the Mets have often featured great pitching and played in parks that favored pitching over offense. The new p(nohitter) when using the Mets’ out percentage rises to .000718, an increase of about 15 percent.
We can now get to the heart of the matter and use our two probabilities to start answering some questions:
Model 
p(nohitter) 
Starts 
p(no nohitters) 
Most likely # of NoHitters 
Expected NoHitters 
Naïve 
0.000625 
8008 
0.006694 
5 
4.99 
NeyerJames 
0.000718 
8008 
0.003177 
5 
5.74 
According to the naïve model, there is only about a .67% chance that a team with 8,008 starts would have yet to throw a nohitter. When we customize this to the Mets, the probability is cut in half, with a .32% chance of no nohitters in 8,008 games. That's just a little more likely than selecting any one of Juan Pierre's 7,660 career plate appearances at random and having it be one of his 16 career home runs.
Each model predicts five as the most likely number of nohitters for the team. Additionally, to match the low probability of zero nohitters, we’d have to go pretty far out on the distribution, with 11 nohitters (.83%) being slightly more likely than zero by the naïve model and 13 nohitters (.38%) being slightly more likely than zero by the “out percentage” model.
The third model, based on former Mets who have thrown a nohitter for other teams, is not a statistically sound one, but it is probably the most fun of the three to discuss. For this model, I went on a pitcherbypitcher basis and calculated specific p(nohitter) for each pitcher as p = (# of nohitters/# of starts.) The most recent addition to this list, Philip Humber, is a perfect example of why this model doesn’t make sense statistically. Having thrown one nohitter in only 36 career starts gives him an extremely high rate of nohitters—one he is not likely to keep up for his career. In fact, one could use the NeyerJames model at the pitcher level and come up with a better estimation. There is one huge timesaving advantage to this weak model, however. It allows you to completely ignore any pitcher who never threw a nohitter in his career, since their p(nohitter) equals zero.
Once we have the p(nohitter) for each of these pitchers, we can raise (1p) to the number of starts he made for the Mets to find the probability that each pitcher would not have thrown a nohitter for the Mets. If we then multiply all of these final values together, we can find the probability that none of these pitchers would have thrown a nohitter for the Mets.
First, the pitchers who threw a nohitter after leaving the Mets (note: Hideo Nomo pitched for the Mets in between his pair of nohitters).
NH 
Mets GS 
NH/GS 
P(no nohitter) 
P(nohitter) 

Doc Gooden 
1 
410 
303 
0.0024 
0.4771 
0.5229 
7 
773 
74 
0.0091 
0.5101 
0.4899 

1 
647 
395 
0.0015 
0.5428 
0.4572 

1 
419 
169 
0.0024 
0.6678 
0.3322 

1 
319 
60 
0.0031 
0.8283 
0.1717 

Hideo Nomo 
2 
318 
16 
0.0063 
0.9040 
0.0960 
Phil Humber 
1 
36 
1 
0.0278 
0.9722 
0.0278 
And the pitchers who threw a nohitter before joining the Mets:
NH 
Mets GS 
NH/GS 
P(no nohitter) 
P(nohitter) 

1 
382 
213 
0.003 
0.572 
0.428 

1 
301 
63 
0.003 
0.811 
0.189 

1 
371 
74 
0.003 
0.819 
0.181 

2 
665 
19 
0.003 
0.944 
0.056 

1 
317 
14 
0.003 
0.957 
0.043 

1 
474 
12 
0.002 
0.975 
0.025 

1 
356 
3 
0.003 
0.992 
0.008 

1 
364 
2 
0.003 
0.995 
0.005 

1 
294 
0 
0.003 
1.000 
0.000 
Multiplying all of these probabilities together gives us a 2.1% probability that none of these pitchers would have thrown a nohitter for the Mets. By any of these three models, it is pretty surprising that the Mets have not thrown a nohitter. The more accurate the model, the lower the probability becomes.
Armed with these numbers, I started putting my presentation together. Presenting for Mets fans ,I wanted to establish the right narrative, to try to avoid telling this as a negative story and to spin it as something amazing and unique about the Mets. I hoped that I could convince my fellow Mets fans that while it might be easier to celebrate the rareness of a nohitter, we could also take some joy from the rareness of no nohitters.
At 8,008 games, the Mets’ no nohitter streak is the longest and the rarest in baseball, even without considering the more accurate NeyerJames model. In my mind, it has become part of the identity of Mets fans. When the Red Sox finally won the World Series after 86 years, their fans were obviously thrilled. Yet some Red Sox fans also feel like they lost a piece of their identity. The torturous wait helped fans bond together and became a key part of the experience of being a fan of the team. With nohitters, the stakes are much lower (I shudder at the thought that any Mets fan would trade one of their two World Series victories for a nohitter), and the probability of not throwing a nohitter in 8,000 games is a factor of magnitude more unlikely than not winning the World Series in 100 years.
I thought that after presenting my data and writing this article, I would go back to wholeheartedly rooting for the Mets to throw a nohitter, but now that it’s done, I’m not so sure. The Mets could throw a nohitter tomorrow, and for a week, we’d have something to celebrate. We’d also lose a piece of our identity—a statistic and a feeling that makes the franchise and its fans unique. We’d become just another team with one nohitter. I’m not sure I can get behind that.
That brings me to the last question I’ll address: How long would the Mets’ streak have to get before it became as unlikely as throwing a nohitter in any individual game? Using the naïve model, the probability of no nohitters in 11,801 games is about the same as the probability of a nohitter in any one of those games. With the NeyerJames model for the Mets, this number drops to 10,079 games. With 8,008 games down, the Mets need to go 3,783 more games by the naïve model, or 2,071 more games by the NeyerJames model, to achieve what I like to call the no nohitter challenge.
In a way, I’m hedging my bets. By anticipating 2,071 more games without throwing a nohitter, I set myself up for success either way. Either they’ll continue their incredibly rare streak long enough to complete the challenge during the 2025 season, or they won’t, and I’ll get to celebrate the nohitter itself. So, will I be rooting for Johan Santana, R.A. Dickey, or Jon Niese to finish it off next time one of them doesn’t allow a hit after seven innings?
Maybe if it’s also a perfect game (p = .00007.)