keyboard_arrow_uptop

Let's not bury the lead: PECOTA spreadsheets will be available on Monday.

We know you're anxious for the release, and there are probably a few of you that will be displeased with any announcement that isn't "today." PECOTA is by far the most anticipated projection system of its kind that I know of, and I just want to say that we don't take that honor lightly. So we want to make sure that when we get PECOTA to you, it is totally worthy of the expectations you have for it.

This will be very nearly the culmination of months of effort for a lot of us here – but it won't be final, of course. PECOTA will continue to update not only through the start of the season, but throughout the season. And these will not be partial updates – we are going to be running the entire PECOTA process and deliver you the same class of PECOTA projections that we've been delivering in the offseason.

But let's not leave you entirely without something to wet your beaks with, shall we? Typically, we offer forecasts of what we think will happen, or at least something that represents the most likely set of outcomes. In this case we offer a forecast simply of what might have been. This is what PECOTA sees Strasburg as having been capable of, before his injury:

STEPHEN STRASBURG Born: 7/20/1988   Age: 22 Bats: R   Throws: R   Height: 6' 4"   Weight: 220  
Breakout: 23%   Improve: 45%   Collapse: 23%   Attrition: 11%   MLB: 87%   Comparables: Mark Prior, Yovani Gallardo, Clayton Kershaw (54)
YEAR TEAM LVL AGE W L SV G GS IP H HR BB SO BB9 SO9 GB% BABIP WHIP ERA FRA WARP
2011 WAS MLB 22 11 4 0 23 23 1222 97 9 35 140 2.5 10.3 48% .296 1.08 2.42 2.63 5.1

I think something like this is as good an example of any as to why PECOTA captures the imagination. PECOTA, as has been pointed out many times before, is nothing but an algorithm (really a set of them). It is capable of nothing more or less than what it's told to do; it possesses no imagination. But it's capable in a very real sense of powering our imagination – of not just projecting the future, but allowing us to visualize it. The comparables represent a lot of tedious number crunching (measuring Euclidean distance in n-th dimensional space, if you want to be precise).

But the PECOTA does something that surprises us – and tells us that Stephen Strasburg is like no other player more than Mark Prior. And we knew that in our hearts, didn't we? (And no, I didn't provide PECOTA with any special hints in this case – like any proud parent, once our children grow up we have to let them make their own decisions, and this was PECOTA's own decision.) And I think there's a very real power in that.

Expect to hear from me again tomorrow, with another little preview of what PECOTA's saying this year. And I hope you'll join me in, for once, looking forward to a Monday.

Thank you for reading

This is a free article. If you enjoyed it, consider subscribing to Baseball Prospectus. Subscriptions support ongoing public baseball research and analysis in an increasingly proprietary environment.

Subscribe now
You need to be logged in to comment. Login or Subscribe
JimmyJack
2/03
Sweet! How soon until the PFM is running?

Thanks Colin & crew.
rawagman
2/03
Looking forward to Monday, Colin!
But, pray tell, what does the "(54)", after Kershaw's name stand for?
cwyers
2/03
That's the average sim score (expressed with 100 as perfectly similar and 0 as not at all similar) of the player's top comps. In this case, what it's telling us is that Strasburg's comps are of about average strength. (Average for all players PECOTA considers, that is. PECOTA will typically see far more minor leaguers than major leaguers, and MLB regulars will typically have better sim scores than that.)
PBSteve
2/03
One of the new things we added to the book this year.
nosybrian
2/03
Colin: there ya go, just as Nate did, attributing a persona to PECOTA. PECOTA "makes decisions." We look forward to what this new generation will do.
markpadden
2/03
I hope you can provide detail at some point about the process you plan to use to generate in-season projections. That is, how much to weight current season stats vs. pre-season projection in coming up with a rest-of-season projection.

Also, would love to see any work you have done to test how well the current PECOTA algorithm has performed in backtesting -- similar to what Swartz did in his article about SIERA.
ccweinmann
2/03
Any idea when PFM will debut this year?
rowenbell
2/03
As a Gallardo owner/fan, I don't know whether to be elated or frightened with his appearance next to the names of Strasburg and Prior....
DLegler21
2/03
I guess its better than March but you missed your "earlier than last year" projection.

Still looking forward to it but will have to go through our auction and the beginning of our draft without it :(
yankeehater32
2/03
When is your auction/draft?
DLegler21
2/03
We meet the first weekend in February every year. I know, its annoyingly early. Typically BP has published a weighted-means spreadsheet with a few days to a week to spare.
Oleoay
2/04
As I remember from last year, PECOTA was out around when the annual came out, but had problems with its depth charts and some relief pitcher projections. I'm assuming those are ironed out for this year, so I don't mind if it's released a bit later.
Michael
2/03
My apology in advance for reacting to last year's PECOTA problems.

If Monday's release is intended to have a lot of errors, please label it "Beta" and give us an estimate of when the "Beta" label will be removed. On the other hand, if it is intended to be the real thing with minimal errors, then don't claim afterward that it should have been labeled "Beta" and that subscribers shouldn't have expected it to be accurate.
jrmayne
2/03
Maybe a tad early to worry? I was handing out pitchforks and torches last year until supplies ran out, but I strongly sense that BP was wounded by the fire and pointy parts and strived mightily to avoid a rerun, as indicated by Colin's involvement.

I'd bet heavily against a 2010 replay. Let's wait until/unless I'm proven wrong to clean the blood off the pitchforks and charge the mound, eh?

Looking forward to the spreadsheet!

--JRM
Michael
2/04
Oh, I feel that Colin's involvement is a good thing.

On the other hand, last year was supposed to be better due to Clay's increased involvement, and that didn't turn out too well.
rawagman
2/03
O don't think that BP ever *intends* to have a lot of errors, even in a BETA run. Anticipate errors, maybe, but intend, never.
dsher84
2/03
Good deal, even it's one day to late to settle my Scoresheet team's Andres Torres v. Brett Gardner debate. Looking forward to seeing it.
laynef
2/03
And one day late for me to decide on Seth Smith vs. Andres Torres.
lucastate
2/03
It is a bit disappointing for them to be one day too late to help the scoresheet folks -- but I'm sure we will be able to make decisions on our own. CAIRO is out already, and there are some partial ZiPS.
yankeehater32
2/04
You can always e-mail the fantasy authors with any questions you have.
luftmich
2/03
Maybe a dumb question, but does "PECOTA spreadsheets" also mean PFM will be up and running on Monday?
Guancous
2/03
Breaking News: CDC to investigate emerging flu epidemic. Thousands of baseball fans simultaneously fall sick on Monday.
pikapp383
2/03
and millions of football fans too.....
zszalewski
2/03
Is eBABIP going to be incorporated into PECOTA?
mwashuc06
2/03
I said that Strasburg was the next Prior when I saw him pitch live in Syracuse in his AAA debut...
steverynear
2/03
How are midseason PECOTA updates made? Do they incorporate "strength of schedule". I remember looking at last year's late season updates and seeing very peculiar results. It would be good to have some clarification.
blcartwright
2/04
I'll have to point out to Tango that Pecota projects Strasburg for a 2.42 ERA (if he would have pitched in 2011) while Oliver says 2.50
TangoTiger1
2/04
I started a thread on my blog.
Oleoay
2/04
2.50 ERA coming back from Tommy John surgery would be impressive... but that being said, I think people are more curious about how many innings he'll throw and how many people he walks as he recovers from TJ.
mgolovcsenko
2/04
Echoing earlier question: when PFM?
o2bnited
2/04
For those interested, there is another related update regarding PFM:
http://www.baseballprospectus.com/article.php?articleid=12859
ajankelo
2/08
Does Pecota only extend to 2011? Or does it project further into the future?
spaulo
3/08
So, the Depth Charts and PECOTA Cards have different numbers than the Weighted Means Spreadsheet. Will we see an update to the WMS any time soon? It's been out a month...
brian206
3/23
I actually had the same questions. I am somewhat new around here but I was expecting the Weighted Means and the 50% PECOTA to be somewhat similiar but they are very different. Is my understanding of the purpose of the 2 totally off or is tehre an update in the works?