CSS Button No Image Css3Menu.com

Baseball Prospectus home
Click here to log in Click here for forgotten password Click here to subscribe

<< Previous Article
Premium Article The BP Broadside: You ... (09/16)
<< Previous Column
Between The Numbers: G... (01/03)
Next Column >>
Between The Numbers: A... (09/22)
Next Article >>
Premium Article Fantasy Beat: The Lone... (09/19)

September 16, 2011

Between The Numbers

BP and the Palmer Database

by Colin Wyers

Part of Big September

We're upgrading existing features, releasing new features, and making improvements all around the site this month. Click here for the summary .

We turned 15 this month, and you're getting the presents. For a strictly limited time, save $6 on a yearly subscription to Baseball Prospectus Premium with coupon code


Subscribe for $4.95 per month
Recurring subscription - cancel anytime.

a 33% savings over the monthly price!

Purchase a $39.99 gift subscription
a 33% savings over the monthly price!

Already a subscriber? Click here and use the blue login bar to log in.

Baseball fans may occasionally (he said drolly) disagree over the relative importance of various statistics, but all baseball fans agree about the importance of keeping track of a record of how players have performed throughout history. Whether you care about a player’s batting average or his True Average, the raw numbers are a vital part of how baseball fans engage with the game.

Unfortunately, keeping those records throughout baseball history has been an arduous and incomplete task—records have been lost, mistabulated, or rendered illegible. Reconstructing an accurate history of the game through its numbers is a laborious process.

Very few people have done as much to tabulate an accurate statistical record of the entirety of professional baseball as Pete Palmer. Palmer, along with Bill James, is one of the founding fathers of sabermetrics—he introduced OPS and linear weights, among dozens of other new metrics. But he’s also done invaluable work correcting baseball’s historical record. Joe Hamrahi and Pete had some discussions about licensing the Palmer dataset, and we’re proud to announce that we’ve partnered with Palmer and Gary Gillette to bring you Palmer’s painstakingly compiled database of player statistics.

This is widely considered to be the most authoritative set of player statistics available; it’s the same stats used by publications like the ESPN Baseball Encyclopedia and available on sites like Baseball-Reference.

How much of a difference does it make? Let’s compare batting lines from Joe Tinker (of the famous trio of Tinkers, Evers, and Chance) during his 1902 season:


































Palmer uncovered three games previously unremarked upon by the stats reported by our player cards—Tinker picks up a handful of extra hits and another RBI by correcting the data. More interesting is the total number of strikeouts—a number previously omitted altogether from our records, so we’re not just filling in few of the blanks here and there, we’re adding some significant new information to our evaluation of players.

And for the first time, the entirety of the statistics at our disposal are available through our sortable reports, including the custom sortables. Seasons where we lack play-by-play accounts will not have all the data available for recent years, unfortunately—we're never going to know how many doubles each and every deadball-era pitcher allowed, as much as we may want to. But now we truly have unified access to the entirety of baseball history through the cards and sortables.

We’ve also worked to improve the accuracy of our biographical data for players as well. Invaluable help in this effort came from the Society for American Baseball Research—particulary Data Czar Ted Turocy and Geoff Harcourt—who provided their master player register as well as assistance with integrating it into our master player tables. Some biographical details, chiefly names, are even tricker than player stats—figuring out whether a player went by William, Will, or Bill can be difficult at best, and some players were inconsistent, making the task even tougher. Nobody is more devoted to chronicling the history of baseball than the people at SABR, though, and having their records available to us has dramatically improved the scope and the accuracy of our biographical data for historical players.

Our analysis can only be as good as the data that underlies it, so having this data available to us is a great boost to our efforts here at Baseball Prospectus. But having good data doesn’t guarantee good analysis, and BP was founded on the belief that we could provide insight by analyzing the statistical record of baseball and connecting it to how a player provides value to a team. So this is just a building block; we’re gearing up to provide revised formulas for such mainstays as WARP, TAv, and Fair RA that will help illuminate the raw stats we’re showing here by adding context. So consider this a teaser, if you will.

Colin Wyers is an author of Baseball Prospectus. 
Click here to see Colin's other articles. You can contact Colin by clicking here

Related Content:  BIGSEPT

7 comments have been left for this article. (Click to hide comments)

BP Comment Quick Links


I like that you're trying to improve Baseball Prospectus' foundation. Good move.

Sep 16, 2011 08:37 AM
rating: 8



This is good news!

Sep 16, 2011 09:39 AM
rating: 1

I don't Joe Tinker had too many broken bats in his days, that bat handle looks to be about 2 1/2 inches wide.

Sep 16, 2011 13:25 PM
rating: 1

Awesome news, guys. I love it.

Sep 16, 2011 23:05 PM
rating: 0

All the more reason why i feel so good about canceling ESPN insider and moving to BP full time. Thank you.

Sep 17, 2011 02:39 AM
rating: 0
Benjamin Harris

Where do the new minor league stats come from?

Sep 17, 2011 18:47 PM
rating: 0
Rob Miller

It would be SUPER helpful if you would provide Player_id as a downloadable field. I ask. I hope. I have yet to see.

Oct 12, 2011 13:09 PM
rating: 0
You must be a Premium subscriber to post a comment.
Not a subscriber? Sign up today!
<< Previous Article
Premium Article The BP Broadside: You ... (09/16)
<< Previous Column
Between The Numbers: G... (01/03)
Next Column >>
Between The Numbers: A... (09/22)
Next Article >>
Premium Article Fantasy Beat: The Lone... (09/19)

Premium Article What You Need to Know: June 2, 2015
Premium Article The Call-Up: Manny Banuelos
Fantasy Rounders: Split the Bit
Premium Article Painting the Black: #HugWatch2015: The Hugge...
Premium Article Transaction Analysis: They're No Angels
Premium Article Rubbing Mud: The Cole Hamels Decision
Premium Article The Call-Up: Miguel Sano

Premium Article The BP Broadside: You Don't Need a Prince, J...
Premium Article Divide and Conquer, AL West: The Angels' Las...
Premium Article Collateral Damage: Aches and Strains
Fantasy Article Value Picks: First, Third, and DH for 9/16/1...
The Prospectus Hit List: Daily Hit List for ...
Premium Article Kiss'Em Goodbye: Kansas City Royals

2011-09-29 - BP Unfiltered: Where You Can Watch the Playo...
2011-09-28 - Between The Numbers: Playoff odds updates
2011-09-22 - Between The Numbers: Addition by Addition
2011-09-16 - Between The Numbers: BP and the Palmer Datab...
2011-09-01 - Premium Article Manufactured Runs: Raising the Stakes
2011-08-30 - BP Unfiltered: Justice League members with s...
2011-08-19 - Premium Article Manufactured Runs: Multifold Changes

2011-09-22 - Between The Numbers: Addition by Addition
2011-09-16 - Between The Numbers: BP and the Palmer Datab...
2011-01-03 - Between The Numbers: Ground-ball Rates in th...

2011-10-04 - Manufactured Runs: The O-Swing of Things
2011-09-19 - BP Unfiltered: Welcome to Big September!