I finally picked up The Hardball Times Annual 2008 in early January and I breezed through the analysis section in a night or two. I’m writing this now because I wasn’t planning on writing a “review” of it - my writing style, if you will, isn’t one that’s able to sum up a fine, 350 page book or add any additional insights in a short essay. That said, I figured I’d jot down some of my thoughts here on some of the articles, now that I have this site/blog. There probably isn’t much here for anyone who’s already read the book or has even rudimentary knowledge of sabermetrics, but I figured maybe I could coax a few newbies into giving the book a chance. By the way, you can pick it up on HBT’s site or, of course, through Amazon.
Anyway, I was most interested in was the analysis section. Tom Tango has a couple of back-to-back articles using his “with or without you” concept. Basically, there are certain parts of the game that have been tough to evaluate with the available data - first basemen saving bad throws, catcher’s fielding, and so on. Something like passed balls for a catcher is heavily dependent on the pitcher out on the mound, so you can’t just look at raw totals in evaluating a catcher’s performance in this area. As the title (With or Without You) implies, he looks at how a catcher does with a certain pitcher and compares him to all other catchers who caught that pitcher. And then he does this with all possible combinations to get an overall +/- past balls (or whatever the stat in question is). Tango’s next article uses a similar methodology to evaluate fielding, and specifically Derek Jeter. Let’s use a quick example (I’m just making this up):
Pedro Martinez on the mound
Nomar, when on the field, converts 12% of balls in play into outs
The rest of Pedro’s shortstops convert 11.6% of balls in play into outs
So Nomar is better than his peers with Pedro on the mound. Now, with one combination this might not be that valuable, but add them all up and you have quite a few comparisons. He also breaks it down by park, base configuration, etc. (all at Jeter’s request ; ) A must read, indeed.
John Walsh has a great piece entitled “The Origin of the Platoon Advantage.” In the second part of the article, Walsh utilizes the PITCHf/x data, which has been a tremendous boon to many researchers over the past year. Anyway, he uses this new pitch tracking technology to classify pitches and then look at platoon splits for each pitch. For instance, when the batter has the platoon advantage on a slider, his average on balls in play is .340. When the pitcher has the advantage … it’s .304. I’ll let you read the article for all the gory details and the conclusions, but it’s certainly an interesting study and a great area for future research.
Another article I’ll briefly talk about is Greg Rybarczyk’s “Of Home Runs and Free Agents.” The first half of the article is a detailed report on all the home runs of 2007, using Greg’s (yes, I think we’ll just go with Greg) Hit Tracker system, which models the flight of a baseball to find each homer’s “true” distance. The second part, however, is even more interesting, at least to me. He now uses Hit Tracker to not only analyze home runs, but all balls in play (in this case, for Torii Hunter and Andruw Jones). On one table you can see the relationship between BABiP and speed off bat. As you might expect, a higher SOB, in general, produces a higher BABiP. This type of stuff - the PITCHf/x data, Hit Tracker, etc. - is really the future of sabermetrics. If you could model every ball hit in play by a hitter (which he does), you could take out the fielding aspects, the parks, and so on, to get a much clearer picture of how a hitter performed in a given sample of games. It could lead to knowledge of when to play a shift, when a player may be injured, when a hitter got unlucky, and numerous other things.
Just so it doesn’t appear like I’m looking at things through rose colored glasses, I was not particularly impressed with Bill James’ article. For being perhaps most influential sabermetrician ever, James certainly takes enough crap — and I certainly don’t feel like someone who is entitled to dish any out. But his article on clutch hitting is simply not up to the high standards of THT or Bill James (there just isn’t much to it). That said, in his 7 elements of clutch, he does bring up at least 3 interesting, largely unexplored topics (at least to my knowledge) – the opposition, the standings, and the calendar. Stats like WPA (and fangraphs’ clutchiness) have thus far omitted things like that, and there’s certainly lots of room for research there. Take two homers — both are walk off grandslams in the same exact situation (base, score, etc.). However, one is hit in september with the team two games down in the standings. The other is in July for a team 15 games back. Which one is more clutch?
Anyway, this is just 4 or 5 of the 10 analysis essays. There are also sections on history, the 2007 season, and more commentary from some of the best bloggers/writers on the ‘net … a bunch of stuff I haven’t even read yet. Also, the back of the book is filled with statistics, both basic and advanced. They have batted ball statistics on all players, stats like GPA, FIP, RZR, as well as month by month team stats (and a million other things).
I don’t know exactly what all of the old annuals used to look like. I’ve picked up BP’s for the past few years, and this is certainly right there with it, if not a couple of notches better (of course, depending on what you’re looking for). My opinion doesn’t mean much, but I certainly recommend it for anybody who happens to be new to sabermetrics. Even if you don’t think you’ll like sabermetrics or this book, give it a try. When you boil it down, it’s just baseball through a different lens.