HOME

TESTIMONIALS

DISCUSSION FORUM

MEMBERS ONLY

CONTACT US

PRIVACY POLICY

ADVERTISING INFO

LATEST NEWS courtesy of Rotowire

LINKS 

 

New Page 1

The Best of Ask David: Volume 2
published November 17, 2004
by David Luciani

It was a popular suggestion last year that I annually compile some of the "best" Ask David questions from the previous year and last year was the first time I did this.  A year ago then, I drew from the complete history of the Ask David series.  This year, I'm selecting only from the 2004 set and I expect to make this compilation an annual occurrence.  To make it into this collection, a question or responses doesn't necessarily need to be one where I gave a response that proved to be correct - that would be too selective and biased in my favor as I'm picking what makes it into the set, though admittedly I do pick a couple that were fun in how they turned out but I'm trying to avoid doing that too much here.  Rather, I'm trying to select the questions and/or answers that either seem to constantly come up again (such as questions where I respond by saying "please refer back to what I said about Ron Belliard in April to see how that sort of thinking applies to this other player) or questions/answers that readers frequently write member support asking to help find because they found them informative about our methods or ideas.  For some reason, a question or response can become so popular that it eventually spins off into its own essay idea and such questions will typically also find their way into this collection.

If a player went out and surprised by hitting 40 home runs for the first time and I had predicted it, I think the reader would benefit from looking back at the reasoning.  Moreover, if I was predicting a breakout that didn't end up occurring, those can be interesting too because we can learn from our mistakes by sorting through faulty reasoning on the projection.

In some cases I've added comments here to put the questions in context or to follow up on the result.  One other change this year is that I also have my "David's Notebook" column to draw on.  Though that feature won't be returning in 2005 in the same format (to be replaced by me focusing on individual issues in essay format more often than in past years), it did offer some interesting or useful comments.  Enjoy!

From January 1, 2004

Q.  This is what has become a "Catch-22" for me in using your projections when preparing for draft day.  I rely heavily on your forecasts but I still like to do my own research.  Let's say you have Rafael Furcal forecasted to have 52 steals.  I do some of my own research and find that Furcal is feeling great and Bobby Cox wants to run him more, etc.  Do I assume my own research is already (or soon will be figured) into your forecasts or do I bump him up from, say, 52 steals to 58 steals?

A.  Firstly, I need to comment on your example in case readers get confused about our Furcal forecast.  We're only forecasting Furcal to steal 28 bases in 2004 in the latest forecast set so I assume you're just writing hypothetically.  I would just hate readers to read your question and then assume we're forecasting Furcal to do something he isn't.

I actually addressed a similar question about manager's comments in previous Ask David's, when a few years ago, Don Baylor said he was going to run Sammy Sosa more and then more recently when Joe Kerrigan, soon to be the ex-Red Sox manager at the time, was going to run Nomar Garciaparra more.  I've said it before and will again:  Do not put much emphasis on what managers say they are going to do unless it relates to the role of a player.  In other words, if Bobby Cox said he were going to move John Smoltz back to the starting rotation, I would have no choice but to consider that but if he said he wanted to see Furcal run more, I would put little weight on that with rare exceptions.  I do remember back in 1998 when Tim Johnson proved in spring training that he was going to have the entire Blue Jays team running more and so I bumped up every appropriate Blue Jay's steal forecast accordingly but it is so rare for something like that to happen.

To answer your more general question, we already do consider all available information when we make forecasts and I can only say that our forecasts are the best we can make based on the information we have available.  I don't know everything there is to know about every player - like whether he had a fight with his wife that will affect his performance - and so it's completely possible if not likely that you could theoretically have access to information that no one else has.

In general, we consider everything that is being said and make our own assessments of what to believe, what to put weight on and so on.  Whenever I get messages from readers who don't agree on a particular forecast for a player, I often remind them that there's nothing stopping them from modifying that particular forecast.  If research is truly to be information-gathering, each reader has to combine all the info they get from various sources, including ours, to form their own opinions about players.  Nothing is set in stone and by all means if you think we're wrong, I recommend modifying the forecast.  However, I do want to respond by saying we consider all such comments when making our forecasts and I already believe that the forecast we publish is at the midpoint of expectations, a phrase I often use to emphasize the type of forecasting we do.

From January 14, 2004

Q. How much does post-season performance affect your evaluation of individual ability?

A. I view post-season performance about the same way as I view regular season performance in that it gives me an opportunity to see an increased sample of a player's skills. An obvious adjustment has to be made not only for the way players are used but for the quality of competition in post-season being higher. It is unlikely to see starting rotations of disastrous quality or pathetic offenses at that stage because such teams rarely make it to the post-season. Therefore, since I tend to evaluate statistics in the context of what sort of competition a player was facing, post-season adjustments are rather difficult to make and are done on a case-by-case basis.

Though the post-season seems long, at most it's only hypothetically possible for a player to play in a total of nineteen post-season games in a given season and that's if they make it all the way to Game Seven of the World Series and if every series went its maximum length. Therefore, it's rare that a post-season offers enough data for me to upgrade or downgrade a player for the subsequent season.

If you're asking whether my Miguel Cabrera forecast is based on the post-season, the answer is that I would have projected him for 2004 almost exactly the same way if he hadn't appeared in the post-season. He's a great player and even with what he did, he's still undervalued by many.

Q. I heard that you once commented about the Pete Rose affair and explained why it can hurt the integrity of the game even if he only bet on his team to win. Can you explain this again? It would seem to me that if you're betting on your team, it can't possibly hurt your team because you want to win that much more.

A. Yes, I commented about this several years ago around the time broadcaster Jim Gray confronted Rose at the World Series and ended up having to issue a public apology to viewers for making Rose uncomfortable at the festive event. At that time, Rose was still denying that he had bet on baseball and a reader wrote me then to ask if it would be acceptable if Rose admitted betting if he had limited his bets to betting on his team to win.

My response then remains about the same. The game of baseball, particularly for a field manager, is about long runs. It's about sometimes juggling your rotation so that the best pitchers will pitch in key games relevant to your pennant success. It's about resting players from time to time so that they perform better for 162 games, even at the expense of today's win. While a manager could theoretically argue that betting on his team does not influence the outcome, the example I used then (and I've heard this was quoted by a reader on our website discussion forum this past week) is that if you have a huge bet on a specific game and you're leading 8-3 in the ninth inning, is it possible that you might bring in your closer to lock down the game, thus jeopardizing his availability for the next day's game? I'm not in any way suggesting that Rose did this and in fact, I suspect that he wanted to win for Cincinnati more than he wanted to win his wagers and so I would doubt that he did manage a different way. The problem is that it does create a reasonable perception of bias in terms of how you approach that individual game, even perhaps at the expense of your team's long run. I believe there are managers who correctly presume that though they could make adjustments in the way they use players to win a particular game today, they are more interested in the long run. I believe that when you bet on a particular game, even betting on your team to win, you jeopardize your own ability to manage a game as though you didn't have a bet on it, even if you have the best of intentions to do otherwise.

On a related note, since Pete Rose was an extremely popular topic this past week in emails to this column, for those of you who have asked whether the Dowd Report was ever published, you'll be pleased to know that the full report, including scans of the alleged betting slips and transcripts of interviews of Pete Rose, is now available online and can be found at http://www.dowdreport.com which is a site published by the former special counsel John Dowd. If you want to get a fascinating look inside this issue, I strongly encourage you to check out this website.

Q. Sometimes I've made some really dumb moves in fantasy baseball and yet other times, it seems I can do no wrong. I just wonder if even you ever make fantasy moves that look really stupid and if so, what's your worst fantasy move ever?

A. We all will make moves that don't work and if we're playing the percentages, we accept the really bad moves with the really good ones with an eye on long run success. I've made some terrible moves over the years but I like to think I've made many more winners. Most of the bad choices that I've made in fantasy baseball at least fall into a category where I can tell you the reason why I made the choice at the time and why I then expected a certain type of outcome. There is a single exception that comes to mind, back in the days when I first started playing in more competitive statistics-based leagues after making the shift from Strat-O-Matic style simulation games. The error came in a published invitational league back in 1994, before I had made the transition from being a "regular" baseball writer to one who also dabbled in fantasy baseball. That year, I had been invited to play in a variation 5X5 league with many better-known baseball writers than myself and celebrities including Bill James, John Hunt of USA Today, Robert Wuhl (the actor who plays Arliss on the HBO series of the same name), John Benson and others. As luck would have it, I had won the first pick overall in the draft and with my pick, and for the life of me I can't possibly explain why, I took pitcher John Burkett, who was coming off a 22-win season with the Giants in 1993. To this day, I don't know what I was thinking at the time as Burkett, even with the wins, had never had an ERA below 3.50 at that point of his career, had only once before topped 12 wins and ended up going 6-8 with a 3.62 ERA in the strike-shortened 1994 season. I can't even tell you what my expectations were at the time because I can't conceive of a single reason that I would ever make such a pick now, even if he had repeated or exceeded his 1993 season. I can only speculate that as the most unknown, at the time, of anyone in the league, I must have been trying to make an impression for being unconventional!

I can't think of a single example that comes close to that miscalculated (or possibly uncalculated) move in the 1994 season but so you know, we all make decisions that later we look back upon and wonder what we were thinking. I'm almost embarrassed to tell you about this one but since I often mention some of my best choices it's only fair to mention one like this.  It's worth a laugh now but I can't explain what I was possibly thinking at the time.

Q. In strong keeper league situations, is it generally better to build up prospects or just use them to trade for established major league players who may cost more budget-wise?

A. I tend to believe in the idea that top prospects are so over-hyped in this era that it is an excellent strategy to stock up on them for the purpose of trading them but it depends on the league. In leagues where you are evaluated based on an active roster and have a second roster or farm system that allows you to store top prospects, I think you should go after the big name prospects that interest you. However, in leagues that require you to keep those prospects active as soon as they make the majors, I am almost always opposed to hanging on to such prospects because there is usually a development time between the day the player arrives and the day he becomes what everyone, including you, expects him to become. By the time he does, especially in leagues that inflate salaries, you're paying for him at the rate you would have had you been forced to buy him in an auction.

There are exceptions such as a Mark McGwire or Albert Pujols but in general, a player arrives in the majors before he is ready. Consider Eric Gagne. Let's say you had known precisely what Eric Gagne was going to do when he eventually became a great pitcher. So, you stored him on your roster. That means you had to endure the disastrous 2000-2001 seasons just to keep him long enough to wait for his great 2002-2003 seasons. I don't find that worth it. Heck, let's say you could go back in time and know exactly what Roy Halladay would be destined to do. Could your roster really have endured his 2000 season? In just 67.2 innings, he allowed as many earned runs that year (80) as it would take to raise a fantasy team that has 1000 innings with an ERA of 3.50 to a fantasy team that has 1067.2 innings with an ERA of about 4.00.

I know this isn't your question but based on other similar questions I receive, I want to caution all readers that our goal should never be to impress our opponents with our vast knowledge of prospects. In other words, many competitors want to be the guy who drafted Eric Gagne when he was no good so that when he becomes good they can look intelligent but that isn't the goal. We're trying to win… we're trying to dominate our opponents. We don't care if they think we're smart or hopelessly lucky or complete idiots as long as we keep stumbling our way into first place, looking as clumsy as possible doing it. We just want to win and I believe that in general, with rare exceptions, these top prospects take time to develop and except for fantasy leagues that give us a place to store them, and I mean store them even while they're active in the majors, I'm not interested in spending too much time stocking up on such players.

So, the short answer is if you can store prospects in a place where they don't hurt your active team, then do that. If you have to put them as active until the day they become great players, I recommend drafting prospects that other people care about and trading them or drafting prospects that you think are ready from square one. I'm curious how many are still waiting for "can't miss" prospect Ruben Rivera to win them their fantasy league…

Q. Do your projected pitcher wins take into account the quality of a team?

A. Yes they do and it's not only based on a consideration of a team's offense but also the estimated quality of defense and the role the pitcher will fill on that team. That's why sometimes you'll see me forecast a mediocre pitcher to have a terrible win/loss record, such as is the case with Nate Cornejo projected to go 8-13 and in other cases, a pitcher who has subpar statistical numbers can have a good projected win/loss record, such as with Tim Wakefield and his projected 16-10 record. If one of these pitchers changed teams, their forecasted wins and losses could move dramatically and instantly.

Q. I have a question concerning your theory about maximizing value with keeper lists. I think the theory makes a lot of sense but how do you factor in draft inflation?

A. What's great about the 78% rule (i.e. you want to have a group of players that costs 78% of their projected value) is that the discount accounts for draft inflation before the draft begins. In other words, we're banking on a bunch of opponents overpaying for players and the more they overpay, the more we can swoop in late in the draft and grab all the bargains. Draft inflation correctly reflects that in the middle to late parts of an auction, there is too much money to spend and not enough value to buy. This is because everyone's keeper list, on average, will probably be getting good players at a discount. Then, in the final stages of the draft, the bargains are everywhere and only those who have managed their resources well are ready to pounce.

As I've argued, we have a specific goal in mind and that's to build a roster that costs about 78% of its real value. We don't care about overpaying because if you track things during the draft as I have outlined before (if you're not sure what I'm talking about, you can read the essay at our website called "On Paying 80% of Projected Value"), what happens late is that you hit a point where yoWanu can now afford to overpay for players. In auctions that start with $260 per team, I have found that this begins happening around the time you get down to $9 bids as the current "high end" of bids. If you've followed the draft diligently and with the methods I've described, you hit a point where now you've got all sorts of money and can outbid everyone by $1 for all the players you're really interested in. I have cautioned before that you do not want to end up in a position where you have any money left over but what's nice is that if you have an exceptionally strong keeper list, you can afford to keep a few players above cost (I explained this in the essay entitled "Perpetual Leagues: Making Your Keeper List" at the site) and if your keeper list was weak, you've probably already spent a lot of your money and are waiting for bargains anyway.

From January 24, 2004:

Q. When you have too many players on one team that played the same position, how does that affect your projections?  Do you deliberately project too many at bats on a team or do you consider where everyone's going to play?

A. When putting together the forecasts I have to consider many factors, such as whether players can move around and play other positions, who's going to play well and who's going to lose their jobs, who might inherit playing time as a result of another player's ineffectiveness and so on. We do consider that there might not be enough at bats to go around at a position and thus, some of the time, you'll see situations where we downgrade a players' projected playing time because there's no room for the player in question to play. For example, Deivi Cruz of the Devil Rays has a very low number of project at bats for 2004 compared to what he had a year ago with the Orioles. This is because, quite simply, he has no place to play as of now.

Projections can go the other way as well. Sometimes, there's no obvious candidate to play a position on a team and sometimes I project that the incumbent will lose his job and another player will step forward and take his place.  A great example of this occurred last year when I projected Desi Relaford to get as many at bats as a full-time player even though he did not start the season with the apparent regular job.  The same situation occurred to two years ago when I projected Geoff Blum to play quite a bit because I was, at the same time, projecting Morgan Ensberg to lose his job in 2002.  This is similar, in some ways, to this year's projection for So Taguchi of the Cardinals, a player whom I don't project to play well but also one whom I expect to find full-time duty out of a utility role, a projection which would be precarious at best as other transactions take place to change the nature of the Cardinals' roster.

Q. Do you think Ken Griffey Jr. can stay healthy this year?

A. It seems that I get asked this question every year in recent years and every year Griffey eventually suffers from an injury that enhances his newfound reputation as an injury prone player.  I'm not going to be the one to project him to return to the days of playing 160 games and when one considers that he hasn't even played in more than 111 games since the 2000 season, even my own forecast of 120 to 130 games could understandably seem optimistic.  That's about what I expect him to play, which would obviously represent an injury-shortened season. It seems hard to believe but the player once considered to be "the kid" turned thirty-four years old this winter. My short answer, and I don't think I'm out on a limb here, is no, I don't think he can stay healthy for the full 2004 season.

Post-2004 comment: Griffey played just 83 games in 2004, even fewer games than I was forecasting.

From February 9, 2004:

Q.  Who are ten players that you foresee having a better year in 2004 than most other prognosticators might predict?  I think Mark Teixeira and Scott Rolen are prime candidates to perform above their expected levels and I would like to know who else to add to this list.

A.  I recall answering a similar question last year and I'll answer it as I did then, not necessarily scientifically but by telling you which players come to mind so far, based on reaction to the forecasts:

1. Byung-Hyun Kim
2. Johan Santana
3. Laynce Nix
4. Brad Fullmer
5. Adam Dunn
6. Jose Contreras
7. J.J. Davis
8. Jason Bay
9. Albert Pujols (I listed him a year ago and still do - hard as it is to believe, he's STILL underrated)
10. Miguel Cabrera (unfortunately everyone knows about him because of the playoffs)

Post-2004 comment: I'm not entirely disappointed with this list.  No doubt, my #1 in Kim proved to be a complete bust but the #2 performance speaks for itself and I was happy with performances from Dunn, Jason Bay, Pujols and Cabrera.  The reader who asked the question did exceptionally well in that both of his/her picks outperformed 2003.

From February 29, 2004:

Q.  My league uses quality starts as a scoring category in place of wins but I don't know of any analysts who do projections for this statistic.  However, since you do project the individual components of a quality start (games started, innings pitched, earned runs), it would be easy for me to calculate the projected IP/GS and ER/GS for each starting pitcher.  Would there be a way to take this information to come up with a forecast for quality starts?

A.  Let me give you some good news and that is that we are considering adding quality starts for the 2005 season.  As for estimating it, here's a rough way to estimate them based on our forecasts.  This isn't perfect and I've already been working on the methodology for forecasting QS for some time but it should at least give you a better estimate that incorporates the way we expect a pitcher to pitch.  As I say, it's not perfect but try this and make sure you use it only for pitchers who have at least one start forecasted:

Estimated QS = -0.24 + (.5413 * GS) + (.1074 * W) + (.0452 * IP) + (-.1094 * ER)

The problems with the formula are fairly obvious.  Certainly, it's not possible to have a "negative" amount of quality starts but we need that modifier at the beginning to get a more reliable result.  As you can see, the formula is much simpler than we could make it.  For example, we might have started by presuming that all shutouts are quality starts to improve the formula.  Basically, the formula is giving you a base that considers games started, wins and innings and then applies a penalty based on the number of earned runs allowed.  Applying this formula to the 2003 pitching population who started at least one game, it had a standard deviation of +/- 1.75, meaning that for about 95% of the pitching population, it estimates the number of quality starts they had within about 3.5 quality starts, which I think is more than good enough for you to use in your league.  One other obvious flaw in the formula is that, theoretically, a reliever who starts the occasional game could end up with more quality starts than actual starts so I suggest you limit the result to be the lesser of games started or the result of the equation.  Undoubtedly, the formula is far too basic and needs to be refined before it could be applied to a successful long-term forecasting model but it's the best I can give you for right now until I focus my attention more on this category.

Trying it on a real life example from our 2004 forecast set (Randy Johnson), Johnson is projected to start 30 games, win 14, pitch 214 innings and allow 97 earned runs.  Inserting all this into the formula, we get:

Estimated QS = -0.24 + (.5413 * 30 Starts) + (.1074 * 14 Wins) + (.0452 * 214 Innings) + (-.1094 * 97 Earned Runs)

And So, Estimated QS = -0.24 + (16.24) + (1.50) + (9.67) + (-10.61) = 16.56 Quality Starts or 17 if we round it to the nearest integer

Post-2004 comment: As suggested then, quality starts will be added to our forecasts in 2005, based on votes taken in the annual reader survey at the end of the season.

From  March 7, 2004:

Q.  My online draft snakes 1-10 and then 10-1.  Is there any benefit to being at the beginning (picks 1-3), the middle (picks 4-7) or the end (picks 8-10)?  The guys that pick at the end always complain about their bad luck so I wondered if it made a difference.

A.  I could see mild complaining with the final pick if you had confidence that everyone drafting ahead of you knew the precise actual eventual value of the players but because we have confidence that we can forecast performance better than our competitors (or we would like to think we do), we shouldn't get too wrapped up in worrying about our draft position.  In fact, for fun, let's say that there are about 230 players to be drafted in a ten team draft and let's also pretend that somehow everyone in the league knew precisely which players would be the most valuable.  In terms of the average quality of player (with #1 being the best and #230 being the worst player drafted), check out the differences between a straight draft and a snake draft, again presuming that everyone knew exactly who would be the best player on the day of the draft:

Average Quality/Rank of Players Drafted in a Perfect 230 Player Draft

 Draft Position

Straight Draft

Snake Draft

1

120

115.6957

2

119

115.6522

3

118

115.6087

4

117

115.5652

5

116

115.5217

6

115

115.4783

7

114

115.4348

8

113

115.3913

9

112

115.3478

10

111

115.3043

As you can see, in a straight draft, it can make a moderate difference in the average quality of player that a team is drafting but in a snake draft, the difference is almost negligible and is certainly offset if a team owner has confidence in his own drafting skills.  The only way there might be an argument about unfairness is if the gap between players is so wide that the top few players (e.g. Alex Rodriguez) make so much difference on their own that they can clinch a league.  I don't believe that's the case in just about any league.  By the way, snake drafts are not only my preference but also my strong recommendation for leagues that don't retain players from year-to-year. 

Q.  One thing I noticed about your forecasts is that, compared to other publications, you seem to put a lot of weight on the previous season, particularly for stolen bases.  For example, I'm confused as to why you have Vladimir Guerrero stealing only 13 bases and Luis Castillo stealing only 26.  Guerrero stole 40 bases just two seasons ago and Castillo stole 48 bases two years ago.

A.  You are astute to notice that for speed, I do put a bit more emphasis on recent performance than I might for other categories but in this case, the Guerrero forecast is more related to how I expect his back to impact his running game.  I have noticed what you're talking about with some other publications (though not all), who seem to use something like a three year average to project steals.  That is extremely dangerous.  As our 2003 projection wrap-up showed, stolen bases are better predicted by the previous season than they are by using a two-year or three-year average but I must clarify that our forecasts aren't just based on averages but rather my own assessment of a player's skills, which are subsequently translated to a statistical forecast.  In the case of Castillo, I quite simply don't believe he runs as quickly as he did two years ago.

Post-2004 comment:  Guerrero stole 15 bases and Castillo stole 21.

From March 21, 2004:

Q.  I checked a sports site which has many proposition bets.  Some of your projections for individual players are significantly different from their over/under marks.  If your projections are correct, wouldn't you be theoretically able to take advantage of those odds?

A.  It's difficult for me to respond to such questions because firstly, I do not encourage that members use our forecasts for gambling purposes because that's not what they're designed for.  There's a pure love of baseball and forecasting behind them.  Having said that, your question is a good enough one that it deserves answering in a public response.  Quite simply, most such bets are based on minimum standards that players have to meet.  For example, they might have an "over/under" on wins for a pitcher but require that the pitcher make at least 30 starts.  They might have an "over/under" for hits but require a batter to have played a certain amount.  Our forecasts are at the midpoint of expectations for not just performance but playing time as well.  In other words, our forecasts account for the percentage chance that a player will have a lengthy injury which is why so few of our projected players will ever get more than about 155-160 games projected.  We have pitchers like Josh Beckett, who are forecasted to get just 10 wins in the latest forecast set but are also forecasted to start only 27 games.  Many such odds would simply call it a non-bet if Beckett started only 27 games.

Anyway, even with my strong recommendation that readers don't use our forecasts for such purposes, we can't control what they do with them so the answer to your question is that anyone who uses a solid statistical foundation and who knows how to recognize overlays in a betting forum does have a betting advantage in the long run.  I just hope readers understand that this isn't horse racing and the forecasts we published are based on our estimation of not only how players will perform but whether they will even play.  Oddsmakers tend to avoid consideration of the second and simply call it a non-bet if the player gets injured or benched.  Beyond that, I really don't want to comment more because it is my sincere hope that people subscribe to our forecasts because they love them or they apply them to more friendly and less stressful forms of entertainment, such as fantasy baseball.

Post-2004 comment: I had used Beckett as the example here and he went on to start 26 games and win 9 of them (at the time we were forecasting 27 starts and 10 wins).

From April 4, 2004 (Opening Day Commentary - was in the form of narrative rather than question and answer):

"Adam Dunn:  I have to start by conceding that a few years ago, I was quite tough on Dunn when everyone else had him as a great prospect.  In fact, back of January, 2002, I wrote the following: "Everyone knows about this guy and as we finalize projections for 2002, we're looking at projecting a struggle for the future superstar.  The Reds would stick with him for at least a half a year, which could be devastating to a fantasy roster if he goes out and hits .230.  He already has the major league power but he strikes out far too often to be an immediate success.  Even if he lives up to expectations, he's not worth what many fantasy leaguers are willing to pay for him.  If you trade him now in your fantasy league, you'll get a player worth what Dunn would do even if he does well."

I quote the Dunn comment because I've done a turnabout of sorts at this stage.  I think the growing pains are over and the player who hit just .215 last year is finally going to be ready to go out and hit in the .260s with 35-40 home run power and plenty of runs scored and RBI, not to mention 15-20 steals."

Post-2004 comment: Dunn hit .266 with 46 home runs, 105 runs scored, 102 RBI but only 6 steals.

"Johan Santana:  By now, everyone knows how high I am on him and while he's coming off elbow surgery during the winter, his stellar performance in 18 starts last year was no fluke.  Even with the team he plays for, I'm expecting he's going to be a bigger winner than most expect and though I don't expect his ERA to be as low as it was, partially because I'm expecting a minor control lapse, he's going to be a huge strikeout guy and even with big expectations, still a bargain."

Post-2004 comment: Santana exceeded my expectations, obviously.

"Jeff Weaver:  His was a very tough forecast to make because you don't know which of two pitchers will show up.  He was disastrous with the Yankees last year but he seems to have a solid grip on his spot in the Los Angeles rotation and because I'm expecting the Dodgers to be good, he's a solid bet for double digit wins.  While I don't see him being an ace pitcher or anything, I could see him landing somewhere in the range he established in 2000-2001 with Detroit, which would be much better than everyone else is expecting and certainly worth having on a fantasy team for cheap.  He strikes me as the kind of pitcher who should fit in well to the National League and become a solid #3-4 type starter for quite a few years."

Post-2004 comment: Weaver went 13-13 with a 4.01 ERA in 34 starts.

"In the American League East, one flip for me is a tough one to make.  Most readers know that I grew up watching the Toronto Blue Jays and though I tend to root for them in the division, and still will in 2004, I can't let my love of the team cloud my interpretation of the player forecasts.  Where back in February, I continued to project a seventh straight year of the American League East coming out in the same order, I now believe the Orioles have moved ahead of the Blue Jays in the division and that Toronto will disappoint, compared to the expectations of many pre-season publications and prognosticators.

Believe me, I'd love to be picking Toronto to do better (and as a fan I hope I'm wrong here) but in particular, I'm forecasting disappointing seasons for Pat Hentgen and Josh Towers and while I like the new-look bullpen here, I'm concerned about the potential everyday contributions from the likes of Reed Johnson, Orlando Hudson and Chris Woodward and I don't believe that either Carlos Delgado or Vernon Wells can repeat the level they set in 2003."

Post-2004 comment:  I have looked back at all the prominent publications to see how they picked the American League East.  While no doubt I got some divisions wrong (e.g. the NL Central), I cannot find any other publication who picked Toronto to finish below third place.  As it turns out, they were even worse than I expected but my reasoning here seems to have been on track.

From April 7, 2004:

"- Thought it was interesting - A reader of my projected list called me a "moron" in reference to my top 200 prospects list.  Leaving Jesse Foppert off the list in 2003 generated similar reaction.  It occurs to me that something seemingly unique to sports, if not baseball in particular, is that it is one of the only fields in which you can be called a "moron" for your views on what will happen in the future, even before anyone knows whether you're right."

From April 11, 2004:

Q.  You have Paul Konerko forecasted to steal no bases but he already has one.  Doesn't that mean your projection for Konerko should have at least one steal in it?

A.  No.  I answer this question every year and it deserves to be answered in this space again because it's important if you're to get the most out of our forecasts.  The projection posted, after the season is already in progress, is what the player is forecasted to do for the remainder of the season.  That means that despite Konerko getting a stolen base, the forecast is saying that we don't project him to steal another one the rest of the way.  It may prove to be incorrect but that's how you should be reading the forecast anyway.  The first steal is in the books.  Our focus is on forecasting the future.

Post-2004 comment:  Konerko did not steal another base the rest of the season.

Q.  I happened to notice that Ron Belliard is leading the league in hits, so I plugged in some numbers and it looks like he's already broken through the upper 95% confidence level for his .257 forecasted average, after only 20 at bats.  Is my math right?  Too early to get excited but I'd be interested if he'd caught your eye as well.

A.  Unfortunately this isn't the case and though things have changed slightly since you wrote the question, I would like to respond.  As of games completed through this past Thursday, Belliard had 10 hits in 20 at bats, which at first glance would seem like it's outside the margin of error but there are several problems.

Firstly, even if at bats were the "denominator" of the equation (they aren't in how I look at statistics), 20 at bats would be too small a sample.  As I explained briefly in the debut of "David's Notebook" this past week, I tend to require at least 30 before I can even begin to consider a sample as meaningful and usually I prefer much larger sample sizes to make meaningful conclusions.  Remember that someone's going to be outside of the margin of error and it's always possible that a player is simply among the small percentage of players who, by luck alone, could be performing outside expectations.

But 30 at bats wouldn't be enough anyway.  In fact, here's how I break things down.  Firstly, I want to know what percentage of time a hitter is making contact.  In Belliard's case, through games completed on Thursday, he had struck out twice in twenty at bats or made contact 18 out of 20 at bats (i.e. 90% of the time).  In the latest projection update published this morning, I'm forecasting his contact rate to be about 84% which, on average over a 20 game span, would be 17 times making contact out of 20.  Therefore, even if the sample size for times making contact was large enough, and it isn't, it would mean that he's performing almost exactly where he should be in terms of making contact.

Then, within times making contact, I consider how many singles, doubles, triples and home runs a hitter gets separately.  The latest forecast set has Belliard expected to get a single about 20% of the time he makes contact, a double about 8% of the time, a triple only about 0.3% of the time and a home run about 2% of the time.

Examining what he's done so far, he has no triples and no home runs.  He has doubled about 11% of the time he's made contact (and he couldn't be closer to my expectations based on the number of times he's made contact).  He's singled about 44% of the time he's made contact, about twice what I would have expected.  The bad news is that in the history of baseball, no one has ever singled at that rate per time making contact over a full season of at least 500 at bats.  In fact, the best mark in history at making contact over a full season was back in 1898 when Wee Willie Keeler singled about 37% of the time he made contact, in an era when distinguishing between singles and errors was still unclear.  In modern baseball, the best rate actually came in the year 2000 when Luis Castillo singled about 35% of the time he made contact and went on to hit .334.  The subsequent season, he hit .263, largely because his singles rate fell back to Earth to about 25%.  In fact, there have only been 26 seasons since 1969 when a hitter has sustained a singles rate per ball in play above even 30% and most of them have been batting title winners and perennial contenders and with the exception of Rod Carew, even the best hitters only did it once or twice in their career (i.e. Wade Boggs only had a rate even above 30% once in 1985, Tony Gwynn only did this once in 1984 and so on).

I say all of this because in fact, the singles per ball in play percentage I'm forecasting for Belliard of 20% is just about at his career mark of 21%, some of which came in Colorado last year when had a 22% rate.  He's made contact 1808 times in his career before this year and I'm not prepared to set aside the weight of that against 18 times making contact this year.  It's virtually impossible that a hitter can maintain a 44% singles rate per ball in play over the long run and last year's leader, Derek Jeter, hit singles "just" 30% of the time he put a ball in play.

The short answer is that so far, most of Belliard's batting average comes from his singles and most of those will not be coming at the same rate in the long run.  I'm hoping that clears up the math of the situation.

Post-2004 comment:  Belliard hit .282 for the full season, including his hot start but he hit just .260 from May 1st on.  We were continuing to forecast a .257 hitter the rest of the way.

From April 21, 2004:

- A reader wrote to say that he does the opposite of panicking by trading for guys that are off to a cold start, which is often a pretty good strategy, but he then added something that was a bit of a fallacy or misunderstanding of the law averages.  This is worthy of either a note in the weekly notebook on in AskDL.

I'll clarify here.  What the reader had said was essentially true and that is that cold starts by players afford smart fantasy GMs to trade for slumping players but he then followed it up with "if a guy is hitting .150 at the end of April who always hits .300 every year, I know he's going to finish with a .300 average by hitting even way better than .300 from May to the end of season because of the law of averages."  It's a common misunderstanding of the law of averages.  The law of averages does not say that probabilities will balance out in the long run by something doing more than it is capable of to make up for a slow start.  It simply clarifies that the larger the sample size, the more a player or a coin flip or whatever will come closer to its real ability than in a shorter sample.  It does not mean that something is "due" to occur because it hasn't yet.  If I really believed that a player is a true .300 hitter and he starts by going 0 for April, I'm going to be projecting a .300 hitter from May on, if that's my belief, and it means that this, combined with his season-to-date miserable April, puts him on target to finish under .300 for the season, even if his real ability is to hit .300.  The best example I can make is that let's say you flip a coin ten times and you get the unlikely result of one head and nine tails.  Now, you're asking me to forecast the next fifty flips of the coin.  I'm still going to be projecting a 50/50 split the rest of the way, if I believe the coin is fair, and what that means is that the implication of my remaining forecast is that the coin will finish with more tails than heads.  You may not agree with the application of the principle to baseball forecasting but I do want to clarify that the "law of averages" does not mean that things balance out in the long run by "making up for" a strong or slumping start.  I don't think I can explain this as well as the many statistical books out there that do (and particularly under the topic of the so-called "gambler's fallacy" which is covered in many books that talk about betting schemes and so on).   Readers who want more info immediately should look up the term "gambler's fallacy" in any of the online search engines to get a great variety of explanations.

- Before the season started, many readers questioned why I was forecasting Morgan Ensberg to play "only" about 130 games.  I have to admit that it's not entirely a belief about what he'll do when healthy because I often tend to discount lesser experienced players more than veterans.  On average, it's the statistically prudent thing to do and history has shown it.  It accounts for the potential for slumps in inexperienced players and/or the potential for injury the first time a player goes through the long haul of a 162 game season.  In so doing, the margin of error on such players is significantly lowered.  But in Ensberg's case, there is another factor and that's the Jimy Williams factor.  I think Jimy is actually a better manager than some give him credit for but he does seem to rotate players through the lineup when there's no one present at a position who has several established years in the books.  Ensberg had a great year last year and he could easily exceed the number of games I've forecasted but thus far, his struggles so far may be just the sort of thing that will cause him to achieve the discounted number for inexperienced players.  Of course, he's a far superior player than the one we've seen so far and in a limited sample, he's showing much more of an ability to take walks than we're used to.  I just couldn't bring myself to forecast more games than I did because I thought to do so would be a disservice to our readers.  Job security plays a big role in the forecasts, particularly under the games projected, and despite a great season last year, I don't see Ensberg's job security as absolute here if he continues to hit to the tune of a sub-.200 average.

From April 25, 2004:

Q.  I was looking at your projections and specifically, the relative confidence ratings you publish.  Instead of doing an awesome season projected for Albert Pujols, with a relative confidence of only 25%, why not publish something lower (like a .330 season with 35 home runs) and a much higher relative confidence that is less of a gamble?

A.  I've done my best to explain relative confidence in the past but I guess I do no need to return to the topic as I've faced several of these questions since the last time I talked about the RC rating.  Firstly, I can't publish a forecast with any higher confidence.  I've used all the data that's available and based on the amount of the data, the consistency of it and other factors that influence my confidence, I assign a relative confidence rating after the forecast is created.  For example, my Opening Day forecast for Jose Contreras, like last year, has a very low confidence rating but I still made the best possible forecast I could make and then assigned a confidence rating to it after that.

The mistake many readers seem to make is that they look at great projected seasons like the one for Pujols, Contreras or Byung-Hyun Kim and they falsely believe that I could assign greater confidence to it by bringing the projected numbers closer to the typical or average player, making it (as you well put it) less of a "gamble."  This is not the case.  What I'm publishing really is what I believe the midpoint of expectations is for the player.  The confidence level does not mean that the player is less likely to achieve a high level.  Rather, it means that the margin of error is much wider.  That is to say that if I publish Pujols to hit in the .350s, it's an inconsistent .350.  In fact, on Opening Day of 2003, I published a low confidence forecast for Pujols - with an RC rating of just 26% - that had him forecasted to hit a career-best .334 with career highs in home runs (38) and walks (73).  Interestingly, I addressed a similar question then but in particular, a reader with whom I carried on a lengthy email exchange said he couldn't understand how that could be the midpoint.  He suggested, as you do, that I could create a higher confidence forecast by lowering the season more to the levels Pujols had established and I recall explaining to the reader that the margin of error was wider and that Pujols could actually exceed the expectations by a wider margin that a low-confidence player would.  Of course, Pujols actually did end up finishing on the high side of my expectations, hitting .359 with a career-high 43 home runs and 79 walks, all even higher than the career-best marks I had forecast.

I really do believe when I publish a forecast that a player is as likely to exceed the projected mark as he is to under-achieve it but to demonstrate this, let's take a look at the history of "high performance" forecasts.  I'll use the hitting side here of players forecasted to hit better than .300 as an example.  Let me show you the typical deviation.  I started publishing confidence forecasts here only in the past few years but as we were building up the system, we maintained them for two full years prior and so I'm including those in the analysis as we ended up using the same methodology as we use now.  I am not including a confidence model that I rejected after a test-run (these weren't published anyway) in the 1998 season.  Even including every possible season, the data remains quite limited here as there aren't that many plus-.300 seasons projected each season but this should make my point anyway:

Relative Confidence Rating 1999-2003 Combined Average >=.300 Projection Combined Average Eventual Result Standard Deviation +/-
80+ .316 .319 +/- .007
60-79 .315 .314 +/- .012
40-59 .308 .311 +/- .019
20-39 .317 .324 +/- .023
0-19 .309 .311 +/- .031

Several things jump out when you look at the table.  Firstly, notice how all of the averages are right about where they should be and in fact, the lowest confidence forecasts of plus-.300 hitters, on average, actually slightly exceeded the expectations on average, though the difference is statistically insignificant.  What's more interesting, as I've been saying, is to look at the standard deviation.  I don't recommend you use this history of standard deviations just yet to make statistical conclusions about "low and high end" expectations for players but look at how neatly the standard deviation increases as the confidence value gets lower.  In the case of the relative confidence forecasts with a rating of 80 or better, the standard deviation is just +/- .007, meaning that about 95% of these high-confidence but all too rarely published forecasts, historically, have landed within about 14 points in either direction of the projected batting average.  On the bottom confidence group here, the standard deviation is extremely wide of +/- .031, meaning that 95% of the forecasts, historically, have landed within about +/- 62 points of the batting average, wide enough that you can run into real problems if you hit one of the bad ones.

I think the results here would have been even more consistent if the relative confidence rating was not assigned to the whole forecast.  In other words, I have contemplated publishing ranges or RC ratings for each category because sometimes, I'm much more confident of a player's stolen base projection than I am of his batting average projection and thus far, the RC rating hasn't captured that.

Anyway, the short response to your question is that I can't raise the confidence any higher than you see.  I've made the best possible forecast that I can for each player and while some low confidence forecasts may look extraordinarily good or bad, they really are at the midpoint of my expectations and statistically, the case has been that the players perform, on average, as the forecasts say they will.  Unfortunately, the lower confidence ones have a much wider margin of error, as you can see from the chart above.

Q.  I'm in a 5X5 keeper league and from what I remember, you are not a big Jeremy Affeldt fan.  I've been offered John Smoltz for Affeldt.  What do you think of the trade for the long-term and the short-term?

A.  For the short-term, my own forecasts give the obvious answer that I like this trade as I continue to forecast Affeldt as a pitcher who will perform below expectations this season and who will eventually lose his rotation spot.  Even for the long-term I like it because if Affeldt struggles as much as I expect him to in 2004, then he'll be easy to re-acquire in next year's draft and then you'll have Smoltz on your roster too.

Sometimes, it's tough to trade away a prospect and I know that for about one in fifteen of them, I'm going to be way off and they'll turn out to be something extraordinary that I didn't expect.  In Affeldt's case, I don't see this happening and I think that if you got Smoltz this year, you'd be a much stronger team and even if Affeldt turns out to be as great as everyone expects, you would have gained substantially more in this deal for the current season, enough to offset the unlikely scenario that has Affeldt going on to be Cy Young a few years down the road.

I might be wrong about Affeldt but I just don't see what everyone else sees.  His perceived trade value will have gained a boost as he threw seven innings of two-hit ball the other day.  Even if he goes out and has an ERA in the 3's over a full season, which I'm not projecting, I still like Smoltz better for the 5X5 league anyway which means there's almost no chance you'll be kicking yourself unless Affeldt wins 20 games or something surprising.  Heck, I'd probably trade the imagined/projected "living up to his potential" version of Affeldt for Smoltz so that should make your decision a bit easier.

Post-2004 comment:  Affeldt ended up hurt but did get a brief chance to be the closer for Kansas City.  Regardless, Smoltz obviously ended up having the superior season.

From May 2, 2004:

Q.  I was wondering if minor league performance to date influences your major league projections?

A.  Yes it does.  As you probably know, we publish weekly minor league translations for every player from Single-A on up and we are always watching for players who could get a call-up.  Also, the actual performance itself, just like major league performance, influences the direction our forecast takes.  That's why a player such as Edwin Jackson (who was downgraded to a projected 4.47 remaining ERA, almost a half run higher than in the previous issue) can see such a change in his forecast even though we haven't got any new big league data.  Basically, his minor league performance has caused us to bump up his forecasted ERA in the majors.

Q.  A trade just went down in our league that was completely one-sided.  An owner took advantage of another by trading away a couple of fast starters for a superstar that is starting slow.  I officially protested this trade.  Our league voted on it and approved it because they saw no collusion and saw it as only taking advantage of "selling high."  There is a difference between a savvy deal and one that is outright ridiculous.  Don't you think that protests should not only prevent cheating but completely lopsided deals?  That was the real point behind my protest.

A.  It's tough for me to answer this because I know you want me to agree with you here but actually, I believe that as long as cheating doesn't occur, if an owner is capable of "taking advantage" of another owner, then that's part of the game.  Moreover, and I know you say it's lopsided but consider that the owner who's being exploited does have the right to make seemingly lopsided trades against him because maybe he's sincerely convinced he made the right deal.  Part of the great joy of fantasy baseball and clearly part of the challenge is that everyone has varying opinions about players and it's the gaps between those opinions that generates trade activity and ultimately determines who wins.

Every league is different and my only recommendation to you is that if you think this owner is, to steal a term from poker, a so-called "fish" then by all means you need to go fishing yourself.  Obviously if collusion is the only grounds upon which your league will overturn a trade, then you need to get on the phone with this inferior owner and start hammering out a series of deals.  You should be able to learn how he thinks by the deal you mentioned - start offering him all our fast starters for guys whom you see as valuable and who are off to a slumping start.

I've been in this boat several times where other owners thought that I was the one being taking advantage of.  For one example and a lengthier answer to your question, I encourage you to refer to my essay called "On Fantasy Trading Rules" (republished February 14, 2004) that's available from the home page of our site.  Another example that was not mentioned in that essay but is similar was where I helped mediate a trade in another league.

One owner had tried to trade Randy Myers in the middle of the 1998 season when he had about 25 saves for Toronto at the time, if I recall correctly.  He was trying to deal Myers for Phil Nevin.  At the time, Nevin had four part-seasons in the books and again if I recall properly, was hitting around .220.  He had never shown any real power and was not getting any playing time.  The owner who protested the trade had presented his case that it was "obvious" that it was a lopsided deal and said something along the lines of "Nevin is already twenty-seven or twenty-eight years old" (I don't recall which) and that "therefore he has no prospect upside."  I recall that he also pointed out that Nevin, who should already be in his prime, was a career .230 or .240 hitter at the time and didn't even seem to have 20 home run potential.  The owner acquiring Nevin made his case to me that he saw Nevin as a future 30+ home run man and this same owner was forced to publicly state that he saw Myers as a guy on the fast decline.  After reviewing the two team cases, I allowed the trade.  I believe that Myers went on to have only three or four more saves in his entire career and within two years, Nevin became everything this astute owner had said he would be and more.  In terms of what makes a deal "ridiculous" (as you called it), where we do draw the line?  What criteria should we use to determine whether a trade meets that standard as opposed to just being "apparently lopsided but actually not?"  I won't pretend to be able to offer the answer to that question so perhaps I can't help you but I do think if you're to come to peace with this issue, you may want to pose these questions.

I know it doesn't always work out like this but deciding what's a good trade is not always black and white and fantasy GMs have the right to make "dumb" moves because many of them turn out not to be so dumb when we look back at them years later.  I understand your frustration and have shared it, on occasion, in my own leagues.  But you asked my opinion and that is that quite simply, we have the right to be foolish as long as we don't cheat.  No one has the right to tell another owner that they are better at assessing how fair a trade is because the gaps in perceptions about players or their eventual value to your fantasy team is exactly what this game is all about.

Q.  You have Ichiro projected to have a .329 average the rest of the way.  Consider that since June 16th of last year, he's hit .287 and since July 9th, he's hit just .265.  Any way you look at it, if you're using a 95% confidence model, your projection would seem to be a bit too optimistic.  What confidence level are you using?

A.  There are several aspects to your question I need to address.  We "average" out at a 95% confidence model but that doesn't mean we use a fixed statistical model only.  My own scouting observations of a player do come into play and in some cases, I use as high as a 99% confidence level and in other cases, I could use less than 95%, depending on the category and how specific I need to be.  For something like triples, a 95% confidence level works extremely well but for something like singles, I need 99% confidence or even more.  I rarely go below 95% in terms of margin of error analysis because too many players would be falsely flagged as a concern but I have done it, on occasion.

It seems that you are looking at the statistics on a "per at bat" basis, which I don't agree with doing.  First, I'm forecasting a player's ability to put the ball in play and then within that model, I'm looking at the percentage of times he gets a single, double, triple and home run per the number of times he puts a ball in play.  I refer you to my answer in this space re: Ronnie Belliard of a couple of weeks ago if you want to see the actual breakdown of the sort of way I take apart the performance.  In fact, Suzuki's projected remaining projected average is well within the margin of error when you look at it that way.

Beyond that, I don't simply limit myself to the most recent data, even though it carries greater weight and it seems that starting at June 16th of 2003 would be deliberately picking the most pessimistic starting point for Suzuki that we can find.  For example, if I start at June 1, 2003, Suzuki was a .309 hitter since then before the start of this season.  If you look at the rate he was making contact even as late as starting after the All Star break, he put the ball in play about 89% of the time after the break compared to about 91% of the time before the break, which is clearly the same player contact-wise.  On a per ball in play basis, his doubles rate was the same after the All Star break (about 5%), his triples rate was even better (about 2% compared to 1% before the break) and his home run rate was about the same (about 2% in both halves of the season).  His fall-off in average was due to the number of singles he was getting per ball in play which was an incredible drop from 42% per BIP to 21% after the break.

What's important to realize is that a 21% singles rate per ball in play is simply not indicative of the type of player we're dealing with here.  In fact, over his career including 2003, Suzuki's rate of singles per ball in play prior to 2003 is about 28% and so, a 21% performance over a half-season of just over 250 balls in play is entirely within the margin of error, though clearly on the low end of the margin.  Using a 99% margin of error (which is what I tend to use for the singles category), if he were really a 28% singles player, Suzuki could be expected to have a singles rate over that span of anywhere between about 20.8% and 35.2% and so, though he's on the extreme outer edge of the margin for singles, he isn't enough outside of it that it forces me to make an adjustment, particularly since I'm not going to confine myself to using only the second half of last year (in which he hit just .259) to make my conclusion.  I'm just saying that if I did do that, he would still be within the margin of error for the type of player I forecast that he still is.

The way I look at it, his slow start did cause me to downgrade him a few percentage points but to answer your question, we continue to use a confidence level of at least 95% on most statistical factors but I tend to use 99% on categories such as singles and walks and Suzuki's projection is within the margin of error there, even considering what he's done since the dates you picked.  Quite simply, I see the same player getting different results and so I'm sticking with my expectation that his singles rate per ball in play remains around 28%.  His contact skills haven't shown any decline at all, even in the period you mentioned.

Post-2004 comment:  Ichiro even exceeded expectations but did end up justifying my continued faith in him, even when he had gone a year without looking like a plus-.300 hitter. 

From May 12, 2004:

"I finally downgraded Barry Zito and though I've never been as high on him as many others, I thought he was a much better pitcher than this.  In particular, there are two primary areas of concern and then a few lesser ones.  The first, and most obvious, is how slow he's looked compared to the pitcher we saw in 2002.  Yesterday, I compared some tapes of a few games from 2002 to two games in 2004 and his fastball is straighter and doesn't have that same pop it used to have.  Admittedly, his curve ball has been right where it always was but what I've noticed, in these 2004 starts I've watched, is that Zito always seems to pitch as if he's tired, even at the start of the game.  More importantly, and what started my downgrade even a week ago, is that Zito is consistently leaving the ball up in the strike zone and even when he's at his best, any of his pitches can be hammered there, which they have been so far.  He's already allowed about half as many home runs as he allowed all of last year and he's pitched only about 15% of the innings.  I think more of these balls are going to start staying in the park for doubles and such, as Zito irons things out, but it's passed the point of being coincidence or bad luck.  Where I think we're going to see a shift is that he may start allowing fewer home runs but he'll continue to give up hits at a an alarming pace, which is where readers, no doubt, will recognize the shift in my latest forecasts that caused the projected ERA to balloon.  Zito, so far, isn't the same pitcher.  What I'd have to see for him to turn it around is not just a couple of good games but individual efforts that show that his pitches are hard to pull and that he can return to pitching inside effectively.  Particularly since he's been missing high and away, it's been tough to tell because even at his best, he's never been a true control pitcher and so it's difficult to determine whether he has a new approach or if his typical wildness is causing him to miss away.  I do think he's overusing the curve, even though he commented after one game that he hadn't thrown that many."

Post-2004 comment:  I list this to show the reader what I was thinking when I make a serious downgrade, as I did with Zito.  Zito didn't end up rebounding and if anything, I was a bit slow to react here but he deserved a long look.

From June 6, 2004:

Q.  I made a trade about three weeks ago in which I thought I was making out greatly by taking advantage of an owner who was down on Carlos Delgado.  Now I'm not so sure.  Should I be trying to unload Delgado or is he likely to have a strong second half?

A.  I'm not sure if in your question, you're asking about Delgado's ability or is health.  In terms of his ability, there's no way he suddenly fell into a decline and that means that the .227 average he boasts as of the time of my response here isn't a real representation of his ability.  Even his power numbers are off and in cases like his, where there's no reason to believe that he'll sustain such a low performance, you can never get fair value for a player like him because his market value drops in line with his performance.  If nothing else, if you feel forced to deal him away, at least wait for Delgado's next inevitable "hot streak" so that the receiving owner believes he's turned the corner, which the next hot streak may very well represent.  Delgado this season has played through rib cage problems and knee problems and a healthy Delgado shows a consistent track record.  I don't believe his MVP-type 2003 season was representative of his real ability as it was a bit on the high end of expectations but his 2001-02 seasons show us a consistent 30-40 home run hitter with 100+ RBI and 100+ run scoring ability and a decent average much higher than we've seen this season.  Look for him to turn things around soon as long as he can get healthy and I always discourage readers from dealing away players when they've been playing through injuries because you can never get fair value for such players in a trade.  My latest forecast has him hitting .291 the rest of the way with 27 home runs and 67 RBI.  No one's going to give you that kind of value for him if you try to move him now.

Post-2004 comment:  Delgado did have a great second half and recovered to hit .294 the rest of the way with 24 home runs and 67 RBI, almost exactly what we told the reader to expect here.

From August 25, 2004:

- On the baseball analysis side of things, I've spent a great deal of time lately revising our age charts to account for the effects of age on each pitching category.  It wasn't intentional but in previous years, I know I gave hitting more attention when it comes to discussing the effects of age and pitching has been interesting me much more lately because some call it volatile where I argue that, particularly in the case of relief pitchers, it's just that the sample size in a season is so small that we can't know whether most relievers even performed up to their abilities, even though there are some so remarkably consistent that we can't say anything but how good they are.  It does seem that the better a pitcher is, the more consistent he will be, which goes in hand with statistical theory.  An event that has a 10% chance of being successful will consistently give you failures try after try whereas an event that has a 40% chance of being successful can give you an assortment for a long run before you have any sense of certainty that you were right in the first place.  In other words, Eric Gagne and Pedro Martinez would have trouble having a bad season even over a limited number of innings but Esteban Loaiza could easily go out and do something entirely the opposite of what you expect over 50 or 60 innings.

In particular, I've focused recently on strikeouts as an age analysis category because not only are they pivotal to a projection but each time we do the analysis, it continues to support the idea of a constantly-declining ability in strikeouts as a player ages and not the more conventional peak that comes in a player's mid twenties.  It seems counter-intuitive to common sense that strikeouts would be in constant decline as most would expect that a pitcher becomes stronger in his mid-twenties and thus throws harder.

Because I've been so interested in this, I returned to update an old study that had considered data from the 1969 season through to the end of 1997 and I recently updated it to include consideration of new data through the completion of the 2003 season and also for the first time ever to incorporate translated performances for the minor leagues.  The analysis now also reaches back into the past of baseball history by adjusting for yearly strikeout totals.  All of the stats in the analysis are park-neutralized and league-neutralized and only players who have played a comparable amount in each of two seasons get considered.

What surprised me is just how straight the line is getting on the strikeouts analysis.  The declining line didn't change much from before but it's straightened quite tightly now as more data gets added to the set.  Including minor leaguers and big leaguers, there are now about 15,000 player performances considered for which there was plenty of data for two seasons and where the data remains limited now is only around the ages of 18-21.

The new analysis yielded an even more level line than we saw a few years ago and basically, and unlike any other pitching category, the pitcher's strikeout rate per batter faced (when we exclude walks from batters faced, which was something I've explained previously) was in constant decline with the exception of around his late teens.  What we saw was that a player would see a significant increase from turning eighteen to nineteen and that he would then begin a constant and steady decline which becomes much sharper as he reaches his late forties.  Here's a graph of the results which compares strikeouts per batter faced (excluding walks) of more than 15,000 neutralized performances compared to their rate of a year earlier (accounting for league, park, batters faced and level of competition):

The starting point in the graph has been selected simply so that we achieve an "average" of 1 by the time the player has reached 29-years-old but it has absolutely no bearing on the shape of the line and we could have started anywhere.  Basically, to read the chart, it's showing you relative skill in this category for an age.  That means that at the age of 18, a pitcher strikes out about 1.2 batters (that he does not walk) for every 1 he would strike out at the age of 29.

The line is so straight that it's remarkable, particularly where there are plenty of players in the analysis from the ages of 22 to 45 and this is the only such declining skill among all the pitching categories analyzed.  In the case of walks per batter faced, we see constant and steady improvement throughout a pitcher's career until his forties and we see such improvements in other categories such as the pitcher's ability to keep the ball down in the strike zone and so on.

Just for fun, I took a few well known pitchers who are perceived to have achieved their prime later in their career to see what they were doing differently.  There were some clear exceptions, such as Randy Johnson, whose strikeout ability per batter faced (when we exclude walks) has risen steadily since his mid twenties from a rate of about 30% around the time he was twenty-eight to a peak of about 40% by the time he was thirty-seven.  His is a particularly tough one to examine because of the effect of moving to the National League but unadjusted, it's clear he's an exception to the rule even though his data is included in the analysis.  What I did find, though, is that many pitchers who became successful later in their career can and do see the typical drop in strikeouts and what's moving them into the success zone is the sudden arrival of their control and in turn likely as a result, an ability to keep the ball in the park.

I'm not a complete believer in DIPS theory because I feel that pitchers can and do control what type of ball gets hit against them (e.g. Eric Gagne is excellent at making batters hit a ball weakly because they don't make square contact against him and thus don't hit a level ball) but for those who put a lot of weight on strikeouts, the history of baseball pushes us in the direction of speculating that strikeouts may be in constant decline.  This goes against both my instinct and conventional theory so I'm not going to be quick to accept the results of this new analysis though the line is so clearly straight that it makes one wonder how this could be possible.  I'm just noting it here to share with readers and am not making a conclusion at this point.

 

Register for our 2010 projections and get instant access!  Click here for details.

Fill out your e-mail address
to receive our FREE newsletter!

Baseball Notebook is an online editorial publication that actively reports on baseball at all levels.  All materials, unless otherwise stated, are © 1998-2010 Baseball Notebook.  All historical statistics displayed are obtained through licensed affiliation with Baseball Info Solutions.  As an independent editorial publication, Baseball Notebook is not sponsored by, authorized by, affiliated with or associated with any other mentioned entities, including those who own reserved trademarks that may be used for descriptive purposes only.  The terms Major League Baseball, World Series, National League, American League, All-Star Game and the names of the Major League Baseball teams are trademarks of Major League Baseball Entities.  Baseball Notebook does not offer, create, administer or endorse any fantasy baseball game or competition.