Special note: this post doesn’t have a lot of jokes. In exchange for your forgiveness, please accept this photo of Padres prospect Blake Tekotte. Thank you.
When looking at statistics, there are two major pieces of information to learn.
- How much has a player contributed to his team in the past?
- How much will a player contribute to his team in the future?
Often times, the ability of a player to contribute to his team in the future is called “true talent level”. This is a player’s raw ability, with other factors such as luck and the ballpark environment in which he plays stripped from the conversation. This is where the concept of sample size is most important. Without using an adequate sample size in measurement, all the stuff that doesn’t affect a player’s future performance might mess up our opinions. Sample size, among other things, is what gets us there.
While fun and interesting, when talking about things a particular baseball squadron should or should not do, a player’s contributions in the past generally aren’t relevant. Sure there are exceptions–when a lifelong Padre player is negotiating his final contract–for instance. But those are rare.
My stupid example: flipping a coin
Suppose you ask me to call heads or tails as you flip a quarter in the air. I choose heads, and wouldn’t you know it, the quarter lands heads up! Does this mean I will know the result of all future coin flips when asked? In other words, do I have a perfect “true talent level” of calling coin flips? Of course not, and we all understand why. Because of luck.
Along these lines, each measurement (or statistic) has its own requirements for sample size. If you flipped a second coin, and I guess correctly a second time, that still doesn’t prove my coin guessing. We simply haven’t reached the number of coin flips necessary to filter out the luck. As you approach 50 coin flips and calls, my successful calling rate will likely be pretty close to my true talent level of 50%.
Back to baseball: wOBA and UZR
The same applies for baseball measurements. Different stats require different amounts of trial before they eliminate noise. I’m not a stat expert, so I can’t expressly say exactly how many tries one should use for each stat. For me, 3-5 years of wOBA (my favorite hitting stat) is what I want to see when looking at a player. 500 plate appearances at minimum.
When measuring defense with UZR, however, things are different. 3 years of UZR data is worth about 1 year of hitting data. That means when determining a defender’s true talent level, as I understand it, you really ought to look at 9 years of data. I’m completely serial. 3 years of UZR at minimum.
So please, everyone from message board posters to SDUT staff writers, be careful when making judgement about a player’s future potential using statistics. Especially UZR.