Corrected Exit Velocity Data & Leaderboards
Statcast data is now everywhere and everyone seems to be using it in some form. While detailed pitch information has been available via Pitchf/x, full season batted ball data was missing. Now the batted ball data is leading to some interesting findings, but it’s not a true answer. So far, 12.6% of the batted balls is missing data. I wouldn’t see this as an issue if the missing data was evenly distrusted, but it is biased. I have made a simple correction to the data and now how have available corrected overall data and leaderboards.
I went over the procedure I used to correct the data in this previous article. Here is a quick review of the problem and corrective procedure:
- 12.6% of all the batted balls are missed by Statcast. No bunts or foul balls were counted though.
- Most of the missing data are weak infield popups and groundballs. As a general rule, weak, groundball hitters are missing the most data. For pitchers, groundball pitchers are obviously the ones with more data.
- I found the average value for all detected batted balls fielded by each position.
- If the data is missing, I replaced it with the calculated league average values.