Estimating Pitch Results from a Small Sample

A few days ago, I wrote an article examining Reid Detmers. For Detmers, I posted the following table on comparable curveballs and the lack of results.

Simply, I shouldn’t be posting a new tool until it is nice and shiny. The above results were garbage and reader Joe Wilkey called me out.

My goal when creating the comps was to take small samples from last season and find comparable pitches. The deal is that I shouldn’t be including any small samples in the comparable list. I’m going to give some background on the tool and show the corrected results.

The end game for this tool is to find comparable pitches to a small sample of a pitch.  The assumption is that the pitches will have similar results. Wilkey does utilize some results-based values (i.e. wOBA). I will use none of these for finding the comps. Once the comparables are found, I will average the results (GB% and SwStr%).

You Aren't a FanGraphs Member
It looks like you aren't yet a FanGraphs Member (or aren't logged in). We aren't mad, just disappointed.
We get it. You want to read this article. But before we let you get back to it, we'd like to point out a few of the good reasons why you should become a Member.
1. Ad Free viewing! We won't bug you with this ad, or any other.
2. Unlimited articles! Non-Members only get to read 10 free articles a month. Members never get cut off.
3. Dark mode and Classic mode!
4. Custom player page dashboards! Choose the player cards you want, in the order you want them.
5. One-click data exports! Export our projections and leaderboards for your personal projects.
6. Remove the photos on the home page! (Honestly, this doesn't sound so great to us, but some people wanted it, and we like to give our Members what they want.)
7. Even more Steamer projections! We have handedness, percentile, and context neutral projections available for Members only.
8. Get FanGraphs Walk-Off, a customized year end review! Find out exactly how you used FanGraphs this year, and how that compares to other Members. Don't be a victim of FOMO.
9. A weekly mailbag column, exclusively for Members.
10. Help support FanGraphs and our entire staff! Our Members provide us with critical resources to improve the site and deliver new features!
We hope you'll consider a Membership today, for yourself or as a gift! And we realize this has been an awfully long sales pitch, so we've also removed all the other ads in this article. We didn't want to overdo it.

The parameters I used to find similar pitches are:

  • Velocity
  • Spin
  • Horizontal Movement
  • Vertical Movement
  • Horizontal Release Point
  • Vertical Release Point
  • Release Extension
  • Effective Velocity

Once I find the pitches, I averaged the groundball and swinging-strike rates to get the new pitch’s expected results. From this final value, I found the pitch’s pERA.

I wasn’t really finding great comparable pitchers so I had to change the coding around a bit. I can still pull from small samples but I only compare pitches that have been thrown over 100 times in a season. Here are the new improved results for Detmers.

The results lineup up a little better with Wilkey with Kershaw(s) and Pannone making both lists. A pERA of ~4.00 isn’t elite, but average (link and link)

The tool wasn’t exactly created for Detmers who made several starts but more for someone like Luis Severino. In 2018, his fastball averaged 97.6 mph and 96.1 mph over 12 IP in 2019. This past season, it was at 95.3 mph in 6 IP. Here are the comps from 2018 and this past season.

A pERA of about ~4.00 is pretty decent for fastball since the rest of the pitches generate all the swings and misses. The small sample of results this past season was decent, but the comps show that his swinging-strike rate should drop ~3% points. The pitch goes from above average to average. He will need to pick up some velocity in Spring Training to keep up the production.

The overall plan is to find some comps for new pitchers and/or pitches once the season starts. I’d like the process to be as smooth as possible so I can provide quick and easy answers. So, let me know if you have any questions and/or suggestions on the information.





Jeff, one of the authors of the fantasy baseball guide,The Process, writes for RotoGraphs, The Hardball Times, Rotowire, Baseball America, and BaseballHQ. He has been nominated for two SABR Analytics Research Award for Contemporary Analysis and won it in 2013 in tandem with Bill Petti. He has won four FSWA Awards including on for his Mining the News series. He's won Tout Wars three times, LABR twice, and got his first NFBC Main Event win in 2021. Follow him on Twitter @jeffwzimmerman.

2 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Joe WilkeyMember since 2025
3 years ago

Glad I could help Jeff! 😉

I really didn’t like using wOBA allowed as a metric, but I was trying to get some quick results. To be clear, I didn’t use it as an input, but I did use it incorrectly in the summary, as it is wOBA allowed on all pitches, not only wOBA on contact.

I have one question on this update, however. Depending on how you do the comparison, is it a good idea to include both velocity, and effective velocity? The correlation between velocity and effective velocity in the population of pitchers with 100 curveballs is nearly 0.95, so it seems like you’re double-dipping on velo, unless you’re using a method that accounts for covariance.