pERA Update From SABR Analytics Presentation
This past Thursday, I spoke at the SABR Analytics conference on my per pitch valuations (pERA). I originally created them to form an understandable framework for comparing prospect pitching grades and major league results. Some byproducts of the work became useful like the effects of dropping a pitch. Today, I will make available new information I provided at the conference.
For the readers who aren’t familiar with the original work, it can be read in its 2500 word entirety in this previous article. Here is a summary.
- The key is to give each pitch an ERA value (pERA) based on the pitch’s swinging strike and groundball rates. All the values are based on the average values for starting pitcher. Closers will have higher grades because their stuff plays better coming out of the bullpen.
- The pitcher’s control is determined from their walk rate which is separate from the pitch grades.
- Each pitch is placed on the 20-80 scale with 50 being average, 80 great, and 20 horrible.
After creating the framework, I attempted to line up pitching prospect grades with the per-pitch-results. I wasn’t able to get the prospect and pitch data to match. I assumed the prospect grades would be bullish. People creating prospect lists are going to insert the best reports they collect and therefore be optimistic. Even adjusting for the expected bullish values, the grades didn’t meld.
The biggest cause for the disconnect is that pitchers quickly change. Most of the scouting publications collect their data after the season ends until they are published just after January 1st. Pitchers aren’t stable. They add a new pitch. Lose some velocity. Find control. These changes can be known as soon as their first spring training game but fantasy owners following prospects are stuck with dated data.
The other main issue is fastball grading. Grades are about 100% based on velocity readings, not their movement. I’ve found context needs to be added to the velocity measurements. For reference, this table contains the standard fastball grading scale.
Grade | Tool Is Called | Fastball Velo |
---|---|---|
80 | 80 | 97 |
75 | 96 | |
70 | Plus Plus | 95 |
65 | 94 | |
60 | Plus | 93 |
55 | Above Avg | 92 |
50 | Avg | 90-91 |
45 | Below Avg | 89 |
40 | 88 | |
35 | 87 | |
30 | 86 |
To show the disconnect, here are two examples of the scale failing. Kyle Hendricks fastball averages 88 mph. This puts his fastball at standard 40 grade and should barely get any MLB consideration. Last season, his fastball results were amazing with a 57% GB% and an 12% swinging strike rate. These values put his pitch grade at 67 or in the 94-95 mph fastball bucket.
On the other hand, Wily Peralta averaged 95 mph on his fastball. His groundball rate was 53% and he only generated a 6% SwStr%. These results are equivalent to a 46 grade or at 89 mph on the standard scale. He’s supposed to have above average results but the pitch is barely playable.
For these reasons, I needed to find out the characteristics of a good pitch. For this analysis, I looked at pitch break, regular spin, effective spin, velocity, and velocity difference from fastball for change. Manipulating various inputs, I came up with some characteristics for individual pitch quality. Here are the results so far.
Four-seam fastball
Most of this pitch’s results could be determined by spin and velocity.
RPM/MPH | 87 | 88 | 89 | 90 | 91 | 92 | 93 | 94 | 95 | 96 | 97 | 98 | 99 |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1500 | 31 | 33 | 36 | 38 | 40 | 43 | 45 | 47 | 49 | 50 | 52 | 54 | 56 |
1600 | 33 | 35 | 37 | 40 | 42 | 44 | 46 | 48 | 50 | 52 | 54 | 55 | 57 |
1700 | 34 | 37 | 39 | 41 | 43 | 45 | 47 | 49 | 51 | 53 | 55 | 56 | 58 |
1800 | 36 | 38 | 41 | 43 | 45 | 47 | 49 | 51 | 52 | 54 | 56 | 57 | 59 |
1900 | 38 | 40 | 42 | 44 | 46 | 48 | 50 | 52 | 54 | 55 | 57 | 59 | 60 |
2000 | 39 | 42 | 44 | 46 | 48 | 50 | 51 | 53 | 55 | 57 | 58 | 60 | 61 |
2100 | 41 | 43 | 45 | 47 | 49 | 51 | 53 | 54 | 56 | 58 | 59 | 61 | 62 |
2200 | 42 | 45 | 47 | 48 | 50 | 52 | 54 | 56 | 57 | 59 | 60 | 62 | 63 |
2300 | 44 | 46 | 48 | 50 | 52 | 53 | 55 | 57 | 58 | 60 | 61 | 63 | 64 |
2400 | 45 | 47 | 49 | 51 | 53 | 55 | 56 | 58 | 59 | 61 | 62 | 64 | 65 |
2500 | 47 | 49 | 51 | 52 | 54 | 56 | 57 | 59 | 60 | 62 | 63 | 65 | 66 |
2600 | 48 | 50 | 52 | 54 | 55 | 57 | 58 | 60 | 61 | 63 | 64 | 66 | 67 |
2700 | 49 | 51 | 53 | 55 | 56 | 58 | 60 | 61 | 62 | 64 | 65 | 66 | 68 |
Pitchers can get average results (50 grade) with a 88-mph fastball up to a 96-mph fastball.
Two-seam fastball
I found a small correlation with velocity but that was all.
Cutter
I found no consistent traits.
Curveball
Along with the four-seamer, the curve was the only pitch in which the results could be accurately measured.
RPM/MPH | 66 | 68 | 70 | 72 | 74 | 76 | 78 | 80 | 82 | 84 | 86 |
---|---|---|---|---|---|---|---|---|---|---|---|
2000 | 30 | 33 | 36 | 39 | 42 | 44 | 47 | 50 | 53 | 56 | 58 |
2100 | 31 | 34 | 36 | 39 | 42 | 45 | 48 | 50 | 53 | 56 | 59 |
2200 | 31 | 34 | 37 | 40 | 42 | 45 | 48 | 51 | 54 | 56 | 59 |
2300 | 32 | 35 | 37 | 40 | 43 | 46 | 49 | 51 | 54 | 57 | 60 |
2400 | 32 | 35 | 38 | 41 | 43 | 46 | 49 | 52 | 55 | 57 | 60 |
2500 | 33 | 35 | 38 | 41 | 44 | 47 | 49 | 52 | 55 | 58 | 61 |
2600 | 33 | 36 | 39 | 41 | 44 | 47 | 50 | 53 | 55 | 58 | 61 |
2700 | 34 | 36 | 39 | 42 | 45 | 47 | 50 | 53 | 56 | 59 | 61 |
2800 | 34 | 37 | 40 | 42 | 45 | 48 | 51 | 53 | 56 | 59 | 62 |
2900 | 34 | 37 | 40 | 43 | 46 | 48 | 51 | 54 | 57 | 60 | 62 |
3000 | 35 | 38 | 40 | 43 | 46 | 49 | 52 | 54 | 57 | 60 | 63 |
The harder the curve is thrown, the better the results.
Slider
A higher velocity gives the pitch some improved results.
Change
I found the bigger the difference in velocity from the pitcher’s fastball, the more effective the change.
—
While informative, the pitch characteristics are only useful for four-seam/curveball pitchers. I need to investigate more on what makes the other pitches work. I acquired a few ideas from people after the speech. I was informed I need to add the difference in pitch movement, break, and velocity from the starter’s fastball. Before, I only included the velocity difference with change ups. At some point in the near future, I will reexamine the data to see if I can improve picking out good pitches.
Another area I can improve is adding in more outside factors besides pitch control like deception, repeatability, and tunneling.
I am not giving up hope using preexisting prospect pitch grades to project a pitcher. For now, I am still working on the disconnect between what scouting publications report and major league results. It has taken me a few steps more than I would have liked but I feel I am getting closer. Hopefully soon, I can find a way to use published scouting reports or assume they are completely useless in their current format.
Jeff, one of the authors of the fantasy baseball guide,The Process, writes for RotoGraphs, The Hardball Times, Rotowire, Baseball America, and BaseballHQ. He has been nominated for two SABR Analytics Research Award for Contemporary Analysis and won it in 2013 in tandem with Bill Petti. He has won four FSWA Awards including on for his Mining the News series. He's won Tout Wars three times, LABR twice, and got his first NFBC Main Event win in 2021. Follow him on Twitter @jeffwzimmerman.
I was playing around on Baseball Savant and was seeing some of the same results. It seems weird just based on the eye-test. Is this the beginning of the argument for sequencing?
I have thought about sequencing and it just makes my brain hurt.
Markov Chains could simplify the sequencing problem considerably, allowing you to tractably calculate the stochastic value of the sequence distributions based on prior knowledge of the pitch values. The edge weights would presumably be derived from a linear weighting of league-wide results (e.g. a high fastball followed by a 12-6 curve is probably good for almost every pitcher) paired with pitcher-specific results as they deviate from the predicted model.
This would be straight machine learning, but I think it would produce a relatively useful model. It’s also possibly more general than it needs to be. Over time, it may become apparent that certain pitch sequences are simply always good, strengthening the league weights and reducing the reliance on individual factors (and thus, sample size problems).