Just a moment...

Archive for Statcast

Regressing StatCast: A Fable

This fable is inspired by Eric Hosmer and Dansby Swanson. In my last article, I noticed both improved in some of their StatCast metrics. So, I wondered how much should the values regress? After wasting more time than I’ll admit roaming for the answers, I remembered I already discovered the answers when creating my StatCast projections. Magically the season weightings work out to simply splitting the difference between 2019 and 2020 value. Normally, the most recent season would carry more weight, but with 2020’s limited sample, the two StatCast metrics (launch angle and max exit velocity) carry equal weight. Now, I can continue drudging through life. The end.


Blasts: A Subset of Barrels (Not the Same as Statcast Blasts)

Edit (5/13/24): Hi! Alex here, writing to you from the future. MLBAM just published swing data (it’ll be exciting, I promise you), and one of the marquee metrics for swings on Statcast is called a “Blast.” It has nothing to do with the Blasts herein. As far as I’m concerned, Statcast is the ledger of record, so I will brainstorm new terminology for this so it doesn’t create confusion… although changing the name of my metric might create confusion for the four people or whatever who use it. Anyway, just a PSA for y’all. These are not the same blasts!

I’ve heard (read) a lot of hullabaloo about “not all barrels are equal.” Hullabaloo or not, it’s true; although barrels capture exit velocity (EV) and launch angle (LA) combinations that produce optimal wOBAcon (weighted on-base average on contact) results, the Statcast metric is defined broadly enough to include absolute blasts alongside somewhat-pedestrian hard hits within the same grouping.

The algorithm used to classify barrels is not publicly available (edit: an anonymous tipster alerted me that it, indeed, is available! I think I reverse-engineered it correctly just by sight…), but one can reverse-engineer it easily enough. Here’s a plot of all barrels since the start of the 2017 season.

Given the scatterplot, the formula is most likely as follows:

if EV < 97.5 mph, then barrel = no
if LA > 25.5° and LA < 30.5°, then barrel = yes
if LA < 25.5° and (25.5 – LA) < (EV – 97.5), then barrel = yes
if LA > 30.5° and ((LA – 30.5) * 2) < ((EV – 97.5) * 3), then barrel = yes
if EV > 97.5 mph but none of these apply, then barrel = no

“Not all barrels are equal” takes on its meaning once you convert the above scatterplot to a heatmap. I set the low end of the color legend artificially high to show the contrast between barrels that are relatively productive versus those that are massively productive:

Read the rest of this entry »


A Needed Update on Launch Angle Tightness

As the de facto purveyor of launch angle tightness (or launch angle consistency, both terms that I use interchangeably), it is important I relay to you significant developments related to launch angles in general in 2020. Let the record show I am merely the messenger and Connor Kurcon, whose name graces these pages (or at least my pages) quite often these days, is forever my muse.

In 2020, Major League Baseball instituted its new pitch-tracking (and also ball- and player-tracking) system, Hawk-Eye. You can read about its merits here, among them being its alleged ability to “more comprehensively [track] the full flight of the ball”:

Furthermore, if the ball leaves the field of view of all 12 cameras (as can happen on high pop-ups and fly balls), the system can then reacquire the ball later in its trajectory as gravity pulls it back into the view of one or more cameras.

Hawk-Eye was expected to track more than 99% of all BBE, a significant upgrade from the previous system. Many approached the claim with skepticism. Turns out, the claim may be legit.

Kurcon noticed Hawk-Eye all but ruined the year-to-year consistency of launch angle tightness. Consistency is now inconsistent! Specifically, launch angle consistency values (calculated as the standard deviation of launch angle) have nearly universally grown larger in 2020. For the purposes of launch angle consistency, higher is worse, so it gives the appearance (if you’re looking at players individually and not at the larger picture) that a lot of players cratered a bit during the spring season. And it’s not only because of the shortened season, although the season’s length does contribute partly to the discrepancy:

Read the rest of this entry »


The Near-Immediate Usefulness of Max EV

Maximum exit velocity (max EV) measures a player’s hardest-hit ball, typically measured within a single season and compared against other players. Our Mike Podhorzer has documented its leaders and laggards. Rob Arthur, one of baseball’s best public analysts and whom I admire greatly, wrote intelligently on the importance of max EV as a projection-buster back in 2018. Max Freeze (real name) blends extremely hard hits (114+ mph) with launch angle to look for possible power breakouts ahead of 2020.

It has been established (by Al Melchior and me, in fact) that max EV, while an effective indicator, is not the or even a superior indicator of hitter power.

That’s not to say max EV is useless, by any means. It is altogether a different breed of metric than, say, barrel rate (Barrel%, either per plate appearance [PA] or per batted ball event [BBE]) or average exit velocity (EV), both to which fantasy baseball analysts refer much more often. The latter two, and many others, are rate metrics that need large sample sizes to become reliable — or, in common parlance, to “stabilize.” (More on that here, from our former and beloved Eno Sarris.)

Meanwhile, max EV is not a rate or average but a singular data point. It can happen at any moment in time — including the very first batted ball of a hitter’s season. This makes it an intriguing addition to the ol’ tool belt insofar as it could become “reliable” (not necessarily in the statistical sense) much sooner than would barrels or EV. Potentially, we could use max EV loosely as a leading indicator of where a hitter’s barrel rate, average EV, or even weighted on-base average on contact (wOBAcon) might eventually settle.

So: what are the merits of max EV?

Read the rest of this entry »


wPDI & CSW: Strikeout Rate

Introduction

This is the fourth article in my wPDI vs. CSW series. You can catch up by reading the first three articles – on called strikes, whiffs and residuals.

Here is a quick summary of some of the basics of wPDI & CSW from this series:

Last year, I developed the Weighted Plate Discipline Index (wPDI) framework, whereby all pitches can be classified into six different outcomes as follows:

wPDI: Classifying the 6 Pitching Outcomes
Outcome Outcome Outcome Outcome Outcome Outcome
A B C D E F
Zone? Out of Zone Out of Zone Out of Zone In Zone In Zone In Zone
Swing? Swung On Swung On No Swing Swung On Swung On No Swing
Contact? No Contact Contact Made No Swing No Contact Contact Made No Swing

Each outcome is then assigned a weight, or an index. A% through F% are the percent of pitches thrown in each outcome. The general formula for wPDI, the Weighted Plate Discipline Index is given as:

wPDI = IndexA * A% + IndexB * B% + IndexC * C% + IndexD * D% + IndexE * E% + IndexF * F%

wPDI can generate an all-in-one sortable metric used to evaluate pitchers. The plate discipline framework may be tailored to mimic (or to correlate to) various measures of deception or effectiveness.

In the first three articles of this series, we developed indices for wPDI to approximate the PitcherList metric, CSW. The Called Strikes + Whiffs (CSW) statistic was featured in last year’s FSWA Research Article of the Year by Alex Fast, and is defined as:

Called Strikes + Whiffs
Total Pitches

We separately tacked the called strikes and whiffs components, and landed on the following wPDI equation to represent CSW: Read the rest of this entry »


wPDI & CSW: Residuals

Introduction

This is the third article in my series – wPDI & CSW. You can catch up by reading the first two articles – on called strikes and whiffs – found here and here.

Here is a quick recap of what we have covered so far:

In this series, we are looking at the PitcherList metric, CSW and how it relates to my plate discipline framework, wPDI. Last year’s FSWA Research Article of the Year by Alex Fast featured CSW, which is defined as:

Called Strikes + Whiffs
Total Pitches

With the Weighted Plate Discipline Index (wPDI) framework, all pitches are classified into six different outcomes as follows:

wPDI: Classifying the 6 Pitching Outcomes
Outcome Outcome Outcome Outcome Outcome Outcome
A B C D E F
Zone? Out of Zone Out of Zone Out of Zone In Zone In Zone In Zone
Swing? Swung On Swung On No Swing Swung On Swung On No Swing
Contact? No Contact Contact Made No Swing No Contact Contact Made No Swing

Each outcome is then assigned a weight, or an index. A% through F% are the percent of pitches thrown in each outcome. The general formula for wPDI, the Weighted Plate Discipline Index is given as:

wPDI = IndexA * A% + IndexB * B% + IndexC * C% + IndexD * D% + IndexE * E% + IndexF * F%

Read the rest of this entry »


wPDI & CSW: Whiffs

This is the second article of my series – wPDI vs. CSW. For those new to either metric, I will quickly catch you up. [The opening article can be found here.]

In last year’s FSWA Research Article of the Year, CSW Rate: An Intro to an Important New Metric, Alex Fast of PitcherList examines his site’s pitching statistic, CSW. The short and simple formula for CSW is defined as follows:

Called Strikes + Whiffs
Total Pitches

Independently, I came up with the concept of Weighted Plate Discipline Index (wPDI). With wPDI, we ask just three questions, or three binary events for every pitch:

  1. Was the ball thrown in the strike zone?
  2. Was the ball swung on?
  3. Did the batter make contact with the ball?

Every pitch can then be classified into 6 possible pitching outcomes based on the above. The definition of each outcome is as follows:

wPDI: Classifying the 6 Pitching Outcomes
Outcome Outcome Outcome Outcome Outcome Outcome
A B C D E F
Zone? Out of Zone Out of Zone Out of Zone In Zone In Zone In Zone
Swing? Swung On Swung On No Swing Swung On Swung On No Swing
Contact? No Contact Contact Made No Swing No Contact Contact Made No Swing

Each outcome is then assigned a weight, or an index. The formula for wPDI, the Weighted Plate Discipline Index is then given as:

wPDI = IndexA * A% + IndexB * B% + IndexC * C% + IndexD * D% + IndexE * E% + IndexF * F%

A% through F% are the percent of pitches thrown in each outcome, and the indexes are linear multipliers to obtain the aggregated, sortable metric.

What CSW has most in common with wPDI, is that it shares the same denominator – Total Pitches. That being the case, we can attempt to use the wPDI framework to express the PitcherList metric. CSW is rooted in Baseball Savant data, while wPDI is fed by FanGraphs figures. By exploring the similarities and differences between the metrics, we can also uncover some great nuggets of understanding.

Read the rest of this entry »


wPDI & CSW: Called Strikes

Introduction

Last year’s FSWA Research Article of the Year, CSW Rate: An Intro to an Important New Metric, was awarded to Alex Fast of PitcherList. In his article, Alex presents the pitching statistic, CSW – a metric which was originally coined and created by Nick Pollack in 2018. As cited in the author’s article summary, CSW is more predictive than Swinging Strike Rate (SwStr%), and is more descriptive than Whiff Rate (Whiff%).

The short and simple formula for CSW is defined as follows:

Called Strikes + Whiffs
Total Pitches

I enjoy elegant formulae. Sure – wOBA, wRC+ and the like are extraordinary metrics in their own right, but they are not the simplest to jot down. CSW is plain, simple, easy to understand, and nicely predictive.

Coincidentally, and unknowing of CSW, I came up with the concept of wPDI back in 2018. I then published my first works of the plate discipline framework on April 2, 2019. The original article was entitled Introducing: Weighted Plate Discipline Index (wPDI) for Pitchers, and can be found here.

What jumped out to me immediately upon reading Fasts’s article – was that the two metrics have something very in common. CSW and wPDI both share the very same denominator – Total Pitches. The base of both of our metrics are identical. Both utilize the very same sample size, both stabilize just as quickly, and both describe baseball through the very same lens – the pitch.

As a quick reminder of how wPDI works, every pitch can be classified into 6 possible pitching outcomes.

Read the rest of this entry »


Mining for … Hell, Anything

Playing Time Leaders

Mookie Betts leads the league with 21 plate appearances, but here are some available hitters who have been racking up the playing time.

  • David Fletcher (18 PA): He leads the league in hits (8) and is hitting (.533/.556/.667), so he’s not moving off the top spot for a while. Also, it helps that he’s qualified at four positions (SS/OF/2B/3B)
  • Niko Goodrum (18 PA): He’s leading off, hitting OK (.250/.333/.563), and qualified at three positions (2B/SS/OF). He should at least be a bench bat in all formats.
  • Shed Long Jr. (18 PA), Evan White (18 PA), J.P. Crawford (17 PA), and Kyle Lewis (18): The Mariners offense has been turning itself over and all the hitters have benefited. There are some interesting extreme small samples happening with 13 of 18 Kyle Lewis’s plate appearances ending with a strikeout, walk, or home run. Also, Crawford leads the league in walks (5) and triples.
  • Freddy Galvis (17 PA): The Reds middle-infield options are drying up. Galvis is being forced into everyday at-bats with a .929 OPS to start the season.

Read the rest of this entry »


League-wide Batted Ball Changes

About every month, podcast mate Rob Silver pours out his undying love and affection for the Rockies Charlie Blackmon. I’m not as much of Blackmon fan but this comment got me thinking.

Blackmon definitely hit the ball harder last year but so did everyone else with MLB’s juiced ball. Even with noisy data, Rob was right and Blackmon exceeded expectations.
Read the rest of this entry »


Just a moment...