Home And Road Exit Velocity

This topic is an iffy area, and I want to say that up front. It is complicated and controversial.  I don’t have all of the facts, and I don’t have access to the best available data.  I’m just going to share, as best I can, the situation as I currently understand it. All of the numbers taken here are from Baseball Savant.  I’ve done nothing to manipulate the numbers in any way.  I say this because, as some of you may know, I generally run an algorithm to clean up and manipulate numbers from Baseball Savant prior to using them for analysis. Today is not one of those days, I’m just going with the raw information.

Trackman Radar, as I understand it, specializes in measuring velocity.  That is its bread and butter, and other metrics measured by Trackman are added on top of, or derived from, this most basic starting point. By all accounts, Trackman is very good radar, and these velocity measurements are very reliable.

Velocity Fluctuations

That said, there are a number of pretty large home/road exit velocity fluctuations worth pointing out.  I’m not exactly sure what may cause these differences. It could be quality of pitching or batting. If your team has great pitchers, you’d expect them to give up fewer hard hit balls, so maybe your home park exit velocity numbers would be suppressed.

Your batters could provide a similar input.  A good offensive team may have overall higher exit velocity contributions, or a bad offense may have below average. I stress *may*. A good offense isn’t necessarily defined by high exit velocity.  Look at guys like Mike Trout and Joey Votto.  Neither of these guys produce particularly high exit velocities, but both are great hitters. Likewise, Manny Machado has had elite exit velocity this season, but with pedestrian numbers.

These things are complicated, there are a lot of moving parts. A great pitching staff and a great offense may cancel each other out, for example. However, we need to talk about these home/away exit velocity numbers.

Team Home/Away Exit Velocity
2015 2016 2017
Team Home Away Δ Home Away Δ Home Away Δ
ARI 87.2 86.0 1.2 88.1 86.8 1.3 88.3 85.8 2.5
ATL 85.0 85.2 -0.2 87.0 85.7 1.3 86.3 86.0 0.3
BAL 88.3 86.8 1.6 88.5 88.4 0.1 88.6 87.5 1.1
BOS 87.8 87.3 0.5 88.4 88.2 0.3 87.7 86.4 1.3
CHC 86.8 86.4 0.4 86.9 86.4 0.5 86.8 85.8 1.0
CIN 85.1 85.5 -0.4 84.8 85.2 -0.4 83.9 84.3 -0.4
CLE 87.0 86.6 0.5 87.7 87.1 0.6 86.7 88.2 -1.5
COL 87.1 86.5 0.7 87.3 86.2 1.1 85.3 85.6 -0.4
CWS 86.5 86.5 -0.1 86.9 87.5 -0.6 86.0 86.7 -0.7
DET 87.4 86.8 0.6 88.5 87.6 0.9 88.7 87.5 1.2
HOU 87.1 87.8 -0.7 86.8 87.5 -0.7 86.2 87.7 -1.4
KC 86.9 86.6 0.3 86.9 86.7 0.2 87.2 86.8 0.4
LAA 86.7 86.1 0.6 87.2 87.1 0.1 85.9 86.4 -0.5
LAD 87.2 86.7 0.5 88.5 87.4 1.0 87.5 87.1 0.5
MIA 86.4 86.1 0.3 86.9 86.6 0.3 85.1 86.5 -1.4
MIL 86.4 86.2 0.2 86.7 86.9 -0.1 86.3 85.3 1.0
MIN 86.8 86.5 0.3 87.1 86.9 0.2 87.5 87.1 0.4
NYM 86.9 88.1 -1.2 86.3 87.4 -1.1 86.2 86.3 -0.2
NYY 87.1 87.0 0.2 86.9 86.9 0.0 87.3 87.2 0.1
OAK 86.5 86.4 0.1 87.0 86.7 0.3 88.1 86.9 1.1
PHI 85.1 85.8 -0.7 86.4 86.5 -0.1 87.1 85.9 1.2
PIT 87.1 86.4 0.7 87.0 86.3 0.7 85.4 84.5 0.9
SD 85.8 86.4 -0.6 85.5 86.3 -0.8 83.6 85.1 -1.5
SEA 88.5 88.5 0.1 88.2 87.7 0.5 86.9 86.9 0.0
SF 86.8 86.1 0.7 86.5 85.9 0.5 85.5 85.8 -0.3
STL 86.5 86.9 -0.5 87.9 86.8 1.1 85.3 86.0 -0.8
TB 86.7 86.9 -0.2 88.2 87.9 0.4 86.1 87.0 -0.9
TEX 87.0 86.4 0.6 87.1 88.1 -1.0 86.5 86.5 0.1
TOR 87.8 87.1 0.8 88.2 87.9 0.3 86.5 86.9 -0.3
WSH 87.3 87.7 -0.5 87.9 87.7 0.2 88.0 87.0 1.0

 

This is a big table, I’m sorry about that.  Especially for mobile users. These are exit velocities generated by the team’s offense in home games versus away games. I’ve highlighted the seasons where road exit velocity is greater than home exit velocity for a given team.

First to note, Arizona. A few weeks ago I wrote an article addressing the addition of the Humidor to Chase Field, and in doing so I addressed a physics based model that claims the low air density and humidity in Arizona should net a roughly 4mph increase in exit velocity in Chase Field when compared to road games. However, in 2015 and 2016 we see a difference of 1.2 and 1.3 respectively. This season, the difference is up to 2.5, much closer to what we would expect.

Arizona is a good standard candle for this sort of analysis, since we expect to see a certain difference between home and away numbers. Unfortunately, that difference is unrealized. Which is unsatisfactory. There are possible explanations.

  1. The physics based model is wrong.
  2. Unaccounted for variables.
  3. Exit velocity numbers are wrong.
  4. Small sample size problem, and the differences haven’t had enough time to emerge.

Personally, I feel number 4 is the least likely possibility. I find it hard to believe that an expected difference this large would go unnoticed to this degree after two and a half seasons. Exit velocity seems to sort itself out after a few hundred batted balls, so I find it unlikely that this is due to some statistical anomaly.

The other three options each have their own roadblocks and challenges. The physics model could be wrong, but it has withstood testing over time and it has made accurate predictions in the past. Trackman appears to be very good at measuring velocity numbers, and it has also withstood its own testing. Unaccounted for variables may be the most alluring, if you were to believe both the physics model and velocity numbers, but what could these factors be? The batters eye? Is the pitching somehow superior in Arizona? It is hard to justify.

One thing for sure, though, our standard candle is throwing up a smoke sign. It is warning us about something, but we don’t know exactly what it might be.

Ballpark Bias?

It could be, perhaps, that there are some sort of velocity measuring bias in each stadium. This is very controversial, because there is little evidence to support case. And if it were true, what mechanisms would drive the bias?

In the past, people have pointed towards slight differences in the location of the radar within the stadium. This explanation seems plausible, but there isn’t much evidence to support the conclusion. Think of it this way, if there is a certain location within each field that is ideal for measuring batted balls, don’t you think the engineers installing the radar would do their best to place the radar within that ideal region of the park?

Perhaps your stadium cannot place the radar in the ideal location for whatever reason, and the engineers would have to push towards the extreme edges of the acceptable locations within the ballpark. Maybe it should be in a given location give or take 30 feet, so you place it 25 feet away. That 25 feet is within your 30 foot acceptable radius, but it is towards the extreme end.  In this case, if you knew your radar weren’t ideal, wouldn’t you keep an eye on the numbers over time, and try to correct issues that arise?

This sort of inherent stadium bias seems a bit farfetched, but it may also be the best explanation for the observed differences to date. In my analysis of Statcast data, I have included a stadium based velocity correction for each batted ball, and recently I have gone through and updated these definitions.

Park Adjustments
Park Adjustment Park Adjustment
ARI 1.36 MIL -0.13
ATL 0.28 MIN -0.28
BAL 0.33 NYM 0.37
BOS 0.59 NYY 0.04
CHC -0.03 OAK 0.02
CIN -0.10 PHI -0.33
CLE -0.40 PIT 0.51
COL -0.03 SD 1.28
CWS 0.12 SEA -0.12
DET -0.73 SF -0.34
HOU 0.98 STL 0.10
KC -0.33 TB 0.41
LAA 0.26 TEX -0.20
LAD 0.19 TOR 0.14
MIA 0.03 WSH 0.10

 

You may notice that the Diamondbacks, Padres and Astros have the largest adjustments. I’ve already addressed the Diamondbacks as the standard candle here, and after this adjustment their home/away difference increases to 3.87, roughly around where we expected to see it. The Astros and Padres are a little bit more difficult to explain. I don’t know why they require such a large adjustment to equalize their home and road performance.

Again, I want to stress that I am not fully convinced these sorts of adjustments are truly necessary. Or correct. I am sure there are people who can devise far more rigorous examinations of the data and come to better conclusions and probably better adjustments.

My goal here isn’t to provide some be all, end all explanation and solution. Far from it, the more I think about this problem, the more confused I feel.  But, as I have read articles about the increasing home run rates, I haven’t seen very much addressing this particular issue.

Um, Why Is This In Fantasy?

I am sure you may be wondering why I’ve written this in the fantasy section of the website. There are two primary reasons. First, I feel fantasy players stand to disproportionately benefit from an understanding of the Statcast system. Second, I know many of you are using Statcast derived stats on a regular basis.

Here’s an example for how these velocity adjustments could be helpful. I have run the exact same algorithm for predicting home runs two times. First, the it was run on the raw data, the second time it was run on the velocity adjusted data. Check out the results below.

Home Run Prediction
Team xHR Before xHR After HR Δ Before Δ After
ARI 83.5 85.5 90 -6.5 -4.5
ATL 61.0 61.4 66 -5.0 -4.6
BAL 97.8 91.8 88 9.8 3.8
BOS 72.4 65.8 63 9.4 2.8
CHC 92.3 91.9 90 2.3 1.9
CIN 87.1 91.0 94 -6.9 -3.0
CLE 74.0 75.3 74 0.0 1.3
COL 79.1 80.1 82 -2.9 -1.9
CWS 77.5 76.2 71 6.5 5.2
DET 99.2 88.6 79 20.2 9.6
HOU 84.3 92.7 103 -18.7 -10.3
KC 75.1 77.2 77 -1.9 0.2
LAA 68.2 69.2 75 -6.8 -5.8
LAD 80.8 77.8 77 3.8 0.8
MIA 69.7 74.1 79 -9.3 -4.9
MIL 88.1 88.8 92 -3.9 -3.2
MIN 87.2 79.8 79 8.2 0.8
NYM 80.1 86.2 94 -13.9 -7.8
NYY 105.7 103.3 105 0.7 -1.7
OAK 90.9 89.6 92 -1.1 -2.4
PHI 69.1 68.9 65 4.1 3.9
PIT 63.9 60.8 61 2.9 -0.2
SD 63.9 75.6 80 -16.1 -4.4
SEA 74.4 71.8 70 4.4 1.8
SF 48.8 49.7 49 -0.2 0.7
STL 78.7 77.4 64 14.7 13.4
TB 93.2 93.0 103 -9.8 -10.0
TEX 88.6 91.5 86 2.6 5.5
TOR 87.7 86.4 90 -2.3 -3.6
WSH 93.2 91.0 98 -4.8 -7.0
RME 72.9 28.5

 

The Root Mean Square error for individual batters, as opposed to teams, is 1.34 prior to the adjustment, and 1.09 after. The velocity adjustments help these algorithms create a more descriptive stat. Whether this may be more predictive is another story, and I do not have an answer for you.

Of course, this runs into the issue of overfitting data. That is a problem we’re going to wrestle with for a long time now that Statcast has become a mainstream tool in baseball. There are two camps that I know of regarding this problem.

First, there are those people who firmly believe Statcast should be used to describe what has happened in games to the best of its ability. All efforts should be spent increasing the accuracy of the system, and efforts to normalize data could wash out real changes and differences throughout the course of a game, season, or career.

Second, there are those who wish to normalize the data across the league and focus on the predictive nature of these stats.

None of This Makes Me Comfortable

I feel like I am hopelessly lost between the two extremes, and maybe that is the worst place to be. Maybe it would be better to be more firmly in one camp over the other. The first camp will adamantly poo-poo all of these velocity adjustments, and I can already hear them yelling at me. I want to say to those people:  I know, I agree, this makes me uncomfortable, too.  But, at the same time, our standard candle in Arizona tells me something needs to be adjusted. And when I make the adjustment, my expected stat algorithms are more descriptive.

But, at the same time, these changes make my skin crawl. I know they aren’t firmly rooted in a logical argument. It feels more like a ‘just make it work’ jerry rig. The academic rigor is stripped away and replaced with a hunch.

Of course, this hunch isn’t new.  Baseball Prospectus wrote about this last year, and they released their own park adjustments. Their adjustments are much more rigorous than mine, but they are also out of date and obsolete. They were calculated using only the first few months of the 2016 season, and a lot has changed since then. The summer of 2016, for one. We know the summer heat makes the ball fly.

At the end of the day, this is meant to be a warning. There is a problem here, and I don’t know the solution. I don’t think anyone knows the solution, and I would hesitate to trust any claims otherwise. Just keep it in the back of your mind, every time you quote an exit velocity, or an exit velocity derived stat like xwOBA, xOBA, Barrels, VH%, xBABIP, or whatever else. Ask yourself, what role could these park factors play in the result? As you can see in the home run totals I showed before, very small changes in exit velocity can create rather large changes in team and player totals. These are just home runs, too. Singles, doubles and triples are impacted as well. Batting average, slugging, everything.





Andrew Perpetua is the creator of CitiFieldHR.com and xStats.org, and plays around with Statcast data for fun. Follow him on Twitter @AndrewPerpetua.

11 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
Pig.Pen
6 years ago

With regards to Arizona, wouldn’t the division they play in be a factor? They play a lot of road games out West, where humidity is typically lower, including a decent number of games each year in Colorado. Obviously, we need more data to help with the SSS here, but perhaps a team that plays most of its away games in markedly different climates would be a better barometer for humidity’s effect on the ball. Fascinating article though.