This data contains batting statistics over almost the entire history of baseball for the MLB. This was found using the Lahman database. Lahman is a reporter who publishes raw baseball data every season. The variables used in the script are ABs and Hits, the two variables needed to calculate batting average.
This is for players with over 500 ABs and who played past the year 1930.
We see that it is pretty normal, which is about what I was expecting in terms of the distribution of hitters.
Note that the echo = FALSE parameter was added to the code chunk to prevent printing of the R code that generated the plot.