1. Introduction

1.1 Brief Summary of Project Proposal

Currently I am doing a part-time internship with Archbishop Moeller High School’s baseball team. In this internship I am a part of the baseball data analytics team. Me and other interns show up to games and practice collecting data about pitching and hitting. The data set that I will be using is all the pitching data we have collected this year. It is called the “PreseasonPens.csv.” This data set has 17 total columns, and 4,741 pitches recorded. There aren’t many missing values because we are tasked to clean the sheet every single week. In my final project I will investigate how successful hitters are in different pitch counts. Below I explain the different variables and values in this CSV.

1.2 Dataset Name

  1. Dataset Structure

2.1 Rows

2.2 Columns

  1. Variable Details

3.1 Player Information

3.2 Pitch Details

3.3 Count & Impact Metrics

3.4 Play Outcome

3.5 Additional Information