I decided to pursue a Masters in Data Science to improve my hore picking abilities. This decision was based largely on the belief that data science techniques could provide me an information edge. Accordingly, my final project will seek to gain insight into the role distance run by each horse plays in the ultimate order of finish in a horse race.
My primary source of data for this project will be the Trakus website. Trakus provides comprehensive horse racing performance analysis. Specifically, Trakus T-charts provide horse racing enthusist segment times, distance run, distance from the rail, average velocity and beaten lengths for each sixteenth of a mile for a particular horse race. Table 1 below sets forth the 1/16th T-chart for Race 1 at Aqueduct race track on November 15, 2019.
Table 1. Sample T-Chart
My anticipated workflow for this project is consistent with Hadley Wickam’s Grammar of Data Science and is summarized below:
Potential challenges I could encounter during this project include: