If you peruse the Kaggle March Marchine Learning Forum, you will see that a numer of additional data sets have been attached. I’ve downloaded them from Kaggle and uploaded them to our server. You can load them into your project using:

load('/home/rstudioshared/shared_files/data/march_madness/MM_extra_data.RData')

This will load several object into your workspace: SBRLines, TeamCoaches, TeamSpellings, TeamGeog, TourneyGeog, massey_ordinals, SeedRoundSlots, PredictionTracker and TeamConferences. You may want to explore these using the View() and summary() functions, read about them on the Kaggle Fourm and think of ways that they might be of use in building your prediction model.