1 pt for correct reasoning for generalizability – Answer should discuss whether random sampling was used. Learners might discuss any reservations, those should be well justified.
1 pt for correct reasoning for causality – Answer should discuss whether random assignment was used.
title_type: New variable should be called feature_film with levels yes (movies that are feature films) and no (2 pt)genre: New variable should be called drama with levels yes (movies that are dramas) and no (2 pt)mpaa_rating: New variable should be called mpaa_rating_R with levels yes (movies that are R rated) and no (2 pt)thtr_rel_month:
oscar_season with levels yes (if movie is released in November, October, or December) and no (2 pt)summer_season with levels yes (if movie is released in May, June, July, or August) and no (2 pt)Conduct exploratory data analysis of the relationship between audience_score and the new variables constructed in the previous part
Develop a Bayesian regression model to predict audience_score from the following explanatory variables. Note that some of these variables are in the original dataset provided, and others are new variables you constructed earlier:
feature_filmdramaruntimempaa_rating_Rthtr_rel_yearoscar_seasonsummer_seasonimdb_ratingimdb_num_votescritics_scorebest_pic_nombest_pic_winbest_actor_winbest_actress_winbest_dir_wintop200_boxComplete Bayesian model selection and report the final model.
Pick a movie from 2016 (a new movie that is not in the sample) and do a prediction for this movie using your the model you developed and the predict function in R.
A brief summary of your findings from the previous sections without repeating your statements from earlier as well as a discussion of what you have learned about the data and your research question. You should also discuss any shortcomings of your current study (either due to data collection or methodology) and include ideas for possible future research.