Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ
You must follow the instructions below to get credits for this assignment.
Predicting Horror Movie Ratings
October 22, 2019
Hint: What’s the source of the data; what does the row represent; how many observations?; what are the variables; and what do they mean?
The data being analyzed consists of 3,328 horror movies. These movies all contain factors such as genre which includes Horror/Comedy, Horror/Drama, etc. Other factors include cast, rating, director and release date.
Hint: For example, importing data, understanding the data, data exploration, etc.
Dave approached the data first by extracting things such as release date to make the data significant. He found the ratings are most significant after 2012. Then Dave tested the hypothesis to see if two factors were correlated in effecting movie rating. He found weak correlations between movie budget and movie rating. Movie ratings and reviews showed little variation. Boxplot shows in categorical variables. Drama/Horror & Drama/Mystery are a little higher rated. The lasso regression model used all in one table. Lasso-model uses “lambda.min” to show where lamda is and what is effected when lamda is at this value.
I find this video pretty relevant in some senses to the star wars movie activity that we did in class at the beginning of the semester. I believe that activity definitely has some relation to the video I watched for this assignment.
What Dave found significant is that the lasso regression model shows movies with genres animation, mystery, and drama to have a positive correlation with movie rating. However, overall David expressed that the findings are not signifcantly reliable and it is not the best way to predict movie ratings.
Something interesting I liked about the analysis was seeing how all factors could tie into the lasso regression model and potentially have a correlation with movie rating. I found that the Southwestern/Indian term “Kannada” had a positive correlation with horror movie ratings, which I thought was intriguing.