Choose one of David Robinson’s tidytuesday screencasts, watch the video, and summarise. https://www.youtube.com/channel/UCeiiqmVK07qhY-wvg3IZiZQ

Instructions

You must follow the instructions below to get credits for this assignment.

Q1 What is the title of the screencast?

Predicting Horror Movie Ratings

Q2 When was it published?

October 22, 2019

Q3 Describe the data

Hint: What’s the source of the data; what does the row represent; how many observations?; what are the variables; and what do they mean?

The data being analyzed consists of 3,328 horror movies. These movies all contain factors such as genre which includes Horror/Comedy, Horror/Drama, etc. Other factors include cast, rating, director and release date.

Q4-Q5 Describe how Dave approached the analysis each step.

Hint: For example, importing data, understanding the data, data exploration, etc.

Dave approached the data first by extracting things such as release date to make the data significant. He found the ratings are most significant after 2012. Then Dave tested the hypothesis to see if two factors were correlated in effecting movie rating. He found weak correlations between movie budget and movie rating. Movie ratings and reviews showed little variation. Boxplot shows in categorical variables. Drama/Horror & Drama/Mystery are a little higher rated. The lasso regression model used all in one table. Lasso-model uses “lambda.min” to show where lamda is and what is effected when lamda is at this value.

Q6 Did you see anything in the video that you learned in class? Describe.

I find this video pretty relevant in some senses to the star wars movie activity that we did in class at the beginning of the semester. I believe that activity definitely has some relation to the video I watched for this assignment.

Q7 What is a major finding from the analysis.

What Dave found significant is that the lasso regression model shows movies with genres animation, mystery, and drama to have a positive correlation with movie rating. However, overall David expressed that the findings are not signifcantly reliable and it is not the best way to predict movie ratings.

Q8 What is the most interesting thing you really liked about the analysis.

Something interesting I liked about the analysis was seeing how all factors could tie into the lasso regression model and potentially have a correlation with movie rating. I found that the Southwestern/Indian term “Kannada” had a positive correlation with horror movie ratings, which I thought was intriguing.

Q9 Display the title and your name correctly at the top of the webpage.

Q10 Use the correct slug.