Part 1: DATA DESCRIPTION
Total Movies: 749
Non-Sequel Sequel
675 74
BoxOffice: The total box office collection of a movie, measured in millions of INR.
Sequel: Dummy variable used to indicate whether a movie is a sequel or not.
par(cex=1.5,mar=c(4,4,1,1))
boxplot(BoxOffice ~ Sequel, data=sequel.df,
main="Box Plot of Box Office collection",
xlab="Sequel (No = 0, Yes = 1)",
ylab="Box Office Collection (Millions of INR)")
Sequel BoxOffice_Average
1: 0 357.9
2: 1 740.9
library(data.table)
dt <- data.table(sequel.df)
dt[,list(BoxOffice_Average = round(mean(BoxOffice),1)), by = Sequel]
Actor: Dummy Variable used to indicate the popularity of the male actor
Actress: Dummy Variable used to indicate the popularity of the female actor.
Producer: Dummy Variable
Producer=1, the producer has produced more than 10 movies before that movie.
Producer=0, otherwise.
He has produced “Dhoom” and its sequels Dhoom 2, Dhoom 3. He has produced more than 10 films before this, such as Mohabbatein, Dilwale Dulhania Le Jayenge , Dil To Pagal Hai , Mere Yaar Ki Shaadi Hai etc. Hence we consider him as an Experienced Producer.
Director: Dummy Variable
Director=1, if the director has directed more than or equal to 10 movies before that movie.
Director=0, otherwise.
He directed the movie “Krrish”, which was a sequel of “Koi Mil Gaya”. He has directed more than 10 movies such as Kaho Na Pyar Hai, Khoon Bhari Maang, Karan Arjun, Khudgarz , Koi Mil Gaya etc. before directing Krrish. Hence, We consider him as an experienced Director according to the above criteria.
Rating: mean rating of the movie out of 5, given by the audience