Video Game Sales This dataset contains data on 16720 games released from the 80s up until 2016.
Data Source Kaggle: https://www.kaggle.com/datasets/sidtwr/videogames-sales-dataset/data
2026-03-28
Video Game Sales This dataset contains data on 16720 games released from the 80s up until 2016.
Data Source Kaggle: https://www.kaggle.com/datasets/sidtwr/videogames-sales-dataset/data
#Load the data
gameSales <- read.csv("Video_Games_Sales_as_at_22_Dec_2016.csv")
#Remove all rows with NA values
gameSales <- na.omit(gameSales)
Major Observations
Ratings: A majority of the games seem to be between 80 and 100 Metacritic score.
Sales: Games with higher sales mostly have high ratings, suggesting that there is a relation between high ratings and sales.
Release Year: Some of the highest sales are in the later years, which is interesting as the Xbox One and PS4 both launched in 2013.
gameSales2 %>%
group_by(Platform) %>%
summarise(
Count = n(),
Mean_US = round(mean(NA_Sales),3),
Mean_EU = round(mean(EU_Sales),3),
Mean_JP = round(mean(JP_Sales),3),
Mean_Global = round(mean(Global_Sales),3),
Mean_Critic = round(mean(Critic_Score),3)
)
## # A tibble: 2 × 7 ## Platform Count Mean_US Mean_EU Mean_JP Mean_Global Mean_Critic ## <chr> <int> <dbl> <dbl> <dbl> <dbl> <dbl> ## 1 PS3 790 0.437 0.35 0.07 1.00 70.8 ## 2 X360 881 0.613 0.269 0.012 0.98 69.0
Key Points:
-Data Size: The Xbox has nearly 100 more entries than the PS3, although this does not seem to have a large effect on the means.
-US Sales: The Xbox has a higher sale count than the PS3 by about 200,000 sales, this is probably because the Xbox had a stronger presence in the US.
-JP Sales: In no surprise to me, the PS3 has a much higher game sale count, as it’s a much more popular console in Japan compared to the Xbox.
-Critics: Both consoles’ ratings hover around a 70 rating on Metacritic, showing that the game quality is about the same on average between consoles.