Code
library(tidyverse)
top_albums <- read_csv(
"https://jsuleiman.com/datasets/Rolling_Stones_Top_500_Albums.csv",
locale = locale(encoding = "ISO-8859-2", asciify = TRUE))The dataset I used comes from Rolling Stone Magazine’s list of the 500 Greatest Albums. This dataset contains information on 500 albums and includes 7 variables: Number (rank of the album), Year (year the album was released), Album (name of the album), Artist (the artist who created the album), Genre (main genre of the album), and Subgenre (additional genres the album fits into). These variables give us a comprehensive look at which albums have made an impact over the years and the characteristics they share.
For my analysis, I decided to focus on two main aspects of the dataset: the distribution of albums by genre and how the number of albums has changed over different decades. I created two visualizations to help make these comparisons. The first visualization is a bar chart to compare the number of albums in each genre, and the second is a dot plot to show how the number of albums changed across different decades. To create these visualizations, I used Gestalt principles to make the data easier to understand and highlight important patterns. These visualizations help us understand trends in music history, such as which genres were most popular and how influential albums have shifted over time.
library(tidyverse)
top_albums <- read_csv(
"https://jsuleiman.com/datasets/Rolling_Stones_Top_500_Albums.csv",
locale = locale(encoding = "ISO-8859-2", asciify = TRUE))library(tidyverse)
library(ggplot2)
library(readr)glimpse(top_albums)Rows: 500
Columns: 6
$ Number <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18…
$ Year <dbl> 1967, 1966, 1966, 1965, 1965, 1971, 1972, 1979, 1966, 1968, 1…
$ Album <chr> "Sgt. Pepper's Lonely Hearts Club Band", "Pet Sounds", "Revol…
$ Artist <chr> "The Beatles", "The Beach Boys", "The Beatles", "Bob Dylan", …
$ Genre <chr> "Rock", "Rock", "Rock", "Rock", "Rock, Pop", "Funk / Soul", "…
$ Subgenre <chr> "Rock & Roll, Psychedelic Rock", "Pop Rock, Psychedelic Rock"…
genre_count <- dplyr::count(top_albums, Genre, sort = TRUE)
ggplot(genre_count, aes(x = reorder(Genre, n), y = n, fill = Genre)) +
geom_bar(stat = "identity") +
coord_flip() +
labs(
title = "Number of Albums by Genre",
x = "Genre",
y = "Count of Albums"
) +
theme_minimal() +
theme(legend.position = "none")The bar chart shows that Rock is the most common genre with 249 albums in the dataset. This is much higher than any other genre. The second most popular genre is Funk/Soul which has 38 albums, followed by Hip Hop with 29 albums. Other genres like Electronic, Blues, Folk, and Jazz, have fewer than 20 albums each.
Another interesting pattern is that some albums belong to more than one genre. For example there are albums labeled as Rock and Pop or Electronic and Rock. However these mixed genres are much less common than single genres like Rock or Hip Hop.
I chose a bar chart because it is one of the best ways to compare different categories. Since the goal is to show how many albums each genre has, a bar chart makes it easy to see which genres are the most and least common.
I also sorted the bars from largest to smallest. This helps the reader quickly see that Rock is the most popular genre, while others like Jazz or Classical have very few albums. The bars are also flipped sideways to make the genre names easier to read.
I used different colors for the bars to help separate the categories. The design is simple, with no extra lines or labels, so it is easy to focus on the most important information.
One of the strengths of this chart is that it clearly shows the biggest trends. It is very easy to see that Rock is the most common genre, while others are much smaller. The simple design also makes it easy to understand at a glance.
However, there are some areas for improvement. Some genres are combined, like “Rock and Pop” or “Funk/Soul and Blues.” This can make it hard to know if an album is counted in both genres or just one. A stacked bar chart could help show how many albums belong to more than one genre.
Another small problem is that some genre names are very long, which can make them harder to read. A possible solution would be to shorten some names or use abbreviations. Also, adding percentages next to the bars could help people understand the data better.
The bar chart does a good job of showing how different genres compare. It clearly shows that Rock is the most popular genre, while other genres have much fewer albums. Some small changes, like grouping genres better and shortening labels, could make it even better.
top_albums <- top_albums %>%
mutate(Decade = floor(Year / 10) * 10)
decade_count <- top_albums %>%
count(Decade)
ggplot(decade_count, aes(x = Decade, y = n)) +
geom_point(size = 4, color = "blue") + # Dots for each decade
geom_line(group = 1, color = "blue") + # Connect the dots with a line
labs(
title = "Number of Top Albums by Decade",
x = "Decade",
y = "Number of Albums"
) +
theme_minimal()The dot plot shows that most of the top albums came from the 1960s and 1970s. The 1970s had the highest number of albums (186 albums), followed by the 1960s with 105 albums. This means that a lot of the most important and famous music came from these two decades.
After the 1970s, the number of top albums started to drop. The 1980s had 85 albums, and the numbers kept getting lower in the 1990s (72 albums) and 2000s (40 albums). The 2010s only had 2 albums, showing that fewer recent albums have been included in this ranking.
The 1950s had only 10 albums, which makes sense because rock and pop music were just starting to become popular back then.
I picked a dot plot because it makes it easy to compare the number of albums across different decades. The dots help show patterns over time, and the line connecting them makes the trend clear. The Gestalt principle of proximity helps because decades with similar numbers of albums are placed closer together, making it easy to see the changes.
A bar chart could have worked too, but a dot plot looks simpler and less cluttered. It also helps show the smooth rise and fall of albums across decades. The minimalist style keeps the focus on the data instead of adding too many extra details.
This dot plot is a good way to show how the 1960s and 1970s were the most important decades for music. The trend is very clear, and it is easy to see how the number of top albums has changed over time.
However, there are a few problems. The 2010s have very few albums, which might be because the dataset focuses on older, well-known albums. A small note explaining this could help make it clearer. Also, the chart only looks at decades, so it does not show smaller changes that might have happened within each decade. If the data were grouped by 5-year periods, it might show even more interesting details.
This dot plot does a great job of showing how music changed over time. It helps explain why the 1960s and 1970s were the most important decades, and it makes the drop in later years very clear. Some extra details could make it even better, but the main message is easy to understand.
The two visualizations work together to tell a complete story about the top 500 albums. The bar chart shows which music genres had the most albums, while the dot plot shows how the number of albums changed over time. Together, they help explain not only what kinds of music were most popular, but also when the most influential albums were made.
From these charts, we can see that rock music dominated the list, and most of the top albums came from the 1960s and 1970s. This suggests that those decades were the most important for music history. However, it also raises some new questions. Why did the number of top albums drop after the 1970s? Is it because music styles changed, or because newer albums haven’t had enough time to be considered classics? Also, since rock is the most common genre, does that mean other genres were underrated?
These visualizations help us understand music history, but they also show that there is more to explore. We could look at other factors, like which artists had the most albums on the list or how music trends changed within each decade.