Assignment Instructions:
I am going to create a visualization from Rolling Stone Magazine’s 500 greatest albums of all time (1955-2011). The columns/variables in the data set include Number (The rank of the album 1-500), Year (Release Year), Album (Album name), Artist (Recording artist), Genre (e.g., Rock, Funk, Jazz, Country, etc.), & Subgenre (e.g., Pop Rock, Folk Rock, Psychedelic Rock, etc.). I need to create a visualization in ggplot2 that is a single chart that shows the relationship of at least two categorical variables and a numeric summary.
Information on Creating the Chart:
After I loaded the top_albums dataset into RStudio, I filtered the dataset by only the Artist/Group “The Beatles”. I chose “The Beatles” for this assignment because most of their songs are my favorites of all time. I wanted to see which of their albums are in the top 500 and what genre they are classified under. The graph below makes a lot of sense to me because The Beatles are known for being in the Rock genre. It was fascinating to note that some of their albums are also considered to be “Pop” and “Stage & Screen.”
I like how the graph came out, but I wish that I could have changed the y-scale so it wasn’t as misleading because some could view the albums with the larger boxes on the graph to be better in the Top 500, but instead they are actually worse. I tried to change this with multiple techniques and referenced good sources, but in the end, it made the chart look worse. So, I stuck with the original chart, which is shown below:
Chart: