Taylor Swift and Billboard Comparison

Final Project

Author

Meredith Briggs

Intro to the Project

With the popularity of the Eras Tour, I wanted to take a deeper look into Taylor Swift’s music and look for some trends in her music over the years. Additionally, I was wondering how Taylor’s popularity was reflected on the Billboard Top 100.

Below are the packages I will use to manipulate the data throughout this document.

Taylor Swift

Taylor Swift is pop artist who has 14 albums. She recent ended her Era’s Tour which honored her music from all of her albums. I wanted to take a deeper look into Taylor Swift’s discography to see how she might have changed over time.

The Spotify API wasused to gather data of every Taylor Swift song. Below is a data dictionary of the Spotify variables for each song.

Data Dictionary

Variable Description
track_id Unique track identifier
track_name Name of track or song
album_id Unique album identifier
album_name Name of the album
duration_ms Time length in milliseconds
time_signature Indicates beats per measure
danceability Measure from 0.0 to 1.0 and represents likelihood to dance based on musical components
energy Measure from 0.0 to 1.0 and represents a perceptual measure of intensity and activity
loudness Measure of how loud a song is by adjusting the Loudness Units Relative to Full Scale down to -14
explicit Use of explicit words (TRUE/FALSE)
key_mode Combination of key_name and mode_name
popularity Measure of song popularity 0 to 100 with larger numbers being more popular
artist_id Unique artist identifier
artist_name Name of the artsit

The Billboard Top 100

The Billboard Top 100 is a list of the top 100 most popular songs on the radio. I want to see how Taylor Swift appears on this chart for the past year.

In order to scrape my data, I used the Billboard Top 100 website (https://www.billboard.com/charts/hot-100/) and my user agent was my personal identifier. I scraped the songs, artists, position on the chart, position on the chart in the previous week, weeks on chart, the peak position on the chart, and the week of the chart.

Data Preparation

After scraping the data, I want to be able to use the billboard_week as an actual data with the lubridate package. This means I have to clean the data by removing the “week of” string and covert the rest of the string into a date.

Comparison of Top 10 Songs

From the Billboard analysis, I have found the top 10 Taylor Swift songs the radio listens to. Something to keep in mind is that the demographic of people who listen to the radio can differ from those who pay for and use a Spotify Premium account exclusively. Therefore, I will compare this to the top 10 songs Taylor has per Spotify. AThis comparison will show which of her songs is currently the most popular per each of these demographics.

Data Preparation

First I will assign the track id from the Taylor Data Set to the matching songs in the Billboard Top 100 Data Set.

Then I will create variables for numeric ranking and origin of Top 10 position.

Finally I will join the tables so that it will be easier to compare.