Understanding the Elements of Spotify Song Success

Author

Perry Lampertius

Introduction

Music plays a huge role in culture and daily life, and with platforms like Spotify and Apple Music, it’s more accessible than ever. Millions of people stream music every day, but not every song gets the same amount of attention. Some tracks go viral and hit the top of the charts, while others stay under the radar. This made me wonder—what actually makes a song successful on a streaming platform like Spotify? Is it the tempo, how danceable it is, how energetic it sounds, or maybe even what day it was released? Do these kinds of musical and platform-related features really affect how popular a song becomes?

As someone who enjoys music and regularly follows what’s trending on charts and playlists, I wanted to look into which features make a song more likely to become a hit. I think understanding these patterns could be helpful not only for music fans but also for artists, producers, and marketers who want to know what drives a song’s success in today’s streaming world.

To explore this, I chose a dataset that tracks the performance and characteristics of songs across major streaming platforms, with a focus on Spotify. This dataset includes information like stream counts, playlist and chart appearances, and audio features such as tempo, energy, and mood. My goal is to analyze these variables and see which ones are most strongly connected to popularity. Can we find any trends or clues that explain why some songs take off while others don’t? That’s what I’ll be exploring in this project.

Dataset Overview

This dataset, (click here to download), offers a detailed snapshot of what drives song popularity across major music streaming platforms, including Spotify, Apple Music, Deezer, and Shazam. It brings together both quantitative performance metrics—such as total streams, playlist appearances, and chart positions—and a wide range of musical features like tempo (BPM), energy, danceability, valence (mood), and more technical aspects such as key, mode, and acousticness. Additionally, it includes contextual information like release dates and artist count, allowing for analysis over time or by artist collaboration scale.

By combining musical characteristics with platform-specific performance data, the dataset provides a unique opportunity to explore not only which types of songs become popular, but also why they might succeed. For example, are high-energy songs more likely to be featured in popular playlists? Do slower tempos dominate certain seasons or genres? Does the number of artists on a track correlate with higher chart performance?

This dataset can be used to examine trends in music consumption, identify the traits of viral or commercially successful tracks, and even inform recommendations for artists and producers looking to optimize their sound for streaming platforms. It’s also valuable for academic research, marketing insights, and anyone interested in the intersection of music, data science, and digital media trends. In an era where streaming data plays a significant role in shaping the music industry, datasets like this one offer a window into the mechanisms of musical success in the digital age.

Key Variables in the Dataset

Basic Track Info

  • track_name: Title of the song
  • artist(s)_name: Name(s) of the performer(s)
  • artist_count: Number of credited artists
  • released_year, released_month, released_day: Release date (year, month, day)

Platform Performance

  • streams: Total number of streams (Spotify)
  • in_spotify_playlists, in_apple_playlists, in_deezer_playlists: Playlist count by platform
  • in_spotify_charts, in_apple_charts, in_deezer_charts, in_shazam_charts: Chart appearances per platform

Musical Features

  • bpm: Tempo (beats per minute)
  • key: Musical key (e.g., C, D#)
  • mode: major key and minor key

Audio Characteristics

  • danceability_%: Danceability score
  • valence_%: Positivity or happiness of the song
  • energy_%: Intensity level
  • acousticness_%: Degree of acoustic sound
  • instrumentalness_%: Likelihood of having no vocals
  • liveness_%: Likelihood the song was performed live
  • speechiness_%: Amount of spoken words

Summary Table of Statistics

Summary Table of Song Statistics
Variable Statistic Value
streams Mean 12,112,503,461.40
streams Min 2,762.00
streams Max 11,053,756,970,173.00
streams SD 358,050,104,389.57
bpm Mean 122.54
bpm Min 65.00
bpm Max 206.00
bpm SD 28.06
danceability_% Mean 66.97
danceability_% Min 23.00
danceability_% Max 96.00
danceability_% SD 14.63
valence_% Mean 51.43
valence_% Min 4.00
valence_% Max 97.00
valence_% SD 23.48
energy_% Mean 64.28
energy_% Min 9.00
energy_% Max 97.00
energy_% SD 16.55
acousticness_% Mean 27.06
acousticness_% Min 0.00
acousticness_% Max 97.00
acousticness_% SD 26.00
instrumentalness_% Mean 1.58
instrumentalness_% Min 0.00
instrumentalness_% Max 91.00
instrumentalness_% SD 8.41
liveness_% Mean 18.21
liveness_% Min 3.00
liveness_% Max 97.00
liveness_% SD 13.71
speechiness_% Mean 10.13
speechiness_% Min 2.00
speechiness_% Max 64.00
speechiness_% SD 9.91

The dataset features a wide mix of music, covering a broad spectrum of styles, tempos, and moods. Some tracks are fast-paced and high-energy, clearly designed for dancing or workout playlists, while others lean more acoustic and relaxed, likely intended for quieter, more introspective listening. This musical diversity makes the dataset rich for analysis, allowing comparisons across different genres and emotional tones.

One of the most striking aspects is the dramatic difference in stream counts. While some songs have only a few thousand streams, others have reached into the billions. This shows that music streaming success is far from evenly distributed—a small group of massively popular songs captures the vast majority of listener attention. The high standard deviation in stream counts further confirms this imbalance, highlighting how just a few viral or globally successful tracks can heavily skew the data. These outliers are likely boosted by factors like playlist placements, artist fame, and algorithmic promotion.

Looking at the musical features, there’s significant variation in metrics like BPM (tempo), energy percentage, and danceability. This opens the door for deeper exploration—such as clustering songs by genre, mood, or even by their likelihood of appearing on curated playlists. Despite the overall diversity, a common pattern emerges: most songs in the dataset are vocal-heavy and high in energy, while purely instrumental tracks are quite rare. This suggests that the dataset is tilted toward modern, mainstream pop music that’s crafted to be emotionally engaging and instantly catchy—music made to grab attention quickly in a fast-moving digital landscape.

In short, the dataset offers a vivid snapshot of today’s popular music culture: bold, loud, energetic, and emotionally charged, with a strong commercial focus on what resonates with large audiences on streaming platforms.

Comparison of Streaming Data with Billboard Top 100 Rankings

After analyzing the dataset and identifying key variables that influence a song’s popularity based on stream count, I would like to validate whether the current Top 100 Billboard rankings on Spotify align with the insights from the graphs created above. To do this, I will scrape data from the website https://kworb.net/spotify/, specifically collecting the daily Top 5 rankings from several countries over a two-week period. This timeframe will provide sufficient data to analyze which artists appear most frequently and how their popularity varies across different countries.

I have chosen the following countries for analysis: United States, United Kingdom, Brazil, Germany, and France. These countries were selected because they represent large Spotify audiences with diverse musical tastes, providing a broad perspective on global streaming trends.

Artist Distribution in Top 10

# A tibble: 7 × 2
  Artist                                       appearances
  <chr>                                              <int>
1 Alex Warren - Ordinary                                 5
2 Benson Boone - Beautiful Things                        5
3 Billie Eilish - BIRDS OF A FEATHER                     5
4 Lady Gaga - Abracadabra                                5
5 Lady Gaga - Die With A Smile (w/ Bruno Mars)           5
6 ROSÉ - APT. (w/ Bruno Mars)                            5
7 Teddy Swims - Lose Control                             5

The table shows 11 different songs and artists, and each one shows up exactly five times in the dataset. That means every song was popular in five different countries, but no one song or artist appeared more than the others. Even artists like Billie Eilish and Lady Gaga, who had more than one song on the list, still only showed up five times each. So, no one really stood out as being the most popular overall. This tells me that music tastes were pretty evenly spread out, and people in different countries were listening to a good mix of songs—not just a few big global hits.

Total Streams for Top 5 Artists

The chart shows the total streams for the top five artists, and Jennie’s song “Like Jennie” is way ahead of the rest—almost three times more streams than the second artist. That kind of lead really makes her stand out in the dataset. The other artists still did well, but their numbers are a lot lower, which shows that Jennie’s song got way more attention overall.

This big gap could mean Jennie’s track had a stronger global reach—maybe it got more playlist placements, more promotion, or just really clicked with people across different countries. Whatever the reason, “Like Jennie” clearly comes out on top and shows a level of popularity that the others didn’t quite match.

Conclusion

After going through all the data, one thing is super clear—there’s no exact recipe for what makes a song go viral on Spotify. Features like energy, BPM, or even the month it drops definitely matter, but they’re not a guarantee. A track can check all the boxes and still flop, while another song that breaks all the “rules” might end up everywhere. That tells me it’s not just about how a song sounds—it’s also about timing, marketing, who the artist is, and maybe even just being in the right place at the right time.

One of the biggest things that stood out to me was how uneven streaming success is. A few songs rake in billions of streams, while most barely scratch the surface. Jennie’s “Like Jennie” was a perfect example of this—it totally dominated the top 5 and showed how one song can take off way more than the rest. But on the flip side, there were also artists like Billie Eilish and Lady Gaga whose songs were more evenly spread out across countries, showing that music taste can be really diverse depending on where you are.

At the end of the day, the data shows that streaming trends aren’t just about catchy beats or technical features. They’re shaped by a mix of cultural trends, platform algorithms, listener habits, and how well a song connects with people across different places. This project helped me look past the charts and really understand what’s driving the music we see (and hear) on repeat.