Introduction

A recommendation system is a type of artificial intelligence algorithm that uses information about users’ interests and history of consumption (material or digital goods) to suggest new items of interest. This type of system is most useful when the total number of items is too large for users to find by other means, such as browsing or using search engines, in a reasonable amount of time.

Below, I analyze the YouTube video recommendation system using the Scenario Design method.1

Scenario design analysis

How the YouTube recommendation system works

I didn’t reverse engineer YouTube because the methodology of its recommendation system has been described in detail.4 As shown below, YouTube’s recommendation system uses a two-stage neural network that first generates a set of candidate videos from YouTube’s entire video collection, and then narrows the candidates by assigning each video a score and ranking them.

Candidate generation uses a user’s history of activity to identify a subset of several hundred videos that are considered relevant to the user. Relevance is assessed by item-item collaborative filtering, a filtering technique that uses similarities between items (videos) that similar customers/users have selected to provide recommendations. The mathematical basis and computational implementation of the filtering are quite complex. Briefly, the filtering is a type of classification problem, which is solved using a Softmax classifier to determine the optimal probabilities for each class (video) given the user’s features (eg, search/watch history, age, location); the classes with the highest probabilities are the most relevant videos to the user.

The ranking step uses a weighted logistic regression model to assign a score to each candidate video using a set of features that describe the user and the video (eg, how old it is, its title). This model is trained on “positive” videos (those that are clicked), which are weighted by observed watch time, and “negative” videos (those that are not clicked), which have unit weights (ie, weight = 1). In test or real-world datasets, several dozen of the highest scoring videos are recommended to the user.

In addition to these models, YouTube uses efficient algorithms and other computational “tricks” to enable very fast calculations (milliseconds) at extremely large scale (billions of users and videos).

Ideas to improve YouTube’s recommendation capabilities

Conclusions

YouTube’s recommendation system uses a two-stage neural network with a combination of item-item collaborative filtering and logistic regression. The ability of this system to generate recommendations from billions of videos in only a few milliseconds is quite impressive. Nevertheless, it could be improved by providing users the ability to control the amount of video churn and by balancing the proportion of videos with positive and negative emotions in its recommendations.

References

  1. Temkin BD. Scenario Design: A Disciplined Approach to Customer Experience. Forrester Research, 2004. https://www.scribd.com/document/109502765/An-Approach-to-Customer-Experience

  2. Larsen R. Understanding the YouTube business model and revenue streams. Untaylored, February 19, 2024. https://www.untaylored.com/post/understanding-the-youtube-business-model-and-revenue-streams

  3. Zote, J. 25 YouTube stats marketers should know in 2024 [Updated]. Sprout Social, March 20, 2024. https://sproutsocial.com/insights/youtube-stats/

  4. Covington P, Adams J, Sargin E. Deep neural networks for YouTube recommendations. RecSys ’16: Proceedings of the 10th ACM Conference on Recommender Systems. September 2016. http://dx.doi.org/10.1145/2959100.2959190

  5. Erdvin et al. Level of user satisfaction with the current YouTube recommendation system. Procedia Comp Sci. 2023;216:442-452. https://doi.org/10.1016/j.procs.2022.12.156