Yadir Lakehal Applied Data Science Project 1 : Happiness Data Story
Happiness is a life-long pursuit. Everyone experiences some degree of short term “happy moments” in their lives and continue to seek happiness. Everyone has different ways of nurturing, feeding and growing their happiness, but how do you quantify that growth? Is there a general equation or rule for happiness that applies to everyone?
Some people have tried to find out.
Mo Gawdat, chief business officer of Google X and author of “Solve for Happy” came up with a Happiness equation that states that: “your happiness is equal to or greater than the difference between the events of your life and your expectations of how life should behave”.
Simply looking at the mathematics of this equation, you can determine that one can easily be happy through life if they have low expectations.
According to Mo, all you need to be happy is to have low expectations, but now the question becomes: how do you determine and quantify expectations? He tries to answer these questions in “Solve For Happy” in different abstract ways, but it seems like this problem needs more of a concrete approach, so I looked elsewhere.
I decided to look into a more data-driven attempt at solving for happiness:
HappyDB is a corpus of 100,000 crowd-sourced happy moments. The goal of the project is to gain a deeper understanding of the causes of happiness that can be gathered from text.
It is simply a collection of happy moments described by individuals in the past 24 hours, as seen in an example below:
In this project, I will use the HappyDB corpus dataset to get a more meaningful understanding of the moments that grow happiness.
A logical first step is to find out what people are talking about in their happy moments.
Let’s take a look at the HappyDB moments data!
After cleaning the happy moments data, combining it with demographic data and filtering out stop words that don’t hold any relevant information, I created a word cloud that displays the 200 most mentioned words in all the happy moments
FRIEND
DAY
TIME
FAMILY
HOME
These are just some of the most mentioned words in people’s happy moments.
This suggests that more than everything else, people find happiness in:
each other
their time
their families
Based on their crowd-sourced moments, scientists who collaborated on the HappyDB project have separated happiness into potential categories:
Achievement
Affection
Bonding
Enjoying Moments
Exercise
Leisure
Nature
What do we think the most popular category is?
Based on the word cloud and simple intuition, I would guess that affection and bonding would be the most popular categories! Let’s find out!
Just like I predicted, Affection is the most popular category, but it is almost equivalent to Achievement and even though my second guess bonding is mentioned about 3 times as little, it still stands at third place.
Even though it wasn’t particularly illustrated in the word cloud, this graphic indicates that people’s happiness is primarily caused by both giving (affection) and accomplishing (achievement)
It would be interesting to see if that were the truth for everyone?
The HappyDB project includes demographic data on people’s age, let’s find out what makes different age groups happy!
Before we can categorize ages, let’s look at how old everyone who contributed to the happydb project is!
Hmm..looks like most people are in their 30s. To confirm that thought, I calculated the mean and found it to be 31.8.
The International Standardized Survey Classifications group classifies age groups as:
15-24: Teens/Mid Twenties
25-34: Young Adults
35-64: Adults
65 + : Seniors
which is how I decided to separate age groups for this data analysis project.
Now, let’s plot what category is the most common amongst each age group!
You can notice from these plots that when you are younger, your “happy moments” are more focused on achievements and as you get older, they tend to shift towards affection.
This seems logical because people generally tend to either achieve what they have wanted to achieve at an older age, or they start caring less about achieving and more about connecting with the people in their lives.
Another detail you can notice from these plots is that there is a slight decrease in the mention of exercise and a slight increase in the mention of nature as you get older!
As you grow older, you tend to accept that your body cannot perform as well and you focus less on exercising but maybe as a trade off, you are more connected with your surroundings and nature. Older people who are retired also have more time to connect with nature!
There seems to be an obvious shift in what makes people happy as you get older, but let’s see if that’s illustrated through word clouds.
What do the youngest participants say the most?
What do the oldest participants say the most?
Below, you can look at the most mentioned words for teenagers to mid-twenties as well as seniors in that order
It makes sense that the words son, daughter, and husband are much more common for the senior category and that the words girlfriend and boyfriend are more common in the mid-twenties category.
You can also notice that the seniors do not necessarily have a much stronger appreciation for connections. Words that stand out are time, life, love,nice, mom,friend, time for both the young and the old!
We’ve looked at how age affects happy moments, what about marriage? Let’s look at what married and single people are saying!
Let’s plot what category is the most common amongst married couples and single participants
At first this was surprising, but single people care much more about affection and bonding!
Put a little thought into it, and it made sense! Married couples have found the affection and bonding they’ve needed, so when single participants engage in affectionate or bonding moments, they tend to associate them with happiness more!
We’ve all heard the same funny stories about marriage getting boring after a while, but what about parenthood?
Does having children affect what makes you happy?
Now, let’s plot what category is the most common amongst each age group!
This result made absolute sense – parents will often associate happiness with their children, so affection was the logical popular category for parents, and achievement remained the most popular category by far for non parents.
It was also interesting to see that non parents seem happier about bonding than parents, especially because non parents are generally younger!
This is maybe because non parents seek out bonding moments as they do not have children or (probably) a spouse yet!
As a single young adult with no children, this made me think of what would personally drive my happy moments, and more often that not, I would think of my environment affecting my happiness. The weather, the season, and the general atmosphere of where I tend to be drastically affects my mood.
Naturally, I decided to take a look at what seasons people brought up the most!
What season of the year do people associate the most with happiness?
Parsing through all the happy moments text data, I looked for which season of the year was brought up the most.
From my own experience, I tend to think of happier moments throughout the summer because the weather is generally nicer, and I take on more vacations.
Will that be the case for everyone else?
I plotted the occurence of all seasons in the text data to find out!
Just as I guessed, summer is the happiest season by a large margin. You can notice that spring, autumn, and winter are significantly less brought up, but in the order in which they are brought up makes sense, seems like the warmer the season the more it is brought up in happy moments.
This made me wonder..how does geography affect happiness?
Where are people happy?
What makes people of different parts of the world happy?
Asking myself these questions reminded me of World Happiness Report, a project ran by Columbia University’s Earth Institute, while I was an undergraduate student there. It is a project that studied happiness worldwide and ranked 156 countries by their happiness levels.
Their research, focusing on six key variables that have been found to support well-being: income, healthy life expectancy,social support,freedom, trust and generosity attempted to map out and rank happiness worldwide, as shown below.
After having read the World Happiness Report, I decided to look into what the happiest countries were happy about.
I only looked at the data submitted by participants from the top 10 happiest countries, according to the above report and analyzed what made them happy!
Additionally, I was interested in what they were saying. Using the same data, I created a word cloud illustrating the most common words mentioned by participants from the happiest countries!
#### Countries with high happiness index care about achivement!
Interesting!
Participants who are from highly ranked happy countries are largely more focused on achieving, rather than affection.
My first thought is that maybe this has something to do with personal success, income, and a wealthier lifestyle because the happiest countries all seem to be countries with high-earning populations.
To confirm that, I compared the top happy countries with the top countries with the highest median salaries and to my surprise, there were only 3 countries in common: Luxembourg, Canada, and Switzerland.
or..does it?
We don’t have any info on participant’s incomes, but we do know what they’re saying!
Money, materialism, and owning things are often associated with both happiness and unhappiness.
Similarly, generosity, sharing and giving back are often associated with happiness.
I decided to do another analysis on the happy moments data to check whether people associated owning things or giving things more with their happiness!
I gathered a list of words associated with owning such as :
“money”,“job”“bought”,“found”,“car”,“business”,“received”,“product”,“promotion” “phone” and “job”
and did the same for words associated with giving:
“friend”,“family”,“people”,“love”,“shared”,“girlfriend”,“boyfriend”,“boyfriend”,“sister”,“brother”
Only searching for my list of associated words, I was able to determine what people talked more about through the following plot:
Participants seem to be more fulfilled with giving, which makes sense considering the previous word clouds and plots from this analysis.
Affection and Achievement drive people’s happiness the most
As you get older, you start caring more about affection and bonding and less about achievements!
When you’re single, you are more driven by affection!
When you have children, your happiness is more driven by affection
Summer is the season of the year that drives happiness the most!
Countries from high happy indexes are more driven towards achievement and personal success, but materialism is not what makes people the happiest!
faith in humanity restored