Baby Names

Baby Names

One of the most complete datasets of US names comes to us from applications to the US Social Security Administration. It is a requirement upon birth in the US that a baby be issued a Social Security Number, which is linked to a name. Data.gov released a state and national list of these names for the public to explore and I thought this would be a fun way to find interesting aspects of baby names: aspects that might predict certain names, trends in time, and geographic spaces. I also couldn't resist doing a bit of sleuting regarding my own name, so the last section is bit of self love...

If you'd like to view the dataset to explore yourself, it can be found here: Data.gov

Exploring Celebrity

I'd always heard on television and in day to day conversations that Americans in particular love to name their children after the Rich and Famous. What I wanted to understand was: if a celebrity named his/her child an uncommon or unique name would we still follow suit and see a spike in the popularity of this name afterward?

This brings me to the candidate names: Apple, Blue, Willow, Jaden, & Ryder are the names of children from A list stars Gwyneth Paltro, Jay-Z, Will Smith x 2, and Kate Hudson. Did these names start a trend because of their famous parents? After a few seaches over the database I came to these 5 charts:

The left-handed region of any of the outlined sections is the year that each celebrity named his or her child that particular name and the shaded area includes a 10 year time span thereafter. What I'd hoped to capture was a spike in the number of babies named each respective name within that range of years.

Looking closely at the charts, for the name Apple there seems to be a plausible connection between the celebrity appearance of the name and a spike in the names occurance in the chart. Directly following the name and for some time after we see many more babies named Apple. For Jaden, Ryder and Willow it is difficult to say that the appearance of the name at that date may have caused the spike in names, because the name was already in an upward trend before the appearance of the celebrity name. Though, we might wonder if the appearance of the celebrity name increased the trend as I suspect it might have for Jaden and Ryder.

Exploring Television Characters

Another interesting source of popular names I had heard about and always wanted to investigate was the names of television characters on popular shows. Could these cause a trend in baby names as well? To test this I took two of the most watched television series in the last 50 years, Friends and The Cosby Show, and pulled the names of each of the characters from IMBD. For each name in the chart, I included the length of time that the show played live on television as the shaded region.

Looking at the plots above it seems as if the only names whose popularity is plausibly connected to a Friends character are Phoebe & Chandler. There is somewhat of a delay between the debut of the show and the spike that we see in the chart, but this could be due to the fact that the show hadn't reached the height of popularity to impact the mind of Americans until that point. Looking at the others, it seems the names popularity is likely part of larger trends that were already in place before the Friends characters debuted with these names. Next, we'll take a look at the Cosby plot below.

Looking at these character names we again only see a subset of these, Rudy and Theo, as being plausibly conneted to trends in baby names. So, what is it about these particular names that catches the attention of Americans over the others? Could it be that there are certain aspects of the characters within the show that impact the popularity of the names? Could it be that well loved characters catch the attention of popular conscience, as I suspect with Theo and Rudy? It might be interesting to quanitify these sorts of questions to test them in future examinations.

High Level Examination

Looking above at some of the charts you might notice that in certain spans of time, a name can reach massive heights of popularity when before it was largely ignored, or non existant. Below I wanted to see if there were any names that were the opposite, showing popularity across generations.

Looking at the charts we can see that the names James and John appear in the top five most popular names in five of the six generations. The names William, Mary, and Michael each appear 3 times respectively. It's intersting to note that all of the most popular names of any generation in the US since 1879 are names from Christian theology. It seems that American names, like our history for the most part, are deeply impacted by our founding as a Christian settlement.

James: Long Lasting?

Given that James is one of the most popular names since 1879, I feel a bit more justified in my interest in examaining it over time and space. Given that the name has been popular for so long, it might be interesting to see a trendline within each generation of how it fared, which I created below:

Within the Greatest Generation chart there is a massive increase in the number of persons named James in the population between 1912-1915. Why might this be? It may have to do with this fellow: James W. Gerard the embassador to Germany during the years before and through the US entry into World War I. Gerard, President Wilson and the Kaiser were involved in a series of discussions regarding the potential for US intervention in the war through diplomatic channels. After being formally asked to leave Germany in 1917, Gerard wrote a book about his experiences in Germany that was made into a feature film by Warner Brothers. He chroniced the tensions he experienced handling communications between then President Wilson and the Kaiser.

Unless there is another conquering hero named James in this generation, it seems likely that the overall trend in the name James is downward. Since the last spike in 1944, new babies named James have dropped to levels approaching those seen at the turn of the 20th century.

James & Geography

I fall into the group of people who love to wonder, "what was happening when...?" and in the context of babies named James the wonder involves my year of birth and United States. I created the chart below to see in which states were there other James being born and how many.

It's probably no surprise that given how common a name James is, the prevalence of the name seems to follow the population; California, Texas, New York and Florida have the largest numbers of babies named James and also the largest populations in the United States.

Closing

This is such an interesting dataset and there are so many questions that we might answer next. Are there certain names that used to be female/male only and that are now used by multiple genders? Are there certain names that are only popular in certain states and what are they? Are there certain parts of names, vowel sounds, or constructions, that might predict whether the US will "Like" a name? Until next time...