Musical Data Exploration

Sing My Name

If your name is Baby, then seemingly every song is about you. But what if your name isn’t Baby?

The film Baby Driver may be about bank heists, but the musical score carries both the action and the movie’s entire plot. According to the director, the scenes were loosely built around the soundtrack, instead of the other way around.

Flirty banter about songs even leads to the main character’s “meet-cute” moment. In the scene, Debora was lamenting the fact that her name was never used in songs, unlike her sister, Mary. Little did she know, she was complaining to someone named Baby. Upon learning his name, she laughed and said that he had both her and Mary beat, since every song is about Baby. She’s not wrong. Here are just 5 examples.

Baby, baby, where did our love go?”

Where Did Our Love Go

The Supremes

1964

Baby, just remember I gave you my heart”

Forever Your Girl

Paula Abdul

1989

Baby, I’m not always there when you call”

Always on Time

Ja Rule ft. Ashanti

2001

Baby, how you do that, make a grown man cry?”

OMG

Usher ft will.i.am

2010

Baby, I’m the one who put you up there.”

Without Me

Halsey

2018

There are hundreds more songs just like those played above. In an analysis of over 15,000 hit songs in the US, 1,832 of them referenced someone referred to as Baby. The next closest name, Jesus, was found in only 142 songs.

Of course, it’s worth noting that in many song-writing contexts, it is less likely that artists are singing to someone literally named Baby, and are more than likely using it as a pet name or term of endearment. But, Baby is a real name. Practically speaking, there’s no way to separate out references to someone named Baby from someone simply being called Baby.

Semantics aside, Debora’s point is an interesting one. Which names do artists sing about? There are many ways to answer this question and I decided to explore as many as I could. The sections of this story are self-contained and can be read in any order, so feel free to use the navigation on the left-hand side to jump around.

How were these numbers calculated?

This analysis includes over 15,000 songs that made it onto the Billboard Hot 100 at least once between its inception in the 1950’s and April 2019. Top songs for which we could not find the lyrics are not included in this analysis. Words were considered “names” if they met the following criteria: the word had been used as a name in the US at least 5,000 times since 1950 (according to the Social Security Administration) and the word is used as a proper noun in the sentence. A total of 5,316 of the analyzed songs contained at least one instance of a name. More details in the Methods section of this article.

You know how some songs are just absolute earworms and they get stuck in your head for days? Well, some of the songs in this section make me wonder if names can be earworms too. If your name is Mickey, Scotty, Bennie, or (if you’re willing to ignore that this is actually a drug-reference) Molly, you’re probably already intimately familiar with these tunes. Check out the most name-repetitive hits or search for your favorites below.

Songs with the Most Instances of the Same Name

Count of name mentions in a single song

Growing up, Mambo No. 5 was always a fun song to hear at parties, because so many of my friends were named in succession. Generally, it was a fun song to serenade them with. Though, with only 9 distinct names repeated several times, Lou Bega’s viral hit didn’t even crack the top 10 songs with the most number of names. The number one spot for that distinct honor goes to Billy Joel’s We Didn’t Start the Fire.

Songs that Contain the Most Unique Names

By count of names mentioned at least once

Some artists end up using lots of names in their songs, but spread them out across many hits. When we look at the artists who have used the most names in their songs, there are a few things worth noting. The top 5 name-dropping artists are all hip hop or rap musicians and the top 3 (Drake, Eminem, and Lil Wayne) use their own name in more songs than any other name in their repertoire. And, to be fair, since Kanye’s and Future’s (stage and legal) names are not used very frequently in the US (according to the Social Security Administration), they were excluded from the names we searched for.

Artists Who Use the Most Unique Names

By count of names that appear in a collection of songs at least once

Names mentioned in 1 3 6 9 12 songs

To the surprise of (probably) no one, “Baby” has been holding down the spot of most common name to sing about for the past 5 decades. There are also a few other interesting trends to point out here. Only a handful of the most popular names over time have been typically women’s names. This is certainly counter to my gut feeling that names are sung primarily by (presumably straight) men in love songs. We also see the rise in naming either oneself or other reputable artists becoming more common in the 2010s with mentions of Wayne, Drake, and Nicki breaking into the top 10.

Top 10 Most Common Names Per Decade

By count of songs that mention a name at least once

Back in 2014, Slate published an article about how the first names used in popular song titles disproportionately started with J. They went on to mention that compared to the actual population of the US, names that started with M, A, and J were more popular in songs than they were in society. Does that trend hold true when we look past song titles and into their lyrics? Short version: not so much. Names that start with A are much more common in society than in song lyrics, while names that start with J show up with similar frequency in both situations.

Looking at the opposite end of the word, names that end with A (e.g., Maria, Rhonda, Sara, Lisa etc.) are much less common in songs than society, whereas names that end with Y (e.g., Mary, Johnny, Jimmy, Billy etc.) are much more common in songs. This could be attributed to the fact that names in songs are used casually and thus tend to be nicknames.

Calculated as the difference between the percentage of names mentioned in at least one song that starts or ends with a letter and the percentage of names that have been used as names in the US at least 5,000 times since 1950 that start or end with that letter.

Methods

All songs used in this analysis were on the US Billboard Hot 100 list at least once since the chart’s inception in 1958. Song lyrics were obtained primarily through MetroLyrics. Many thanks to Colin Morris for providing lyric data for songs popular prior to 2017 (used in his project on repetition in pop music). Since these lyrics are user submitted, it is possibly (and even likely) that there are spelling errors that I was unable to compensate for. Over 15,000 songs that met the above criteria were analyzed, and of those, 5,195 contained a name (as defined below).

To identify names in song lyrics, I used US national-level baby name data from the Social Security Administration (downloaded from Kaggle). Since there were over 93,000 names in this dataset, I limited my search to just those names that have been used as names at least 5,000 times since the 1950’s. This resulted in 3,607 names. I then searched through the song lyrics for any instance of any name in this dataset. Using the R package udpipe, I tagged each name with a part of speech to eliminate uses of words that were used as verbs (e.g., Will the person vs. “I will do something”) or adjectives (e.g., “baby girl”). I intended to retain only names tagged as proper nouns, but since song lyrics are not always grammatically correct, part of speech tagging can be inaccurate. Thus, I also manually checked the data, removing instances of names that were not being used as proper nouns.

There were a few other manual corrections made. Last names presented as surnames (e.g., “Kennedy” in “John F. Kennedy”) were removed, but if the surname stood on its own (e.g. “I am the realest since Kennedy”) it could have been referring to a first name, so it was retained. Names that appeared to refer to a non-person (e.g., places, brands, drug references, colors, seasons, etc.) have been removed from the default view, but can be included in the analysis using the “Include Non-People” toggle in the left-pane. Double word names (e.g., Barbara Ann) were adjusted manually to count as a single entity rather than counting as “Barbara” and “Ann”.