Skip to content

A quick script to grok the last 132 years of baby naming data.

Notifications You must be signed in to change notification settings

binarybana/namesearch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

NameSearch

NameSearch is a quick script to mine the US Social Security Administration's dump of baby names given over the last 132 years in the United States.

Useful for searching for names if you want one that's not too uncommon, but not too popular either.

You'll need to download the data yourself and then run python play.py <names.zip> to unpack the data and put it in a compressed binary for easier loading.

Current Output

For the lazy, this is some of the kinds of things you can do with the data:

$ python play.py
Most 10 popular names over the last 132 years:
             count         year
name                           
Mary       4106851  1945.500000
Elizabeth  1572095  1945.500000
Patricia   1569225  1947.500000
Jennifer   1457441  1964.478723
Linda      1449318  1945.500000
Barbara    1431765  1945.500000
Margaret   1234391  1945.500000
Susan      1119336  1945.500000
Dorothy    1104599  1945.500000
Sarah      1045750  1945.500000

'Least' 10 popular names over the last 132 years:
          count  year
name                 
Gregorio      5  1951
Matelynn      5  2003
Matilee       5  2010
Aviara        5  2002
Mathie        5  1919
Mathison      5  1997
Matiah        5  1993
Maticia       5  1970
Matildia      5  1917
Curtney       5  1986

About

A quick script to grok the last 132 years of baby naming data.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published