-
Notifications
You must be signed in to change notification settings - Fork 0
Py DS_Engineer Lab Report #08
Amy Lin edited this page Jul 29, 2017
·
3 revisions
Popular Baby Names 2016 National Data from Social Security is used as the data source.
Occurrences of each character is counted then characters and counts are parsed into two lists. These two lists are parameters to generate a random series of characters weighted by their occurrences. Meaning, if a letter shows up more in a name, then it has a higher chance of getting selected.
nltk package is used to check if the word exists or not. If not, the program will pick another list of letters until the word is a real word.