firstname-database
firstname-database copied to clipboard
Conversion Bugs
Hi,
I needed a free first name database for https://github.com/mdecorde/BabyName and found http://www.heise.de/ct/ftp/07/17/182/ Then I found this repo and started to compare the output. There are a few bugs I could spot in this repo.
- a few lines with empty names (start with ;)
- the frequency value mapping in the Rakefile is kinda buggy (in "'A' => 1, 'B' => 2, 'C' => 4, 'D' => 5" there is no mapping to 3)
- names with spaces seem to be split up to multiple names, this causes a bunch of duplicate name entries. e.g. "Maria de Jesus" is split to the names "Maria", "de" and "Jesus".