firstname-database icon indicating copy to clipboard operation
firstname-database copied to clipboard

Conversion Bugs

Open mwarning opened this issue 2 months ago • 0 comments

Hi,

I needed a free first name database for https://github.com/mdecorde/BabyName and found http://www.heise.de/ct/ftp/07/17/182/ Then I found this repo and started to compare the output. There are a few bugs I could spot in this repo.

  1. a few lines with empty names (start with ;)
  2. the frequency value mapping in the Rakefile is kinda buggy (in "'A' => 1, 'B' => 2, 'C' => 4, 'D' => 5" there is no mapping to 3)
  3. names with spaces seem to be split up to multiple names, this causes a bunch of duplicate name entries. e.g. "Maria de Jesus" is split to the names "Maria", "de" and "Jesus".

mwarning avatar Oct 23 '25 15:10 mwarning