generatedata icon indicating copy to clipboard operation
generatedata copied to clipboard

Help wanted: remaining countries that need name data

Open benkeen opened this issue 4 years ago • 11 comments

Hey folks!

The Names data type lets you generate country-specific names to make your data set look more realistic. But to do that, we need to actually supply that data. To do that, add a names.ts file into the client/src/plugins/countries/[countryname] folder, in the same structure as the other ones.

I suggest adding a bare minimum of 100 male, female & last names, but anything up to 1000 of each. Note: the source data has to be license-free.

Remaining

  • Colombia
  • Costa Rica
  • France
  • Indonesia
  • Ireland
  • Italy
  • Mexico
  • New Zealand
  • Pakistan
  • Peru
  • Poland
  • Russia
  • South Africa
  • South Korea
  • Sweden
  • UK
  • Ukraine

Done

  • ~Australia~
  • ~Austria~
  • ~Belgium~
  • ~Brazil~
  • ~Canada~
  • ~Chile~
  • ~China~
  • ~Germany~
  • ~India~
  • ~Netherlands~
  • ~Norway~
  • ~Singapore~
  • ~Turkey~
  • ~US~
  • ~Vietnam~

benkeen avatar Nov 14 '21 01:11 benkeen

Hi Ben, attached a Dutch name listing. 300 entries, comprised of the most popular last names, first names in the Netherlands. I added separate columns for last name prefixes and alphabetization, as this can be confusing in Dutch. Let me know if this works for your purposes? Thanks!

NL names.xlsx

rvanraamsdonk avatar Nov 24 '21 10:11 rvanraamsdonk

This is terrific, thanks @rvanraamsdonk! I'll convert that into code & add it to the next release.

benkeen avatar Nov 25 '21 06:11 benkeen

Glad you find it useful! Attached a cleaned-up version of the sheet, the previous one had two redundant tabs in, to avoid confusion.

NL.names.xlsx

rvanraamsdonk avatar Nov 25 '21 10:11 rvanraamsdonk

I just looked a bit closer and sorry, I didn't noticed the names weren't separated by gender. The last names are terrific, but unfortunately I'll need the first names separated. But no worries if it's a pain, I can track down an alternate source!

benkeen avatar Dec 03 '21 04:12 benkeen

Hi Ben,

I understand. Attached an updated version with a gender column added. Hope this works for you.

Kind regards, Robert

From: Ben Keen @.> Date: Friday, 3 December 2021 at 05:44 To: benkeen/generatedata @.> Cc: Robert van Raamsdonk (Contenu) @.>, Mention @.> Subject: Re: [benkeen/generatedata] Help wanted: remaining countries that need name data (Issue #710)

I just looked a bit closer and sorry, I didn't noticed the names weren't separated by gender. The last names are terrific, but unfortunately I'll need the first names separated. But no worries if it's a pain, I can track down an alternate source!

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/benkeen/generatedata/issues/710#issuecomment-985212657, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AFTWKK6YFNUQCOAW43MFFELUPBDL5ANCNFSM5H7IDGPA. Triage notifications on the go with GitHub Mobile for iOShttps://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Androidhttps://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

rvanraamsdonk avatar Dec 03 '21 09:12 rvanraamsdonk

Thanks @rvanraamsdonk! Possibly you responded to the thread, but github didn't attach the file? I checked the two attachements above but they're the same.

benkeen avatar Dec 05 '21 18:12 benkeen

Here's the updated version with the gender column added. NL.names with gender 031221.xlsx

rvanraamsdonk avatar Dec 06 '21 14:12 rvanraamsdonk

Looks perfect! Thanks again. I'll include this in the next version.

benkeen avatar Dec 07 '21 06:12 benkeen

Hey folks!

The Names data type lets you generate country-specific names to make your data set look more realistic. But to do that, we need to actually supply that data. To do that, add a names.ts file into the client/src/plugins/countries/[countryname] folder, in the same structure as the other ones.

I suggest adding a bare minimum of 100 male, female & last names, but anything up to 1000 of each. Note: the source data has to be license-free.

Remaining

  • Colombia
  • Costa Rica
  • France
  • Indonesia
  • Ireland
  • Italy
  • Mexico
  • New Zealand
  • Pakistan
  • Peru
  • Poland
  • Russia
  • South Africa
  • South Korea
  • Sweden
  • UK
  • Ukraine

Done

  • ~Australia~
  • ~Austria~
  • ~Belgium~
  • ~Brazil~
  • ~Canada~
  • ~Chile~
  • ~China~
  • ~Germany~
  • ~India~
  • ~Netherlands~
  • ~Norway~
  • ~Singapore~
  • ~Turkey~
  • ~US~
  • ~Vietnam~

Sir, I want to work on the Pakistan part if you allow me I will try to add a list of more than 300 names. Thank you..!!

sha0urya avatar Jun 14 '23 10:06 sha0urya

That'd be terrific, thanks @sha0urya!

benkeen avatar Jun 15 '23 01:06 benkeen

Hey! I added some name data for Mexico on #848. Hope it helps and thanks for making this amazing tool!

itaquito avatar Oct 24 '23 21:10 itaquito