haikunator icon indicating copy to clipboard operation
haikunator copied to clipboard

Increase the Noun and Adjective Library

Open stevenm111 opened this issue 4 years ago • 4 comments

Love the gem however the library was small, I have added 4186 Adjectives and 6720 Nouns happy to post the branch if wish for your consideration, it does dilute the quality of the Haiku but means you don't get repetition, perhaps we can set a Turk Machine to remove any negatives to improve quality of library?

Brings possible Haikus from 6,480,000 to 26,937,036,000 give a 3 digit number is used for third metum.

stevenm111 avatar Mar 10 '21 13:03 stevenm111

can you share your list? can you add the list to my fork?

codered avatar Mar 26 '21 02:03 codered

Sure

Its an extensive list so the quality of the Hauiku can drop, however it means theres no repetition. I've pulled out some offensive and senseless ones , could do with a human to edit and select.

haikunator.rb.txt

stevenm111 avatar Mar 26 '21 09:03 stevenm111

thanks @stevenm111 . I noticed a word as soon as I opened it (dipstick), which has some negative connotations. I think it would be best to provide a way to load wordlist into the gem for use. that way, it can easily scale.

codered avatar Mar 26 '21 21:03 codered

Just an observation in Steven's haikunator.rb.txt file. There are 1481 duplicate adjectives and 70 duplicate nouns. There are only 2,733 unique adjectives and 6,714 unique nouns. Hit the arrays with .uniq to clean up before use. ;)

Planning to use for ship names. You will want to cull the list. My first generation included 'sleepy rape' and 'angry crucifixion.'

https://gist.githubusercontent.com/Merovex/eb1fbff5c8594c5bb73b47065f481fd1/raw/caf22b252b850839de8636d86091fa1eaa6bc284/nouns.txt

This is a slightly purged list of the nouns (6587). I took out all of the dupes, some odd medical terms (including patented meds), and the offensive words I could find in 10 minutes of scanning.

Merovex avatar Jan 19 '22 00:01 Merovex