openbenches.org icon indicating copy to clipboard operation
openbenches.org copied to clipboard

Wordle - help needed!

Open edent opened this issue 6 years ago • 2 comments
trafficstars

We've been asked to create a Wordle or Word Cloud based on inscription data.

BUT! They would like to included common phrases. Like "in loving memory of", "always in our hearts", "together again".

When I've previously done word clouds, it has always been done on single words. Does anyone know I can create a "phrase" wordle? Or even just get stats out about common phrases?

edent avatar Sep 23 '19 10:09 edent

Selecting all the inscriptions from the DB is easy. SELECTION inscription WHERE 1 etc.

The text can then be dumped in a Word Cloud site. I can find lots of software which works on individual words, but none which will automatically find phrases.

edent avatar Sep 23 '19 10:09 edent

Just came across this and thought it would be an interesting task to attempt to find phrases. I built out some logic here: https://github.com/ssddanbrown/openbench-phrases A view of the current output can be seen here.

It's pretty dumb, and not currently suitable for running server-side due to high resource usage, but may be helpful as a first step depending on desired usage. It looks to the live API for data (With caching on subsequent runs) but could be easily adapted to DB usage; the fetch of the inscription list is contained to one function.

ssddanbrown avatar Jun 09 '22 16:06 ssddanbrown