natural icon indicating copy to clipboard operation
natural copied to clipboard

Allow for alternate storage backends

Open simonwex opened this issue 10 years ago • 7 comments

To allow for larger data sets, multi-process computation and managed backups, it'd be handy to be able to store data in a storage backend instead of in memory.

Possible back-ends:

  • in-memory (default)
  • memcached
  • Redis
  • PostgresSQL

I've got a need for this for a large-ish-scale classification project and will be working on it on my fork. If others would find value in this feature, I'm happy to discuss an approach that could eventually be merged into the main project. Thoughts?

simonwex avatar Mar 26 '15 07:03 simonwex

im totally open to this, it also comes at a time when im working on a corpus manager to handle algorithms that rely on large datasets which can be downloaded. The pilot algorithm for this new functionality is WordNet but i hope to add many more in the future. I currently index the wordnet files and save the index in a json file which is loaded into memory to make queries so it would be an interesting idea to have other storage options.

I'm not sure whether or not it fits as a piece of a natural language toolkit or not but im more than happy to flesh out the details and see if itll all make sense.

Thanks! -Ken

kkoch986 avatar Mar 26 '15 14:03 kkoch986

Hey, is there an update on this? Or any tips on how to store data on a classification in a back end?

simonv3 avatar Oct 30 '15 19:10 simonv3

:up:

zenati avatar Mar 26 '16 20:03 zenati

+1

bobalazek avatar Apr 01 '16 23:04 bobalazek

+1

guns2410 avatar Apr 17 '16 20:04 guns2410

+1 !!!

dannyfreed avatar Jan 27 '17 03:01 dannyfreed

+1

pantchox avatar Jun 19 '17 18:06 pantchox

duplicate with #681

Hugo-ter-Doest avatar Mar 01 '24 12:03 Hugo-ter-Doest

See #727

Hugo-ter-Doest avatar Mar 05 '24 20:03 Hugo-ter-Doest

Support for a number of storage methods has been added with #727

Hugo-ter-Doest avatar Mar 12 '24 18:03 Hugo-ter-Doest