semantic-chunking icon indicating copy to clipboard operation
semantic-chunking copied to clipboard

running with a hosted model?

Open dcsan opened this issue 1 year ago • 1 comments

i'd like to include this technique in a larger project but I'm worried about the requirements downloading the model as I'm using a containerized environment.

what would be involved to port this code to use a hosted model eg via HF? I assume its not enough to have an embedding API?

alternatively could there be an installer that grabs the required files, to add to a docker container?

dcsan avatar Jul 09 '24 09:07 dcsan

I added a download-models tool that might help you get closer to your goal of pre-downloading/packaging the model files. https://github.com/jparkerweb/semantic-chunking?tab=readme-ov-file#-pre-downloading-models

jparkerweb avatar Jul 12 '24 03:07 jparkerweb

@jparkerweb hey, could I run this on runpod or cloudflare workers AI?

davevilela avatar Oct 21 '24 19:10 davevilela

I'm trying to use this library in a cloudflare worker, and getting this error:

Screenshot 2024-10-22 at 15 44 26

Here's an overview of my setup:

Screenshot 2024-10-22 at 15 43 05

davevilela avatar Oct 22 '24 18:10 davevilela

not the author but would be surprised if it works in a worker - it needs to download a lot of model data first time its run.

dcsan avatar Oct 22 '24 19:10 dcsan

@dcsan Hey! What would you recommend to deploy this?

davevilela avatar Oct 22 '24 20:10 davevilela

@dcsan This error is occurring even after manually downloading the models locally. I have no clue on how to solve this.

davevilela avatar Oct 22 '24 20:10 davevilela