johnbrisbin comments

Results 24 comments of


                                            johnbrisbin

Any way to change size and amount of embeddings?

The values for embedding size and overlap are in both ingest.py and privategpt.py ``` chunk_size = 500 chunk_overlap = 50 ``` That is in bytes not tokens. in privategpt.py you...

MultiRetrievalQAChain requires ChatModel... but should it?

MultiRetrievalQAChain imports ChatOpenAI for no apparent reason in this line (16): from langchain.chat_models import ChatOpenAI This errant import creates a chain of errors at runtime if no OpenAI modules are...

Prompt users to install Docker when execute_python_file encounters a Docker error.

Has there been a high level decision that all generated python scripts shall run in Docker environment? If not this is just documenting an introduced bug (assuming Docker installed). If...

Cannot ingest Chinese text file

I encountered the same issue (too many tokens) in a short Arabic passage in the PaLM 2 Technical Report pdf, published by Google recently where they extoll how good it...

Add new documents to an existing chroma collection

Once adding new documents without having to reload all is working reliably, periodic persistence of the db would become an effective way of avoiding massive loss of effort when a...

Parallel document loading with multiprocessing Pool

You may find that any performance improvement you see will be very dependent on the type of media where the document files are located. That is, attempts to read multiple...

Added bat and sh files for install, ingest and start

I tend to like the idea of some kind on installer not because it is difficult right now (pretty close though) but because it will only get more complex. Keeping...

fix exit bug

Modal text interfaces are a pain, but I would take the traditional ^C over starting to pick commands out of innocent text. Soon enough there will be proper UI to...

Update ingest.py

> chunk_size 500 requires too much memory, even 32GB can't fit in it, change to 200, which works fine on 16GB macbook m1 As you ingest more data you will...

feat: Enable GPU acceleration

That looks great. I have a few questions though. 1. Rather than scraping nvidia-smi, have you considered using pycuda? It is simple to get free memory as a plain number...