Darrin Hodges comments

Results 40 comments of


                                            Darrin Hodges

Old CPU | AVX / AVX2 required to run privateGPT?

Also tried: mpt-7b-instruct.ggmlv3.q8_0.bin and get: gptj_model_load: invalid model file 'models/mpt-7b-instruct.ggmlv3.q8_0.bin' (bad vocab size 2007 != 4096)

Are there any other GPT4All-J compatible models of which MODEL_N_CTX is greater than 2048?

Have been getting similar errors with various models as per below, the error is the same: `gptj_model_load: invalid model file 'models/mpt-7b-instruct.ggmlv3.q8_0.bin' (bad vocab size 2007 != 4096)`

VERY BIG performance improvement and beautiful features

for reference, after running the requirements, I still had to install the following (on clean environment): - python -m pip install python-dotenv - pip install tqdm - pip install langchain...

VERY BIG performance improvement and beautiful features

> > for reference, after running the requirements, I still had to install the following (on clean environment): > > > > * python -m pip install python-dotenv > >...

VERY BIG performance improvement and beautiful features

> ```python > #!/bin/bash > export LLAMA_CUBLAS=1 > source ~/anaconda3/bin/activate > #check if venv virtual env exists > if conda info --envs | grep -q "venv" > then > echo...

VERY BIG performance improvement and beautiful features

this is where it fails: ``` g++ -I. -I./examples -O3 -std=c++11 -fPIC -DNDEBUG -Wall -Wextra -Wpedantic -Wcast-qual -Wno-unused-function -Wno-multichar -pthread -march=native -mtune=native -DGGML_USE_CUBLAS -I/usr/local/cuda/include -I/opt/cuda/include -I/targets/x86_64-linux/include -c llama.cpp -o llama.o...