LaaZa comments

Results 113 comments of


                                            LaaZa

AssertionError: Torch not compiled with CUDA enabled

Okay, it appears that it is trying to load a normal model and not ggml because the model isn't named properly for textgen. Rename the `Pygmalion-7b-4bit-Q4_1-GGML.bin` to something like `ggml-Pygmalion-7b-4bit-Q4_1.bin`

AssertionError: Torch not compiled with CUDA enabled

You changed the folder name and not the .bin file?

Can't load llama13b-4bit.pt without using CPU ram, extremely slow ~ 20 seconds/token

Honestly, try to use a GGML model that runs on the cpu instead. 4 GB is just hopeless to run anyhing on the GPU. You can technically split the model...

Can't load llama13b-4bit.pt without using CPU ram, extremely slow ~ 20 seconds/token

Look for models here [Hugging Face models](https://huggingface.co/models?pipeline_tag=text-generation&sort=downloads&search=ggml) @oranda Doesn't sound like you are using textgen, so not really the issue for here. But eitherway, if you are using llama-cpp-python, try...

Set n_ctx for llama.cpp models when loading/reloading

Correct me if I'm wrong but doesn't textgen use specifically llama.cpp via llama-cpp-python, which are for LLaMA? Maybe it could be useful to be able to change this value anyway...

Why is your hypernetwork option in that location?

In the settings tab > user interface > quicksettings list. Change it to this list: `sd_model_checkpoint, sd_hypernetwork, sd_hypernetwork_strength` You can add other settings separated by commas and you can find...

Why is your hypernetwork option in that location?

@Shake128 does it look exactly like this? ![quicksettings](https://user-images.githubusercontent.com/6142286/198877894-bcbb9bc6-d9b5-4095-81d8-a2ddbe8c094c.png) For me the Apply and Restart buttons seem to stop working after using the restart from the UI. Make sure when you...

LaaZa

AssertionError: Torch not compiled with CUDA enabled

AssertionError: Torch not compiled with CUDA enabled

Can't load llama13b-4bit.pt without using CPU ram, extremely slow ~ 20 seconds/token

Can't load llama13b-4bit.pt without using CPU ram, extremely slow ~ 20 seconds/token

Set n_ctx for llama.cpp models when loading/reloading

Why is your hypernetwork option in that location?

Why is your hypernetwork option in that location?

[FEATURE] ADD Support DBRX

[BUG]Windows 11 operating system encountered an issue during source code installation

[BUG]Windows 11 operating system encountered an issue during source code installation