dblacknc comments

Results 20 comments of


                                            dblacknc

ValueError: MPTForCausalLM does not support `device_map='auto'` yet.

Confirmed it now runs with --load-in-8bit

Output with --verbose prints previous response only after next prompt entered

The bug report is response N is printed in the verbose log only after prompt N+1 is entered. Sorry for any confusion from my also mentioning the entire history is...

On interface restart, conn. refused then port is in use

If I run "netstat -an | fgrep :80" and watch for the TIME_WAIT connections to go away (no output), it'll then start. It has been a very long time since...

Crash with llava 4bit and --auto-devices

Looks like with --gpu-memory 7100MiB it starts pushing some layers to cpu. I'm thinking part of the challenge is with the llava extension active the GPU already has 1.8-2.0 GB...

Crash with llava 4bit and --auto-devices

The line is 7100 - pushes a few layers to the CPU, and 7200 doesn't. However with 7200 (and above) it overruns the 12 GB VRAM with many prompts.

Crash with llava 4bit and --auto-devices

OK - confirmed, --pre_layer is allowing CPU offload to work with GPTQ. I found a couple other related things: --auto-devices seems to be unconditionally enabled. I can omit it and...

Crash with llava 4bit and --auto-devices

OK - thanks for the explanation, and pointer to the README for LLaVA for more info. Closing as looks like it's not a bug, and I'll assume for now the...

Crash with llava extension + --no-cache

this model: wojtab_llava-13b-v0-4bit-128g

API port in use

I reported the same in issue #1632

Error code 137 when loading RWKV-4-RAVEN model

I'm just trying RWKV and it's working well for me. Not running in a container though. I'm using an Ubuntu 22.04 KVM VM with 64 GB RAM and passing through...