brucethemoose comments

Results 100 comments of


                                            brucethemoose

trafficstars

Add Grammars

@lapp0 Could you post your multiprocessing branch, even if its incomplete? I've been trying to implement it myself, but it seems I can't get it quite right.

How to finetune and serve the finetuned model using VLLM like tools

SGlang support would be excellent, as it doesn't seem to work out of the box.

How to finetune and serve the finetuned model using VLLM like tools

Made an issue for it here as well: https://github.com/sgl-project/sglang/issues/297 SGlang already supports llava, so perhaps adding deepseek would not be as huge of a jump as vllm (which currently does...

Large Pages fails when allocating 8GB+ on Windows/Java 19.

Oh possibly related, see the last comment here (from a previous report I did): https://bugs.openjdk.org/browse/JDK-8292905

Request for Optimization, fixing issues with CUDA devices

Yeah, IDK what the memory requirements are but it maxxes out my 16GB of RAM and eats tons of swap. And I also noticed some models don't work as an...

Request for Optimization, fixing issues with CUDA devices

> > Yeah, IDK what the memory requirements are but it maxxes out my 16GB of RAM and eats tons of swap. > > And I also noticed some models...

Request for Optimization, fixing issues with CUDA devices

Merges are reasonably fast on CPU, thats not really an issue IMO since they are so infrequent. But being locked to torch 11 because of the cpu requirement kinda is...

Request for Optimization, fixing issues with CUDA devices

> > Merges are reasonably fast on CPU, thats not really an issue IMO. > > Being locked to torch 11 because of the cpu requirement kinda is though 🤔....

Request for Optimization, fixing issues with CUDA devices

(For reference a merge finishes in like 30 seconds on my 8C 4900HS running linux)

Are models being generated with the correct alpha?

playing with some merged models more, it *seems* like its properly respecting the alpha value.