Alex "mcmonkey" Goodwin comments

Results 541 comments of


                                            Alex "mcmonkey" Goodwin

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Had the same error, redownloaded from decapoda, it's fine now. Note that decapoda pushed an update ~15 days ago so if you downloaded before then, you have outdated weights and...

integrate xturing for Low VRAM fine-tuning

Ooo! I'm midway through making a LoRA Trainer PR for different features but I'll get to testing that ASAP. That looks to be, uh, actually possible to install and load...

integrate xturing for Low VRAM fine-tuning

Actually, limitation here will be that it has its entirely own separate model loading/formats/etc it seems, it doesn't just use the HuggingFace stuff nor the GPTQ stuff.

Please add lora fine-tuning of more models.

Grab the PR @ https://github.com/oobabooga/text-generation-webui/pull/1098 and run `pip install git+https://github.com/huggingface/peft` to get updated peft (both of these will be happening on main branch soon, waiting on ooba to merge the...

Some models just giving numbers (used to work though)

Despite the coincidence in naming `monkey-patch` does not actually relate to me lol, other than just I'm one of the people who's excited to use it because I like 4-bit...

Add an "Evaluate" tab to calculate the perplexities of models

At a glance, the perplexity code looks to be about right in terms of logic (haven't verified the small details). One thing you might add is to use the streaming...

Cannot run as nonroot

I'm running as not-root on a Ubuntu install without issue. I have seen this error before, but it's usually resolved by closing my terminal window and opening a new one,...

Request: LLaVA: Large Language and Vision Assitant Visual instruction tuning towards large language and vision models with GPT-4 level capabilities.

Now the question is whether LLaVA or MiniGPT4 #1312 is better

Lora Trainer Improvements Part 4: 4-Bit Support and more!

Actual numbers since I know some people will want them. Tested on an RTX 3090, on Linux, using ~1.3 GiB while nothing is loaded. VRAM (Inference): 800 tokens input, generating...

Lora Trainer Improvements Part 4: 4-Bit Support and more!

@janvarev `--auto-devices` / `--gpu-memory` don't work with 4bit, you have to use `--pre_layer` ... and seemingly neither works _at all_ with `--monkey-patch`. Like it doesn't even process the pre_layer at...