Maya comments

Results 8 comments of


                                            Maya

GPT-J and Pygmalion-6b 4bit

> Do you have the compiled wheel for `quant_cuda-0.0.0-cp310-cp310-win_amd64` quant kernel? > > I'd love to test Pyg-6B-q4 capability but I absolutely despise installing MSVC build environment and it seems...

GPT-J and Pygmalion-6b 4bit

> It asked for a HF token (which I provided) and then it failed to quantize. `c4` dataset requires huggingface authorization, you can use `wikitext2` or `ptb` instead. I'm not...

GPT-J and Pygmalion-6b 4bit

https://github.com/oobabooga/text-generation-webui/pull/615 - new version, this PR is outdated for now

Fix api extension duplicating

`setup()` is called when ui is ready and parameters from settings.json are parsed. Global scope statements executed before this happens. Another thing is rare case, but what if someone wants...

Add MPT quantized model support

https://github.com/oobabooga/text-generation-webui/blob/34970ea3af8f88c501e58fef2fc5c489c8df2743/modules/GPTQ_loader.py#L100 There is hardcoded sequence length in `_load_quant`. Does it work with context sizes over 2048? MPT-Storywriter should support up to 65k contexts.

Maya

GPT-J and Pygmalion-6b 4bit

GPT-J and Pygmalion-6b 4bit

GPT-J and Pygmalion-6b 4bit

Fix api extension duplicating

Add MPT quantized model support

Running on CPU

FlexGen does not work on finetuned OPT models

Bug: Failed to process regex error with long repeating sequences