Lukas Kreussel comments

Results 114 comments of


                                            Lukas Kreussel

Better generation stats

I could give it a try but im still kinda bussy with the CUDA/OpenCL stuff and i have no idea how i would implement performance metrics and loggin correctly in...

Invalid magic number while loading LoRAs

LoRA files should always start with the `ggla` magic. (See [here](https://github.com/ggerganov/llama.cpp/blob/master/convert-lora-to-ggml.py#L51)) Could you link to the LoRA file you were using? And could it be that you did use a...

Runtime GPU backend selection

Shouldn't switching the backend be possible after https://github.com/ggerganov/llama.cpp/pull/2239 is implemented?

How much RAM needed to convert gpt2 13b model to ggml using your Manual convert function?

Well you can calculate it via: 13b times 16 Bit (f16) = 26 GB. Accelerate will probably try to page some of the layers, if you exceed your 16 GB...

thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: InvalidMagic { path: "model_merak.bin" }', src/model.rs:47:12

What format has your model? `rustformers` currently only supports `GGJT` models. `GGUF` support still needs some more time. The error your getting means your model has an unknown file type.

thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: InvalidMagic { path: "model_merak.bin" }', src/model.rs:47:12

This is the related PR in the rustformers repo https://github.com/rustformers/llm/pull/412

Feature Request: Falcon 7B support

`llm-rs-python` is a wrapper around [rustformers/llm](https://github.com/rustformers/llm). If the falcon support lands in [rustformers/llm](https://github.com/rustformers/llm) via https://github.com/rustformers/llm/issues/293 i'll include it in the wrapper.

Lukas Kreussel

Better generation stats

Invalid magic number while loading LoRAs

Runtime GPU backend selection

How much RAM needed to convert gpt2 13b model to ggml using your Manual convert function?

thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: InvalidMagic { path: "model_merak.bin" }', src/model.rs:47:12

thread '<unnamed>' panicked at 'called `Result::unwrap()` on an `Err` value: InvalidMagic { path: "model_merak.bin" }', src/model.rs:47:12

Feature Request: Falcon 7B support

Is streaming supported with langchain AsyncIteratorCallbackHandler?

fix: bundle CUDA DLL into the release

fix: bundle CUDA DLL into the release