Eric Buehler issues

Results 136 issues of


                                            Eric Buehler

Sliding window for phi3

Sliding window models do not properly slice KV cache

**Describe the bug** This affects models which use sliding window attention, but only when the sequence length is great enough (seq_len > sliding_window) to need the sliding window. This will...

bug

Default values in enum struct-variants

Hello everyone, Thank you for your great work here. Our project makes extensive use of struct enum variants. However, we have many variants which should have default values: some of...

enhancement

Support quantized models

Refs #24.

enhancement

Add automatic pypi upload and docker build on release

Update PyO3 to take dict

This increases compatibility with OpenAI and llama-cpp-python. I would appreciate any thoughts on this change. # Breaking This breaks any code which uses the chat completion API as it removes...

pyo3

breaking

backend

models

`broadcast_as` error when processing multiple tokens at once in quantized example

Hello all, Thanks for your great work here. We are implementing speculative decoding at mistral.rs, and were in the final stages of testing when we discovered some incredibly strange behavior....

Eric Buehler