flexorRegev comments

Results 7 comments of


                                            flexorRegev

Support loras on quantized models

@fmmoret @Yard1 From what it looks in this PR - there isn't anything inherent to the fact that quantized models + lora aren't supported right now - it just wasn't...

How can I use the Lora Adapter for a model with Vocab size 40960?

Was also trying to run Yi with the same problem.. @Yard1 can you elaborate on what's needed to support? I'd be glad to work on this PR

How can I use the Lora Adapter for a model with Vocab size 40960?

@Yard1 How does the Gemma avoid this? it also has a huge vocab_size

How can I use the Lora Adapter for a model with Vocab size 40960?

Also made it work with a gemma-like adaptation to Yi and it works :)

Support for Constrained decoding

> 2\. Ability to skip ahead if there is no choice between tokens (next token is dictated by a schema) How would you think about creating this? since the sampler...

heatmap visualization of query + documents with example

Few questions here: 1. Why did you hardcode the max_len at 512 tokens? 2. It seems like there are like 3 locations for the configuration of max_length - this causes...

heatmap visualization of query + documents with example

Cool, right now I added a parameter for the document length and I'm setting it upstream manually (not ideal but working) Tried using a png and it looks pretty good...