AlpinDale comments

Results 170 comments of


                                            AlpinDale

[Bug]: v0.6.3(.post1?) regression

Sorry I totally forgot to get back to you 😅 Seems like this can be solved by setting `--guided-decoding-backend outlines`. Not sure what it is about lmfe that breaks mistral...

feat: quadratic + cubic sampling

Thanks for the review @Yard1. It needs a bit more work, could probably use a mask and an unsqueeze for the factor/curve. I'll finish this up soon and add a...

feat: quadratic + cubic sampling

@Yard1 Should be done, let me know if there's any issues. Correctness tests pass for the new sampler.

feat: quadratic + cubic sampling

> @AlpinDale do you plan to continue this line of work? This sampler is quite outdated by now, so I'm not sure it'll be a useful addition anymore. There are...

feat: Classifer-Free Guidance (take 2)

It works, but will fail due to changed logit shapes when running without CFG.

[Usage]: GGUFed models on AMD GPUs

Hi. GGUF kernels should theoretically work on AMD, but it's untested as I don't have regular access to AMD compute. Multi-gpu should work fine on AMD. Tensor parallelism will split...

[Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request

@gitzaidi I've added range in #855. I did play around with using python lists, but I remember running into issues due to the batched nature of our samplers - we...

[Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request

> @AlpinDale Thanks for your reactivity, very impressive ! I will look into it then. > > Also, do we agree that, as of now, the implementation considers the same...

[Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request

Fixed the sequence breaker ID issue at PygmalionAI/aphrodite-engine@c6e0ae0515a7b6364a152fbe132747d94713f6e4

[Bug]: Generation sometimes slows to a crawl for all requests when there is a DRY sampler request

DRY should be faster now but still very slow. I'm attempting to write kernels to bypass this issue. Progress will be logged here: https://github.com/AlpinDale/dry_sampling_kernel