Casper comments

Results 293 comments of


                                            Casper

How to use multiple GPU nodes during quantization

Hi @ghntd, at the moment, data parallelism is not implemented. I welcome any help on implementing this that demonstrates a speedup.

[Kernel] [Triton] [AMD] Add Triton implementation of awq_dequantize

@robertgshaw2-neuralmagic could you take a look? I think Triton implementations could be interesting to expand access to other hardware.

fix: handle dict format in response processing for web_search tools

LiteLLM should adhere to the responses API, making delegation there an impractical option. The parsing in DSPy incorporates specific assumptions about the structure of a response, which unfortunately fail to...

fix: handle dict format in response processing for web_search tools

I am still getting errors running the script I referenced in the issue after installing from your branch. It fixes some initial issue, but it seems not fully compatible with...

Support DeepSpeed AutoTP

New example is out for this. https://github.com/deepspeedai/DeepSpeedExamples/blob/592d28fa45c12613f39ed388e043be760707237c/training/tensor_parallel/train.py

Support DeepSpeed AutoTP

@winglian yes, but I seem to remember upgrading to latest accelerate fixed it

[Bug] Responses API always falls back to JSON mode which breaks with web search

> Hi, I ran your code but didn't hit the exact error. Can you share the full stack trace? I just tried making a new environment and reinstalled things, but...

[Bug] Responses API always falls back to JSON mode which breaks with web search

I accidentally deleted this when uploading my script and forgot about it. This line of code is the sole reason the script crashes immediately, but a little while later the...

[Bug] Responses API always falls back to JSON mode which breaks with web search

@bhaskargurram-ai Appreciate the dev experience getting better, but I am not sure this fixes the issue raised here. I am more concerned with not being able to use the web_search...

[Bug] Responses API always falls back to JSON mode which breaks with web search

@TomeHirata looks like the PR for this issue got abandoned. any plans to fix this issue for now?