Huang Xin comments

Results 11 comments of


                                            Huang Xin

Ubuntu pip installation issue

I'm getting this error after pulling the Docker image with CUDA 11.8 and installation of vLLM: from vllm import LLM, SamplingParams Traceback (most recent call last): File "", line 1,...

Ubuntu pip installation issue

> ```shell > pip install vllm > ``` Thanks, it resolved my issue

Question: Gradient Accumulation

Simply add following code after allocation of optimizer in `optimizers.py` support the gradient accumulation: ``` if config.accumulate_gradient_steps > 1: optimizer = optax.MultiSteps(optimizer, config.accumulate_gradient_steps) ```

Convert Orbax ckpt to HuggingFace

Hi @A9isha , I found two bugs in your conversion code, and I have fixed it and validated the weights converted from maxtext version of llama3-8b with the HF one....

[Feature]: Support for RTX 5090 (CUDA 12.8)

> If anyone has any ideas, I can try it out on a RTX 5090. > > Apparently: > > * I guess you need https://github.com/vllm-project/vllm/pull/12702/files > * Adapt the...

Add a conversion script from maxtext gemma-2 and gemma-3 to huggingface format

I have added conversion script for gemma-3 and examined converted parameter weights and it matches with official huggingface gemma-3 weight.

[Bug] GRPO Qwen3 Failed: Inference tensors cannot be saved for backward. To work around you can make a clone to get a normal tensor and use it in autograd.

I got the exact same problem with my RTX 5090 setup

Huang Xin

Ubuntu pip installation issue

Ubuntu pip installation issue

Question: Gradient Accumulation

Convert Orbax ckpt to HuggingFace

[Feature]: Support for RTX 5090 (CUDA 12.8)

Add a conversion script from maxtext gemma-2 and gemma-3 to huggingface format

[Bug] GRPO Qwen3 Failed: Inference tensors cannot be saved for backward. To work around you can make a clone to get a normal tensor and use it in autograd.

Convert Gemma 2 to HuggingFace

Convert Gemma 2 to HuggingFace

Convert Gemma 2 to HuggingFace