aikitoria comments

Results 65 comments of


                                            aikitoria

Support Cohere Command-A (Cohere2ForCausalLM arch)

Btw, would it be possible to open source the kernels for FMHA, XQA, and "trtllmGen" ? Since MMHA is already open source, that would expose the whole library for people...

Support Cohere Command-A (Cohere2ForCausalLM arch)

@b8zhong I don't believe any special code is required? As the model architecture is supported by vLLM, you can simply launch the model with vLLM and it should work. Though...

"Trying to remove block n by 0 that is not in hash map" spam in release 0.17

This also occurs with trtllm-serve, not just run.py, which is much worse!

[FEATURE_REQUEST] Add swipes when regenerating stable diffusion image

Interesting, that's very useful! Still think this would be a nice addition though, where you could swipe to switch out images on individual messages that generated some

Preparing dataset for training CogVideo1.5 I2V

> Regarding CogVideoX1.5, it supports 768-1360 (long edge) and 768 short edge Is vertical video (i.e. 768x1360) meant to be supported? It always becomes blurry when I try.

[Feature] EPYC性能优化

Hi guys, has there been any progress towards multi-NUMA tensor parallel? I see sglang recently implemented this feature ( https://lmsys.org/blog/2025-07-14-intel-xeon-optimization/ ) but they do not have the partial offload solution...

[Feature] EPYC性能优化

Awesome, looking forward to it being released!

[Feature] EPYC性能优化

When you say in future, do you already have an estimate when this update will come?

do you think you could support HunyuanVideo the new video model from tehcent?

This would be awesome indeed! It seems the architecture is quite similar to flux aswell?

do you think you could support HunyuanVideo the new video model from tehcent?

Since HunyuanVideo I2V seemingly didn't turn out as good as expected, how about focusing the video support on Wan 2.1? 👀