Jiaxin Shan comments

Results 742 comments of


                                            Jiaxin Shan

[Mics]: modify stormservice scaling logic when diff > 0

@jiangxiaobin96 thanks for the change, I will double check the necessity

[Multi User] Support separate metadata for each namespace

I didn't find existing issue to track this story. If there's one, please let me know

[Multi User] Support separate metadata for each namespace

@numerology Yeah, If MLMD can add support for multi-tenancy. that would be great. Pipeline project can make corresponding changes. > I am assuming this one is talking about supporting multi-tenancy...

[RFC]: xDiT Video Generation API

1. why not use openai style like `POST /v1/videos` instead of `/generatevideo`? 2. what's the investigation result of google video generation API? 3. comfyui seems an additional layer on top...

How to install and deploy AIBrix on a single server?

@Alan-D-Chen I think you just need to follow this guidance. this page gives you everything you need. what's you followed like lambda cloud llama installation is not helpful. Seems you...

How to install and deploy AIBrix on a single server?

> However, aibrix/gpu-optimizer:v0.4.1 cannot be found at all with the server and the local PC. where did you find this image? did you follow the guidance exactly? or you fetch...

How to install and deploy AIBrix on a single server?

@Alan-D-Chen awesome work! > 之前是不是说要推出不使用 K8s 或者 minikube 的版本吗？现在好了吗？正常来说 AIbrix 是需要部署在数十台服务器上管理成百上千个GPU的，对吗？ it's not fully finished yet. I will keep you posted once it's done. the...

How to install and deploy AIBrix on a single server?

> And results: > @Alan-D-Chen this is awesome! but from the results perspective, I didn't see big difference between P/D and non P/D. Technically, the decoding latencieis for non P/D...

Add external-filter in Header for advanced routing

this looks like a very reasonable requirements. thanks for driving the efforts!

Autoscaling Benchmark Initial

![image](https://github.com/user-attachments/assets/4a30f938-55eb-4b00-9507-2d29c5b64d3a) Can these files be generated? If so, let's get ride of them in the source file