Jiaxin Shan
Jiaxin Shan
@jiangxiaobin96 thanks for the change, I will double check the necessity
I didn't find existing issue to track this story. If there's one, please let me know
@numerology Yeah, If MLMD can add support for multi-tenancy. that would be great. Pipeline project can make corresponding changes. > I am assuming this one is talking about supporting multi-tenancy...
1. why not use openai style like `POST /v1/videos` instead of `/generatevideo`? 2. what's the investigation result of google video generation API? 3. comfyui seems an additional layer on top...
@Alan-D-Chen I think you just need to follow this guidance. this page gives you everything you need. what's you followed like lambda cloud llama installation is not helpful. Seems you...
> However, aibrix/gpu-optimizer:v0.4.1 cannot be found at all with the server and the local PC. where did you find this image? did you follow the guidance exactly? or you fetch...
@Alan-D-Chen awesome work! > 之前是不是说 要推出不使用 K8s 或者 minikube 的版本吗? 现在好了吗?正常来说 AIbrix 是需要部署在 数十台服务器上 管理 成百上千个GPU的,对吗? it's not fully finished yet. I will keep you posted once it's done. the...
> And results: > @Alan-D-Chen this is awesome! but from the results perspective, I didn't see big difference between P/D and non P/D. Technically, the decoding latencieis for non P/D...
this looks like a very reasonable requirements. thanks for driving the efforts!
 Can these files be generated? If so, let's get ride of them in the source file