Le Xu
Le Xu
### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on [Azure inference trace](https://github.com/Azure/AzurePublicDataset/tree/master/data) workload trace format (e.g., csv file). ### Use Case The generator...
### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on BurstGPT workload trace format (e.g., csv file). ### Use Case The generator should support...
### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on internal workload trace format (e.g., csv file). ### Use Case The generator should support...
This PR reconfigures workload generator to adopt predefined load types.
## Pull Request Description This PR removes model name from the workload generator and client.
## Pull Request Description Supporting autoscaling experiment scripts
## Pull Request Description This branch enhance cache awareness for aibrix mock inference app.
## Pull Request Description This PR removes await from worker thread for async IO. ## Related Issues #903
### Summary The goal of this API is to expose a scalable, distributed video generation service for xDiT using models such as CogVideoX, ConsisID, and Latte. The API allows clients...