Le Xu

Results 9 issues of Le Xu

### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on [Azure inference trace](https://github.com/Azure/AzurePublicDataset/tree/master/data) workload trace format (e.g., csv file). ### Use Case The generator...

kind/feature
area/benchmark
area/cli

### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on BurstGPT workload trace format (e.g., csv file). ### Use Case The generator should support...

kind/feature
area/benchmark
area/cli

### 🚀 Feature Description and Motivation Adding a workload generator that generate workload file based on internal workload trace format (e.g., csv file). ### Use Case The generator should support...

kind/feature
area/benchmark
area/cli

This PR reconfigures workload generator to adopt predefined load types.

## Pull Request Description This PR removes model name from the workload generator and client.

## Pull Request Description Supporting autoscaling experiment scripts

## Pull Request Description This branch enhance cache awareness for aibrix mock inference app.

## Pull Request Description This PR removes await from worker thread for async IO. ## Related Issues #903

### Summary The goal of this API is to expose a scalable, distributed video generation service for xDiT using models such as CogVideoX, ConsisID, and Latte. The API allows clients...

area/inference-engine
area/orchestration