Jiaxin Shan issues

Results 271 issues of


                                            Jiaxin Shan

Define and validate Recommended AIBrix configurations for Production Use

### 🚀 Feature Description and Motivation Currently, many controller configuration values (e.g., resources, concurrency limits, resync periods, queue lengths) are set based on heuristics or "magic numbers" without empirical justification....

kind/documentation

area/installation

Infinistore failed to work in the scale testing

### 🐛 Describe the bug ![Image](https://github.com/user-attachments/assets/ed4e186a-b6bd-4d81-b01b-2964767eaf03) ![Image](https://github.com/user-attachments/assets/83b2a604-a65b-49f6-b4a8-22ca0cfbea16) ### Steps to Reproduce 8 vllm instances, 960 GB Infinistore single instance ### Expected behavior it should work as expected ### Environment -...

MacOS make docker-build-all take long time to finish

### 🐛 Describe the bug ![Image](https://github.com/user-attachments/assets/7f404777-167d-4520-9542-67cf89454933) ``` .PHONY: docker-build-all docker-build-all: make -j $(nproc) docker-build-controller-manager docker-build-gateway-plugins docker-build-runtime docker-build-metadata-service docker-build-kvcache-watcher ## Build all docker images ``` I cleaned up everything and rebuild...

area/cicd

Finalized the v0.3.0 manifest & installation

### 🚀 Feature Description and Motivation We are using two steps deployment in the past and `apply` can not be used due to long crds contents. ``` # Install nightly...

Runtime image build issue: ERROR: failed to solve: failed commit on ref "layer-sha256:209998..."

### 🐛 Describe the bug ![Image](https://github.com/user-attachments/assets/36b24f2c-41da-4c71-90e7-8514e5407288) ``` ERROR: failed to solve: failed commit on ref "layer-sha256:209998ede32af8f5bcd2d0b9d1d2ca17a41dcce7a0a9e2437aec0e8f557b323d": "layer-sha256:209998ede32af8f5bcd2d0b9d1d2ca17a41dcce7a0a9e2437aec0e8f557b323d" failed size validation: 1014 != 251: failed precondition ``` ### Steps to Reproduce...

kind/bug

area/cicd

Run compatibility test as CI jobs

### 🚀 Feature Description and Motivation This test should focus more on using openai python sdk to test different model endpoints and check compatible responses. if any gateway changes break...

Integrate the benchmark code into CI Job

### 🚀 Feature Description and Motivation We've seen many issues related to benchmark - https://github.com/vllm-project/aibrix/issues/1040 - https://github.com/vllm-project/aibrix/issues/1029 - https://github.com/vllm-project/aibrix/issues/1028 Can we integrate the code to CI system to make sure...

priority/critical-urgent

area/benchmark

area/cicd

Failed to build inifistore images from test pypi

### 🐛 Describe the bug We have an infinistore patch version to accept gid, this is now just released in test PYPI. I meet some issue to install it ![Image](https://github.com/user-attachments/assets/e0ab9b0b-e73c-47ad-bda5-df600ad5a971)...

priority/critical-urgent

area/installation

area/kv-cache

Failed to import transformers.generation.utils because of the following error

### 🐛 Describe the bug ``` ./benchmark.sh all + CONFIG_FILE=config/base.sh + [[ -f config/base.sh ]] + echo '[INFO] Loading configuration from config/base.sh' [INFO] Loading configuration from config/base.sh + source config/base.sh...

kind/bug

area/benchmark

[Performance]: 1P1D Disaggregation performance

### Proposal to improve performance I try to reproduce the P&D 1P1D benchmark to compare performance with chunked prefill https://github.com/vllm-project/vllm/blob/main/benchmarks/disagg_benchmarks/disagg_performance_benchmark.sh. TTFL is higher than what I expected. Because the overhead...

performance

stale