Jiaxin Shan
Jiaxin Shan
### π Describe the bug  job id: https://github.com/aibrix/aibrix/actions/runs/13360433659/job/37309320244?pr=687 ### Steps to Reproduce N/A ### Expected behavior it should be done within 1min ### Environment Default CI runner
### π Feature Description and Motivation metrics: requests, tokens (prefill, decode), latencies(e2e, TTFT, TPOT), resources (SM_ACTIVE) measurement: - request per pod - standard deviation of requests - gini coefficient Recently,...
### π Describe the bug ``` INFO 02-17 17:03:44 model_runner.py:1041] Loading model weights took 12.5708 GB INFO 02-17 17:03:44 vineyard_llm_cache.py:296] VineyardLLMCache async update: {'enable_async_update': True, 'min_inflight_tasks': 1, 'max_inflight_tasks': 8} INFO...
### π Feature Description and Motivation Achieving efο¬cient online LLM inference with SLO guarantees necessitates isolation among different clients is super important. Beside OSDI'24 VTC, I did see some new...
### π Feature Description and Motivation Let's create some tutorials for people who like to try AIBrix on single node ### Use Case n/a ### Proposed Solution _No response_
### π Feature Description and Motivation Currently, the pod becomes ready immediately, however, the application loading time is still long, at this moment, request to the model server will fail....
### π Feature Description and Motivation Now we have readthedocs website but lack of blog website. Let's create one to host blog posts and features details and company collaborations. ###...
### π Describe the bug  ``` ubuntu@158-101-17-114:~$ kubectl logs -f aibrix-redis-master-84769768cb-j5rfb -p -n aibrix-system 1:C 16 Feb 2025 18:46:20.187 * oO0OoO0OoO0Oo Redis is starting oO0OoO0OoO0Oo 1:C 16 Feb 2025...
### π Describe the bug  ### Steps to Reproduce deploy aibrix on lambda cloud instances ### Expected behavior should be stable or experience fatal errors ### Environment nightly version