Lu Changqi comments

Results 17 comments of


                                            Lu Changqi

the problem of test

hello, have you solved it?

line 83, in call\n label_idx = self.class_to_ind[name]\nKeyError:

> > 您可以改用VOC_CLASSES = [（ > > 'feng'）] > > it doesnt work 您好！我遇到了同样的问题，您解决了吗？可以探讨一下！

RuntimeError: copy_if failed to synchronize: device-side assert triggered

> Excuse me, have you solved this problem? hello，have you solved the problem?Thank you!

[Bug]: Runtime error when running MLA models with "prefix caching enabled" and "chunked prefill disabled"

@ApostaC When using --enable-prefix-caching and encountering a cache hit, the same error occurs. However, with --enable-prefix-caching enabled, this error can be avoided, and the TTFT is reduced during cache hits....

[Bug]: Runtime error when running MLA models with "prefix caching enabled" and "chunked prefill disabled"

In this PR https://github.com/vllm-project/vllm/pull/13747#issuecomment-2687039690, I mentioned this bug. cc @LucasWilkinson

[Roadmap] vLLM Roadmap Q4 2024

@simon-mo hi，regarding the topic “KV cache offload to CPU and disk”, I previously implemented a version that stores kv cache in a local file(https://github.com/vllm-project/vllm/pull/8018). Of course, I also did relevant...

[Core] Enable Memory Tiering for vLLM

Hi! I have an idea. Can we support a key-value database similar to valkey (Redis over RDMA)? Among them, the key is the hash value of the token. In this...

[benchmark]: llama add tokens metrics

Ping! Could you give me some advice?

[Bug]: AssertionError, assert prefill_metadata.context_chunk_seq_tot is not None

mark

[Core][Kernel][Misc] Support external swapper for vllm

> pin_memory has a great impact on swapping blocks. > > more specifically > > benchmarks/kernels/benchmark_swap_blocks.py > > ``` > + from light_vllm.utils import is_pin_memory_available > + pin_memory = is_pin_memory_available()...