Jin Shang

Results 32 comments of Jin Shang

> Or to try to track things down from the other end: Does the following, super minimal example, modified from @NomiChirps', reproduce the issue for you, @js8544, or you, @NomiChirps,...

Oh I see. My system has clang 13.0.1, which doesn't include the fix @cpsauer mentioned above. So recognizing headers as c files is expected.

> Just to be clear, clang_d_ 13, right? If so, good news, since clangd 14 should resolve that issue. (Seems likely that we're dealing with two different root causes here.)...

I don't mind having two separate block tables. I chose to have them in one because it would make minimal change to existing components. In fact it only adds a...

I agree. The current indexing scheme (by computing paddings each time) is painful and hard to read.

> The issue of NaN may arise due to numerical overflow when using FP16 for computation in these models(T5-Large). Yeah I noticed that transformers's original implementation of T5 also suffers...

@zhuohan123 @WoosukKwon Would you guys mind taking a look at this PR? T5 seems to be working now.

@zhuohan123 Hi, I'm interested in implementing support for encode-decoder models. Does it require any changes other than what's listed in https://vllm.readthedocs.io/en/latest/models/adding_model.html?

@WoosukKwon @zhuohan123 Hi, my team plans to work on T5 support. We would like to ask a few questions before we start. 1. Is the vLLM team currently working or...

Update: I'm very close to finishing this. I've run T5 with vllm successfully on my local machine. I think I will be able to submit a PR in the coming...