Bhuvanesh Sridharan
Bhuvanesh Sridharan
@symphonylyh Kindly, take a look if possible.
> Hi @ruckc, > > I was able to reproduce this issue by installing Poetry in an Ubuntu 20.04 container and creating a simple `pyproject.toml` file, such as the following:...
@zeroepoch Thanks for the prompt reply, Unfortunately, I am wanting tensorrt_llm compatible version which currently is `9.3.0.post12.dev1`. Even though tensorrt itself is correctly installed, the tensorrt-bindings and tensorrt-libs versions installed...
> The other problem I found from experimenting earlier is that Poetry is not "smart" enough to figure out the dependencies here. Since the libs and bindings packages are installed...
@Guangxuan-Xiao , Thanks a lot for the reply. Can't the same be done for Sliding Window with recomputation decoding without attention sinks? At each moment, we simply put a new...
@chenmoneygithub : Thanks for your quick response! ## Regarding performance of XML vs JSON - In a lot of our internal tests, we have found that nesting XMLs and adhering...
> This is very interesting, @Bhuvanesh09 . Thanks @chenmoneygithub for discussing it with @Bhuvanesh09 . > > Q: How will this handle Lists? Hi @okhat! I'm currently working on collecting...
Hi @chenmoneygithub and @okhat, Thank you for the discussion on this PR. I've conducted a series of experiments to provide data-driven evidence for the proposed changes, focusing on how different...
Hi @chenmoneygithub and @okhat, Hope you're having a good week. I'm just checking in on this pull request to see if you've had a chance to review the experimental data....
@chenmoneygithub : Great, thank you for the update! Please let me know if any questions come up during the review. Looking forward to your feedback.