Aaron Friel comments

Results 199 comments of


                                            Aaron Friel

[RFC]: Reimplement and separate beam search on top of vLLM core

@youkaichao How well does the KV cache handle a block size of 1, in terms of compute or memory overhead?

[RFC]: Pinned Caching with Automatic Prefix Caching (Related to Anthropic Prompt Caching API)

Would you support pinned caching in the same manner as the OpenAI chat completion API as Anthropic? I'm not sure how you would support this for text completion APIs but...

[RFC]: Pinned Caching with Automatic Prefix Caching (Related to Anthropic Prompt Caching API)

@llsj14 The first thing that comes to mind is that explicit APIs add a new dimension to metering and cost analysis, but also security. If it's efficient to check if...

How to set a list of strings for a key in config-map

@jalmeroth thanks for reporting this. If I have this right, you have an @pulumi/actions job like so: ``` - uses: pulumi/actions@v3 with: config-map: some-key: value: - some-value-a - some-value-b ```...

Memory leak when lots of files are changed at once

It would be great if @yzhang-gh you could identify what might cause this feedback loop with other extensions. The issues you refer to, and the two that link to this...

Scaling performance as sequences exceeds core count

I did some cursory research into the libraries available for cross-process IPC in Rust, and, well, essentially all of them add some significant degree of complexity to the implementation or...

Pulumi new fails with "not a valid zip"

## Root cause The AI backend is throwing an error when generating the zip template. After extracting the program from the markdown of the conversation, the backend also parses the...

Pulumi new fails with "not a valid zip"

That leaves us with a few issues: ## Incorrect code generation We can prompt and train the model to try to prevent this statistically. I'll open an issue and make...

Resource constructor result typing

Static methods also return `T` instead of `GuestT`, so this workaround doesn't work: ```wit interface foo { resource bar { fallible-ctor: static func() -> result } } ``` I think...

[RFC] Drop beam search support

As a user of guidance/AICI/other methods of constraining LLM output, disabling beam search can reduce quality of outputs. For the reason users describe above. We've noticed that across a wide...