Luca Beurer-Kellner comments

Results 149 comments of


                                            Luca Beurer-Kellner

Docs for cache behaviour

This is how caching is actually implemented with OpenAI. However, with `sample` I think caching does not apply, since it is typically not seeded. With the `seed` parameter, it could...

Chat templates

State handling looks good now thanks. I just wanted to test with `"meta-llama/Llama-2-7b-chat-hf"`, however, it seems the start/end tag extraction does not work very well there. First of all, some...

Chat templates

The problem with the Llama template seems to be that it does not allow all tag interleavings as expressible with our current {:role} syntax. If users specify e.g. two system...

Query Request Builder

Thanks for suggesting this, it it is definitely a good idea. I will keep this issue to track progress/work on this.

Query Request Builder

Marking this as a good first issue. A query request builder would just construct an LMQL string, that is then passed to the LMQL compiler to produce a callable LMQL...

Support for GPT-4-turbo

I just pushed support for "openai/gpt-4-1106-preview" to `main`, which should now work out of the box. For other models that raise a similar issue, you can now also specify that...

AssertionError: Length of deterministic continuation did not match length of provided logprobs. Provided logprobs: 1505, Provided IDs: []

Thanks for reporting this. Any more concrete way to reproduce this would be much appreciated. I have seen this kind of issue before and it typically is caused by unexpected...

Luca Beurer-Kellner

Docs for cache behaviour

Chat templates

Chat templates

Query Request Builder

Query Request Builder

Support for GPT-4-turbo

AssertionError: Length of deterministic continuation did not match length of provided logprobs. Provided logprobs: 1505, Provided IDs: []

special tokens/templates best practices

Dataclass type constraint issues (Infinite arrays & LogitBias Warnings)

Dataclass type constraint issues (Infinite arrays & LogitBias Warnings)