Thomas Germer

Results 141 comments of Thomas Germer

> Is the lecture online? It is not, unfortunately. But you could ask if you can have the slides: https://dbs.cs.hhu.de/mitarbeiter.php?id=ludhim

I just noticed that I had an old and a new version of the slides. The new version included full citations: ![1](https://user-images.githubusercontent.com/18725165/109773633-ba746800-7bff-11eb-951f-21c2be1f8fa3.png) ![2](https://user-images.githubusercontent.com/18725165/109773636-bb0cfe80-7bff-11eb-86c8-2cec14d744d2.png) ![3](https://user-images.githubusercontent.com/18725165/109773638-bb0cfe80-7bff-11eb-831d-5577e01a201c.png) ![4](https://user-images.githubusercontent.com/18725165/109773640-bba59500-7bff-11eb-9939-67a83a7e6da6.png) ![5](https://user-images.githubusercontent.com/18725165/109773641-bba59500-7bff-11eb-9bdb-414844a0af3e.png) ![6](https://user-images.githubusercontent.com/18725165/109773642-bc3e2b80-7bff-11eb-938b-29185d8909eb.png)

I think there was some issue with literature where `m` was used differently depending on author. Not sure though if this was the exact issue. The derivation went something like...

> right now the workaround is to use the new `/apply-template` endpoint in llama-server, added in a recent commit. It's explained here: https://github.com/ggerganov/llama.cpp/tree/master/examples/server#post-apply-template-apply-chat-template-to-a-conversation Great! With this new `/apply-template` endpoint, we...

> The feature already exists in the form of custom [GBNF grammars](https://github.com/ggerganov/llama.cpp/blob/master/grammars/README.md)! Great! It works! ```python import requests url = "http://localhost:8080/v1/chat/completions" def prefix_using_grammar(): prefix = "```go\nfunc quacksort" data = {...

@ggerganov Could you please reopen this issue? [The grammar-workaround](https://github.com/ggml-org/llama.cpp/issues/11536#issuecomment-2643444612) works, but a more efficient solution is possible.

> this is solved by [#13174](https://github.com/ggml-org/llama.cpp/pull/13174) ~~Do you have an example how to use this? I can only see an example for `/apply-template`.~~ EDIT: It seems like assistant answers are...

Above the equation, it says: "Note that $\beta_\text{max}$ is defined as a small value". In addition, $\beta_t < \beta_\text{max}$ ("∵" means "because"). Therefore, $\beta_t$ is also a negligibly small value....

This issue is caused by Numba and has to be fixed there eventually. * https://github.com/numba/numba/issues/5520 * https://github.com/numba/numba/issues/5275 Until that happens, it might be possible to hide the warning by setting...