Xuan Son Nguyen comments

Results 73 comments of


Xuan Son Nguyen

Implement (properly) different chat templates in main.cpp

As I understand, your Possibility-2 uses prefix/postfix system which is discussed in #5922 . The idea lead to nowhere so I though that using postfix/prefix was a bad idea. After...

Implement (properly) different chat templates in main.cpp

> After a second thought (and also which what you suggested), my idea would be to firstly refactor the chat template system to support the postfix/prefix. This will save a...

Implement (properly) different chat templates in main.cpp

> Why not use a switch? You cannot use switch if inside the `if` statement you do some logics (for example `str_contains`). In other words, `switch` will be compiled into...

Implement (properly) different chat templates in main.cpp

> Or does C++ suffer from the same issue as C in a switch where it would need to be a integer or a character Yes, see: https://stackoverflow.com/questions/650162/why-cant-the-switch-statement-be-applied-to-strings > could...

Implement (properly) different chat templates in main.cpp

`llama.h` is C-style header file, so `std::map` cannot be placed there. We can place it in `llama.cpp` if needed. But anw it is code style and not the real implementation,...

llama : add T5 (encoder-decoder) support

@dranger003 Probably that's because GritLM uses 2 prompt templates, one is used only for text generation and one only for embedding. Can you try embedding with the template specified by...

Time Series Forecasting LLM Support

This model uses T5 architecture so we firstly need to support T5, see this issue: https://github.com/ggerganov/llama.cpp/issues/5763 Beside that, this model doesn't have a tokenizer (since time-series signal is converted directly...

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

At first lance, I'm not sure if it's a good idea to move the implementation completely into a separated JSON. While the good point is that it allows users to...

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

I've just have a look in detail for this PR. The idea seems ok (i.e. using input_prefix/input_suffix/antiprompt), but I still find the implementation is quite complicated IMO: - I still...

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

I understand the high level idea but sorry I really don't have time to look at the detailed implementation. While it's a good idea, IMO the chat template infrastructure should...