Austin comments

Results 116 comments of


                                            Austin

How to create customised vocal models to use with train-text-from-scratch ?

You need to extract it from a currently existing model. The models are available in the `llama.cpp/models` path following the prefix format of `ggml-vocab-`. If any of the existing vocabs...

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

This is interesting. The only issue I see with this is that it doesn't account for FIM (Fill-in-the-Middle). Other than that, it seems alright. Something to note is that this,...

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

> Or are you meaning coding related models and I dont know, if they have some fill-in-the-blank or is it fill-in-the-middle Yes, [this is what I meant](https://arxiv.org/abs/2207.14255). One of the...

convert.py: add python logging instead of print()

I have a [pattern](https://github.com/teleprint-me/text-extraction/blob/main/text_extraction/logger.py#L13) we can use to centralize the logger instantiation with. The implementation is really simple and flexible. ```python LOGGER_FORMAT = "%(asctime)s - %(filename)s:%(lineno)d - %(levelname)s - %(message)s"...

convert.py: add python logging instead of print()

No worries. It was just a suggestion. No need to use it if undesired. As for your questions. This is builtin. No added dependency. You're already using it. ```python import...

convert.py: add python logging instead of print()

Maybe this one should be prioritized for now?

Added llama-3 chat template

`` is [End of Turn](https://github.com/meta-llama/llama3/blob/main/llama/tokenizer.py#L70). Meta always includes the [templates in their source code](https://github.com/meta-llama/llama3/blob/main/llama/tokenizer.py#L202). Should always reference it as a guide. > The end of each message is marked by...

Austin

How to create customised vocal models to use with train-text-from-scratch ?

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

Generic Chat templating code with text/json file based config; main chat updated to drive its in-prefix, in-suffix and reverse-prompt from same; chat-apply-template equivalent c-api to allow use by other codes also

convert.py: add python logging instead of print()

convert.py: add python logging instead of print()

convert.py: add python logging instead of print()

Added llama-3 chat template

Server: possibility of customizable chat template?

Server: possibility of customizable chat template?

Server: possibility of customizable chat template?