outlines icon indicating copy to clipboard operation
outlines copied to clipboard

Review/improve the custom tokenizers for transformers and llamacpp

Open RobinPicard opened this issue 4 months ago • 0 comments

The models Transformers and LlamaCpp include some workarounds through the use of an Outlines-defined tokenizer on top of the model's native tokenizer. Those have recently been the cause of bugs and feel a bit hacky. We should look at whether there's a way of improving that part of our codebase to ensure all edge cases are covered and have better maintainability.

RobinPicard avatar Aug 08 '25 15:08 RobinPicard