emmanuel

Results 2 issues of emmanuel

Currently, when trying to run constrained decoding with a new grammar, we are prompted with ``` Creating DFA mask store for LlamaTokenizerFast and custom, may take more than 10 minutes....

In the following example, the constrained model fails to generate a string that matches the specified grammar: ```python import traceback import lark import torch from transformers import AutoModelForCausalLM, AutoTokenizer from...