Andrew Wang comments

Results 11 comments of


                                            Andrew Wang

pipreqs not capturing all imports

Same issue

`build_regex_from_schema`: Implementation of `pattern` disagrees with JSON schema spec

This is probably a side effect of interregular, which has implicit anchoring: https://github.com/MegaIng/interegular/issues/10

[Feature] Add support for LLama 3.1 tool use

> btw @maxdebayser this is the chat template I have built. It's not an understatement to say I have spent 4+ hours simplifying it from the original, tuning the system...

RuntimeError: Cannot convert token

Same on the Nemotron models.

Why are \n generated in the output

They're whitespace tokens (\n, \t, \d, etc) which are typically allowed. You can disable them using the `CharacterLeveParserConfig`.

torch.export-friendly data-dependent assertions in misc.py, solvers.py

@rtqichen Bumping this. I am also running into similar issues with torch export, enabling this would be very useful

Support Open Models that allow OpenAI API-style tool use & "auto" tool choice

> if the tool use token is called, pull out the tool call JSON from the rest of the response (should be array) Given that guided decoding is not enabled...

Support Open Models that allow OpenAI API-style tool use & "auto" tool choice

I also wanted to ask a question about how `tool_call_ids` are going to be handled. I know HF says: > Note that we generate a random tool_call_id here. These IDs...

Support Open Models that allow OpenAI API-style tool use & "auto" tool choice

> @aw632 thats a great question for Hermes V2 we plan to pass back a dict for tool response as follows: > > {"tool_call_id": , "name": , "content": } >...

Support Open Models that allow OpenAI API-style tool use & "auto" tool choice

Btw, it looks like the stop reason/finish reason isn't accurately displaying "tool_calls" as the OpenAI API expects (this happens with named tool use too). I can open an issue for...