Roman Lutz comments

Results 458 comments of


                                            Roman Lutz

[FEAT] Support JSON Schema in Responses

Do you expect a different schema per message sent, or just one schema for one target? I think that's the main decision point. If it's the latter then we can...

[FEAT] Support JSON Schema in Responses

Re: Response target not working with the scorer We don't currently enforce a json format, we just use the other option (I don't recall the name offhand, but switching to...

[FEAT] Support JSON Schema in Responses

> The specific bug is that the Responses API doesn't use `response_format` in the body, but the alternative I show. There are also two options: `json_object` which doesn't enforce a...

FEAT export InterpolatedThresholder as a public object and update its API (e.g., rename an argument)

> Since we are taking a private object and making it public, we should review its API to make sure we like. Adding to the list: tests!

FEAT export InterpolatedThresholder as a public object and update its API (e.g., rename an argument)

I know some people felt strongly about this at some point 🤣 Personally, I'm happy with it as is. The naming improvement is truly a small one if at all....

FEAT replace fastchat in GCG

Indeed! Have you come across options?

BUG Console printer only prints objective scores if include_auxiliary_scores is set to True

Given the difficulty with determining which one the objective score actually is, I am leaning towards just displaying all scores and getting rid of the flag as it's not actually...

BUG Console printer only prints objective scores if include_auxiliary_scores is set to True

Currently, I would expect the difference between `include_auxiliary_scores` being True and False to be - True: all scores are printed - False: only the objective scores are printed That's not...

Improving the Greedy Coordinate Gradient implementation

From what I understand nanoGCG is for causal models or am I misremembering? Nevermind, I just came to that conclusion from a cursory glance a few days ago because of...

FEAT Beam search

Please do! Nobody is on it to the best of my knowledge.