Piotr Mardziel comments

Results 27 comments of


                                            Piotr Mardziel

[BUG] Use Anthropic Claude 3 Opus as Feedback Provider

Hi; is the issue coming from litellm? Can you check whether litellm without the use of trulens operates as expected on the models you want to run and whether litellm's...

[BUG] Use Anthropic Claude 3 Opus as Feedback Provider

Looks like these models don't like having messages with only role of "system" and will complain if no "user" role is given.

[BUG] Use Anthropic Claude 3 Opus as Feedback Provider

That is, the problem is with the prompt we are giving to evaluate some of these eval metrics. Will look more into it.

[BUG] Use Anthropic Claude 3 Opus as Feedback Provider

Does not work: ```python input_dict = {'messages': [{'content': 'You are a RELEVANCE grader; providing the relevance ' 'of the given CONTEXT to the given QUESTION.\n' 'Respond only as a number...

"RuntimeError: cannot reuse already awaited coroutine"

Having trouble replicating on main. @kouskouss can you let me know if the issue persists on the main branch of trulens_eval and the versions of pip packages and python you...

"RuntimeError: cannot reuse already awaited coroutine"

Problem may be related to https://github.com/truera/trulens/issues/639 . Looking for a fix.

LLM: Standardized fields for LLM Security and protection [Discussion]

Do all of the attributes that end in "score" have a single consistent and agreed upon definition of how such scores are computed?