Brett Young
Brett Young
Hey, yeah I will try this out when I get a chance.
Yes, I can confirm your script does work for logging videos.
Im looking to run an experiment on with my own alignment method and compare it against the instruct model, so I need to start with the SFT model. The script...
Ah interesting. What's the difference?
Ok thanks. I think this would be useful to others if this was mentioned on the non hf model pages. (Maybe it already is an I missed it)
I have read the CLA Document and I hereby sign the CLA
any updates on this?
yeah, with claude 3.5 sonnet
yeah, using ` metrics = [ LLMContextRecall(), FactualCorrectness(), Faithfulness(), LLMContextPrecisionWithoutReference(), NoiseSensitivity(), ResponseRelevancy(), ContextEntityRecall() ] results = evaluate(dataset=eval_dataset, metrics=metrics, llm=evaluator_llm, run_config=my_run_config) ` with the LangchainLLMWrapper(ChatAnthropic(model="claude-3-5-sonnet-20240620")) model gives the error on some...
nice! Yeah a max_token error would be very helpful!