Karolina Źróbek
Results
1
issues of
Karolina Źróbek
New metric definitions for llama-3-3-70b as judge in Arena Hard benchmark * Added metric definitions for llama-3-3-70b as judge in Arena Hard benchmark supporting: * WML Inference Engine * Generic...