unitxt icon indicating copy to clipboard operation
unitxt copied to clipboard

🦄 Unitxt: a python library for getting data fired up and set for training and evaluation

Results 201 unitxt issues
Sort by recently updated
recently updated
newest added

Closes: https://github.com/IBM/unitxt/issues/1517

Safety benchmark comprised of AttaQ, ProvoQ, AirBench, and AILuminate, all with Granite Guardian as judge.

New metric definitions for llama-3-3-70b as judge in Arena Hard benchmark * Added metric definitions for llama-3-3-70b as judge in Arena Hard benchmark supporting: * WML Inference Engine * Generic...

Repeating #1945 here again. To the letter. It seems to me that github could not digest the late addition of `.github/actions/install-internal-pip/action.yml` so repeated here, when that action is already in.