Siniša Stanivuk

Results 2 issues of Siniša Stanivuk

Adding TruthfulQA benchmark for Serbian language (although it could easily be changed to Croatian). Dataset isn't mine, so the shoutout goes to @jon-tow! Here are a couple of examples to...

Add OZ Eval task for evaluating General Knowledge of LLMs in Serbian. More can be seen [DjMel/oz-eval](https://huggingface.co/datasets/DjMel/oz-eval).