Koan-Sin Tan comments

Results 251 comments of


                                            Koan-Sin Tan

LLM IFEval Dataset Implementation

@farook-edev: for quantized models, it could generate repeating tailing patterns. @freedomtan: it appears to a common issue, I'll ask around to see if we can get rid of that.

LLM IFEval Dataset Implementation

This phenomenon is called Neural Text Degeneration. There are many articles and papers discussing it. And generally, quantization could magnify Neural Text Degeneration. > [@farook-edev](https://github.com/farook-edev): for quantized models, it could...

LLM IFEval Dataset Implementation

@freedomtan to verify @farook-edev's findings. If confirmed, we should report to Google guys' (send a PR / issue ). @farook-edev to check the 33 prompts provided by @AhmedTElthakeb for the...

LLM IFEval Dataset Implementation

> [@freedomtan](https://github.com/freedomtan) to verify [@farook-edev](https://github.com/farook-edev)'s findings. If confirmed, we should report to Google guys' (send a PR / issue ). > > [@farook-edev](https://github.com/farook-edev) to check the 33 prompts provided by...

LLM IFEval Dataset Implementation

> [@freedomtan](https://github.com/freedomtan) That's very curious.. I redownloaded the [python code](https://github.com/google-research/google-research/tree/master/instruction_following_eval) and re-ran the strict comparison with python3.10 and python3.13 > > The difference I got from my original google test...

Koan-Sin Tan

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

LLM IFEval Dataset Implementation

Allow more than 1 LLM benchmark

Feat. Benchmark Sets