Farook (EDev) Al-Sammarraie comments

Results 33 comments of


                                            Farook (EDev) Al-Sammarraie

Migrate from Bazel v6.3.2 to v8.4.2

@anhappdev There's a bunch of migration work I did in #1040 to get TF 2.18.0 to work, I suggest waiting until that one's merged since it'll resolve quite a few...

LLM Dataset Implementation

- [x] Datasets should provide token limits as an input

LLM Dataset Implementation

> [@farook-edev](https://github.com/farook-edev) Can you please share the dataset? So I can upload it to our CDN so that the app can download it automatically. Sorry for the delay, here are...

LLM Dataset Implementation

@freedomtan I found a way to use different dataset files per `normal`, `quick`, and `rapid` runs, but they all have to use the same dataset (AKA `mlperf::QuerySampleLibrary`). Please let me...

LLM Dataset Implementation

@anhappdev could you please upload this model to the CDN? https://drive.google.com/file/d/1cd06WLDIW7QSZVI4_9o7iAjmNl0VdHM7/view?usp=sharing Thanks!

LLM IFEval Dataset Implementation

Compare current implementation against google's example. @farook-edev

LLM IFEval Dataset Implementation

I've managed to compare the result evaluation done by our implementation against google's implementation. I found and fixed multiple bugs (I'll submit a PR with the details later). I also...

LLM IFEval Dataset Implementation

I've tested the tiny33 results for 1b and 8b. Here's what I noticed from the test: - discrepancy with (1127) did not reoccur, however, (1219) (in 8b) and (1129) (in...

LLM IFEval Dataset Implementation

@freedomtan That's very curious.. I redownloaded the [python code](https://github.com/google-research/google-research/tree/master/instruction_following_eval) and re-ran the strict comparison with python3.10 and python3.13 The difference I got from my original google test was this: 3.10...