lm-evaluation-harness Loglikelihood refactor

Loglikelihood refactor

Open anjor opened this issue 1 year ago • 3 comments

Not sure if utils.py is the best place for these functions. Open to starting a new utils file under models.

Dec 26 '23 13:12 anjor

Hey @anjor , thanks for taking this on, and I'm sorry it took so long to look at for me!

I agree with Lintang that the current refactor is a bit more confusing in terms of adding abstraction / redirection for someone reading the code.

I think an alternative way to execute this refactor would be to do something analogous to the BaseLM class in the old v0.3.0: implementing the skeleton code / outer loops of the 3 different request type functions in a subclass of LM that is an intermediate between the fully-abstract LM base class and a fully-implemented specific LM subclass. Then, things like HFLM could have the potential to offload some of the boilerplate to this shared location and just keep their specific machinery.

Jan 03 '24 19:01 haileyschoelkopf

All good, thanks for the comments.

Your suggestions sounds good, I will get that implemented. Any ideas on what to call the intermediate layer?

Jan 03 '24 23:01 anjor

TemplateLM perhaps?

Jan 05 '24 00:01 haileyschoelkopf

Sorry have been a bit occupied with some other stuff. Hoping to get to this soon.

Jan 11 '24 10:01 anjor

lm-evaluation-harness lm-evaluation-harness copied to clipboard

Loglikelihood refactor

lm-evaluation-harness
lm-evaluation-harness copied to clipboard