bocoel icon indicating copy to clipboard operation
bocoel copied to clipboard

Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few lines of modular code.

Results 6 bocoel issues
Sort by recently updated
recently updated
newest added

Currently, the factories are using enums. This isn't scalable.

refactor

Rather than using Bayesian, use MCTS to explore the search space.

enhancement

I have decided to archive the repo. I am not feeling particularly motivated to work on this, and it has been inactive for a while now. For: #26 #29 :...

wontfix

Since the optimizers mostly different from their sampling strategy, this change would make it more obvious.

enhancement
architecture

Hi RenChu! This project looks really neat! I've looked through the docs, but I was wondering if you have any preliminary tech report that explains a bit more about how...