Kevin M Jablonka
Kevin M Jablonka
this is a bit conditional on what our timeline for refactoring the codebase is
If we limit ourselves to one question per file, stuff will be a lot simpler it seems
this seems to be reused in many places _Originally posted by @kjappelbaum in https://github.com/lamalab-org/chem-bench/pull/435#discussion_r1703126065_
it would be nice if we could filter based on the tags and in this way look at very many different subsets
- What scripts to I have to run to obtain a final report? - Why does `all_correct` change between reports?
- [ ] ideally also a link on chembench.org - [ ] there should also be one in the README
- UUIDs in the style of what we have for the chem-bench app might be nice (or what wandb does) - https://github.com/nebbles/hruid-python - https://github.com/orf/human_id - If we do datetime +...