Dirk Groeneveld

Results 200 comments of Dirk Groeneveld

Still not a big fan, but maybe add an API to the workspace that allows you to delete a cache entry, and then expose it in the CLI?

> the step in question was defined in a dependency of my project (catwalk) so I couldn’t tick the version Why did you have to re-run it then?

The caveats in the description of this issue stand, but we can probably find sub-graphs of the whole execution graph that _always_ make sense to group together into one Beaker...

> These steps would also have to have matching step resource requirements. Or we take the max of the resource requirements.

I'm closing this since it's a Docker issue, not a AllenNLP issue.

This is good, but I want to run that separately first to see if it makes a difference. We'll have to wait a bit to get cluster time.

Let's keep this on hold for a bit, but if it gets too long, we'll merge it as a separate config.

I just ran this on Beaker, and it said this: ``` RuntimeError: When using the full_megatron init, every module must have a type. ``` coming from `/home/dirkg/LLM/olmo/model.py:826`. Can you find...

I wouldn't mind putting in some reasonable lower bounds if it's just to keep the search times down. I wouldn't want to be too tight though, because that makes it...

I used this script: https://github.com/allenai/LLM/blob/2d4b62d6978e869f700a15268fa6302d78718a06/scripts/v1-mix-medium-on-lumi.sh The command line was just `sbatch scripts/v1-mix-medium-on-lumi.sh --load_path=`.