Dirk Groeneveld comments

Results 200 comments of


                                            Dirk Groeneveld

How to re-cache a step after modifying it?

Still not a big fan, but maybe add an API to the workspace that allows you to delete a cache entry, and then expose it in the CLI?

How to re-cache a step after modifying it?

> the step in question was defined in a dependency of my project (catwalk) so I couldn’t tick the version Why did you have to re-run it then?

Beaker Executor should execute multiple Tango steps in one Beaker experiment

The caveats in the description of this issue stand, but we can probably find sub-graphs of the whole execution graph that _always_ make sense to group together into one Beaker...

Beaker Executor should execute multiple Tango steps in one Beaker experiment

> These steps would also have to have matching step resource requirements. Or we take the max of the resource requirements.

New to AllenNLP. Trying to start the Demo.

I'm closing this since it's a Docker issue, not a AllenNLP issue.

Update Llama config to use Llama block and RoPE lower precision

This is good, but I want to run that separately first to see if it makes a difference. We'll have to wait a bit to get cluster time.

Update Llama config to use Llama block and RoPE lower precision

Let's keep this on hold for a bit, but if it gets too long, we'll merge it as a separate config.

Update Llama config to use Llama block and RoPE lower precision

I just ran this on Beaker, and it said this: ``` RuntimeError: When using the full_megatron init, every module must have a type. ``` coming from `/home/dirkg/LLM/olmo/model.py:826`. Can you find...

Add bounds on dependency versions

I wouldn't mind putting in some reasonable lower bounds if it's just to keep the search times down. I wouldn't want to be too tight though, because that makes it...

Configs for LUMI ablations

I used this script: https://github.com/allenai/LLM/blob/2d4b62d6978e869f700a15268fa6302d78718a06/scripts/v1-mix-medium-on-lumi.sh The command line was just `sbatch scripts/v1-mix-medium-on-lumi.sh --load_path=`.