Oleksiy Ostapenko
Oleksiy Ostapenko
Hello, I am interested in the downstream performance on the super-NI test tasks (0-shot). For the model downloaded from hf (https://huggingface.co/tloen/alpaca-lora-7b) I got 38 rouge-L points on super-NI test tasks....
possible bug: in ewc_in_rl.py even though I set max_steps=100 (line 303) it still runs for much more steps
Hello, in the following code the result returned by `triton.ops.blocksparse.matmul` and `torch.einsum` do not align (please no`layout` consists of all ones). My understanding is that both outputs should be the...