Oleksiy Ostapenko

Results 3 issues of Oleksiy Ostapenko

Hello, I am interested in the downstream performance on the super-NI test tasks (0-shot). For the model downloaded from hf (https://huggingface.co/tloen/alpaca-lora-7b) I got 38 rouge-L points on super-NI test tasks....

possible bug: in ewc_in_rl.py even though I set max_steps=100 (line 303) it still runs for much more steps

bug

Hello, in the following code the result returned by `triton.ops.blocksparse.matmul` and `torch.einsum` do not align (please no`layout` consists of all ones). My understanding is that both outputs should be the...