Grigor Nalbandyan comments

Repositories
Issues
Comments

Results 4 comments of


                                            Grigor Nalbandyan

Pruning MViT

Hi @VainF did you look at this issue?

Reproducing paper results

I am using 'yahma/llama-7b-hf'

Reproducing paper results

With the checkpoint you specified, I could replicate the metrics. Do you know what is the difference between those 2? I thought there is one LLama and the checkpoints should...

Reproducing paper results

I checked both the model and the tokenizer. Model weights and tokenizer.get_vocab() are the same, but there is the difference of special tokens - for baffo32 all three special tokens...