Yeu-Tong Lau

Results 1 issues of Yeu-Tong Lau

**Describe the bug** The output logits from transformer_lens and huggingface are quite different using Gemma-2-2b-it model **Code example** ```python import torch import transformer_lens from transformers import AutoTokenizer, AutoModelForCausalLM device =...

complexity-high
implementation-inaccuracy