Dmitrii Gavrilov
Dmitrii Gavrilov
> > My notebook: Windows 11 Pro 23H2 Intel i7-8750H GeForce GTX 1050Ti (Mobile) 32GB RAM (2666GHz) > > > > After I removed the mention of flash_atn in gemma.py,...
> > Не могли бы вы поделиться изменениями в main.py? > > model_path = "./models/models--mustafaaljadery--gemma-2B-10M" #tokenizer = AutoTokenizer.from_pretrained(model_path) tokenizer = AutoTokenizer.from_pretrained(model_name, cache_dir="./models") model = GemmaForCausalLM.from_pretrained( #model_path, model_name, cache_dir="./models", torch_dtype=torch.bfloat16 )...
> i can't get past GemmaModel.forward() got an unexpected keyword argument 'cache_position' I "solved" this problem, but it turned out not to be the end look at my issue of...