gemma_pytorch icon indicating copy to clipboard operation
gemma_pytorch copied to clipboard

The official PyTorch implementation of Google's Gemma models

Results 50 gemma_pytorch issues
Sort by recently updated
recently updated
newest added

While generating any text with a specified value of max_length, the generated text keeps repeating several times until the output spans the value of max_length. An example of the above...

type:support
stat:awaiting response

This is updated code should be more robust , maintainable and user-friendly.......

how to finetune with gemma model?

type:support
stat:awaiting response

Due to BPE vocabulary unable to dynamically expand after training, for finetuning, some BPE tokenizer based models such as Qwen reserved 2k extra unused tokens at the end for developers...

Question as the above title, some prompt it can work, for example, the default prompt " the meaning of the life", but the below prompt cannot work. "the self-attention is...

bug
stat:awaiting response

Will there be MPS support for the Gemma models? It would enable access to a larger community.

enhancement

It's really a wild potential to use gemma in web runtime, so please provide a web runtime version (wasm or typescript version will be better) for the users! Thanks a...

enhancement

Hi there :hugs: In collaboration with Google, we added the gemma.pytorch files to the Hugging Face Hub. This PR adds instructions to manually or programmatically download the models.