gemma_pytorch issues

Output with higher max_length is repetition of base text

7

While generating any text with a specified value of max_length, the generated text keeps repeating several times until the output spans the value of max_length. An example of the above...

azrael05

type:support

stat:awaiting response

Update config.py

1

This is updated code should be more robust , maintainable and user-friendly.......

Khajaamee455

how to finetune with gemma model？

8

how to finetune with gemma model？

runningabcd

type:support

stat:awaiting response

Are there reserved/unused tokens for developers?

Due to BPE vocabulary unable to dynamically expand after training, for finetuning, some BPE tokenizer based models such as Qwen reserved 2k extra unused tokens at the end for developers...

Qubitium

why some prompt doesn't work, the hidden_states will be nan after GemmaModel.forward

8

Question as the above title, some prompt it can work, for example, the default prompt " the meaning of the life", but the below prompt cannot work. "the self-attention is...

vupjing

bug

stat:awaiting response

MPS (Apple Silicon) Support

2

Will there be MPS support for the Gemma models? It would enable access to a larger community.

dsanmart

enhancement

A web runtime supported version of gemma is really needed and high value

2

It's really a wild potential to use gemma in web runtime, so please provide a web runtime version (wasm or typescript version will be better) for the users! Thanks a...

Zwe1

enhancement

gemma_pytorch
gemma_pytorch copied to clipboard

Metadata

Output with higher max_length is repetition of base text

Update config.py

how to finetune with gemma model？

Are there reserved/unused tokens for developers?

why some prompt doesn't work, the hidden_states will be nan after GemmaModel.forward

MPS (Apple Silicon) Support

A web runtime supported version of gemma is really needed and high value

keras finetuning and inference examples uploaded

H

Add instructions to download from Hugging Face Hub

← Metadata

Owner

Metadata

gemma_pytorch gemma_pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

gemma_pytorch
gemma_pytorch copied to clipboard