Davide Macario
Davide Macario
When running `train_gpt2.py`, I get all 16 output tokens equal to "!" (token 0). Here is the complete output: ```text ❯ python3 train_gpt2.py using device: mps loading weights from pretrained...
Since I believe this project is being used a lot in academia/research, especially as a way to explain and showcase LLMs and the decoder-only Transformer architecture, I think it needs...
When running ```python python sample.py --init_from=gpt2 --num_samples=2 --max_new_tokens=100 ``` having set `device = 'mps'` on my M1 Pro MacBook (MacOS 14.4), with Torch 2.2.1 and 2.2.0, I get this output:...
The file `/litgpt/prompts.py` was missing the mapping between the `llama3` model name and the `Llama3` PromptStyle class in the variable `prompt_styles`.