minGPT issues

Results 79 minGPT issues

Sort by recently updated

生成圖片

生成一張IG限時動態

Adding a requirements.txt file

I saw that there was not a requirements.txt file, so I created one by looking at all the .py files and gathering all non-built-in libraries used.

kukuquack

What is the purpose of `c_proj` here?

https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/mingpt/model.py#L42 Why do we need an additional linear transformation after the MHA and before the MLP when the dimensions are the same? (I understand that this is how the initial...

brynhayder

GPT-2 implementation problem

"Hi, I am reading the GPT-2 paper and encountering a problem with the following phrase related to implementation: 'A modified initialization method is used to account for the accumulation on...

sanhai77

where did the self.bias get defined in the casual attention class

nebyu08

Strange model behavior when taking the softmax in the wrong dimension

https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/mingpt/model.py#L64 I accidentally changed the softmax dimension to -2 instead of -1 and got incredibly low losses on both the training and validation set when using the [tiny_shakespeare](https://www.tensorflow.org/datasets/catalog/tiny_shakespeare) dataset. However,...

Cloud299

We collect more than 60 open source software for CAX and want to train miniGP to be the AI assistant.

codes, comments, mathematical and physical books for the development of CAX software to train miniGPT to be the AI assistant of our project, www.opendigitaltwin.top

fengsim

DEMO: minGPT on tinygrad

DEMO: minGPT on tinygrad Do not merge. demonstration only. https://github.com/ziliangpeng/tinyGPT # You like karpathy? You like geohot? You love tinyGPT! ❤️ ![image](https://github.com/karpathy/minGPT/assets/551900/72639756-e894-454c-8e1b-d4c122f196a7)

ziliangpeng

<|endoftext|> token isn't encoded correctly

```python import torch from mingpt.bpe import BPETokenizer tokenizer = BPETokenizer() print(tokenizer("")) # tensor([[ 27, 91, 437, 1659, 5239, 91, 29]]) print(tokenizer.decode(torch.tensor([50256]))) # '' print(tokenizer(tokenizer.decode(torch.tensor([50256])))) # tensor([[ 27, 91, 437, 1659,...

ttumiel

minGPT
minGPT copied to clipboard

Metadata

生成圖片

Adding a requirements.txt file

What is the purpose of `c_proj` here?

GPT-2 implementation problem

where did the self.bias get defined in the casual attention class

Strange model behavior when taking the softmax in the wrong dimension

We collect more than 60 open source software for CAX and want to train miniGP to be the AI assistant.

DEMO: minGPT on tinygrad

<|endoftext|> token isn't encoded correctly

← Metadata

Owner

Metadata

minGPT minGPT copied to clipboard

Metadata

← Metadata

Owner

Metadata

minGPT
minGPT copied to clipboard