makemore issues

LayerNorm eps value

Hi! thanks for this little piece of juicy code! Just for curiosity, I've noticed that in your implementation you are using `nn.LayerNorm` with the standard denominator constant `eps=1e-5`, whereas in...

guglielmocamporese

Similar stuff

To those in the know, is there possibly any newer alternatives that work better and do the same thing as this? I fear I'm missing out on something more effective....

busyammonia

Updated Installation Instructions for running locally

Updated Necessary package installation instructions before running the file $ pip install torch numpy tensorboard

pravintargaryen

Can these models also be used for classification?

3

If we had labels for these names, such as: ``` | name | is_palindrome | h_index | scrabble_score | |--------+---------------+---------+----------------| | anna | 1 | 4 | 4 | |...

hoosierEE

Simplify code for collecting + sorting of all possible characters.

2

Small code simplification: the line ``` chars = sorted(list(set(''.join(words)))) ``` can be simplified to ``` chars = sorted(set(''.join(words))) ``` because `sorted(...)` accepts any `iterable` and `set(...)` returns an `iterable`. I...

jonasreinsch

Added --input-file-encoding as a command line argument

1

I wanted to train the program on making more Swedish names. They contain special characters like Å and Ö, so I need to read the file using utf-8. On windows...

JohanNorberg

Question about MLP

[Here](https://github.com/karpathy/makemore/blob/988aa59e4d8fefa526d06f3b453ad116258398d4/makemore.py#L382) you are padding the tensor with special starting token. It looks strange to me that you are doing it inside the embedding. Isn't this strange? Aren't you supposed to...

isentropic

remove duplicate words from the dataset

hi thanks for your videos, just finished to watch the [first part](https://www.youtube.com/watch?v=PaCmpygFfXo) when I tried to intersect between the test & train datasets I noticed some names repeat in the...

iamdoron

'too' -> 'a'

I tried not to. I had to. It's the small things, right? Favoring verbosity, you could say "This is not meant to be too heavyweight a library" but that is...

johnnypeck

[Suggestion] Add a note about the training of Bengio et al. MLP

Hi @karpathy, thanks for that great repo! Maybe it would be better to note in your code that while you're training by [minimizing the CE loss](https://github.com/karpathy/makemore/blob/f61811b994280cb12ddae15ef5800baa2e3a1ca4/makemore.py#L392), Bengio actually **maximized** the...

OmriKaduri

makemore
makemore copied to clipboard

Metadata

LayerNorm eps value

Similar stuff

Updated Installation Instructions for running locally

Can these models also be used for classification?

Simplify code for collecting + sorting of all possible characters.

Added --input-file-encoding as a command line argument

Question about MLP

remove duplicate words from the dataset

'too' -> 'a'

[Suggestion] Add a note about the training of Bengio et al. MLP

← Metadata

Owner

Metadata

makemore makemore copied to clipboard

Metadata

← Metadata

Owner

Metadata

makemore
makemore copied to clipboard