minGPT issues

Cs674

1

Westen-M

What's the max output tokens this model supports?

1

aletote

Should -1 marker (as special token) be counted in vocab_size?

1

https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/projects/adder/adder.py#L118 https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab957409a1cc2fbfba8a26/projects/adder/adder.py#L89

mw66

Check off a todo in utils: add 'freeze()' to freeze config.

Adds a three-line method which uses namedtuple to create frozen configs if one wants to avoid footguns. Checks off the 'todo' item in config. Elements of the config are still...

JosephCatrambone

Question: does it support other utf-8 natual language?

1

For example, chinese or japanese?

yingshaoxo

How can I run a trained model and can't run Test_ Hugging face_ Import. py

1

How can I run a trained model? Include/ Projects/add/model. pt. Test_ Hugging face_ Import. py directly runs this test program and reports File ".\minGPT\master\mingpt model. py", line 202, in from_...

linlong1314

Output of CausalSelfAttention

1

It seems that the output of this block is simply reshaped from multiple heads. From the original "Attention is all you need" paper, it seems that there is another linear...

whchan05

vcvycy

minGPT
minGPT copied to clipboard

Metadata

Cs674

What's the max output tokens this model supports?

Should -1 marker (as special token) be counted in vocab_size?

Check off a todo in utils: add 'freeze()' to freeze config.

Question: does it support other utf-8 natual language?

How can I run a trained model and can't run Test_ Hugging face_ Import. py

Output of CausalSelfAttention

Added generator repl for using adder model.

Rename transformer layers

About layer norm dimention parameter:

← Metadata

Owner

Metadata

minGPT minGPT copied to clipboard

Metadata

← Metadata

Owner

Metadata

minGPT
minGPT copied to clipboard