Michael Pang comments

Results 91 comments of


                                            Michael Pang

First "good" results

It looks like the model might be overfitting to the opening before it has a chance to learn basic chess principles like piece values. GM games might not be the...

First "good" results

Yes, sort of like AlphaZero :) That way we can also see where it plateaus and iterate faster. BTW can you comment on why you closed my PR? I can...

First "good" results

Are you referring to the "data format" issue I raised? I've already fixed all the stuff I mentioned (and a few other bugs) so after a bit more testing I...

First "good" results

Noted thanks! I hope we can share test cases too. The training loss is questionable (in my opinion) because it can just overfit the common openings, and we don't know...

No weights in ftp server?

Sure I'll look around. Also could you see if you can use your company can let you use their Azure server for FTP? BTW if you change the keras.json to...

Potential collaboration

~~It looks like it's private?~~ https://github.com/crypt3lx2k/Zerofish

Problem in running run.py

You have to go up at least one directory, so the chess_zero folder is in the working directory of run.py.

Next step

We could just post all our training data publicly so anyone is free to experiment with different architectures on their hardware

How is the AI Dungeon able to generate text so quickly using the 1558M model in Collaboratory?

Comparing the two sampling implementations: https://github.com/minimaxir/gpt-2-simple/blob/ca6bc61d958fd4c474af9a412ace27279b88dd90/gpt_2_simple/src/sample.py#L41 https://github.com/AIDungeon/AIDungeon/blob/e87d13b6b011a88c2fb034617ffef2ee4fef98e3/generator/gpt2/src/sample.py#L56 There are some minor differences, like how initial loop variables are computed, but for the most part they're the same. Another source of...

How is the AI Dungeon able to generate text so quickly using the 1558M model in Collaboratory?

Lower temperature should lead to more coherent (but more repetitive) output. I believe they used a DGX-1 to finetune 1.5B