Question about supervised training process

Open fastwalker1118 opened this issue 6 months ago • 1 comments

I’m experimenting with fine-tuning a KataGo model on my own games to capture my personal play style. My plan is to start from a pretrained network roughly at my strength and then run supervised learning over a directory of my human-played SGF files.

When I tried to convert my SGFs into the .npz format using ./katago writetrainingdata, I discovered that it was designed for AI self-play data (it seems to expect MCTS-generated Q-value targets or a rated-games CSV). Pointing it at pure human SGFs triggers assertion failures and other bugs. I haven’t found any branch, flag, or script in the repo that handles “human-only” SGF → .npz conversion. Am I overlooking something, or is the only path forward to patch the existing C++ data-writing code?

In addition, I would also appreciate any insights or experience on human supervised tuning in general (any problems i might run into after having npz training data)!

Jun 24 '25 00:06 fastwalker1118