ai-research-automation topic
List
ai-research-automation repositories
PostTrainBench
104
Stars
11
Forks
104
Watchers
PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours