ai-research-automation topic

List ai-research-automation repositories

PostTrainBench

104
Stars
11
Forks
104
Watchers

PostTrainBench measures how well CLI agents like Claude Code or Codex CLI can post-train base LLMs on a single H100 GPU in 10 hours