test-time-scaling topics

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...

bobxwu

guided-decoding

large-language-models

llm

llms

compute-optimal-tts

277

Stars

24

Forks

277

Watchers

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

RyanLiu112

large-language-model

o1

process-reward-model

r1

GenPRM

91

Stars

2

Forks

91

Watchers

[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

RyanLiu112

large-language-model

o1

process-reward-model

r1

m1

47

Stars

3

Forks

47

Watchers

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models

UCSC-VLAA

llm

medical

r1

reasoning

Awesome-Parallel-Reasoning

40

Stars

3

Forks

40

Watchers

Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.

PPPP-kaqiu

awesome-parallel-reasoning

large-language-models

r1

reasoning-models

MassGen

668

Stars

102

Forks

668

Watchers

🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents to collaborate, reason, and produce high-quality results. | Jo...

massgen

agent

agentic-ai

autonomous-agents

cli

SGI-Bench

130

Stars

6

Forks

130

Watchers

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

InternScience

agent

ai

ai-scientist

ai4science