test-time-scaling topic

List test-time-scaling repositories

verdict

307
Stars
22
Forks
Watchers

Inference-time scaling for LLMs-as-a-judge.

learning-from-rewards-llm-papers

59
Stars
2
Forks
59
Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...

compute-optimal-tts

277
Stars
24
Forks
277
Watchers

Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".

GenPRM

91
Stars
2
Forks
91
Watchers

[AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".

m1

47
Stars
3
Forks
47
Watchers

[ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models

Awesome-Parallel-Reasoning

40
Stars
3
Forks
40
Watchers

Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.

MassGen

668
Stars
102
Forks
668
Watchers

🚀 MassGen is an open-source multi-agent scaling system that runs in your terminal, autonomously orchestrating frontier models and agents to collaborate, reason, and produce high-quality results. | Jo...

SGI-Bench

130
Stars
6
Forks
130
Watchers

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows