HRM icon indicating copy to clipboard operation
HRM copied to clipboard

What is the purpose of ACT if turned off during evaluation / inference?

Open FreddieK opened this issue 5 months ago • 2 comments

I'm trying to wrap my head around this paper, and one thing I find confusing in the reference repo is how ACT is only active during training. Doesn't that negative the purpose of having adaptive compute during inference time, and isn't what you want to show that it learns to stop reasoning during inference when further reasoning won't further improve the result?

FreddieK avatar Jul 28 '25 09:07 FreddieK

Since batched inference with ACT is complex and requires dynamically scheduling multiple sequences, in this repository we provide the simplest version that runs to the maximum number of steps, as this does not make the results worse.

imoneoi avatar Jul 29 '25 06:07 imoneoi

Thanks for clarifying. Do you have plans to publish the dynamic scheduling version of the code?

narvind2003 avatar Aug 05 '25 11:08 narvind2003