implement REFINER

Open sandys opened this issue 2 years ago • 0 comments

https://github.com/debjitpaul/refiner

https://arxiv.org/abs/2304.01904

LLMs Getting Fine-Grained Reasoning Feedback Improves Performance

-Critic model trained to provide feedback reasoning LLM uses to iteratively improve intermediate arguments -Easily beats baselines -Even w/ GPT3.5, critic significantly improves reasoner

Apr 26 '23 21:04 sandys