EdgeChains icon indicating copy to clipboard operation
EdgeChains copied to clipboard

implement REFINER

Open sandys opened this issue 2 years ago • 0 comments

https://github.com/debjitpaul/refiner

https://arxiv.org/abs/2304.01904

LLMs Getting Fine-Grained Reasoning Feedback Improves Performance

-Critic model trained to provide feedback reasoning LLM uses to iteratively improve intermediate arguments -Easily beats baselines -Even w/ GPT3.5, critic significantly improves reasoner

sandys avatar Apr 26 '23 21:04 sandys