self-correction topics

Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.

IAAR-Shanghai

attention-head

chain-of-thought

data-augmentation

decoding

learning-from-rewards-llm-papers

59

Stars

2

Forks

59

Watchers

A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward models and learning strategies across training, inference, and post-...

bobxwu

guided-decoding

large-language-models

llm

llms