Logic-RL-Lite icon indicating copy to clipboard operation
Logic-RL-Lite copied to clipboard

Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".