reasoning-models topic
xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
AI-Lawyer-RAG-with-Deepseek
AI Lawyer is an intelligent reasoning legal assistant powered by DeepSeek , Ollama RAG and LangChain, designed to streamline legal research and document analysis. By leveraging retrieval-augmented gen...
deep-searcher
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
acl2025-diverse-cot
Code for the 2025 ACL publication "Fine-Tuning on Diverse Reasoning Chains Drives Within-Inference CoT Refinement in LLMs"
Awesome-Parallel-Reasoning
Awesome-Parallel-Reasoning: Unlocking the reasoning potential of LLMs. Papers, Code, Resources & Survey.
Logic-RL-Lite
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
DeepEnlighten
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
pts
Pivotal Token Search
Dynasor
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
OmniCaptioner
Official Repository of OmniCaptioner