Memtensor Research Group
Memtensor Research Group
UHGEval
[ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.
Grimoire
Grimoire is All You Need for Enhancing Large Language Models
DATG
[ACL 2024]Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs
CRUD_RAG
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
xFinder
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
ICSFSurvey
Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasoning elevation🍓 and hallucination alleviation🍄.
CTGSurvey
Controllable Text Generation for Large Language Models: A Survey
NewsBench
[ACL 2024 Main] NewsBench: A Systematic Evaluation Framework for Assessing Editorial Capabilities of Large Language Models in Chinese Journalism
Awesome-Attention-Heads
An awesome repository & A comprehensive survey on interpretability of LLM attention heads.
xVerify
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations