ThoughtSource
ThoughtSource copied to clipboard
A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research group: https://samwald.info/
The COT of id 3 is partially the COT of id 2. Something is mixed up... { "id":"2" "ref_id":"" "question":"One pencil weighs 28.3 grams. How much do 5.0 pencils weigh?"...
https://github.com/kaistAI/CoT-Collection Dataset accompanying the paper "The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning", including 1.88M CoT rationales extracted across 1,060 tasks" - https://arxiv.org/abs/2305.14045
Loading datasets med_qa_open and the MMLU datasets does not work in source view. (It is working in thoughtsource view, so generating CoTs etc is working.)
do not correct for negative values when bootstrapping.
The feature with the star (key: "preferred") does not really work . (The primary I think the primary problem is, that it saves values as bool, when the have to...
In datasets are sometimes examples with 4 or 5 answer choices. I think what has been done is just to duplicate one of the answer choices to always have 5...
this is right: collection = Collection.from_json("...") this should throw an error in the second line: collection = Collection([dataset]) collection.from_json("...")
When items are created, we do not have to save the template every time. We can just define it as 'default' or standard and save it somewhere, e.g. in the...