ThoughtSource issues

mawps dataset cot incorrect

The COT of id 3 is partially the COT of id 2. Something is mixed up... { "id":"2" "ref_id":"" "question":"One pencil weighs 28.3 grams. How much do 5.0 pencils weigh?"...

KonstantinHebenstreit

New CoT Dataset Report: CoT-Collection

https://github.com/kaistAI/CoT-Collection Dataset accompanying the paper "The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning", including 1.88M CoT rationales extracted across 1,060 tasks" - https://arxiv.org/abs/2305.14045

chunhuizng

Source mode not working for new datasets

1

Loading datasets med_qa_open and the MMLU datasets does not work in source view. (It is working in thoughtsource view, so generating CoTs etc is working.)

KonstantinHebenstreit

krippendorff scores

do not correct for negative values when bootstrapping.

KonstantinHebenstreit

Annotator "star" feature does not work properly

The feature with the star (key: "preferred") does not really work . (The primary I think the primary problem is, that it saves values as bool, when the have to...

KonstantinHebenstreit

Add "_annotated" subscript to downloaded file from annotator

elmaestrobert

Annotator does not allow unselecting annotations

elmaestrobert

evaluation for duplicated answer choices

1

In datasets are sometimes examples with 4 or 5 answer choices. I think what has been done is just to duplicate one of the answer choices to always have 5...

KonstantinHebenstreit

throw error if from_json is used on already loaded collection

this is right: collection = Collection.from_json("...") this should throw an error in the second line: collection = Collection([dataset]) collection.from_json("...")

KonstantinHebenstreit

change saving of default template

When items are created, we do not have to save the template every time. We can just define it as 'default' or standard and save it somewhere, e.g. in the...

KonstantinHebenstreit

enhancement

ThoughtSource
ThoughtSource copied to clipboard

Metadata

mawps dataset cot incorrect

New CoT Dataset Report: CoT-Collection

Source mode not working for new datasets

krippendorff scores

Annotator "star" feature does not work properly

Add "_annotated" subscript to downloaded file from annotator

Annotator does not allow unselecting annotations

evaluation for duplicated answer choices

throw error if from_json is used on already loaded collection

change saving of default template

← Metadata

Owner

Metadata

ThoughtSource ThoughtSource copied to clipboard

Metadata

← Metadata

Owner

Metadata

ThoughtSource
ThoughtSource copied to clipboard