gsm8k-ScRel icon indicating copy to clipboard operation
gsm8k-ScRel copied to clipboard

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models

Results 8 gsm8k-ScRel issues
Sort by recently updated
recently updated
newest added

if you have some plan to release the training data?

Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these...

There seems no people tried your 13b2-u13b version and I may be the first one. But I got 'RuntimeError: mat1 and mat2 shapes cannot be multiplied (111x5120 and 1x2560)' on...

Where can we find the file or folder named ``./ckpt/gsm8k_sft_llama7b''?

请问repo中是否包含使用MuggleMath论文中提到的方法对GSM8K或者MATH进行数据增强的代码?

Can I ask some questions? In the single_inference_7b_13b.py, there exists code in the implementation of `SupervisedDataset` class, ``` self.input_ids = data_dict["input_ids"] + data_dict["input_ids"][-100:] self.labels = data_dict["labels"] + data_dict["labels"][-100:] ``` I...

请问ICL的相关实现要怎么做呢,没有在代码里面找到相关的信息