gsm8k-ScRel
gsm8k-ScRel copied to clipboard
Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
if you have some plan to release the training data?
Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these...
There seems no people tried your 13b2-u13b version and I may be the first one. But I got 'RuntimeError: mat1 and mat2 shapes cannot be multiplied (111x5120 and 1x2560)' on...
Where can we find the file or folder named ``./ckpt/gsm8k_sft_llama7b''?
请问repo中是否包含使用MuggleMath论文中提到的方法对GSM8K或者MATH进行数据增强的代码?
Can I ask some questions? In the single_inference_7b_13b.py, there exists code in the implementation of `SupervisedDataset` class, ``` self.input_ids = data_dict["input_ids"] + data_dict["input_ids"][-100:] self.labels = data_dict["labels"] + data_dict["labels"][-100:] ``` I...
ICL相关实现
请问ICL的相关实现要怎么做呢,没有在代码里面找到相关的信息