过拟合

Results 27 comments of 过拟合

最后开源出来的Belle.train.json格式是已经将insttruct和input拼接到一起的格式了。

> Please give us the output of `accelerate env` and how you are creating your DataLoaders and Dataset (rough code will work) The key issue is not with the dataset...

You can reproduce my issue by doing the following: my dataset tokenizes all data during loading, which takes longer than 30 minutes. I then set the waiting time to be...

> ```shell - `Accelerate` version: 0.19.0.dev0 - Platform: Linux-4.19.96-x86_64-with-glibc2.10 - Python version: 3.8.13 - Numpy version: 1.22.4 - PyTorch version (GPU?): 2.0.0+cu117 (True) - System RAM: 503.82 GB - GPU...

> How are you creating your `Accelerator` object and the `Dataset`? Is it an `IterableDataset`? ```python class customer_dataset: def __init__(self,df): self.df = pd.read_csv(df) self.text = self.df['text'].tolist() self.all_data =tokenizer(self.text) # tokenizer...

> Hello @bestpredicts, as the config has `zero3_init_flag` set to True, it results in DeepSpeed using default timeout only. And you have mentioned the correct issue with respect to this....