zhangzai666 comments

Results 10 comments of


                                            zhangzai666

trafficstars

> > > > > > > > > 您好，请问您的预训练数据集哪里下载的 > > GLM的页面有一些数据的下载地址，比如wikitext的数据：https://s3.amazonaws.com/research.metamind.io/wikitext/wikitext-103-v1.zip 感谢您的回答，我想继续预训练GLM-chinese。但是我发现代码中的数据集是.lazy文件。这个您这边有示例么

请问pretrain怎么控制训练的epoch数？

> 非常感谢您的回答，请问是这种类型的么： {"title":"XXX","content":""} {"title":"XXX","content":""} {"title":"XXX","content":""} 即一行一个json就可以么

[BUG] 在全量finetune的时候，loss第一步有输出，接下来全部为nan

> ### Is there an existing issue for this? > * [x] I have searched the existing issues > > ### Current Behavior > loss第一步有输出，接下来全部为nan > > ### Expected Behavior...

显存不够：CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

> > CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in total by...

显存不够：CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

> > CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in total by...

显存不够：CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF

> > > > CUDA out of memory. Tried to allocate 6.28 GiB (GPU 1; 39.45 GiB total capacity; 31.41 GiB already allocated; 5.99 GiB free; 31.42 GiB reserved in...

zhangzai666

请问pretrain怎么控制训练的epoch数？

请问pretrain怎么控制训练的epoch数？

请问pretrain怎么控制训练的epoch数？

[BUG] 在全量finetune的时候，loss第一步有输出，接下来全部为nan

作者您好，请教问题：tokenizer词表大小和模型embedding层对应不上

如何基于YUAN模型进行无监督预训练？

Which version of transformers is it? Please download it below