zhihao-chen comments

Results 11 comments of


                                            zhihao-chen

trafficstars

'BertModel' object has no attribute 'set_flag'

非常感谢，计划适配官方的sentence_transformers 和transformers吗

'BertModel' object has no attribute 'set_flag'

明白了，谢谢您的回复

how to use zero-offload?

@jeffra Why can't we use zero-offload? Can you tell me the specific reason? I can successfully enable zero-offload。

Adafactor is not a supported DeepSpeed Optimizer

ok，但我现在又遇到另一个问题： │ /root/work2/work2/chenzhihao/DeepSpeed/deepspeed/runtime/zero/partition_parameters.py:673 in │ │ __init__ │ │ │ │ 670 │ │ # If we are provided an already-allocated module to prepare. │ │ 671 │ │ if...

binascii.Error: Incorrect padding

Adam，在deepspeed_config.json中指定的 "optimizer": { "type": "Adam", "params": { "lr": 0.0004, "weight_decay": 0.01, "betas": [ 0.9, 0.98 ], "eps": 1e-6 }

binascii.Error: Incorrect padding

deepspeed=0.8.3 cuda=10.2 pytorch=1.12.1

群满了，求拉我入微信群，微信号dandanshanhu，我们也在训练大模型

+1 微信号：764140207

发现语义角色标注过程可能存在内存泄漏，请确认，谢谢！

我是本打算连续处理1百万条数据的，结果只到30万就爆了

按照示例运行报错

在提供的colab上能运行，但我的本地环境不行。我的环境配置： transformers=4.26.1 pytorch=1.12.1+cu102 你们要求的环境是怎样的

按照示例运行报错

升级了torch版本后可以了。但项目主页写的，torch==1.7，transformer==4.26.1