Zhenyi Lu

Results 3 issues of Zhenyi Lu

### Feature request When I use the model with `trust_remote_code=True`, I cannot directly change these remote codes because everytime I load model it will request new codes from remote hub....

zero3能够和模型并行一起用吗?我在尝试中使用 ``` config.use_flash = False config.tp_size = 4 config.ds_config = { "fp16": { "enabled": True }, "zero_allow_untested_optimizer": True, "zero_force_ds_cpu_optimizer": False, "zero_optimization": { "stage": 3, "offload_optimizer": { "device": "cpu", "pin_memory": False...

help wanted

用4卡A100-40G,加载llama-13B:报错如下 ``` torch.distributed.elastic.multiprocessing.errors.ChildFailedError: ============================================================ pretrain_llama.py FAILED ------------------------------------------------------------ Failures: ------------------------------------------------------------ Root Cause (first observed failure): [0]: time : 2023-06-28_17:45:54 host : gpu5.example.com rank : 1 (local_rank: 1) exitcode : 1 (pid:...