tanzhipeng

Results 5 comments of tanzhipeng

command: torchrun --nnodes=1 --nproc_per_node=1 scripts/train.py configs/opensora/train/16x256x256.py --data-path /home/tanzhipeng/open-sora/Open-Sora/datasets/travel.csv changes in code : ![image](https://github.com/hpcaitech/Open-Sora/assets/51696454/ac820ffd-6ea8-4938-8e5d-c2d70904988e)

When I set use_reentant = False,I got the same error. ![image](https://github.com/hpcaitech/Open-Sora/assets/51696454/e7fb3427-f0e5-4a87-90f6-78daa8b76096) ![image](https://github.com/hpcaitech/Open-Sora/assets/51696454/74540705-36fb-4be8-975a-0571f2f30613)

![image](https://github.com/hpcaitech/Open-Sora/assets/51696454/20f36281-a494-4735-a0ac-e95e826c1233) ![image](https://github.com/hpcaitech/Open-Sora/assets/51696454/16bb1cd0-ec1a-465d-b065-dba2d30761ce)

> 我是把acceleration下的空的init.py考到shardformer目录下的。 我按照你的操作,仍然会报No module named 'opensora.acceleration.shardformer'

> May I know your outputs of `nvidia-smi`? The error codes of `-4` are often flagged with Out-Of-Memory (OOM) issue. Hi, is there any update about this issue?