flybird11111
flybird11111
Thanks, I'll look into it.
Hi, I haven't been able to reproduce this issue. From the code, it seems that part_norm itself is of type float.
Hi, can you identify where it is getting stuck? I'm unable to reproduce this issue.
You can set enable_all_optimization to false. And configure other optimizations as needed.
Thank you for your feedback; we will address this issue as soon as possible.
Hi, could you please share the traceback? I'll look into this issue.
Hi, Please install apex from https://github.com/NVIDIA/apex, or set enable_fused_normlization to False.
Maybe the version of apex is not correct, can you have a try that "from apex.normalization import FusedRMSNorm"
> And I saw this prompt in `examples/language/llama2/scripts/benchmark_70B/3d.sh` > > ``` > # TODO: fix this > echo "3D parallel for LLaMA-2 is not ready yet" > ``` > >...
> > > And I saw this prompt in `examples/language/llama2/scripts/benchmark_70B/3d.sh` > > > ``` > > > # TODO: fix this > > > echo "3D parallel for LLaMA-2 is...