tmac.liang

Results 4 issues of tmac.liang

user: 最后一行的用户被忽略; crontab:不规范的空格导致panic listen:进程名为空时导致panic

**Describe the bug** we find cudaMemcpyAsync run too much time on torch profiler: 1. on aten::_local_scalar_dense ![image](https://github.com/microsoft/DeepSpeed/assets/16029804/95282d56-5940-40b9-9162-5a4fade75405) 2. on _has_inf_or_nan ![image](https://github.com/microsoft/DeepSpeed/assets/16029804/4768e574-6772-47fd-b332-43a3d167668b) **Expected behavior** reduce cudaMemcpyAsync op **ds_report output** ``` [2023-12-25...

bug
training

HI,Is there plan to upgrade megatron-lm,it can be support more feature and better performance

Long-text scenarios are quite common, and it would be of great help if they could be supported.

enhancement